NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F080463

Metagenome / Metatranscriptome Family F080463

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080463
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 196 residues
Representative Sequence MESNVVIIPVDAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPEGSR
Number of Associated Samples 102
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 46.09 %
% of genes near scaffold ends (potentially truncated) 40.87 %
% of genes from short scaffolds (< 2000 bps) 78.26 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.522 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.217 % of family members)
Environment Ontology (ENVO) Unclassified
(54.783 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(67.826 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 22.08%    β-sheet: 24.24%    Coil/Unstructured: 53.68%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF10431ClpB_D2-small 21.74
PF00413Peptidase_M10 3.48
PF13582Reprolysin_3 2.61
PF12680SnoaL_2 2.61
PF06996T6SS_TssG 1.74
PF08238Sel1 0.87
PF00496SBP_bac_5 0.87
PF12802MarR_2 0.87
PF14108ABA4-like 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG5549Predicted Zn-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 3.48
COG3520Predicted component of the type VI protein secretion systemIntracellular trafficking, secretion, and vesicular transport [U] 1.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms57.39 %
UnclassifiedrootN/A42.61 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10365857All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Haloferacales → Haloferacaceae → Haloferax → Haloferax sulfurifontis696Open in IMG/M
3300005327|Ga0070658_10589452All Organisms → cellular organisms → Bacteria963Open in IMG/M
3300005329|Ga0070683_101796567All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → unclassified Acinetobacter → Acinetobacter sp. ANC 3862589Open in IMG/M
3300005336|Ga0070680_100110877All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2284Open in IMG/M
3300005339|Ga0070660_101084469All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → unclassified Acinetobacter → Acinetobacter sp. ANC 3862678Open in IMG/M
3300005344|Ga0070661_100548205All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → unclassified Acinetobacter → Acinetobacter sp. ANC 3862930Open in IMG/M
3300005467|Ga0070706_100087866All Organisms → cellular organisms → Bacteria2882Open in IMG/M
3300005577|Ga0068857_100141631All Organisms → cellular organisms → Bacteria → Proteobacteria2174Open in IMG/M
3300005995|Ga0066790_10172160All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Nocardioidaceae → Nocardioides → unclassified Nocardioides → Nocardioides sp. Iso805N926Open in IMG/M
3300006047|Ga0075024_100006557All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4683Open in IMG/M
3300006172|Ga0075018_10067298All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300006237|Ga0097621_101289758Not Available689Open in IMG/M
3300006864|Ga0066797_1029704All Organisms → cellular organisms → Bacteria1919Open in IMG/M
3300007255|Ga0099791_10062692All Organisms → cellular organisms → Bacteria → Proteobacteria1676Open in IMG/M
3300007265|Ga0099794_10008329All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium4300Open in IMG/M
3300007788|Ga0099795_10152888Not Available946Open in IMG/M
3300009038|Ga0099829_10048212All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp.3142Open in IMG/M
3300009088|Ga0099830_10182004All Organisms → cellular organisms → Bacteria → Proteobacteria1635Open in IMG/M
3300009090|Ga0099827_10164614All Organisms → cellular organisms → Bacteria → Proteobacteria1824Open in IMG/M
3300009143|Ga0099792_10199733All Organisms → cellular organisms → Bacteria → Proteobacteria1136Open in IMG/M
3300009143|Ga0099792_10266508Not Available1004Open in IMG/M
3300009174|Ga0105241_10730033Not Available907Open in IMG/M
3300009545|Ga0105237_10006130All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. WSM141713439Open in IMG/M
3300009551|Ga0105238_10523080All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1189Open in IMG/M
3300010159|Ga0099796_10120265Not Available1008Open in IMG/M
3300010401|Ga0134121_13033488Not Available517Open in IMG/M
3300010877|Ga0126356_10771129Not Available671Open in IMG/M
3300011270|Ga0137391_10231951All Organisms → cellular organisms → Bacteria → Proteobacteria1602Open in IMG/M
3300012096|Ga0137389_10204541All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1645Open in IMG/M
3300012096|Ga0137389_10335075All Organisms → cellular organisms → Bacteria → Proteobacteria1285Open in IMG/M
3300012189|Ga0137388_10395918All Organisms → cellular organisms → Bacteria → Proteobacteria1278Open in IMG/M
3300012200|Ga0137382_10100074All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1911Open in IMG/M
3300012202|Ga0137363_10143724All Organisms → cellular organisms → Bacteria → Proteobacteria1867Open in IMG/M
3300012202|Ga0137363_10668032Not Available879Open in IMG/M
3300012203|Ga0137399_10524526All Organisms → cellular organisms → Bacteria → Proteobacteria994Open in IMG/M
3300012205|Ga0137362_10196376All Organisms → cellular organisms → Bacteria → Proteobacteria1736Open in IMG/M
3300012207|Ga0137381_10266354Not Available1490Open in IMG/M
3300012210|Ga0137378_10524284Not Available1092Open in IMG/M
3300012212|Ga0150985_103525067Not Available1664Open in IMG/M
3300012357|Ga0137384_10486284Not Available1014Open in IMG/M
3300012360|Ga0137375_10131168Not Available2484Open in IMG/M
3300012361|Ga0137360_10027443All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3895Open in IMG/M
3300012361|Ga0137360_10050929All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2983Open in IMG/M
3300012362|Ga0137361_10601915Not Available1007Open in IMG/M
3300012469|Ga0150984_102447958All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1132Open in IMG/M
3300012469|Ga0150984_113805334All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2622Open in IMG/M
3300012683|Ga0137398_10420513Not Available911Open in IMG/M
3300012685|Ga0137397_10379778All Organisms → cellular organisms → Bacteria → Proteobacteria1052Open in IMG/M
3300012923|Ga0137359_10721879Not Available867Open in IMG/M
3300012924|Ga0137413_10079258All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1992Open in IMG/M
3300012927|Ga0137416_10094953All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2228Open in IMG/M
3300012927|Ga0137416_11766770Not Available565Open in IMG/M
3300012929|Ga0137404_10591460Not Available997Open in IMG/M
3300012929|Ga0137404_10802819Not Available855Open in IMG/M
3300012930|Ga0137407_10179156All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1890Open in IMG/M
3300012930|Ga0137407_10278403All Organisms → cellular organisms → Bacteria → Proteobacteria1524Open in IMG/M
3300012944|Ga0137410_10048745All Organisms → cellular organisms → Bacteria → Proteobacteria3012Open in IMG/M
3300013297|Ga0157378_11225383Not Available790Open in IMG/M
3300014969|Ga0157376_11754092Not Available656Open in IMG/M
3300015051|Ga0137414_1136966All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5572Open in IMG/M
3300015053|Ga0137405_1156863All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phreatobacteraceae → Phreatobacter → Phreatobacter stygius1590Open in IMG/M
3300015242|Ga0137412_10003867All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria11953Open in IMG/M
3300015264|Ga0137403_10047583All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4439Open in IMG/M
3300015264|Ga0137403_10193295All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1972Open in IMG/M
3300015371|Ga0132258_11215458Not Available1905Open in IMG/M
3300015371|Ga0132258_11890343Not Available1503Open in IMG/M
3300018052|Ga0184638_1019726All Organisms → cellular organisms → Bacteria2378Open in IMG/M
3300018066|Ga0184617_1064467Not Available962Open in IMG/M
3300018075|Ga0184632_10066874All Organisms → cellular organisms → Bacteria1562Open in IMG/M
3300018078|Ga0184612_10010694All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium4636Open in IMG/M
3300018468|Ga0066662_12304592Not Available566Open in IMG/M
3300019789|Ga0137408_1406244Not Available855Open in IMG/M
3300020004|Ga0193755_1061178All Organisms → cellular organisms → Bacteria → Proteobacteria1224Open in IMG/M
3300020170|Ga0179594_10014798All Organisms → cellular organisms → Bacteria → Proteobacteria2259Open in IMG/M
3300020170|Ga0179594_10100081Not Available1042Open in IMG/M
3300020170|Ga0179594_10186525All Organisms → cellular organisms → Bacteria → Proteobacteria775Open in IMG/M
3300021086|Ga0179596_10141819All Organisms → cellular organisms → Bacteria → Proteobacteria1135Open in IMG/M
3300024330|Ga0137417_1208917All Organisms → cellular organisms → Bacteria → Proteobacteria1161Open in IMG/M
3300025464|Ga0208076_1018343Not Available1204Open in IMG/M
3300025494|Ga0207928_1003192All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3013Open in IMG/M
3300025509|Ga0208848_1056606Not Available831Open in IMG/M
3300025625|Ga0208219_1082132Not Available749Open in IMG/M
3300025910|Ga0207684_10152434All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. AUGA SZCCT02831989Open in IMG/M
3300025911|Ga0207654_10209871Not Available1286Open in IMG/M
3300025919|Ga0207657_11235672Not Available567Open in IMG/M
3300025920|Ga0207649_11336055Not Available567Open in IMG/M
3300025921|Ga0207652_11263469Not Available641Open in IMG/M
3300025929|Ga0207664_10782146All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria859Open in IMG/M
3300025932|Ga0207690_10394440All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1102Open in IMG/M
3300026078|Ga0207702_10289765Not Available1550Open in IMG/M
3300026294|Ga0209839_10116963Not Available900Open in IMG/M
3300026359|Ga0257163_1037509Not Available766Open in IMG/M
3300026482|Ga0257172_1037081Not Available885Open in IMG/M
3300026507|Ga0257165_1012328All Organisms → cellular organisms → Bacteria → Proteobacteria1359Open in IMG/M
3300026515|Ga0257158_1047760Not Available785Open in IMG/M
3300026538|Ga0209056_10675941Not Available521Open in IMG/M
3300026551|Ga0209648_10154061All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1811Open in IMG/M
3300026920|Ga0208575_1002953All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1962Open in IMG/M
3300027546|Ga0208984_1051099Not Available881Open in IMG/M
3300027655|Ga0209388_1121661All Organisms → cellular organisms → Bacteria → Proteobacteria744Open in IMG/M
3300027671|Ga0209588_1005932All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum3522Open in IMG/M
3300027846|Ga0209180_10009656All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4957Open in IMG/M
3300027882|Ga0209590_10097323All Organisms → cellular organisms → Bacteria → Proteobacteria1751Open in IMG/M
3300027915|Ga0209069_10008616All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5018Open in IMG/M
3300028536|Ga0137415_10065831All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3487Open in IMG/M
3300028673|Ga0257175_1009127All Organisms → cellular organisms → Bacteria → Proteobacteria1459Open in IMG/M
3300028792|Ga0307504_10080471Not Available999Open in IMG/M
3300028828|Ga0307312_10500567Not Available802Open in IMG/M
3300031708|Ga0310686_103457324All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria763Open in IMG/M
3300031938|Ga0308175_102482503Not Available580Open in IMG/M
3300031939|Ga0308174_10296296Not Available1273Open in IMG/M
3300031996|Ga0308176_10301679Not Available1560Open in IMG/M
3300032074|Ga0308173_10431193Not Available1167Open in IMG/M
3300032074|Ga0308173_10996142Not Available778Open in IMG/M
3300033180|Ga0307510_10134864Not Available2130Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.22%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere5.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.35%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.48%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil3.48%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere3.48%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.61%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil2.61%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.74%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.74%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.74%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.74%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.74%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.74%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.87%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.87%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.87%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza0.87%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.87%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300005327Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C1-3 metaGHost-AssociatedOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005344Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaGHost-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005995Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006864Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 3 DNA2013-193EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010877Boreal forest soil eukaryotic communities from Alaska, USA - W3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025464Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-3 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025494Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-2 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025509Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025625Arctic peat soil from Barrow, Alaska - NGEE Surface sample 53-2 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025920Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026294Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050 (SPAdes)EnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026920Forest soil microbial communities from Willamette National Forest, Oregon, USA, amended with Nitrogen - NN397 (SPAdes)EnvironmentalOpen in IMG/M
3300027546Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031939Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R2EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032074Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R1EnvironmentalOpen in IMG/M
3300033180Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 12_EMHost-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1036585713300001661Forest SoilHDYRANVRRKEGDLVIIPVDAHLLTLTISTPDIVERPVRVGRETHMQRERVMRSRDIPTTRDAASLASSFKEVNKIFGAADIEFRLQNTTTDPIEAPSGSEAVDDRDFLMLASRFPMKNAVSLLLLHRFKGAEGGASVEKLGVCAVDDNSPDTALAHEFGHLLGLEHQGDIRDLMNAGLSPPGTPLTAREISAARASALAKRFAPRSSP*
Ga0070658_1058945213300005327Corn RhizosphereMERHVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGT
Ga0070683_10179656713300005329Corn RhizosphereLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0070680_10011087743300005336Corn RhizosphereVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0070660_10108446913300005339Corn RhizospherePEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0070661_10054820513300005344Corn RhizosphereVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDTSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0070706_10008786623300005467Corn, Switchgrass And Miscanthus RhizosphereMDSDVVIIPVDAHLLTLTISKFEIIERPVSVGSTTHMQRERVQRKSEIQTQRTAATLMVSFTAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNKVVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTGSEIVAARASPLARRFGAPR*
Ga0068857_10014163123300005577Corn RhizosphereVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVGAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0066790_1017216013300005995SoilMRSRRSHANYDFGTLAESPWMENDVGNVVIPVDAHLLTLTISKFKIVERPVHVGRETRMQRERLQTKDEIPTQRSKASLVLSFAEVNKIFAAADIEFRLRNVTSEPVEAPKGAEALDDEGLLMLAKDYPMKDAVSLLLVRRFAGSEGGASKEELGVCAVGDTSPDTALAHEFGHLLGLDHQGDIRDLMNPGLSPPGTPLTAGEIRDARKSRLAKRFGARASQGPS*
Ga0075024_10000655743300006047WatershedsVVIIPVDAHLLTLTISTPEIVERPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFSAADIEFRLRKTTSDSVEAPKGSEALDDNGLYMLAAGFPMNDAVSLLLVRRFAGSEGGASVEKLGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTSSEITDVRASRLFQRFGGRPSEGSH*
Ga0075018_1006729823300006172WatershedsVVVIPVDAHLLTLTISTPQIVERPVRVGRETRMQRERIQTTREIKTQRSAASLTLSFVEVNKIFGAADIEFRLRNTTPESVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASKEKLGVCAIGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSPPGTPLTSSEITGARASWLFQRFGGRPSKGSS*
Ga0097621_10128975813300006237Miscanthus RhizosphereVVIIPVDAHLLTLTITKAEIVERPIRVGRETRMQRERVQTKREIQTQRSKASLALSFAEVNKVFGAADIEFRLRNTTPEVAEAPKGSEALDDEGFLMLAKDFPMTDAVSLLLVRRFVGSEGGASEEKLGVCAVGDSSPDTALAHEFGHLLALGHQGDIRDLMNRGLSPPGTPLTA
Ga0066797_102970423300006864SoilMRSRRSHANYDFGTLAESPWMENDVGNVVIPVDAHLLTLTISKFKIVERPVHVGRETRMQRERLQSKDEIPTQRSKASLVLSFAEVNKIFAAADIEFRLRNVTSEPVEAPKGAEALDDEGLLMLAKDYPMKDAVSLLLVRRFAGSEGGASKEELGVCAVGDTSPDTALAHEFGHLLGLDHQGDIRDLMNPGLSPPGTPLTAGEIRDARKSRLAKRFGARASQGPS*
Ga0099791_1006269223300007255Vadose Zone SoilMESDVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGNVRKSPLARQFGGRPSKG
Ga0099794_1000832923300007265Vadose Zone SoilMESDVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRKSPLARQFGGRPSKG*
Ga0099795_1015288823300007788Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTSDSVEAPKGSEALDDEGFLMLAAGFPMSNAVSLLLVRRFVGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEVKGARASWLFQRFGGRPSERSR*
Ga0099829_1004821223300009038Vadose Zone SoilMESNVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELRVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARQFGGRPSKG*
Ga0099830_1018200423300009088Vadose Zone SoilMESNVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDLVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELRVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARQFGGRPSKG*
Ga0099827_1016461413300009090Vadose Zone SoilMESDLVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFTAVNKIFRAADIEFRLRNTSSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0099792_1019973323300009143Vadose Zone SoilMESNVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDLVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELRVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARQFGGRPSKG*
Ga0099792_1026650813300009143Vadose Zone SoilMESDVVIIPVDAHLLTLTISTPEIVMRPVRMGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTSDSVEAPKGSEALDDEGFLMLAAGFPMSNAVSLLLVRRFVGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSERSR*
Ga0105241_1073003313300009174Corn RhizosphereMITIPVDAHLLTLTITTYKIVARPVRVGRETRMQRERIEAKDEFPTRRSAASLRLSFMEANKSFAAAGIEFQLRNVSSESAEAPKGSDALDDEGFLMLAKQFPMSDAVSLLLVRRFGGKEGGASKKELGVCAVGDVASAAALAHEFGHLLELEHQGDIRDLMNPSLSPDGTPLTAPEMETARRSKLATRFGARPT*
Ga0105237_1000613093300009545Corn RhizosphereMITIPVDAHLLTLTITTYKIVERPVRVGRETRMQRERIEAKDEFPTRRSAASLRLSFMEANKSFAAAGIEFQLRNVSSESAEAPKGSDALDDEGFLMLAKQFPMSDAVSLLLVRRFGGKEGGASKKELGVCAVGDVASAAALAHEFGHLLELEHQGDIRDLMNPSLSPDGTPLTAPEMETARRSKLATRFGARPT*
Ga0105238_1052308023300009551Corn RhizosphereMERHVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN*
Ga0099796_1012026513300010159Vadose Zone SoilMESDVVIIPVDAHLLTLTISTPEIVMRPVRMGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKVFGAADIEFRLRNTTSGSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASNEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPG
Ga0134121_1303348813300010401Terrestrial SoilVIIPVDAHLLTLTISKAEIVERPVRVGRETRMQRERIQTKREIPTQRSKASLTLSFVEVNKIFGAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMTDAVSLLLVRRFAGSEGGASEEKLGVCAVGDSSPDTALAHEFGHLLGLDHQGDIRDLMNPGLSPPGTPL
Ga0126356_1077112913300010877Boreal Forest SoilESDVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSADSLISSFKEVNKIFGAADIQFRLRHTTSDPVEAPKGSEALDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARQFGGRPSRG*
Ga0137391_1023195123300011270Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVSVGSTTHMQRERVQRKSEIQTQRTAASLMVSFTAVSKIFRAADIDFRLRNTSSESVEAPKGSEALDDEGFLMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSADTALAHEFGHLLGLAHQGDIRDLMNRGLSVPGAPLTRSEIATARASPLARRFGVSR*
Ga0137389_1020454123300012096Vadose Zone SoilMESDVVVIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTSGSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASNEKLGVCAVGDSSPDTALAHAFGHLLGLDHQGDIRDLMNPGLSAPGTPLTSSEITGVRASRLFQRFGGRPSKDSR*
Ga0137389_1033507523300012096Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVSVGSTTHIQRERVQRKNEIQTQRTAASLMVSFTAVSKIFRAADIDFRLRNTSSESVEAPKGSEALDDEGFLMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSADTALAHEFGHLLGLEHQGDIRDLMNRGLSVPGAPLTRSEIATARASPLARRFGVSR*
Ga0137388_1039591813300012189Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVSVGSTTHMQRERVQRKSEIQTQRTAASLMVSFTAVSKIFRAADIDFRLRNTSSESVEAPKGSEALDDEGFLMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSADTALAHEFGHLLGLAHQGDIRDLMNRGLSVPGAPLTRGEIATARASPL
Ga0137382_1010007423300012200Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIHRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTSASVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRKFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPEGSR*
Ga0137363_1014372433300012202Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGATR*
Ga0137363_1066803213300012202Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTIRSADSLISSFKEVNKIFGAADIEFRLRNTTSDPVEAPKGSEDLDDEGFLMLAKDFPMRNAVSLLLVRRFVGSEGGASKKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRKSPLARQFGGRPSKG*
Ga0137399_1052452623300012203Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTSSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARAS
Ga0137362_1019637623300012205Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0137381_1026635423300012207Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNRGLSAPGTPLTSSEIKGARASWLFQRFGGRPSEGSR*
Ga0137378_1052428423300012210Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRKTTSESVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFTGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSEGSR*
Ga0150985_10352506743300012212Avena Fatua RhizosphereMPAPFPTAHTQGEWILVIIPVDAHLLTLTISTPEIVERQVRQGRETHVQRERVMRSREIPTTRDAASLASSFKEVNKIFGAAAIEFRLRNTTTDPIEAPNGSEAIDDPGFLMMATKFPMNNAVSLLLFHRFKGAEGGASLEKLSVCAVDDYSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSPPGTPLTAREIAAARKSALAKRFAARSS*
Ga0137384_1048628423300012357Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRKTTSESVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFTGSEGGASKEKLGVCAVGDGSPETALAHEFGHLLVLEHQGDIRDLMNRGLSAPGTPLTSSEIKGARASWLFQRFGGRPSEGSR*
Ga0137375_1013116833300012360Vadose Zone SoilETHMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRKTTSESVEAPKGSEALDDEGFLMLAAGFPMNDAVSLLLVRRFAGSEGGASKEKLGVCAIGDSSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIRGARASWLFQRFGGRPSERSR*
Ga0137360_1002744313300012361Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREVKTIRSAASLISSFKEVNKIFGAADIEFRLRHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARRFGGRPSKG*
Ga0137360_1005092933300012361Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLKRREIVAARASPLARRFGASR*
Ga0137361_1060191513300012362Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFTAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0150984_10244795823300012469Avena Fatua RhizosphereVEGDVVIIPVDAHLLTLTISKFEIVERPVQVGRETRMQRERIQSKREIQTQRSKASLALSFIEVNKIFGAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMTDAVSLLLVRRFAGAEGGASEEKLGVCAVGDTSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSPPGTPLTPREIADARRSRLAKRFGARPAQDPG*
Ga0150984_11380533433300012469Avena Fatua RhizosphereMPAPFPTAHTQGEWILVIIPVDAHLLTLTISTPEIVERQVRQGRETHVQRERVMRSREIPTTRDAASLASSFKEVNKIFGAAAIEFRLRNTTTDPIEAPNGSEAIDDPGFLMLATKFPMNNAVSLLLFHRFKGAEGGASLEKLSVCAVDDYSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSPPGTPLTAREIAAARKSALAKRFAARSS*
Ga0137398_1042051323300012683Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVHVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTSDSVEAPKGSEALDDEGFLMLAAGFPMSNAVSLLLVRRFVGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSEGYR*
Ga0137397_1037977813300012685Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0137359_1072187913300012923Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVHVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSERSR*
Ga0137413_1007925823300012924Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAASLMVSFTAVSKIFRAADIDFRLRNTSSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0137416_1009495333300012927Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSE
Ga0137416_1176677013300012927Vadose Zone SoilSDVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTQKEIGDVRKSPL
Ga0137404_1059146013300012929Vadose Zone SoilAHLLTLTISKFEIIERPVSVGSTTHIQRERVQRKNEIQTQRTAATLMVSFTTVNKIFRAADIEFRLRNTSSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR*
Ga0137404_1080281913300012929Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRKFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPERSR*
Ga0137407_1017915623300012930Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVHVGRETRMQRERVQRTREIKTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSEGSR*
Ga0137407_1027840313300012930Vadose Zone SoilMEINVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLRHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPG
Ga0137410_1004874533300012944Vadose Zone SoilMESNVVIIPVGAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADMEFRLRHTTSDPVEAPNGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPASRFPARP*
Ga0157378_1122538313300013297Miscanthus RhizosphereLENDVVIIPVDAHLLTLTISKAEIVERPVRVGHETRMQRERVQTKREIQTQRSKASLTLSFVEVNKVFSAADIEFRLRNVTCEPAEAPKGSEALDDEGFLMLAKDFPMTDAVSLLLVRRFAGSEGGASEEKLGVCAVGDSSPDTALAHEFGHLLGLAHQGDIRDLMNPGLSPPGTPLTPREIADARKSRLAKRFGARPTQDPG*
Ga0157376_1175409213300014969Miscanthus RhizosphereMQRERIQTKRDIPTQRSKASLTLSFVEMNKIFGAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMTDAVSLLLVRRFAGSEGGASEEKLGVCAVGDSSPDTALAHEFGHLLGLAHQGDIRDLMNPGLSPPGTPLTASEIADARKSKLAKRFGARPPQDPG*
Ga0137414_113696623300015051Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPEGSR*
Ga0137405_115686323300015053Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREVKTIRSAASLISSFKEVNKIFGAADIEFRLRHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLMLVRRFVGSEGGASKKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRGFDEPRPRGSRRALDAKGN*
Ga0137412_1000386723300015242Vadose Zone SoilMESNVVIIPVDAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLRNTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPEGSR*
Ga0137403_1004758323300015264Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTIRSADSLISSFKEVNKIFGAADIEFRLRNTTSDPVEAPKGSEDLDDEGFLMLAKDFPMRNAVSLLLVRRFVGSEGGASKKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARRFGGRPSKG*
Ga0137403_1019329523300015264Vadose Zone SoilMESNVVIIPVEAHLLTLTISTPEIVMRPVHVGRETRMQRERVQRTREIKTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTPDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPPERSR*
Ga0132258_1121545823300015371Arabidopsis RhizosphereVDAHLLTLTIKTPEIVERPVRVGKTTHMQRERIQRTREIATKRNAASLRSSFMEVNKIFGAADIEFRLRNVTSEPVEAPKGSEALDDEGFLMLAKDFPMSDAVSLLLVRQFEGSEGGASKEELGVCAVGDGSPDTAIAHEFGHLLGLEHQGDIRDLMNRGLSPPGTPLTAREIADARKSRLAKRFGARPPQDPR*
Ga0132258_1189034313300015371Arabidopsis RhizosphereMERVVVVIPVDAHLLTLTISTPEIVQRPVRVGRETRMQRERVMRTREIPTTRNAASLALSFKEVNKIFGAAEIEFRLRNTTTEPIEAPKGAEVLDDDGFFMLASSFPMKNAVSLLLVRRFAGSEGGASVEKLGVCAVGDNSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSPPGTPLTAREISDARASRLARRFGGPTSP*
Ga0184638_101972623300018052Groundwater SedimentDAHLLTLTISTPQIVERPVRVGRETRMQRERIQRTREIQTQRSAASLILSFMEVNKIFRAADIEFRLRKTTSESVEAPKGSEALDDDGFYMLASGFPMNKAVSLLLVRRFAGSEGGASVEKLGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTSREITAARASRLVQQFGGRPSEGSR
Ga0184617_106446723300018066Groundwater SedimentMESNVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTSASVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRKFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSERSR
Ga0184632_1006687423300018075Groundwater SedimentVNCPINRSNAILFQVGARRTSQHDAEALPSRNVPENEASEACNRLTRNSCMESEVVIIPVDAHLLTLTISTPQIVERPVRVGRETRMQRERIQRTREIQTQRSAASLILSFMEVNKIFGAADIEFRLRKTTSESVEAPKGSEALDDDGFYMLASGFPMNKDVSLLLVRRFAGSEGGASIEKLGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTSREITAARASRLVQQFGGRPSEGSR
Ga0184612_1001069413300018078Groundwater SedimentVVIIPVDAHLLTLTISTPQIVERPVRVGRETRMQRERIQRTREIQTQRSAASLILSFMEVNKIFGAADIEFRLRKTTSESVEAPKGSEALDDDGFYMLASGFPMNKAVSLLLVRRFAGSEGGASVEKLGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTSREITAARASRLVQQFGGRPSEGSR
Ga0066662_1230459213300018468Grasslands SoilYCVGLMRLIDGDRYGFHHRIIREMDLVIIPVDAHLLTLTISTPEIVERQIRVGKETHIQRERVMRSREIPTTRDAASLASSFKEVNKVFGAADIEFRLRNTTTDPIEAPNGSECVDDPGFLMLASKFPMNNAVSLLLIHRFKGDEGGASVEKLAVCAVDDNSPDTALAHELGNLLGLKHQGDISDLMN
Ga0137408_140624423300019789Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLRHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRVVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRKSPLARQFGGRPSKG
Ga0193755_106117823300020004SoilVVVIPVDAHLLTLTISTPQIVERPVRVGRETRMQRERIQTTREIKTQRSAASLTLSFVEVNKIFGAADIEFRLRNTTPESVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRKFAGSEGGASKEKLGVCAIGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSPPGTPLTSSEITGARASWLFQRFGGRPSKGSS
Ga0179594_1001479853300020170Vadose Zone SoilVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR
Ga0179594_1010008123300020170Vadose Zone SoilMEIVVVIIPVEAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFGAADIEFRLLKTTSDSVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRSFAGSEGGASKEKLGVCAVGDGSPDTALAHEFGHLLGLEHQGDIRDLMSRGLSAPGTPLTKSEIKGARASWLFQRFGGRPSEGSR
Ga0179594_1018652523300020170Vadose Zone SoilMWSSCRWTPNLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREVKTIRSAASLISSFKEVNKIFGAADIEFRLRHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFAGSEGGASKKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRKSPLARQFGGRPSKG
Ga0179596_1014181913300021086Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVSVGSTTHMQRERVQRKSEIQTQRTAASLMVSFTAVSKIFRAADIDFRLRNTSSESVEAPKGSEALDDEGFLMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSADTALAHEFGHLLGLEHQGDIRDLMNRGLSVPGAPLTRSEIATARASPLARRFGASR
Ga0137417_120891713300024330Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHRGDIRDLMNPGLAV
Ga0208076_101834313300025464Arctic Peat SoilMESVVTNVVIPVDAHLLTLTISKFKIVERPVQVGRETRMQRERLQTKDEIPTQRSKASLTLSFAEVNKIFAAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMKDGISLLLVRRFVGSEGGASKEELGVCAVGDSSPDTALIHEFGHLLGLGGCLPDANAHRVGEREHARERRRTDPAARRVDDPGERPRV
Ga0207928_100319233300025494Arctic Peat SoilMESVVTNVVIPVDAHLLTLTISKFKIVERPVQVGRETRMQRERLQTKDEIPTQRSKASLTLSFAEVNKIFAAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMKDGISLLLVRRFVGSEGGASKEELGVCAVGDSSPDTALIHEFGHLLGLGHQGDIRDLMNRGLSAPGTPLTAREINDARKSRLAQRFGARPPQAPG
Ga0208848_105660623300025509Arctic Peat SoilMESVVTNVVIPVDAHLLTLTISKFKIVERPVQVGRETRMQRERLQTKDEIPTQRSKASLTLSFAEVNKIFAAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMKDGISLLLVRRFVGSEGGASKEELGVCAVGDSSPDTALIHEFGHLLGLGHQGDIRDLMNRGLSAPGTPLTAREIND
Ga0208219_108213213300025625Arctic Peat SoilMESVVANVVIPVDAHLLTLTISKFKIVERPVRVGRETRMQRERLQTKDEIPTQRSKASLTLSFAEVNKIFAAADIEFRLRNVTSEPAEAPKGSEALDDEGFLMLAKDFPMKDGISLLLVRRFVGSEGGASKEELGVCAVGDSSPDTALIHEFGHLLGLGHQGDIRDLMNRGLS
Ga0207684_1015243423300025910Corn, Switchgrass And Miscanthus RhizosphereHLLTLTISKFEIIERPVSVGSTTHMQRERVQRKSEIQTQRTAATLMVSFTAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNKVVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTGSEIVAARASPLARRFGAPR
Ga0207654_1020987123300025911Corn RhizosphereMITIPVDAHLLTLTITTYKIVERPVRVGRETRMQRERIEAKDEFPTRRSAASLRLSFMEANKSFAAAGIEFQLRNVSSESAEAPKGSDALDDEGFLMLAKQFPMSDAVSLLLVRRFGGKEGGASKKELGVCAVGDVASAAALAHEFGHLLELEHQGDIRDLMNPSLSPDGTPLTAPEMETARRSKLATRFGARPT
Ga0207657_1123567213300025919Corn RhizosphereTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0207649_1133605513300025920Corn RhizosphereVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDTSDNAIAHEFGHLLGLQHQGDIRDLMNKGLS
Ga0207652_1126346913300025921Corn RhizosphereSMERHVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0207664_1078214623300025929Agricultural SoilVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGL
Ga0207690_1039444023300025932Corn RhizosphereVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0207702_1028976513300026078Corn RhizosphereVLSYQSMERHVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0209839_1011696313300026294SoilMRSRRSHANYDFGTLAESPWMENDVGNVVIPVDAHLLTLTISKFKIVERPVHVGRETRMQRERLQTKDEIPTQRSKASLVLSFAEVNKIFAAADIEFRLRNVTSEPVEAPKGAEALDDEGLLMLAKDYPMKDAVSLLLVRRFAGSEGGASKEELGVCAVGDTSPDTALAHEFGHLLGLDHQGDIRDLMNPGLSPPGTPLTAGEIRDARKSRLAKRFGARASQGPS
Ga0257163_103750923300026359SoilSTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRQFVGSEGGASKKELSVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTQKEIGDVRKSPLARRFGGRPSKG
Ga0257172_103708113300026482SoilGSETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRQFVGSEGGASKKELSVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTQKEIGDVRKSPLARRFGGRPSKG
Ga0257165_101232813300026507SoilVVIIPVDAHRLTMTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRKSPLARQFGGRPSKG
Ga0257158_104776013300026515SoilNLVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRQFVGSEGGASKKELSVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTQKEIGDVRKSPLARRFGGRPSKG
Ga0209056_1067594113300026538SoilNSCMENDVVIIPVDAHLLTLTISTPEIVERPVRVGRETRMQRERIQRMREIKTTRSAASLTESFKEVNKIFGAADIEFRLRHTTADPVEAPKGSEDLDDEGFLMLAKAFPMKNGVSLPLVHRFAGSEGGASKKELGVCAVGDAANDTSLAHEFGHLLWLEHQGDIRDLMNPGL
Ga0209648_1015406123300026551Grasslands SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELSVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPRLAVPGAPLTRKEIGDVRKSPFARRFGGRPSKG
Ga0208575_100295323300026920SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETHMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELSVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPFARQFGGRPSKGSR
Ga0208984_105109913300027546Forest SoilLRHDYRANVRRKEGDLVIIPVDAHLLTLTISTPDIVERPVRVGRETHMQRERVMRSRDIPTTRDAASLASSFKEVNKIFGAADIEFRLQNTTTDPIEAPSGSEAVDDRDFLMLASRFPMKNAVSLLLLHRFKGAEGGASVEKLGVCAVDDNSPDTALAHEFGHLLGLEHQGDIRDLMNAGLSPPGTPLTAREISAARASALAKRFAPRSSP
Ga0209388_112166113300027655Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGNVRKSPLARQFGGHPSKG
Ga0209588_100593233300027671Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTIRSAASLISSFKEVNKIFGAADIEFRLQHTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTKKEIGDVRK
Ga0209180_1000965643300027846Vadose Zone SoilVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDLVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASKKELRVCAVGDDANDTSLAHEFGHLLWLDHQGDIRDLMNPGLAVPGAPLTRKEIGDVRKSPLARQFGGRPSKG
Ga0209590_1009732323300027882Vadose Zone SoilMESDVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFTAVNKIFRAADIEFRLRNTSSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELGVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGASR
Ga0209069_1000861633300027915WatershedsVVIIPVDAHLLTLTISTPEIVERPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFMEVNKIFSAADIEFRLRKTTSDSVEAPKGSEALDDNGFYMLAAGFPMNDAVSLLLVRRFAGSEGGASVEKLGVCAVGDSSPDTALAHEFGHLLGLEHQGDIRDLMNPGLSAPGTPLTSSEITDVRASRLFQRFGGRPSEGSH
Ga0137415_1006583153300028536Vadose Zone SoilVVIIPVDAHLLTLTISKFEIIERPVRVGSTAHMQRERVQRKSEIQTQRTAATLMVSFMAVNKIFRAADIEFRLRNTTSESAEAPKGSEALDDEGFFMLASGFPMNNNAVSLLLVRKFAGSEGGASAEELSVCAVGDSSPDTALAHEFGHLLSLEHQGDIRDLMNPGLSAPGTPLTRSEIAAARASPLARRFGATR
Ga0257175_100912713300028673SoilMESDVVIIPVDAHLLTLTISTPEIVEHPVRVGRETRMQRERIQRTREIKTTRSAASLISSFKEVNKIFGAADIEFRLRYTTSDPVEAPKGSEDLDDEGFLMLAKDFPMKNAVSLLLVRRFVGSEGGASAKELGVCAVGDDANDTSLAHEFGHLLGLDHQGDIRDLMNPGLAVPGAPL
Ga0307504_1008047123300028792SoilVVIIPVDAHLLTLTISTPQIVERPVRVGRETRMQRERIQTTREIKTQRSAASLTLSFVEVNKIFGAADIEFRLRNTTPESVEAPKGSEALDDEGFLMLAAGFPMNNAVSLLLVRKFAGSEGGASKEKLGVCAIGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLS
Ga0307312_1050056713300028828SoilPRNPNFSEPSAAAMQTAKAQRPANASPETRMESDVVVIPVDAHLLTLTISTPEIVVRPVRVGRETRMQRERIQRTREIQTQRSAASLTLSFIEVNKIFGAADIEFRLRKTTPASVEAPKASEALDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASKEKLGVCAIGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSAPGTPLTSSEIAGVRASWLFQRFGGRPSKDSR
Ga0310686_10345732413300031708SoilQGHTSHVQHERIPCQQAFATNRNIASLAALLLEANKAFSAADIEFRLRNTTSDSVEAPKESEALDDEGFYVLAAKFPMNNAVSLLLVHRFAGAEGGASAEKLGVCAIPDDAPATALAHEFGHLLGLAHQGDIRDLMNPGLSPPGTALTPREIAAARASALALRFGGPPPDQKGGGS
Ga0308175_10248250313300031938SoilDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPKGLEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSQTAN
Ga0308174_1029629613300031939SoilDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPKGLEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0308176_1030167913300031996SoilRHCRYQSMERHVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPKGLEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSQTAN
Ga0308173_1043119323300032074SoilVVTILVDAHLLTLTISTPEIVQRAVRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPKGLEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSQTAN
Ga0308173_1099614213300032074SoilVVTILVDAHLLTLTISTPDIVQRPIRVGRETRMQRERVMRTREIPTTRHAGSLMASFTEVNKIFAAADIEFKLRNWAKEPIEAPNGSEKLDDDGFLMLANSFPMKTAVNLLLVHRFQGSEGGASVEKLGVCAVGDDSSDNAIAHEFGHLLGLQHQGDIRDLMNKGLSAPGTPLTAREIADARASRLFKLFSSQQSPSAN
Ga0307510_1013486423300033180EctomycorrhizaVVVIPVDAHLLTLTISTPEIVMRPVRVGRETRMQRERIQRTREIKTQRSAASLTLSFTEVNKIFGAADIEFRLRNTTSESVEAPKGSETLDDEGFLMLAAGFPMNNAVSLLLVRRFAGSEGGASKEKLGVCAIGDGSPDTALAHEFGHLLGLEHQGDIRDLMNRGLSPPGTPLTSSEITDVRASRLFQRFGGRPSEGSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.