NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F060679

Metagenome / Metatranscriptome Family F060679

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F060679
Family Type Metagenome / Metatranscriptome
Number of Sequences 132
Average Sequence Length 183 residues
Representative Sequence GDSPADAWARYTASPESGTGRVSVPPVLTFVHRADVPKAPETVTQSALQQQAQLLDAHRRVEERLGVVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSASLTGQLKELANRLDTIQGKVSNLK
Number of Associated Samples 122
Number of Associated Scaffolds 132

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.27 %
% of genes near scaffold ends (potentially truncated) 59.85 %
% of genes from short scaffolds (< 2000 bps) 53.79 %
Associated GOLD sequencing projects 114
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (59.091 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(10.606 % of family members)
Environment Ontology (ENVO) Unclassified
(34.091 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.121 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Mixed Signal Peptide: No Secondary Structure distribution: α-helix: 71.50%    β-sheet: 0.00%    Coil/Unstructured: 28.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 132 Family Scaffolds
PF00005ABC_tran 9.09
PF00664ABC_membrane 5.30
PF08598Sds3 0.76
PF00873ACR_tran 0.76



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms59.85 %
UnclassifiedrootN/A40.15 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_10208411All Organisms → cellular organisms → Bacteria1519Open in IMG/M
3300000956|JGI10216J12902_117587425All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300002886|JGI25612J43240_1009368All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300002886|JGI25612J43240_1022387All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300002907|JGI25613J43889_10222629All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus influenzae509Open in IMG/M
3300003994|Ga0055435_10046445All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300004025|Ga0055433_10042993All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300004062|Ga0055500_10040430All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300005295|Ga0065707_10269042All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300005328|Ga0070676_10508533All Organisms → cellular organisms → Bacteria856Open in IMG/M
3300005334|Ga0068869_100009502All Organisms → cellular organisms → Bacteria6313Open in IMG/M
3300005343|Ga0070687_101033679All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300005438|Ga0070701_10067713All Organisms → cellular organisms → Bacteria1900Open in IMG/M
3300005440|Ga0070705_101329538All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300005459|Ga0068867_100758324All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300005546|Ga0070696_100584203All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300005549|Ga0070704_101183383All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300005719|Ga0068861_100286510All Organisms → cellular organisms → Bacteria1421Open in IMG/M
3300006041|Ga0075023_100022675All Organisms → cellular organisms → Bacteria1767Open in IMG/M
3300006844|Ga0075428_101240542All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300006845|Ga0075421_100270393All Organisms → cellular organisms → Bacteria2077Open in IMG/M
3300007004|Ga0079218_13871816All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Paracoccus → unclassified Paracoccus → Paracoccus sp. 228509Open in IMG/M
3300009090|Ga0099827_10122677All Organisms → cellular organisms → Bacteria2095Open in IMG/M
3300009090|Ga0099827_10504182All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300009147|Ga0114129_11086090All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300009553|Ga0105249_10582166All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300009795|Ga0105059_1006535All Organisms → cellular organisms → Bacteria1049Open in IMG/M
3300009803|Ga0105065_1040474All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300009812|Ga0105067_1007148All Organisms → cellular organisms → Bacteria1370Open in IMG/M
3300009821|Ga0105064_1094004All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300010396|Ga0134126_10922340All Organisms → cellular organisms → Bacteria981Open in IMG/M
3300010399|Ga0134127_10292625All Organisms → cellular organisms → Bacteria1564Open in IMG/M
3300010403|Ga0134123_10775028All Organisms → cellular organisms → Bacteria949Open in IMG/M
3300011271|Ga0137393_10167926All Organisms → cellular organisms → Bacteria1841Open in IMG/M
3300011406|Ga0137454_1006659All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300012040|Ga0137461_1029173All Organisms → cellular organisms → Bacteria1433Open in IMG/M
3300012202|Ga0137363_10105812All Organisms → cellular organisms → Bacteria2147Open in IMG/M
3300012203|Ga0137399_10049520All Organisms → cellular organisms → Bacteria3040Open in IMG/M
3300012349|Ga0137387_11250494All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300012360|Ga0137375_10055164All Organisms → cellular organisms → Bacteria4300Open in IMG/M
3300012685|Ga0137397_10335085All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300012927|Ga0137416_10233063All Organisms → Viruses → Predicted Viral1495Open in IMG/M
3300012930|Ga0137407_10367095All Organisms → cellular organisms → Bacteria1328Open in IMG/M
3300012931|Ga0153915_10440542All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1481Open in IMG/M
3300012931|Ga0153915_10501812All Organisms → cellular organisms → Bacteria1387Open in IMG/M
3300013306|Ga0163162_10728464All Organisms → cellular organisms → Bacteria1112Open in IMG/M
3300014884|Ga0180104_1003621All Organisms → cellular organisms → Bacteria3272Open in IMG/M
3300014968|Ga0157379_11104146All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300018053|Ga0184626_10065508All Organisms → cellular organisms → Bacteria1528Open in IMG/M
3300018053|Ga0184626_10078125All Organisms → cellular organisms → Bacteria1397Open in IMG/M
3300018063|Ga0184637_10612927All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300018071|Ga0184618_10050839All Organisms → cellular organisms → Bacteria1506Open in IMG/M
3300018078|Ga0184612_10073296All Organisms → cellular organisms → Bacteria1786Open in IMG/M
3300018422|Ga0190265_11527287All Organisms → cellular organisms → Bacteria781Open in IMG/M
3300019255|Ga0184643_1362906All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300019879|Ga0193723_1059660All Organisms → cellular organisms → Bacteria1112Open in IMG/M
3300022534|Ga0224452_1098513All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300022534|Ga0224452_1100988All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300022694|Ga0222623_10142078All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300025580|Ga0210138_1020241All Organisms → cellular organisms → Bacteria1399Open in IMG/M
3300025900|Ga0207710_10576662All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300025904|Ga0207647_10394913All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300025910|Ga0207684_10699718All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300025918|Ga0207662_11135335All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300025965|Ga0210090_1020501All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300026095|Ga0207676_11076048All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300026285|Ga0209438_1005767All Organisms → cellular organisms → Bacteria4205Open in IMG/M
3300026480|Ga0257177_1014598All Organisms → cellular organisms → Bacteria1071Open in IMG/M
3300026507|Ga0257165_1005627All Organisms → cellular organisms → Bacteria1809Open in IMG/M
3300027645|Ga0209117_1029734All Organisms → cellular organisms → Bacteria1712Open in IMG/M
3300027949|Ga0209860_1015859All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300028381|Ga0268264_10232465All Organisms → cellular organisms → Bacteria1703Open in IMG/M
3300028884|Ga0307308_10373514All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300031820|Ga0307473_10448802All Organisms → cellular organisms → Bacteria858Open in IMG/M
3300032174|Ga0307470_10650514All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300032180|Ga0307471_100585577All Organisms → cellular organisms → Bacteria1273Open in IMG/M
3300032180|Ga0307471_101123862All Organisms → cellular organisms → Bacteria951Open in IMG/M
3300033486|Ga0316624_10573433All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300033513|Ga0316628_103816396All Organisms → cellular organisms → Bacteria540Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.61%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.58%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.82%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands6.06%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand6.06%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.55%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil3.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.03%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.03%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.27%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.27%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.27%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere2.27%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere2.27%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.52%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.52%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.52%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.52%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.52%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.52%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.52%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.52%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.76%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.76%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.76%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.76%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.76%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.76%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.76%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.76%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.76%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.76%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459021Litter degradation NP4EngineeredOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009795Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011406Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT539_2EnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300012040Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT746_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025551Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025965Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027949Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033500Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF7AN SIP fractionEnvironmentalOpen in IMG/M
3300033501Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF12FN SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
4NP_003893502170459021Switchgrass, Maize And Mischanthus LitterYTPPSANPGMSGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRSGLQEQRAVRTSLAALETQVGIVQREVAESIAATKVEADARADIQKALASLSEDLAAVRKFMLQTAQLGWLNQELNVENASEIRKVAVASQELSASSARLEESL
JGI10214J12806_1020841113300000891SoilAPHAAAIHAAIDRSGNVGALAFLDAKDGRLVVLPGDNPSDAWSRHARSPESGTAPVSVPPVLTFVHRADVPKAPETVTRSALQQQQAVAALEPQVRRLEEQLGLVQRDLAESVAAAKRETDARAAMQTALNSLSEDLAAVRKFVLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELASRLDTIQGKVSSLK*
JGI10216J12902_11758742513300000956SoilDTRDRALVPHAAAIQAAIRQSGNVGALAFLDAKDGSLVVLPGDSPADAWARYATSPESATGRVSVPAVVTFVYRADVPKAPETVTQNLLQQQQAFRRSLAALETELRDADRNTEQRLGIVQRELAESIAATKQETDRSLALVRADVQKALSSLAEELDSARKFMLQTAQLGWLNHELNVENAGGIRKVAAASQELTAN
JGI25612J43240_100936833300002886Grasslands SoilPESGTGRVSLPPVLTFVHRADVPQAPETVTLSVLQQQAQLLDAHRRVAERLDLVQGELAASKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSGSLAGQLKELSSRLDSIQGKVSNLK*
JGI25612J43240_102238713300002886Grasslands SoilIRQSGHAGALALLDAQDGRLVVLPGDSPADAWARYTASPESGTGRVSLPPVLTFVHRADVPQAPETVTLSVLQQQAQLLDAHRRVAERLDLVQGELAASKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSGSLAGQLKELSSRLDSIQGKVSNLK*
JGI25613J43889_1022262913300002907Grasslands SoilVLTFVHRADVPQAPETVTLSVLQQQAQLLDAHRRVAERLDLVQGELAASKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSGSLAGQLKELSSRLDSIQGKVSNLK*
Ga0055435_1004644513300003994Natural And Restored WetlandsTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQEMSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSLK*
Ga0055437_1001588533300004009Natural And Restored WetlandsLDAQDGRLVVLPGDSPADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQELSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSLK*
Ga0055433_1004299313300004025Natural And Restored WetlandsYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQEMSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSLK*
Ga0055500_1004043013300004062Natural And Restored WetlandsAWARYTTSPESETGRVSVPPVLTFVHRADVPKTPETVTRSVLQQQQAVAALATELRDAHRRVEEQLGIVQGELADSIAATKQEAAARADMQTALTSLSEELATVRKFMLQTAQLGWLNHELNVENASGIGKMATASQELSASSERLEETLRQLSKSLAAQLKELANRLDTIQGKVSSLK*
Ga0062593_10302356113300004114SoilRALAPHAAAIHAAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTQGVLQQQAQLRDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASS
Ga0068993_1000230443300005183Natural And Restored WetlandsLAFLDAQDGRLVVLPGDSPADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQEMSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSLK*
Ga0065707_1026904223300005295Switchgrass RhizosphereAIHAAIDQSGNVGALAFLDAKDGRLVVLPGDSPADAWSRHARSPESGTGPVSVPPVLTFVHRADVPKAPETVIRSALQQQQAVAALEPQVRRLEEQLGVVQRDLAESVAVAKRETDARAAMQTALSSLSEDLAAVRKFVLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELANRLDTIQGKVSSLK*
Ga0065707_1049481123300005295Switchgrass RhizosphereAIGQSGKAGALAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0070676_1050853323300005328Miscanthus RhizospherePVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0068869_10000950213300005334Miscanthus RhizosphereAFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0070687_10103367913300005343Switchgrass RhizosphereGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLTAFETELRDADRRTEQRLGIVQREQRELAEAIAATKQETDRSLAVVRADVQTALGSLAEELDSARKFMLQTAQLGWLNHELNVENAGGIRRVATASQELAANSARLADTMRQLSENLAGQ
Ga0070701_1006771333300005438Corn, Switchgrass And Miscanthus RhizosphereAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLTAFETELRDADRRTEQRLGIVQREQRELAEAIAATKQETDRSLAVVRADVQTALGSLAEELDSARKFMLQTAQLGWLNHELNVENAGGIRRVATASQELAANSARLADTMRQLSENLAGQLKELAGRLDAIQGLVTNAK*
Ga0070705_10132953813300005440Corn, Switchgrass And Miscanthus RhizosphereLAFLDAKDGRLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVPKAPETVTHTLLQQQQAFRRSLTALETELRDADQRTEQRLGIVQRELAESIAATKQETDKALTAARADMQKELSSLAEDLAAARKFMLQTAQLGWLNHELNVENANGIRKVATASQELTASSARLEDTMRQLSDSLASQLKELANRLD
Ga0070708_10218349213300005445Corn, Switchgrass And Miscanthus RhizosphereDAWAGYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRGVLQEQQALRTAVAALETQLGIVQRELAESIGATKREAAARADMQTALSSLSEDLAAVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASAARLEETMHQLSESLAGQLKELANRLDTIQGKVS
Ga0068867_10075832413300005459Miscanthus RhizosphereLAFLDAVDGNLIVLPGDSPADAWARYATSPESGTGRVSVPAVVTFVYRADVPKAPETVTQNILQQQQAFRRSLAALETELRDADRSTEQRLGIVQRELAESSAATKQETERSLALVRADMQKALSSLAEELDSARKFMLQTAQLGWLNHELNVENASGIRKVAAASQELTANSARLADTMRQLSESLASQLKELANRLDAIQGLVTNIK*
Ga0070699_10140376113300005518Corn, Switchgrass And Miscanthus RhizosphereSGTGRVSVPPVLTFVHRADVPKAPETVTRGVLQQQQALRTAVAALETQLGIVQRELAESIGATKREAAARADMQTALSSLSEDLAAVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASAARLEETMHQLSESLAGQLKELANRLDTIQGKVSSLK*
Ga0070672_10201947413300005543Miscanthus RhizosphereFLDARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTQGVLQQQAQLRDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQL
Ga0070695_10185773013300005545Corn, Switchgrass And Miscanthus RhizosphereWSRHARSPESGTAPVSVPPVLTFVHRADVPKAPETVTRSALQQQQAVAALEPQVRRLEEQLGVVQRDLAESVAAAKRETDARAAMQTALNSLSEDLAAVRKFVLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELANRLDTIQG
Ga0070696_10058420323300005546Corn, Switchgrass And Miscanthus RhizospherePPVLTFVHRADVPKAPETVTTSALQQQGALRSSLAALETALRDEHRTLEERLGTVQRELAESKQAADASLAAARADLQTSLSSLAEDLAAVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASSARLEETMRQLSGTLAGQLKELAKRLDTIQGKVNKLK*
Ga0070696_10188247313300005546Corn, Switchgrass And Miscanthus RhizospherePVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSMVQSELAESKREADASLTGARADMQAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRKVATASQELSASSARLEDIMRELPERLAGQLKELANRLDTIQVRSVASSEADDARRRRRETMSAYPD
Ga0070704_10118338313300005549Corn, Switchgrass And Miscanthus RhizosphereGALIALPGDSPADAWARYTTSPDSGTGRVSVPAVVTFVYRADVPKAPETVTQTLLQQQQAFRKSLAALETELRDADRNTEQRLGSIQRELAESIGTTRQETERSLALVRADVQKALSSLAEDLDSARKFMLQTAQLGWLNHELNVENASGIRKVAAASQELTANSARLADTMRQLSESLASQLKELANRLDTIQGLVSTVK*
Ga0068861_10028651013300005719Switchgrass RhizosphereARDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETVRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0075300_107571213300005876Rice Paddy SoilVHRADVPKAPETVTRSVLQQQQALRTSVAALETQVGIVQRELAESIAATKVEADARADIQKALTSLSEDLAAVRKFMLQTAQLGWLNQDLNVENASEIRKVAAASQELSASSARLEESLRQLSGSLAGQLKELAHRLDTIHGKVNSPK*
Ga0075023_10002267533300006041WatershedsNVGALAFLDAKDGRLIVLPGDSSADAWSRYIASPEGETGRVSVPPVLTFVHRADVPKAPETIPRSVLQQEAQLRDAWRRIEERLSIVQSDLAESKREADASLTGARADMHAALSSLAEDLAAVRKFMLQTAQLGWLNHELTVENETGIRKVATASQELSASSARLEDLMRELPKRLAGQLEELANRLDSIQGKVSSLK*
Ga0075428_10124054223300006844Populus RhizosphereIRQSGNVGALAFLDARDGRLIVLPGDSPGDAWARYNASPESGTGRVSVPAVVTFLHRADIPEAPETVTRSLLQEQQALRTSVAALETELRDGHRRLAERLDSVQRELAESIAATKQETDTSLAATRADMQAALSSLAEDLGTVRKVVLQTAQLGWLNHELNVENASGLRKVATASQELSASSARLEEVLRQLSESLTGQLKELAKRLDSIQSKVSSLK*
Ga0075421_10027039333300006845Populus RhizosphereAAAIHAAISQSGNVGALAFLDAKDGRLVVLPGDSPADAWSRHTTSPESGTGPVSVPPVLTFVHRADVPKAPETMTRSALQQQQAVAALEPQVRRLEEQLGVVQRDLAESIAAAKRETDARAAMQTALSSLSEDLAAVRKFMLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELANRLDTIQGKVSSLK*
Ga0075431_10115200013300006847Populus RhizosphereVLPGDSPADAWSRHTTSPESGTGPVSVPPVLTFVHRADVPKAPETMTRSALQQQQAVAALEPQVRRLEEQLGVVQRDLAESIAAAKRETDARAAMQTALSSLSEDLAAVRKFMLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELANRLDTIQGKVSSLK*
Ga0079218_1387181613300007004Agricultural SoilHTAPPESGTGRVSAPPVLTYVHRADIPKAPETITRSVLQQEGALRVSVAALQSELSEAHRRIEQRLGLVQRELAESIAAAKREADTSAATARADLQKALSSFSEDLGAVRKFMLQTAQLGWLNQELIVENANAVRKVATASQELAASSAKLEETMRVLSETLAGQLKQL
Ga0099791_1062668313300007255Vadose Zone SoilQSGTAGALTFLDAQDRRLVVLPGDDPADAWARYTRSPESTTGGVSVPAVVTFVHRADVPKAPETVTSSVLQEQYVLKTSVAALETELRDAVRRVEERLGIVQRELAESIAATKQETDASLGTARADIQRTLSSLAGDLAAVRQFMLQTAQLGWLNHELIVENASGMRKVATASQE
Ga0099829_1047394723300009038Vadose Zone SoilVVLPGDSPADAWARYATSPESGTGRVSMPAVVSFVYRADVPKAPETVTHSVLQQQQALRTSLAALETELHDAHRRTEQRLDIVQRELAESIAATKQETDRSLAVARADVQKALSSLAEDLDSARKFMLRTAQLGWLNHELNVENASGIRKVATASQELTANSARLADTMRQLSESLARQLKELANRLDTIQGMVSNTK*
Ga0099827_1012267733300009090Vadose Zone SoilSVPPVLTFVHRVDVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFGESKREVDASLAAARAEMQTALSSFAEDLTTVRKFVLQTAQLGWLNHELTVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSLK*
Ga0099827_1050418213300009090Vadose Zone SoilRLVVLPGDDPADAWARYTRSPQSTTGGVSVPAVVTFVHRADVPKAPETVTSSVLQQQQVLKTSVAALETELRDAVRRVEERLGIVQRELAESIAATKQETDASLGTARADIQRTLSSLAGDLAAVRQFMLQTAQLGWLNHELIVENASGIRKVATASQELAASSAKLEETMRQLSETLAGQLKELAHRLDTIQGKVSSLK*
Ga0114129_1108609013300009147Populus RhizosphereAGTVGALAFRDAKDGHLVVLPGDTPADAWARYTASPESRTGRVSVPVVVVFVHRADVPKAPEAVTQSALQQEQALRTSVAALETELRDAYRRTEERLSLAQRELAESIAATKQETDKALAAARADMQKELSSLAEDLAAARKFMLQTAQLGWLNHELNVENASGIRKVATASQELTASSARLEDTMRQLSDSLASQLKELANRLDTIQAKASSLK*
Ga0105242_1305480913300009176Miscanthus RhizospherePEGETGRASAPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESKREADASLTGARADMQAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRKVATASQELSASSARLEEIMRELPERLAGQLKELANRLDTIRGKVSSLK*
Ga0105249_1058216623300009553Switchgrass RhizospherePAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTQGVLQQQAQLRDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0105059_100653523300009795Groundwater SandRSVPPVLTFVHRADIPKAPETVTRSVLQQQQAQFLDAHRRIEERLGIVQRELAESNREAAALQTALRSLSEELAAVRKFMLQTAQLGWLNHELNVENATGSRKGATASQELSASSARLEETMRQLSESLAGQLKELATRLDTIQGKVSNLK*
Ga0105065_104047413300009803Groundwater SandAIRQSGNAGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPEPVTRSVLQQQQALAALGTELRDAHRRIEEQLVIVQRELAESNREAAALQTALRSLSEELAAVRKFMLQTAQLGWLNHELNVENASDIRKVATASQELSASSARLEETMRQLSESLAGQLKELADRLDTIQSKVGSLK*
Ga0105071_104187423300009808Groundwater SandDAWSRYIASPEGETGRVSVPPVLTFVHRADIPKAPETVTRSVLQQQQAQFLDAHRRIEERLGIVERELAESIAATKREAAARADMQTALRSLAEDLAAVRKFMLQTAQLGWLNHELTVENASGIRKLATAGQELSASSARLEETMRQLSGSLAGQLKELANRLDTIQGKVSSLK*
Ga0105067_100714833300009812Groundwater SandHADAIHAAIGQSGTVGALAFLDARDGGLVVLPGDSPGDAWARYTALPESGRGRRSVPPVLTFVHRADVPKAPETVTRGVLQQQAQLRDAYRRFEEQLGTVQRELAESIAATKREAVARADMQTALTSLSEELATVRKFMLQTAQLGWLNHELNVENASDIRKVATASQELSASSARLEETMRHLSGNLAGQLKELATRLDTIQGKVSSLK*
Ga0105076_108834923300009816Groundwater SandVSVPPVLTFVHRADVPKAPEPVTRSVLQQQQALAALGTELRDAHRRIEEQLVIVQRELAESNREAAALQTALRSLSEELAAVRKFMLQTAQLGWLNHELVVENASSMRKVATASLELSASSERLEETMRQLSKSLAGQLKELANRLDTIQGKVSSLK*
Ga0105064_109400413300009821Groundwater SandRALASHADAIHSTIRQSGNVGALAFLDAKDGRLIVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPRAPETVTRSVLQQQQALRTSVAALETQLGIVQRELAESNAATKGAADARADMQKALSSLSEDLAAVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASSARLEDTLRQLSESLGSQLKELAAR
Ga0134126_1092234023300010396Terrestrial SoilYTMSPENGTGRVSMPPVLTFVYRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESKREADASLTGARADMYAALSSLAEDLAAVRKFILQTAQLGSLNHELNVETETGLRKVATASQELSASSARLEEIMRELPERLAGQLKELANRLDTIRGKVSSLK*
Ga0134124_1296061813300010397Terrestrial SoilKVGALVFLDATDSHLVVLPGESPADAWARYTMLPEHGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESKREADASLTGARADMHAALSSLAEDLDSARKFMLQTAQLGWLNHELNVENASGIRRVATASQELTANSARLADAMRQL
Ga0134127_1029262513300010399Terrestrial SoilAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLTAFETELRDADRRTEQRLGIVQREQRELAEAIAATKQETDRSLAVVRADVQTALGSLAEELDSARKFMLQTAQLGWLNHELNVENAGGIRRVATASQELAANSARLADTMRQLSENLAGQLKELAGRLDAIQNKIQNIK*
Ga0134122_1015863533300010400Terrestrial SoilLVVLPGDSPADAWARHIAAPESGAASAEPPPVLTFVHRADVPKAPELVTRTVLQQQHALRTSVAALETELREAHQRIEERFGLVQRELTDSITAAKQQADVSQAAARADIQKALSALSEDLAAARKFMLQTAQLGWLNHELTIENAGGIRKVAAASQELSASAAKLEETMRQLSETLAAQLKDLANRLDNIQGKVSSLK*
Ga0134123_1077502823300010403Terrestrial SoilADAWARYTASPESGTGRVSVPPVLTFVHRADVPKAPETVTQSGLQQQAQLLDAHRRVEERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTTVRKFVLQTAQLGWLNHELVVENASSVRKAATASQELTASSERLEETLRQLSKSLAGQLKELANRLDTIQGKVSSLK*
Ga0105246_1206271013300011119Miscanthus RhizosphereARDGRLVVLPGDSPADAWARHIAAPESGAASAEPPPVLTFVHRADVPKAPEVVTRTVLQQQHALRTSVAALETELREAHQRIEERFGLVQRELTDSITAAKQQADVSQAAARADIQKALSALSDDLAAARKFMLQTAQLGWLNHELTIENAGGIRKVAAASQELTANSARLADTMRQLSESLA
Ga0137393_1016792613300011271Vadose Zone SoilGGALAFLDAKDSHLVVLPGDSPADAWPRYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEEQLSIVQSEFAESKREADASLTGARADMQAALSSLAEDLAAVRKFMLQTARLGWLNHELNVETETGLRKGATASQELSASSARLEEIMRELPKRLAGQLKELANRLDTIQGKVSSLK*
Ga0137454_100665923300011406SoilAPHAAAIHAAIGQSGNAGALAFLDAKDGRLIVLPGDSPADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRGALQQQAQLRDTYRRFEEQLGTAQRELTESIAATKREADARAAMQTALASLAEDLAAVRKFMLQTAQLGWLNHELNVENATAIRKMATASQELAASSVRLEETLRQLSESLAGQLKELANRLDTIQGKVSSFK*
Ga0137446_117947313300011419SoilDSRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPETITRSLLQQQQALAALETELRDAHRRIEGQLGIVQRELAESNREAAALQTALRSLAEELATVRKFMLQTAQLGWLNHELNVENATGIRKMATASQELSASSVRLEETMRQLSESLAGQLKE
Ga0137458_115574113300011436SoilIVLPGDSPADAWGRYTTSPESGTGLASVPPVLTFVHRADVPKAPETVTRGVLQQQAQLRDSYRRFEEQLGIVQRGLAESIAATKREADARADMQTALTGLSEELATVRKFMLQTAQLGWLNHELVVENATGIRKLGAASQELSASSVKLEETLRQVAESLAGQLKELANRLDTIQGKVSSLK*
Ga0137452_124985013300011441SoilGDSPADAWGRYTTSPESGTGLASVPPVLTFVHRADVPKAPDVVTRSALLQQQALRAAVATLESELRDAHRRIEERFAVVQRQLAESIAAAKQEADVSQSAARADVQRALGSLSEDLAAARKFMLQTAQLGWLNHDLTLENASGIRKVATASQELSASAAKLEETMRQLSETLAGQLKDLATRLDNIQGKVGSLK*
Ga0137461_102917333300012040SoilDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSAPPVLTFVHRADVPKAPETITRSLLQQQQALAALETELRDAHRRIEEQLGIVQRELAESNREAAALQTAVRSLSEELATVRKFMLQTAQLGWLNHELTVENAIGIRRVVTTSQELSASSERLEETIRQLSKSLAAQLKELASRLDTIQGKVSSLK*
Ga0137363_1010581213300012202Vadose Zone SoilLDAKDGHLVVLPGDSPADAWARYAASPESGTSPVSVPVVVTFVYRTDVPKAPETVTQSALQQHQAQLTSMAAFASELRRIEERLGLVQRELAESIAATKRERADMQAALSSLSEDLATVRKFMLQTAQLGWLNHELTVENASGIRKMATASQELTASSAKLEDTLRQLSESLASQLKELAARLGAIQGKIQNLK*
Ga0137399_1004952033300012203Vadose Zone SoilLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESKREADASLTGARADVQAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRTVATASQELSASSARLEEIMRELPKRLAGQLKELANRLDTIQGKVSSLK*
Ga0137399_1088956023300012203Vadose Zone SoilSGHVGALVLLDAKDGRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVEERLDIVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSNLK*
Ga0137387_1125049413300012349Vadose Zone SoilMGALAFLDAKDGHLVVLPGDSPADAWERYTASPENRTSPVSVPVVVTFVYRTDVPKAPETVTQSGLQQQQALRTSVAAELQRLEERLGLVQRDLAATKQDTDKVLADMRALAEDLAAVRKFMLQTAQLGWLNHE
Ga0137386_1126601313300012351Vadose Zone SoilFLDAKDGRLVVLPGDTPADAWARYTASPESGTGRVSVPVVVVFVHRADIPKAPEPVTQSALQQQQALRTSVAALETELRDAYRRTEERLSLVQRELAESVAATKQETDRSLAAARADMQKELSSLAQDLAAARKFMLQTAQLGWLNHELNVENASGIRKVATASQELTASS
Ga0137375_1005516443300012360Vadose Zone SoilWARYTASPESGTGRVSMPPVLTFVHRADVPKAPETVTLSVLQQQAQLLDAHRRVAERLDIVQRELAESKREVDASLAAARADMQTALSSVAEDLATVRKFMLQTAQLGWLNHELTVENATGIRKVVTVSQELSASSAKLEETMRQLSGSLAGQLKELASRLDTIQGKVSSLNK*
Ga0137397_1033508513300012685Vadose Zone SoilLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTTVRKFALQTAQLGWLNHELTVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSNLK*
Ga0137416_1023306313300012927Vadose Zone SoilSPADAWARYIMLPESGTGRVSVPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEEQLSIVQSEFAESKREADASLTGARADMQAALSSLAEDLAAVRKFMLQTAQLGWLNHELNVETETGLRKGATASQELSASSARLEEIMRELPKRLAGQLKELANRLDSIQGKVSSLK*
Ga0137407_1036709533300012930Vadose Zone SoilVVLPGDSPADAWARYTASPESATGRVSVPPVLTFVHRADVPKAPETVTQSGLQQQAQLLDAHRRVEERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTTVRKFVLQTAQLGWLNHELTVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSNLK*
Ga0153915_1044054213300012931Freshwater WetlandsLTFVHRADVPKAPETVTRSGLQQQQALRTSVAALETQVGIVQRELAESIAATKGEADARADMQKALTSLSEDLAAVRKFMLQTAQLGWLNQELNVENASDIRKVAAASQELSASSARLEESLRQLSGSLAGQLKELAHRLDTIHGKIGSPK*
Ga0153915_1050181213300012931Freshwater WetlandsADAIHAAISQSGHVGALAFLDATDGRLVVLPGDSVADAWSRYTTSPESETGRVSVPPVLTFVHRADVPKAPETVTRSVLQQQQALRTSLAALETQLGVVQRELAESIAATKGEADARADMQKALTSLSEDLAAVRKFMLQTAQLGWLNHELNLENASDIRKVAAASQELSATSARLEETLRQLSGSLAGQLKELANRLDTIHGKVSSPK*
Ga0163162_1072846413300013306Switchgrass RhizosphereDGRLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0180104_100362133300014884SoilADAWSRYTTSPESATGRGSVPPVLTFVHRADVPKAPETVTRSVLQQQAQLHDAYRRFAEQLGTVQRELAESIAATKREADARANMQTALTSLSEELATVRKFILQTAQLGWLNHELNVENATGIRKLGTASQEVSASSVRLEETMRQLSESLAGQLKELANRLDTIQGKVSSLK*
Ga0157379_1110414623300014968Switchgrass RhizosphereSAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK*
Ga0187825_1017323713300017930Freshwater SedimentSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRSVLEQQQALRTSLAALETQVGIVQRGLAESIAATKGEADARADIQKALTSLSEDLAAVRKFMLQTAQLGWLNQELNVENASEIRKVAAASQELSASSARLEESLRQLSASLAGQLKELTHRLDIIHGKASSPK
Ga0187822_1029922513300017994Freshwater SedimentDAIHAAIGQSGRAGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVPKAPETVTRSVLEQQQALRTSLAALETQVGIVQRGLAESIAATKGEADARADIQKALTSLSEDLAAVRKFMLQTAQLGWLNQELNVENAIEIRKVAAASQELSASAARLEESLRQLSASLAGQ
Ga0184610_126628413300017997Groundwater SedimentKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPETMTRSLLQQQQALAALETELRDAHRRIEEQVGIVQRQLAESNREAAALQTALRSLSEDLAAVRKFMLQTAQLGWLNHELTVENASGIRKAATASQELSASSARLEETMLQLSKSLAGQLKELANRLDTIQGKVSSLK
Ga0184604_1009806723300018000Groundwater SedimentVLPGDTPADAWVRYTASPESGTGLVSVPVVVVFVHRADIPKAPEAVTQSALQQEQTLRTSVAALETELREAYRRIEERLSLAQRELAEAVAATKQAVAARADMQKELSSLAEDLAAARKFMLQTAQLGWLNHELNLENASGIRKVATASQELAASSARLEQTMRQLSDSLASQLKELANRLDTIQEKASSLK
Ga0184626_1006550813300018053Groundwater SedimentADAIQTATRESGNLGALAFLDAKDGRLVVLPGDTPADAWARYTASPESGTGRVSVPVVVVFVHRADIPKSPEAVTLSALQQQQALRTSVATLETELRDAYRRTEARLSLVQRELAESVAATKQEAAARADMQKELSSLAQDLAAARKFMLQTAQLGWLNHELNVENASGIRKVATASQELTASSARLEQTMRQLSDSLASQLKELANRLDTIQDKASTLK
Ga0184626_1007812513300018053Groundwater SedimentAIHAAIRQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSVLQQRQALETELRDAHRRIEEQLSIVQRALAESTAATKREADARADIQTALSSLSEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSSLK
Ga0184637_1061292713300018063Groundwater SedimentQSGNVGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSVLQQQHAQLRDAYRRIEERLGIVQRELAEAKREADASLAAARADMQTALSSLAEDLAAVRKFMLQTAQLGWLNHELNVENASGIRKVATASQELSASSARLEETMRQLSGSLAAQLKELASRLDTIQGKVSSLK
Ga0184618_1005083913300018071Groundwater SedimentSQSGHVGALALLDAKDGRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDTSLAAARAEMQTALSSLAEDLTTVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0184609_1012607623300018076Groundwater SedimentLPGDTPADAWARYTASPESGTGRVSVPVVVVFVHRADIPKAPEPVTQGALQQQQALRTSVAALETELRDGYRRTEARLSLVQRELAESVAATKQEAAARADMQKELSSLAQDLAAARKFMLQTAQLGWLNHELNVENASGIRKVATASQELTASSARLEQTMRQLSDSLASQLKELANRLDTIQDKASTLK
Ga0184612_1007329613300018078Groundwater SedimentLAFLDAKDGRLVVLPGDSSADAWARYTTSPESGTDRVSVPPVLTFVHRADVPKAPETVTRSVLQQRQVLETELRDAHRRIEEQLGIVQRALAESTAATKREADARADMQTALSSLAEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSSLK
Ga0184639_1044527823300018082Groundwater SedimentLPGDSPADAWARYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSLLQQQQALAALETELRDAHRRIEERLGIVQRELAESNREAAALQTALRSLSEDLAAVRKFMLQTAQLGWLNHELTVENASGIRKAATASQELSASSARLEETMLQLSKSLAGQLKELANRLDTIQGKVSSLK
Ga0184629_1002256113300018084Groundwater SedimentVGALAFLDAKDGRLVVLPGDSSADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPEIVTRSVLQQRQALETELRDGHQRIEEQLGIVQRELAESTAATKREAGARADIQTALSSLSEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSSLK
Ga0190265_1152728723300018422SoilVHRADVPRAPETVTRSALQQQQALTAMANELRDAHRRIEEQVGTVQRELAESVAATKREAAARADMQTALTALSEELATVRKFMLQTAQLGWLNHELSVENATGIGKVATASQELSASSERMEETLRQLSKSLAGQLKDLATRLDTIQGKVSSLK
Ga0190272_1088120823300018429SoilRLVVLPGDGPADAWSRYATSPESATGRVSVPPVLTFVHRADVPQAPETVTRSVLQQQQALAALATELRDAHRRIEEQLVVVQRELAESVAATKREADARAVMQTALISLSEELATVRKFMLQTAQLGWLNHELNVENASGIGKVATASRELSASSERLEETLRQLSKSLAGQLKELANRLDTIQGKVSSLK
Ga0184643_136290613300019255Groundwater SedimentRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDTSLAAARAEMQTALSSLAEDLTNVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKVEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0193723_105966013300019879SoilGDSPADAWARYTASPESGTGRVSVPPVLTFVHRADVPKAPETVTQSALQQQAQLLDAHRRVEERLGVVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSASLTGQLKELANRLDTIQGKVSNLK
Ga0193718_111432713300019999SoilKDGRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDVVQREFAESKREVDASLAAARAEMQTALSSLAEDLTNVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDT
Ga0180118_136931813300020063Groundwater SedimentAGALAFLDADDGRLIVLPGDSPADAWARYTASTEGGTGRASPPPVLTFVHRADVPKAPETVTRGVLQQQAQLHDAYRRFAEQLGTVQRELAESIAATKREADARANMQTALTSLSEELATVRKFMLQTAQLGWLNHELNVENATGIRKLGTASQEVSASSVRLEETMRQLSESLAGQLKELANRLDTIQGKVSSLK
Ga0210382_1038780613300021080Groundwater SedimentSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTNVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKVEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0224452_109851323300022534Groundwater SedimentFVHRADVPKAPETVTRSVLQQRQALETELRDGHRRIEEQLGIVQRELAESTAATKREADARADMQTALSSLSEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSTLK
Ga0224452_110098813300022534Groundwater SedimentDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTTVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0222623_1014207813300022694Groundwater SedimentPGDSSADAWSRYTSSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSVLQQRQALETELRDGHRRIEEQLGIVQRELAESTAATKREADARADMQTALSSLSEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSTLK
Ga0222622_1079425413300022756Groundwater SedimentSVPPVLTFVHRADVPKAPETVTQSGLQQQAQLLDAHRRVAERLDIVQREFAESKREVDASLAAARAEMQTALSSLAEDLTTVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0210131_109830813300025551Natural And Restored WetlandsTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQEMSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSL
Ga0210138_102024133300025580Natural And Restored WetlandsAIHAAISQSGKVGALAFLDAQDGRLVVLPGDSPADAWSRYTTSPESGTGRVSVPPVLTFVHRADVPKAPETVTRSALQQQQALAVLETELRHIEAQLGIVQRELAESIAAAKREAVARADMQTALSSLHEDLATVRKFMLQTAQLGWLNHELVVENASSMRKVATASQEMSASSERLEETMRQLSKSLAGQLKDLANRLDTIQGKVSSLK
Ga0207710_1057666213300025900Switchgrass RhizosphereSGSLGGLAFLDAKDSRLVVLPGDSPADAWARYATSPESGAGRVSVPAVLTFVYRADIPKAPETVTQNLLEQQQVFRRSLAALETELRDADRRTEQRVGIVQRELAESIAATKLDTDRSLAVVRADMQKALTSLAEELESARKFMLQTAQLGWLNQELIVENASGIRRVATASQELTANSAKLADTMRQLSESLAT
Ga0207647_1039491323300025904Corn RhizosphereIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLTAFETELRDADRRTEQRLGIVQREQRELAEAIAAAKQETDRSLAVVRADVQTALGSLAEELDSARKFMLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
Ga0207645_1063048013300025907Miscanthus RhizosphereLIVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTRVVLQQQAQLGDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSLK
Ga0207684_1069971823300025910Corn, Switchgrass And Miscanthus RhizosphereGRVSVPPVLTFVHRADVPKAPETVTRGVLQQQQALRTAVAALETQLGIVRRELAESIGATKREAAARADMQTALSSLSEDLAAVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASAARLEETMHQLSESLAGQLKELANRLDTIQGKVSSLK
Ga0207662_1113533513300025918Switchgrass RhizosphereVSHGAAIQAAIRQSGTIGALAFLDAKDGTLVVLPGDSPADAWARYATSPESGTGRVSVPAVLTFVYRADVLKAPELVTQNLLQQQQAFRRSLTAFETELRDADRRTEQRLGIVQREQRELAEAIAATKQETDRSLAVVRADVQTALGSLAEELDSARKFMLQTAQLGWLNHELILENAGGIRRVA
Ga0207712_1174129213300025961Switchgrass RhizosphereVLPGDSPAEAWSRYIASPEGETGRASAPPVLTFVHRADVPKAPETVTQGVLQQQAQLRDAYQRFEEQLGTVQRELAASIAATKREADARADMQTALTALSEELATVRKFMLQIAQLGWLNHELVVENATGIRKLGAASQELSASSAKLEETLRQLSESLAGQLKELASRLDTIRGKVSSL
Ga0210090_102050123300025965Natural And Restored WetlandsAWARYTTSPESETGRVSVPPVLTFVHRADVPKTPETVTRSVLQQQQAVAALATELRDAHRRVEEQLGIVQGELADSIAATKQEAAARADMQTALTSLSEELATVRKFMLQTAQLGWLNHELNVENASGIGKMATASQELSASSERLEETLRQLSKSLAAQLKELANRLDTIQGKVSSLK
Ga0207676_1107604823300026095Switchgrass RhizosphereKDSRLVVLPGDSPADAWARYATSPESGAGRVSVPAVLTFVYRADIPKAPETVTQNLLEQQQVFRRSLAALETELRDADRRTEQRVGIVQRELAESIAATKLDTDRSLAVVRADMQKALTSLAEELESARKFMLQTAQLGWLNQELIVENASGIRRVATASQELTANSAKLADTMRQLSESLATQLKELANRLDAIQGSVSNVK
Ga0209438_100576753300026285Grasslands SoilGDSPADAWARYTASPESGTGRVSLPPVLTFVHRADVPQAPETVTLSVLQQQAQLLDAHRRVAERLDLVQGELAASKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSGSLAGQLKELSSRLDSIQGKVSNLK
Ga0257176_104605223300026361SoilPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPEAIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESTREADASLTGARADMHAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRTVATASQELSASSARLEEIMRELPERLAGQLKELANRLDTIQGKVSSLK
Ga0257171_104074423300026377SoilWARYTASSESGTGRVSVPPVLTFVHRADVPKAPETVTQSALQQQAQLLDAHRRVEERLGVVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSASLTGQLKELANRLDTIQGKVSNLK
Ga0257169_105687923300026469SoilAWARYTASPESGTGRVSVPPVLTFVHRADVPKAPETVTQSALQQQAQLLDAHRRVEERLGVVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSASLTGQLKELANRLDTIQGKVSNLK
Ga0257177_101459823300026480SoilGTGRVSVPPVLTFVHRADVPKAPETVTQSALQQQAQLLDAHRRVEERLGVVQREFAESKREVDASLAAARAEMQTALSSVAEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVVTASQELSASSAKLEETMRQLSASLTGQLKELANRLDTIQGKVSNLK
Ga0257165_100562713300026507SoilSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEEQLSIVQSEFAESKREADASLTGARADMQAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRTVATASQELSASSARLEEIMRELPERLAGQLKELANRLDTIQGKVSSLK
Ga0209886_106301413300027273Groundwater SandALPESGRGRRSVPPVLTFVHRADIPKAPETVTRSVLQQQQAQFLDAHRRIEERLGIVERELAESIAATKREAAARADMQTALRSLAEDLAAVRKFMLQTAQLGWLNHELTVENASGIRRVATASQELSASSERLEETIRQLAKSLAAQLKELASRLDTIQGKVSSLK
Ga0209117_102973413300027645Forest SoilALVFLDAKDSHLVVLPGDSPADAWARYIMLPESGTGRVSVPPVLTFVHRADVPKAPETITGSVLQQQAQLRDARRRIEERLSIVQSELAESKREADASLAGARADMQAALSSLAEDLAAVRKFMLQTAQLGWLNHELNVENETGIRKVATASQELSASSARVEETMRQLSGSLAGQLKELATRLDTIQGKVSSLK
Ga0209860_101585913300027949Groundwater SandSSADAWSRYIASPEGETGRVSVPPVLTFVHRADVPKAPETVTRSVLQQQQALRTSMTALETQLGIVQRELAESIAATKRGADARADMQTALTSLSEDLAAVRKFMLQTAQLGWLNHELNVENAAGIRKVATASQELAASSVRLEETMRQLSESLAGQLKELAKRLDTIQGKVSSLR
Ga0268264_1023246513300028381Switchgrass RhizosphereDSPADAWARYATSPESGTGRVSVPAVVTFVYRADVPKAPETVTQNILQQQQAFRRSLAALETELRDADRSTEQRLGIVQRELAESSAATKQETERSLALVRADMQKALSSLAEELDSARKFMLQTAQLGWLNHELNVENASGIRKVAAASQELTANSARLADTMRQLSESLASQLKELANRLDTIQGLVSTVK
Ga0307296_1057979613300028819SoilLLDAKDGRLVVLPGDSPADAWARYTASAENGTGRVSVPVVVVFVHRADIPKAPEAVTQSALQQEQTLRTSVAALETELREAYRRIEERLSLAQRELAEAVAATKQAVAARADMQKELSSLAEDLAAARKFMLQTAQLGWLNHELNLENASGIRKVATASQELASSSARLEQTMRQLSDSLASQLKELANRLDTIQEKASSLK
Ga0307308_1037351413300028884SoilSQSGHVGALALLDAKDGRLVVLPGDSPADAWARYTASPESGAGRVSVPPVLTFVHRADVPKAPETVTQSVLQQQAQLLDAHRRVAERLDIVQREFAESKREVDTSLAAARAEMQTALSSLAEDLTNVRKFVLQTAQLGWLNHELAVENATGIRKVVTTSQELSASSAKLEETMRQLSASLAGQLKELANRLDTIQGKVSSFK
(restricted) Ga0255310_1010317923300031197Sandy SoilYIASPEGETGRVSVPPVLTFVHRADVPKAPETVTRGVLQQQAQLRDAYRRFEEQLGIVQRELAESIAATKRGADARADMQTALTSLSEDLAAVRKFMLQTAQLGWLNHELNVENASGMRKVATASQELSASSARLEETMRQLSESLAGQLKELANRLDTIQGKVSNLK
(restricted) Ga0255310_1020972813300031197Sandy SoilLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRSVLQQQQALRTSLAALETQVGIVQREVAESIAATKVEADARADIQKALTSLSEDLAAVRKFMLQTAQLGWLNQESNVENASEIRKVAAASQELSASSARLEESLRQLSGSLAGQLKELTHRLETIHGKVSSPK
Ga0307469_1225492613300031720Hardwood Forest SoilGDSPADSWARYTASPESGTGRVSEPPAVLTFVYRADVPKAPEAVTRSDLQRQQALGTSVAALETELRDAYRVVEERLGLVQRDLAASIATSKGEADASLAAARGDLQRALGLVSDDLAAVRKFMLQTAQLGWLTHELSVENAGGMRKVTTASQEMAASSAKLEETVRQLSATLAGQL
Ga0307473_1044880223300031820Hardwood Forest SoilASSPESGTGRVSVPVVVTFVHRADVPKAPEPVTRSALQEQQALRASVTALSTELGDVYQRLEERLGIVRRELAESIITTTQKTDESLAAARAATQNELKALAEELAAQRKFMLQTAQLGWLNHELNVENANGIRKVATASQELIASSARLEDTMRQLSDSLASQLKELANRLDTIQAKASSLK
Ga0307470_1065051423300032174Hardwood Forest SoilLPGESPADAWARYTMLPENGTGRVSMPPVLTFVHRADVPKAPETIPRSVLQQQAQLRDAWRRIEERLSIVQSELAESKREADASLTGARADMHAALSSLAEDLAAVRKFMLQTAQLGSLNHELNVETETGLRKVATASQELSASSARLEEIMRELPERLAGQLKELANRLDTIRGKVSSL
Ga0307471_10058557723300032180Hardwood Forest SoilIVLPGDSSADAWSRYKASPEGETGRVSVPPVLTFVHRADVPKAPETVTQGVLQQEARLRDAYRRSEEQLGTVQRELAESIAATKRQTDARADMQAALTSLSEDLATVRKFMLQTAQLGWLNHELNVENATGIRKVATASQELSASSVRLEETMRQLSESLAGQLKELAKRLDTIQGKVSSLK
Ga0307471_10112386223300032180Hardwood Forest SoilDGETGRVSVPPVLTFVHRADVPKAPETVTGSALQQQAQLRDAYRRFEEQLGTVQRELAESVAAAKRETDARTDLQAALTSLSEELATVRKFMLQTAQLGWLNHELNVENATGMRKVAAASQELSASSARLEETLRQLSESLTAQLKELANRLDTIQGKVSSLK
Ga0326726_1012006213300033433Peat SoilEAWSRYTTSPESGTGRVAVPPVLTFVHRADVPKAPETVTRSVLQQQQAWRTSVAALETQLGLVQRELAESIAATKGEAAARTDMQTALSSLSEELAAVRKFMLQTAQLGWLNHELNVENASDVRKVATASQELSASSARLEETMRQLSESLAGQLKELATRLDTIQGKVSSLK
Ga0316624_1057343323300033486SoilRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVPQAPETVTRSVLQQQQALRTSVAALETQLGIVQRELAESIAATKGEADARADMQKALTSLSEDLAAVRKFMLQTAQLGWLNQELNVENASDIRKVAAASQELSASSARLEESLRQLSGSLAGQLKELAHRLDTIHGKIGSPK
Ga0326730_106746813300033500Peat SoilNAGALVFLDAKDGRLVVLPGDSSAEAWSRYTTSPESGTGRVAVPPVLTFVHRADVPKAPETVTRSVLQQQQAWRTSVAALETQLGLVQRELAESIAATKGEAAARTDMQTALSSLSEELAAVRKFMLQTAQLGWLNHELNVENASDVRKVATASQELSASSARLEETMRQLSESLAGQLKELATRLDTIQGKVSSLK
Ga0326732_106594713300033501Peat SoilAFLDAKDGRLVVLPGDSSAEAWSRYTTSPESGTGRVAVPPVLTFVHRADVPKAPETVTRSVLEQQQALRTSVAALETRLGLVQRELAESIAATKGEAAARTDMQTALSSLSEELAAVRKFMLQTAQLGWLNHELNVENASDIRKAAAASQELSASSARLEETMRQLSESLAGQLKELATRLDTIQGKVSSFK
Ga0316628_10381639613300033513SoilRDRALASRADAIHAAISQSGHLGALAFLDAADGRLVVLPGDSPADAWSRYTTSPESGPGGVSVPPVLTFVHRADVAKAPETVTRSVLQQQEALRTSLAALETQVGIVQRGLAESMAATKGEADARADMQKALTALSEDLAAVRKFMLQTAQLGWLNQELNVENASEIRKVAAASQELSA
Ga0364928_0161886_3_5513300033813SedimentAGALAFLDAKDGRLVVLPGDSSADAWSRYTSSPESGTGRVSAPPVLTFVHRADVPKAPETVTRSVLQQRQALETELRDAHRRIEEQLGIVQRELAESTAATKREAGARADMQTALSSLSEELATVRKFMLQTAQLGWLNHELTVENASGIRRVVTTSQELSASSERLEETIRQLAKSLAAQLK
Ga0370495_0246329_1_5823300034257Untreated Peat SoilVAIHAAIGQSGNAGALAFLDAKDGRLIVLPGDSPADAWSRYVASPEGETGRGSVPPVLTFVHRADVPKAPETVTRSVLQQQQALAALATELRDAHRRIEEQLGAVQRELAESIAATKREADARADMQTALTALSEELATARKFMLQTAQLGWLNHELVVENATGMRKLATSSQELSASSGRLEETLRQLSESLA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.