NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F054648

Metagenome Family F054648

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F054648
Family Type Metagenome
Number of Sequences 139
Average Sequence Length 196 residues
Representative Sequence LERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Number of Associated Samples 132
Number of Associated Scaffolds 139

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 1.43 %
% of genes near scaffold ends (potentially truncated) 49.64 %
% of genes from short scaffolds (< 2000 bps) 46.04 %
Associated GOLD sequencing projects 120
AlphaFold2 3D model prediction Yes
3D model pTM-score0.86

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.345 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(19.425 % of family members)
Environment Ontology (ENVO) Unclassified
(35.971 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(25.180 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 68.35%    β-sheet: 0.00%    Coil/Unstructured: 31.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.86
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.118.1.1: Armadillo repeatd1wa5c_1wa50.62568
a.118.1.19: Exportin HEAT-like repeatd3ibva_3ibv0.61851
a.118.1.0: automated matchesd3vyca_3vyc0.60595
a.118.1.1: Armadillo repeatd3ea5b_3ea50.6057
a.118.17.0: automated matchesd2wzka12wzk0.60369


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 139 Family Scaffolds
PF00501AMP-binding 29.50
PF13487HD_5 6.47
PF13193AMP-binding_C 4.32
PF00873ACR_tran 1.44
PF00990GGDEF 1.44
PF13520AA_permease_2 1.44
PF01796OB_aCoA_assoc 0.72
PF02538Hydantoinase_B 0.72
PF00108Thiolase_N 0.72
PF00589Phage_integrase 0.72
PF00909Ammonium_transp 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 139 Family Scaffolds
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 1.44
COG0004Ammonia channel protein AmtBInorganic ion transport and metabolism [P] 0.72
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 0.72
COG1545Uncharacterized OB-fold protein, contains Zn-ribbon domainGeneral function prediction only [R] 0.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.35 %
All OrganismsrootAll Organisms31.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003503|JGI26141J51220_1006370All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300004058|Ga0055498_10027001All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria895Open in IMG/M
3300004156|Ga0062589_100511066All Organisms → cellular organisms → Bacteria1016Open in IMG/M
3300005181|Ga0066678_10417955All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria890Open in IMG/M
3300005341|Ga0070691_10085286All Organisms → cellular organisms → Bacteria1551Open in IMG/M
3300005345|Ga0070692_10911439All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Deinococcales → Deinococcaceae → Deinococcus → Deinococcus hopiensis608Open in IMG/M
3300005355|Ga0070671_100365449All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1232Open in IMG/M
3300005468|Ga0070707_100211968All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1888Open in IMG/M
3300005468|Ga0070707_100618869All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1045Open in IMG/M
3300005878|Ga0075297_1001117All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1831Open in IMG/M
3300005879|Ga0075295_1048683Not Available568Open in IMG/M
3300006028|Ga0070717_11192291Not Available693Open in IMG/M
3300006057|Ga0075026_100377319All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300007076|Ga0075435_102040751All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Corynebacteriaceae → Corynebacterium → Corynebacterium sphenisci503Open in IMG/M
3300009089|Ga0099828_11287055Not Available647Open in IMG/M
3300009804|Ga0105063_1006046All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1183Open in IMG/M
3300009806|Ga0105081_1038295All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium661Open in IMG/M
3300010360|Ga0126372_11063805All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium825Open in IMG/M
3300010397|Ga0134124_11306252All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium749Open in IMG/M
3300010400|Ga0134122_11406293Not Available712Open in IMG/M
3300010401|Ga0134121_10019639All Organisms → cellular organisms → Bacteria → Proteobacteria5398Open in IMG/M
3300011270|Ga0137391_10832200All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300011419|Ga0137446_1106578Not Available669Open in IMG/M
3300012225|Ga0137434_1001232All Organisms → cellular organisms → Bacteria → Proteobacteria1987Open in IMG/M
3300012226|Ga0137447_1016363All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1089Open in IMG/M
3300012906|Ga0157295_10370477Not Available525Open in IMG/M
3300012929|Ga0137404_12263209Not Available508Open in IMG/M
3300012957|Ga0164303_10089843All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1491Open in IMG/M
3300014870|Ga0180080_1061314Not Available638Open in IMG/M
3300014968|Ga0157379_10418518All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1234Open in IMG/M
3300017930|Ga0187825_10383105Not Available538Open in IMG/M
3300018027|Ga0184605_10070596All Organisms → cellular organisms → Bacteria1512Open in IMG/M
3300018059|Ga0184615_10383240Not Available774Open in IMG/M
3300018089|Ga0187774_10267213All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria978Open in IMG/M
3300018422|Ga0190265_10033056All Organisms → cellular organisms → Bacteria4329Open in IMG/M
3300019882|Ga0193713_1120433All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300019890|Ga0193728_1225139Not Available770Open in IMG/M
3300021088|Ga0210404_10290810Not Available897Open in IMG/M
3300021476|Ga0187846_10083532All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1385Open in IMG/M
3300021560|Ga0126371_11486956All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria806Open in IMG/M
3300023072|Ga0247799_1007172All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1576Open in IMG/M
3300025922|Ga0207646_10083085All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2864Open in IMG/M
3300025922|Ga0207646_10857304All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium807Open in IMG/M
3300025931|Ga0207644_10331717All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1232Open in IMG/M
3300025957|Ga0210089_1043067Not Available573Open in IMG/M
3300026142|Ga0207698_12245560Not Available558Open in IMG/M
3300026497|Ga0257164_1010552All Organisms → cellular organisms → Bacteria1174Open in IMG/M
3300027650|Ga0256866_1143298Not Available648Open in IMG/M
3300027682|Ga0209971_1009075All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2360Open in IMG/M
3300027875|Ga0209283_10151081All Organisms → cellular organisms → Bacteria1540Open in IMG/M
3300027910|Ga0209583_10156182All Organisms → cellular organisms → Bacteria937Open in IMG/M
3300028587|Ga0247828_10485799All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria731Open in IMG/M
3300028719|Ga0307301_10291103Not Available535Open in IMG/M
3300028792|Ga0307504_10318722Not Available590Open in IMG/M
3300028792|Ga0307504_10387343Not Available547Open in IMG/M
3300028809|Ga0247824_10444849All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium756Open in IMG/M
3300028812|Ga0247825_10910847Not Available637Open in IMG/M
3300028878|Ga0307278_10068855All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1597Open in IMG/M
(restricted) 3300031197|Ga0255310_10086132All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium837Open in IMG/M
(restricted) 3300031197|Ga0255310_10148372Not Available644Open in IMG/M
3300031576|Ga0247727_10733650Not Available717Open in IMG/M
3300031716|Ga0310813_11603595Not Available608Open in IMG/M
3300031720|Ga0307469_11171528Not Available725Open in IMG/M
3300031754|Ga0307475_10452607Not Available1032Open in IMG/M
3300031949|Ga0214473_10003201All Organisms → cellular organisms → Bacteria19930Open in IMG/M
3300032180|Ga0307471_103943480Not Available524Open in IMG/M
3300032893|Ga0335069_12032260Not Available605Open in IMG/M
3300033432|Ga0326729_1001880All Organisms → cellular organisms → Bacteria4424Open in IMG/M
3300033475|Ga0310811_10499098All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1275Open in IMG/M
3300033513|Ga0316628_101023855All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1096Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil19.42%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands7.91%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil7.19%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.19%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.47%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.88%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.16%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.16%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.16%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.16%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.16%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.44%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.44%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.44%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.44%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.44%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.44%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.44%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.44%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.44%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.44%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.44%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.72%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.72%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.72%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.72%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.72%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.72%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.72%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300003503Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AMHost-AssociatedOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005204Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2EnvironmentalOpen in IMG/M
3300005205Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012133Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT121_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012906Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S212-509R-1EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300014269Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D1EnvironmentalOpen in IMG/M
3300014324Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1EnvironmentalOpen in IMG/M
3300014870Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT560_16_10DEnvironmentalOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015251Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT293_16_10DEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300023072Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S151-409C-6EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025558Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025569Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025955Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025973Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026142Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027682Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028719Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033812Sediment microbial communities from East River floodplain, Colorado, United States - 65_j17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1187613423300000891SoilAPARGAAPPTVTPEWRAQVAALIRGRECAEARQLLDPALARGEVDGETAAFLLEICSTAVARDLWRLRRALRRGGGGEAPLEGTLEVTQVMLEAAVTETLPQERRRRVSARLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDHERAVALEQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIEQAPVR*
JGI26141J51220_100637013300003503Arabidopsis Thaliana RhizosphereIRRRELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGPGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRVLEHIRRARAAGVPAEDLAVAASRARQLLEHIEQTSVR*
Ga0055437_1020481313300004009Natural And Restored WetlandsKAPSRAAVTAVPPEPEAPPQEARATVTQEWRSRVAALIRGRESGEARQLLEPALERGEVDRETARFLLDVCSTALARDLSRLRRAVRPGGGDEAPLEGSLEVTRLVLEASVAEALPREQRRRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRARALEQTQRLRDHI
Ga0055433_1020290313300004025Natural And Restored WetlandsLIRGRESGEARQLLEPALERGEVDRETARFLLDVCSTALARDLSRLRRAVRPGGGDEAPLEGSLEVTRLVLEASVAEALPREQRRRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRARALEQ
Ga0055498_1002700123300004058Natural And Restored WetlandsPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR*
Ga0055500_1000122833300004062Natural And Restored WetlandsVTHEWRSRVAALIRGRESGEARQLLEAALERDEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR*
Ga0062589_10051106623300004156SoilAWRERVAGHIRRRELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGPGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRVLEHIRRARAAGVPAEDLAVAASRARQLLEHIEQTSVR*
Ga0062592_10124607613300004480SoilEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTRGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR*
Ga0066678_1041795523300005181SoilLAARDARPESVGFLLEVCSIATARELWRLRRALRRGVGDEASLGGALETTRLLLDSEPAAGLPSEERGRAGRRLWRGHTRLGLRRWRAGEFDDAVEALFKALAVPGLDDRRRALARDLLVRTLEDLAGQRLELIPQLLGDGDRPAALEQAQRLAAQVGRAREEGVSAEDLAVAAARARQLLEHIEHTPAR*
Ga0068997_1002270923300005204Natural And Restored WetlandsVTQEWRSRVAALIRGRESGEARQLLEAALERDEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR*
Ga0068999_1000368513300005205Natural And Restored WetlandsEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR*
Ga0070691_1003032023300005341Corn, Switchgrass And Miscanthus RhizosphereVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEATVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR*
Ga0070691_1008528623300005341Corn, Switchgrass And Miscanthus RhizosphereVVELAVAAGDVTPDAAAFLLEVCSTATARELWRLRRALRRGAGDETPLGGALETARLLLDCGPAAARGPAGRRLWRGHTRLGLRRWRAGDFDAAVGTLFQALAVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRAAALEQAQRLLAHVRRAREEGIAAEDLAVAAARARQLLEHIEHTPVR*
Ga0070692_1091143913300005345Corn, Switchgrass And Miscanthus RhizosphereAWRERVAGHIRRRELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGAGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRVLEHIRRARAAGVPAEDLAVAASRA
Ga0070671_10036544913300005355Switchgrass RhizosphereVCSIASARELWRLRRATRRGTGDEAPLQGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRTLARDLLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENVEHDPVE*
Ga0070707_10021196813300005468Corn, Switchgrass And Miscanthus RhizosphereVVESALAAREAHPESVAFLVEVCSTATARELWRLRRALRRGAGDEAPLAGALETARVLLDSRPAAGLPAEARSRAGRRLWRGHTRLGLRRWRAGHFDQAVAALFQALAVPGLDERRRALARDLLVRTLEDMAAQRLDLIPQLLGDGDRPAALEQAQRLTAHVQRAREEGVTAESLAVAAARARQLLEHIEHTPVQ*
Ga0070707_10061886913300005468Corn, Switchgrass And Miscanthus RhizosphereTARELWRLRRALRRGAGDETPLGGALETARLLLDSEPAAGLSSAARGRAGRRLWRGQTRLGLRRWRAGEFDPAVEALFQALAVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQLLGDGDRVAALGQAQRLAAHVARAREEGIAAEDLAVAAARARQLLEHIEHTPVR*
Ga0070695_10032712513300005545Corn, Switchgrass And Miscanthus RhizosphereDAPVEGEAPARGAAPPTVTPEWRAQVAALIRGRECAEARQLLDPALARGEVDGETAAFLLEICSTAVARDLWRLRRALRRGGGGEAPLEGTLEVTQVMLEAAVTETLPQERRRRVSARLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDHERAVALEQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0068860_10014528823300005843Switchgrass RhizosphereAEPPAPEPARATVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR*
Ga0075297_100111713300005878Rice Paddy SoilAERTAFLLEVCSTAIVRDLWRLRRALRRGAGEEAPLVGCMEMTGALLESSAAAGLPAAQRRRVARRLWRGHARLGLRKWRAGEFDAAAECLFRAAAGPEIDERRRRLARDLLVRTLEDLAGLSLEVIPQLLDDGDRAAALERAQRLLGHIRRARAEGVSTEDLTVSASRARQLLEHIEHTPVR*
Ga0075295_104868313300005879Rice Paddy SoilAGFLLEVCSTVIARDLWRLRRALRRGTGEEAPLVGCMEIARALLEAPVAAALPAVQRRRLTRRLWRGHTRLGLRRWRAGEFDAAAECLFQAAAGAEIDERRRRLARDLLVRTLEDLAGLSLEVIPQLLDDGDRAAALERAQRLLGHIRRARAEGVSTEDLTVSASRARQLLEHIEHTPVR
Ga0070717_1119229113300006028Corn, Switchgrass And Miscanthus RhizosphereVESALAAREAHPESVAFLVEVCSTATARELWRLRRALRRGAGDEAPLAGALETARVLLDSEPAAGLPAEARSRAGRRLWRGHTRLGLRRWRAGHFDQAVAALFQALAVPGLDERRRALARDLLVRTLEDMAAQRLDLIPQLLGDGDRPAALEQAQRLTAHVQRAREEGVTAESLAVAAARARQLLEHIEHTPVQ*
Ga0075417_1030144823300006049Populus RhizosphereFVAEEWRTRVAALIRGRECGEARELLESALERGELDQDTAGFLLDVCSTAVARDLWRLRRAVRRGGGDPAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR*
Ga0075026_10037731923300006057WatershedsAFLLEVCSTATARELWRLRRAVRRGAGDEAPLAGALETGRLLLDSESATAVPSEARGRAGRRLWRGHTRLGLRRWRAGEFDPAVEALFQALAVPGLDDRRRALARDLLARTLEDMAGQRLDLIPQLLGDGDRAAALEQAQRLAAHVGRAREEGIAAEDLAVAAARARQLLDHIEHTPVR*
Ga0075421_10124025213300006845Populus RhizospherePAEPEATAREGASALVAEEWRTRVAALIRGRECGEARELLESALERGELDQDTAGFLLDVCSTAVARDLWRLRRAVRRGGGDPAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGEPDRAAALGRAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0075431_10172091713300006847Populus RhizosphereDEARQLLEPALDRGEVDRETAAFLLDVCSTAVARDLWRLRRAVRRGGGGEAPLEASLQVSRVVLEASVVEALPPEQRGRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQPLGDHGRAAALDQAQRLLDHIRRTREEGVSTEDLAVAASRARQLLEHI
Ga0075435_10204075113300007076Populus RhizosphereESVAFLVEVCSTATARELWRLRRALRRGVGDEAPLAGALETARLLLDSEPAAGLPAEARSRAGRRLWRGHTRLGLRRWRAGHFDEAVAALFQALAVPGLDQRRRALGRDLLVRTLEDMAAQRLDLIPQLLGDGDRAAALEQAQRLTTHVQRAREEGVTAESLAVAAA
Ga0099828_1128705513300009089Vadose Zone SoilSTASRGAPPLETAAEDWRAGVATLIRRRELGEARRLIDPALASGAAGEKAVGFLLEVCSTAIARDLWRLRRGLPRGGGDDGPLAGSMETARVLLDSRPAAGLPPTLRLQVGRRLWRGHTRLGLRRWRAGEFDAAAETLFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLELIPQLLGDGDRAAALGRAQRLLTRIRRARDEGVSAEDLAVAAS
Ga0105243_1023456723300009148Miscanthus RhizosphereVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR*
Ga0105063_100604623300009804Groundwater SandRRELGEARRLIDPALASGAAGVETVGFLLEVCSTAIARDLWRLRRGLPRGGGDDAPLAGSMETARVLLDSRPATRLPASLRLRVGRRLWRGHTRLGLHRWRAGEFDAAVETLFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLEVIPQLLGDGDRAAALGRAQRLLTQIRRARDEEVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0105081_103829523300009806Groundwater SandLRRGLPRGGGDDAPLAGSMETARVLLDSRPAAALPPPLRLRVGRRLWRGHTRLGLRRWRAGEFDAAVETLFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLEVIPQLLGDGDRAAALGRAQRLLTQIRRARDEEVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0126372_1106380523300010360Tropical Forest SoilRAVRRSAGDEAVLVASMETTRVLLESRPAAALPAQQRRRTARRLWRGQARLGLRRWRAGDFDAAVETLFQALSVPGIDERRGRVARELLVRTLEDMAGQSLELIPQLLGEGDRAAALEQAQRLLVHIRRARDEAVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0134124_1130625213300010397Terrestrial SoilRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR*
Ga0134122_1140629313300010400Terrestrial SoilTATARELWRLRRALRRGAGDEAPLGGALETTRLLLDSGPARGLPPEARDRAGRRLWRGQTRVGLRRWRAGEFDAAVEALFQALAVPGLDARRRALARDLLVRTLEDMAGQRLELIPQLLGDGDRAAALEQAQRLATHVGRARHEGIEAEDLAVAAARARQLLEHIEHTPAG*
Ga0134121_1001963963300010401Terrestrial SoilMKPDMAEYLLDVCSIASARELWRLRRATRRGTGDEAPLQGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRTLARDLLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENVEHDPVE*
Ga0137391_1083220013300011270Vadose Zone SoilPESVGFLLEVCSIATARELWRLRRALRRGVGDEASLGGALETTRLLLDSEPAAGLPSEARGRAGRRLWRGHTRLGLRRWRAGEFDDAVEALFKALAVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQLLGDGDRPAALEQAQRLAAHVGRAREEGVSAEDLAVATARARQLLEHIEHTPAR*
Ga0137446_110657813300011419SoilTAVVRDLWRLRRALRRGGGGEASLEGSLQVTRVVLEASVAEALPREQRRRVSGRLWRGHTRLGLHRWRAGTFEPAVESLFRALGVRGLDERRCRLARDLLVRTLEDMAGQSLEVIPQLLGDHERAVALEQAQRLLGHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR*
Ga0137432_123343513300011439SoilPPPELAARGAAAPIPAEGEASTRSAARATVTEEWRARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGD
Ga0137445_108183113300012035SoilRQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVVLEASVAEALSREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGLDERRCRLARDLLVRTLEDMAGQSLELIPQLLGDHERGAALDQAQRLLDHIRRTREEGVSAEDLAVAASRARRLLEHIGQAPVR*
Ga0137329_102622113300012133SoilAEGEVSARGAVRATVTEEWRARVAALIRGRECGEARQLLEPALDRDQVDRDTAAFLLDVCSTAVARDLWRLRRVLRGGGGGEAPLEGSLQVTRVVLEASVAEALPREQRRRVSGRLWRGHTRLGLHRWRAGTFEPAVESLFRALGVRGLDERRCRLARDLLVRTLEDMAGESLELIPQLLGDHERGAALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQSPVR*
Ga0137434_100123233300012225SoilAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRQRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR*
Ga0137447_101636313300012226SoilCSTAVARDLWRLRRVLRGGGGGEAPLEGSLQVTRVMLEASVAEALPQEQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERGAALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIEHTPVR*
Ga0137397_1051790413300012685Vadose Zone SoilVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFATAVESLFRALGVRGIDERRHRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHALVR*
Ga0157295_1037047713300012906SoilRVAGHIRRRELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEASRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGAGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRV
Ga0137394_1063590213300012922Vadose Zone SoilVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFATAVESLFRALGVRGIDERRHRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHAPVR*
Ga0137404_1035722523300012929Vadose Zone SoilALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFATAVESLFRALGVRGIDERRHRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHAPVR*
Ga0137404_1226320913300012929Vadose Zone SoilAGDQAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEERDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR*
Ga0137407_1054078813300012930Vadose Zone SoilMGAAGPAEPEATAREGARALVAEEWRTRVAALIRGRECGEARELLESALERGELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDEAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDEHRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR*
Ga0137407_1116092213300012930Vadose Zone SoilEAPAREGARVLVAEEWRTRVAALIRGRECGEARQLLESALERDELDRETAGFLLDVCSTAVARDLWRLRRAVRRGAGDQAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQAVGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEDGDRAIALGQVQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR*
Ga0164303_1008984313300012957SoilARELWRLRRATRRGTGDEAPLQGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRALARELLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENIEHDPVE*
Ga0075302_107462513300014269Natural And Restored WetlandsVTQEWRSRVAALIRGRESGEARQLLEPALERGEVDRETARFLLDVCSTALARDLSRLRRAVRPGGGDEAPLEGSLEVTRLVLEASVAEALPREQRRRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRARAL
Ga0075352_103905313300014324Natural And Restored WetlandsPEPEAPTQEARATVTHEWRSRVAALIRGRESGEARQLLEAALERDEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR*
Ga0180080_106131423300014870SoilAVARDLWRLRRALRGGGGGEAPLEGSLQVTRVVLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFLALGVRGLDERRYRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR*
Ga0180066_103457313300014873SoilPAEVDASARGAARATVTEEWRARVAALIRGRECGEARQLLEPALDRGEVDRETVAFLLDVCSTAVARDLWRLRRALRRGDGGEAPLEGSLQVTRVVLEASVSEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGLDERRCRLARDLLVRMLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR*
Ga0180104_123541813300014884SoilEGSARGAARATVTEEWRARVAALIRGRECGEARQLLEPALDRDQVDRDTAAFLLDVCSTAVARDLWRLRRTLRRGDGGEAPLEGSLQVTRVVLEAPVSEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDH
Ga0157379_1041851823300014968Switchgrass RhizosphereASARELWRLRRATRRGTGDEAPLQGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRTLARDLLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENVEHDPVE*
Ga0120098_104095623300015170FossillGEVDRETARFLVDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRAVLEASVGEALSREQRRRVSGRLWRGHTRLGLRRWRAGAFEPAVESLFQALGVRGIDERRRRLARDLVVRTLEDMAGQSLEVIPQLLGDRERAVALEQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIEQAPVR*
Ga0180070_106852113300015251SoilIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRTLRGGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGESLEVIPQLLGEGDRAIALGQAQ
Ga0137403_1020019413300015264Vadose Zone SoilRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFATAVESLFRALGVRGIDERRHRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHAPVR*
Ga0187825_1038310513300017930Freshwater SedimentESAVASGDMSPDRAAFLLEVCSTATARELWRLRRALRRGAGDEAPLGGALETARVLLDCAPAAGLPAEARGRMGRRLWRGHTRLGLRRWRAGDFDAAVGALFEALSVPGLDERRRALARELLVRTLEDMAGQRLELIPQPLGDGDRAAALEQAQRLLAHVRRAREEGIAAEDLAVAAAR
Ga0184605_1007059623300018027Groundwater SedimentWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0184615_1038324023300018059Groundwater SedimentFGGAPRRDPVAEDWREGVATLIRRREFGEARRLIEPALAGGAVSHDTVGFLLEACSTAIARDLWRLRRGLRRGGGDEVPLTGSMETARVLLDSRPAAGLPPTLRLRVARGLWRGHTRLGLRRWRAGEFDTAAGTLFQALSVPGIDDRRRRLARDLAVRTLEDMAGQSLELIPQLLGDRDRAAALERAQRLLTQIRRARDEGVSPENLAVAASRARQLLEHIEQAPVR
Ga0184609_1011442913300018076Groundwater SedimentPPELAAREAAAPIPAEGEASTRGAARATVTEEWRAQVAVLIRGRECGEARQLLEPALGRGEVDRDTAGFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVVLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0187774_1026721313300018089Tropical PeatlandDGTAFLLEVCSTAIARDLWRLRRGQRRGTGEEAPLIAAMDITQVILDSETAASVDAAQRERVARRLWRGQTRLGLRRWRAGDFEAAVDALFRALAVPGIGERRRRLARDLLVRTLEDMAGQSLELIPQLRGEGDRAAALEQAQRVLEHIRRARTEGLPAEDLAVAASRARQLLEQIEPTSVQ
Ga0190265_1003305653300018422SoilVARDLWRLRRALRRGGGDEAPLGGSLAATRLLLDAPVAQAFPREQRRRVSCRLWRGHTRLGLRRWRAGSFEPAVEVLFQAYGVHGIDDRRRRLARDLLVRALEDMAGQSLELIPQLLGDRAAALEQAERLLAHIHRAREQGVAAEDLAVAASRARQLLDHIEQAPVR
Ga0193723_100311013300019879SoilEPEASARSAAASPVEPEPPAPAPARTTVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVASLFRALAVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVTLEQAQRLLDHIRRTREHGVSAEDLAVASSRARQLLDHIEHTPVR
Ga0193713_112043323300019882SoilVRRGGGDQAPLESSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0193725_114359113300019883SoilLDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPRDQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVA
Ga0193743_123095413300019889SoilRATVTEEWRARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLL
Ga0193728_122513913300019890SoilRPDSVGFLLEVCSTATARELWRLRRALRRGVGDETPLGGALETTRLLLDSEPAAELSPATRGRAGRRLWRGQTRLGLRRWRAGEFDAAVEALFQALAVPGLDDRRRALARDLLARTLEDMAGQRLELIPQLLGDGDRPAALEQAQRLAAHVGRAREEGIAAEDLAVAAARARQLLEHIEHTPVR
Ga0193755_106481113300020004SoilAEEWRTRVAALIRGRECGEARQLLESALERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLESSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0193721_111891713300020018SoilTVSSPASPSPAPEAGARAMGAAGPAEPEDPAREGARAIVTVEWRTRVAALIRGRECGEARQLLESALERGELDRETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLE
Ga0193733_113499113300020022SoilEPEAAAREGSRALVAEEWRTRVAALIRGRECGEARQLLESALERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVILEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAVSRARQLLEH
Ga0193717_102893013300020060SoilPIVSEEWRARVAALIRGRECGEARQLLDPALARDEVDRDTAGFILDVCSTAVARDLWRLRRAVRRGGGDEAPLTSSLEMTRVVLAAPVALGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGENLEVIPQLLGEGDRAIARGQAQRLLEHIRRTRDEGVPAEDLAVAAARARQLLEHIEHTPVR
Ga0210381_1003953423300021078Groundwater SedimentLERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0210382_1040061713300021080Groundwater SedimentRQLLESALEWDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALEVRGIDERRSRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0210404_1029081013300021088SoilRRLIDPALAQGNMSTETAGFLVEVCSTAIARDLWRLRRALRRGGGDEAPLVASVENTHVLLDSRPAAALPAEQRLRVARRMWRGRTRLGLRRWRAGDFEAGVDALFPALAAPSIGDRRRRLVGDLLVRTLEDMAGQSLELIPQLLGDGERAAALEQAQRLLGHIRRARGHGISAEDLAVAASRARQLLEHIEQASVR
Ga0210377_1029646313300021090Groundwater SedimentPRELAAQEAAAPTPAEGEVSARGAVRATVTEEWRARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVVRDLWRLRRALRRGGGGEAPLEGSLQVTRVVLEASVAEALPREQRRRVSGRLWRGHTRLGLHRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDHERAVALEQAQRLLGHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0187846_1008353213300021476BiofilmWGEARQLIDPALAQGSMSAETASFLVEVCSTAIARDLWRLRRALRRGPTDEAPLIASVENTRVLLDSRPAATLPAEQRRRAARRMWRGYTRLGLRRWRAGSFDAAVDALFPALAIAEVGDRRHRLAGDLLVGTLEDMAGQSLELIPQLLGEGERAAALEQTQRLLVHIRRAREQGIQAEDLAVAASRARQLLEHIEQGSMR
Ga0126371_1148695623300021560Tropical Forest SoilALLDVCSSVVARDLWRLRRAVRRSAGDEAVLVASMETTRVLLESRPAAALPAQQRRHTARRLWRGQARLGLRRWRAGDFDAAVETLFQALSVPGIDERRGRVARELLVGTLEDMAGQRLELIPQLLGEGDRAAALEQAQRLLLHIRRARDEGVSPEDLAVAASRARQLLEHIEQAPVR
Ga0224452_121695513300022534Groundwater SedimentRGRECGEARQLLESALERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQL
Ga0222623_1013184423300022694Groundwater SedimentPELAAREAAAPIPAEGEASTWGAARATMTEEWRARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0222622_1080242713300022756Groundwater SedimentAPEAVARAMGAAGPAEPEATAREGARARVAEGWRTRVAALIRGRECGEARELLESALERGELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVILEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAV
Ga0247799_100717213300023072SoilELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGPGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRVLEHIRRARAAGVPAEDLAVAASRARQLLEHIEQTSVR
Ga0209640_1005132963300025324SoilEAEGSAGGGATATESEEWHARVAALIRGRECGEARQLLEPALDRGEVDRETVAFLLDVCSTAVARDLWRLRRALHRGGGGEAPLEGSLQVTRVVLEASVSEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMTGQSLELIPQLLGDHERAVALNQAQRLLDYIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0210139_100832623300025558Natural And Restored WetlandsVTQEWRSRVAALIRGRESGEARQLLEPALERGEVDRETARFLLDVCSTALARDLSRLRRAVRPGGGDEAPLEGSLEVTRLVLEASVAEALPREQRRRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRARALEQAQRLRDHIRRTREEGVSGEDLAVAASRARQLLEHIEQAPVR
Ga0210073_111049213300025569Natural And Restored WetlandsAVTAVPPEPEAPPQEARATVTQEWRSRVAALIRGRESGEARQLLEPALERGEVDRETARFLLDVCSTALARDLSRLRRAVRPGGGDEAPLEGSLEVTRLVLEASVAEALPREQRRRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRARALEQT
Ga0207645_1003033133300025907Miscanthus RhizosphereMRRGIPGKRRGPPTAPMTGKKGLAPAGWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPV
Ga0207646_1008308513300025922Corn, Switchgrass And Miscanthus RhizosphereARRVVESALAAREAHPESVAFLVEVCSTATARELWRLRRALRRGAGDEAPLAGALETARVLLDSRPAAGLPAEARSRAGRRLWRGHTRLGLRRWRAGHFDQAVAALFQALAVPGLDERRRALARDLLVRTLEDMAAQRLDLIPQLLGDGDRPAALEQAQRLTAHVQRAREEGVTAESLAVAAARARQLLEHIEHTPVQ
Ga0207646_1085730423300025922Corn, Switchgrass And Miscanthus RhizosphereIDPALASGAVGPGTVEFLLEVCSTAVARDLWRLRRGLPRGGGDDGPLAGSMETTRVLLDSPPAAGLPPALRLQVGRRLWRGHTRLGLRRWRAGEFDAAAETLFQALSVPGIDDRRRRLARDLVVRTLEDMAGQSLELIPQLLGDGDRAAALGRAQRLLTRIRRARDEGVSAEELAVAASRARQLLEHIEHAPVR
Ga0207644_1033171723300025931Switchgrass RhizosphereVCSIASARELWRLRRATRRGTGDEAPLQGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRALARELLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENVEHDPVE
Ga0210071_102876613300025955Natural And Restored WetlandsALERDEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR
Ga0210089_104306723300025957Natural And Restored WetlandsPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR
Ga0210145_100242113300025973Natural And Restored WetlandsEVDRETAGFLLDVCSTAVARDLSRLRRAVRPGGGDEAPLEGSLEVTRLALEASVAAALPREQRGRVSGRVWRGQTRLGLRRWRAGTFETAVESLFRALGVRDIDERRRRLARDLLVRTLEDMAGQSLEVIPQLLGDRDRALALEQAQRLRDHIRRAREEGVSAEDLAVAASRARQLLDHIEQAPVR
Ga0207698_1224556013300026142Corn RhizosphereEMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGAGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVVRAQRVLEHIRRARAAGVPAEDLAVAASRARQLLEHIEQTSV
Ga0209438_112960613300026285Grasslands SoilMDAADPAESEAPAREGARVLVAEEWRTRVAALIRGRECGEARQLLESALERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQAVGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEDGD
Ga0257171_103193613300026377SoilPIPAEGEASTRGAARATVTEEWRARVAALIRGRECGEARELLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPRDQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0257177_104473313300026480SoilRGRECGEARELLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPRDQRRRVSGLLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAMAASRARQLLEHIGQASVR
Ga0257164_101055223300026497SoilDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPRDQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0209886_103862123300027273Groundwater SandLERGEVDRETAAFLLDVCSTAVARDVWRLRRAVRRGGGDELPLEASLEVTRVVLEAAAAQGLPRDQRRREAGRLWRGHTRLGLRRWRAGTFEAAIESLFRALGVRGIDERRHRLARDLLVRTLEDMAGQSLELIPQLLDEGDRAIALEQAQRLLEHIRRTREEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0256866_114329813300027650SoilARDLWRLRRALGRGGGDEGPLEGSLEASRVLLEASVSEAVSRERRRRVSDRLWRGHTRLGVRRWRAGSFEASVESLFRALGVPGTDERRRRLARDLLVRTLEDMAGQSLELIPQLLGEGDRAIALGQAQRLLEHIRRAREEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0209971_100907523300027682Arabidopsis Thaliana RhizosphereAEMASEADGEPAEASRAEDPAWRERVAGHIRRRELGEARGLIDSALASREMTIGSAGVLLEACSTAIARDLSRLRRAARRGSGDEAPLGALMEATRAILESDPASGLAAEPRQQAGRRLWRGHTRLGLRRWRAGDFEAAVETLFGALSVPGPGERRRRLGRDLLVRTLEDAAGQSLELIPELRGDGDRAAAVERAQRVLEHIRRARAAGVPAEDLAVAASRARQLLEHIEQTSVR
(restricted) Ga0233416_1010150013300027799SedimentARETATEQWRDRVAALIQGRELGQARQLLEPAIAEGEVGPETTAFLLGIASAAMARDLSRLRRARRRGGGEEAPLAASLQLTGLLLGAPGAQALPPEQQRRLRGRLWRGHARLGVRRWRAGDFEAAVEALFQAFGIPELDERRRRLARDLLVRALEDMAGQSLELIPQLLGEGDRAAALEQAQRLLTHIHRAREEGLPAEDLAVAASRARQLLAHIEQAPVQ
Ga0209814_1022063523300027873Populus RhizosphereFVAEEWRTRVAALIRGRECGEARELLESALERGELDQDTAGFLLDVCSTAVARDLWRLRRAVRRGGGDPAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0209283_1015108123300027875Vadose Zone SoilFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPRDQRRRVSGLLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVSAEDLAVAASRARQLLEHIGQAPVR
Ga0209583_1015618213300027910WatershedsAAPLAAPETRPESVAFLLEVCSTATARELWRLRRAVRRGAGDEAPLAGALETGRLLLDSESATAVPSEARGRAGRRLWRGHTRLGLRRWRAGEFDPAVEALFQALAVPGLDDRRRALARDLLARTLEDMAGQRLDLIPQLLGDGDRAAALEQAQRLAAHVGRAREEGIAAEDLAVAAARARQLLDHIEHTPVR
(restricted) Ga0233417_1017920223300028043SedimentETTAFLLGIASAAMARDLSRLRRARRRGGGEEAPLAASLQLTGLLLGAPGAQALPPEQQRRLRGRLWRGHARLGVRRWRAGDFEAAVEALFQAFGIPELDERRRRLARDLLVRALEDMAGQSLELIPQLLGEGDRAAALEQAQRLLTHIHRAREEGLPAEDLAVAASRARQLLAHIEQAPVQ
Ga0268264_1013852813300028381Switchgrass RhizospherePEPKAPARSAASGPAEAEPPAPEPARATVTEEWRARVATLIRGRESAEARQLLDPALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR
Ga0247828_1048579923300028587SoilARELWRLRRATRRGTGDEAPLEGALAMTRLLLDSPPAAELREDARGRAGRRLWRGHARVGLRRWRAGDFDVAVDALFGALGVPGLDERRRALARELLVRTLEDMAGQRLELIPQLLGDGDRAAALEQARSLATHVARAREDGIAPEDLAVAAARARQLLENIEHDPVE
Ga0307293_1011117813300028711SoilMGAVAPAEPEAAAREGSRALVAEEWRTRVAALIRGRECGEARQLLESALERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAVSRARQLLEHIEHTPVR
Ga0307301_1029110313300028719SoilFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALEVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307290_1028922813300028791SoilWDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307504_1031872223300028792SoilRGLSRGGGDDGPLAGSMEMARALLDSRPAAGLPPTLRLQVGRRLWRGHTRLGLRRWRAGEFDAAAETLFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLELIPQLLGDGDRAAALGRAQRLLTRIHRARDEGVSAEDLAVAASRARQLLEHIEHAPVR
Ga0307504_1038734313300028792SoilESAVTIGDVSPDSAGFLLEVCSTATARELWRLRRALRRGAGDETPLGGALETARVLLDCEPAAGLPAKARGRVGRRLWRGHTRLGLRRWRAGDFDTAVGALFEALSVPGLDARRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRAAALEQAQRLLAHVRRAREEGIAAEELAVAAARARP
Ga0307299_1030167913300028793SoilLERDELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307305_1006726023300028807SoilDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVILEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAVSRARQLLEHIEHTPVR
Ga0247824_1044484913300028809SoilEQTSRMNPAVARDLWRLRRAVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR
Ga0247825_1091084713300028812SoilVRRGGGDEGPLEASLAVTRVMLEAPVTQGLPREQRRRGTGRLWRGHTRLGLRRWRAGTFETAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLEHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR
Ga0307302_1021512823300028814SoilAEWRTRVAALIRGRECGEARELLESALERGELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALEVRGIDERRSRLTRDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307296_1047642713300028819SoilVAEEWRTRVAALIRGRECGEARELLESALERGELDQETAGFLLDVCSTAVARDLWRLRRAVRRGGGDQAPLEGSLAVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGTDERRRRLARDLLVRTLEDMAGQSLELIPQLVGEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307278_1006885513300028878SoilRLRRAVRRGGGDEAPLEGSLEVTRVMLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALGVRGIDEHRRRLARDLLVRTLEDMAGQSLELIPQLLEEGDRAIALGQAQRLLEHIRRTRDEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0299907_1116836413300030006SoilAEWRDRVAALIRGRESGEARQLLASALERGEVDRETAGFLLEVCSTAVARDLWRLRRALGRGGGDEGPLEGSLEASQVLLEASVSEAVSRERRRRVSDRLWRGHTRLGVRRWRAGSFEASVESLFRALGVPGTDERRRRLARDLLVRTLEDMAGQSLELIPQLLGEGDRAIALGQAQRLLEHIR
(restricted) Ga0255310_1008613223300031197Sandy SoilAGFLLDVCSTAVARDLWRLRRAVRRGGGDETPLAGSLEVTRVVLEAPVAQGLPREQRRRVTGRLWRGHTRLGLRRWRAGTFEPAVESLFQALAVRGIDERRGRLSRDLLVRTLEDMAGESLELIPQLLGESDRAAALGQAQRLLEHIRRTRDAGVSAEDLAVAASRARQLLEHIEHTPVR
(restricted) Ga0255310_1014837213300031197Sandy SoilPDGAGFLLEVCSTATARELWRLRRALRRGAGDEAPLGGALETARALLDCAPASELPAEARGRVGRRLWRGHTRLGLRRWRAGDFDPAVGALFEALSVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRAAALEQVQRLLAHVRRAREEGIAAEDLAVAAARAHQLLEHIEQTPVR
Ga0299913_1024369233300031229SoilEAGAPVLAAPEAPAPDGARATVTAEWRDRVAALIRGRESGEARQLLAPALERGEVDRETAGFLLEVCSTAVARDLWRLRRALGRGGGDEGPLEGSLEASQVLLEASASEAVSRERRRRVSDRLWRGHTRLGVRRWRAGSFEASMESLFRALGVPGTDERRRRLARELLVRTLEDMAGQSLELIPQLLGEGDRAIALGQAQRLLEHIRRAREEGVSAEDLAVAASRARQLLEHIEHTPVR
Ga0307505_1059429613300031455SoilEWRARVAALIRGRECEEARQLLDPALERDEVDRETAGFLLDVCSTAVARDLWRLRRAVRRGGGDEAPLASSLQVTGVVLEASVAQDLPREQRRRVTGRLWRGHTRLGLRRWRAGTFETAVESLFQALAVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLL
Ga0247727_1073365013300031576BiofilmELLVATAGEPSTASGGAPPLDTAAEDWRAGVATLIRRRELGEARRLIDPALASGAAGVEAVGFLLEVCSTAIARDLWRLRQGLPRGGGDDAPLAGSMETARVLLDSRPAARLPASLRLRVGRRLWRGHTRLGLHRWRAGEFDAAVEAVFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLEVIPQLLGDGDRAAALGRAQRLLTQIRRARDEEVSAEDLAVAASRARQLLEHIEQAPV
Ga0310813_1160359523300031716SoilRSGASARELWRLRRAQRRGAGDEAPLGGALETARALLDCEPAMGLPAEVRNRMGRRLWRGQTRLGLRRWREGDFDSAVVVLFEALAVPGLGDRRRALARDLLVRSLEDMAGQRLVLIPQPLGDGDRAAALEQAQRLLAHVRRAREEGIAAEDLAVAAARARQLLEHLEQTPVR
Ga0307469_1117152813300031720Hardwood Forest SoilVEPVEPALEMAEAVPEPEPAGAASAEDLAWRERVAGHIRRRELGEARGLIDRALASGDMTVGSAELLLEVCSTAIARDLWRLRRAARRGTGDETPLGTLMDTTRVILESEPASAVVAEQRQQAGRRLWRGHTRLGLRRWRSGDFEAAVETLSGALSVPGLDERRRRLARDLLVRTLEDAAGLSLELIPQLRGDGDRAAAV
Ga0307468_10019909723300031740Hardwood Forest SoilALERGEVDRETAGFILDVCSTAVARDLWRLRRAVRRGGGDEGPLEASLEVTRVVLEAPVTQGLPREQRRRGTGRLWRGHTRIGLRRWRAGTFETAVESLFRALAVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGESDRAVALEQAQRLLDHIRRTRENGVSAEDLAVASSRARQLLDHIEHTPVR
Ga0307475_1045260713300031754Hardwood Forest SoilAQGNMSTETAGFLVEVCSTAIARDLWRLRRALRRGGGDEAPLVASVENTHVLLDSRPAAALPAEQRLRVARRMWRGRTRLGLRRWRAGDFEAGVDALFPALAAPSIGDRRRRLVGDLLVRTLEDMAGQSLELIPQLLGDGERAAALEQAQRLLGHIRRARGHGISAEDLAVAASRARQLLEHIEQASVR
Ga0214473_10003201233300031949SoilLWRLRRGLPRGGGDDAPLAGSMETARVLLDSRPAAGLPPALRLRVGRRLWRGHTRLGLRRWRAGEFVAAVEALFQALSVPGIDDRRRRLARDLLVRTLEDMAGQSLELIPQLLGEGDRAAALGRAQRLLTQIRRARDEGVSAEDLAVAASRARQLLEHIEHAPAR
Ga0307471_10394348013300032180Hardwood Forest SoilGFLLEVCSTATARELWRLRRALRRGAGDETPLGGALETARLLLDSEPAAGLSSAARGRAGRRLWRGQTRLGLRRWRAGEFDPAVEALFQALAVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQLLGDGDRLAALGQAQRLAAHVARAREEGIAAEDLAVAAARARQLLEHIE
Ga0335069_1203226013300032893SoilRRREFGEARALIDPALAAGAVTVDGAEFLLEVCSTAIARDLWRLRRALRRGAGDETALSAAMETTRLILESAPAANLAAGSRQKIGRRLWRGHTRLGLRRWRAGDFEAAVETLFQALSVPGIDERRRRLARDLLVRTLEDMAGQSLELIPQLRGEGDRAAALEQAQRVLEHIRRARADGIVAEDLAVAASRVRQLLEHIET
Ga0326729_100188043300033432Peat SoilVVESAVATGDVSPDSAGFLLEVCSTATARELWRLRRALRRGAGDETPLGGALETARVLLDCEPATGLPAEARGRVGRRLWRGHTRLGLRRWRAGDFDASVGALFEALGVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRAAALEQAQRLLAHVRRAREEGVAAEDLAVAAARARQLLEHIEQTPVR
Ga0310811_1049909813300033475SoilDSAGFLLEVCSTATARELWRLRRAQRRGAGDEAPLGGALETARALLDCEPAMGLPAEVRNRMGRRLWRGHTRLGLRRWREGDFDSAVVVLFEALAVPGLGDRRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRAAALEQAQRLLAHVRRAREEGIAAEDLAVAAARARQLLEHLEQTPVR
Ga0316628_10102385513300033513SoilATARELWRLRRALRRGAGDETPLGGALEVARLLLDCGSAAGLPAEARGRAGRRLWRGHTRLGLRRWRAGDFDPAVGALFQALAVPGLDDRRRALARDLLVRTLEDMAGQRLELIPQPLGDGDRTAALEQAQRLLAHVRRAREEGIAAEDLAVAAARARQLLEHIEHSSVR
Ga0364926_108821_39_5753300033812SedimentVTEEWHARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGIDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQ
Ga0364934_0241122_2_6763300034178SedimentMSPPPELAAREAAAPIPAEGEASTRGAARATVTEEWRARVAALIRGRECGEARQLLEPALDRGDVDRETATFLLDVCSTAVARDLWRLRRALRRGGGGEAPLEGSLQVTRVMLEASVAEALPREQRRRVSGRLWRGHTRLGLRRWRAGTFEPAVESLFRALGVRGLDERRRRLARDLLVRTLEDMAGQSLELIPQLLGDHERAVALDQAQRLLDHIRRTREEGVS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.