NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F052854

Metagenome / Metatranscriptome Family F052854

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F052854
Family Type Metagenome / Metatranscriptome
Number of Sequences 142
Average Sequence Length 209 residues
Representative Sequence MFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAVIGWQDRGNL
Number of Associated Samples 106
Number of Associated Scaffolds 142

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 60.78 %
% of genes near scaffold ends (potentially truncated) 42.25 %
% of genes from short scaffolds (< 2000 bps) 61.97 %
Associated GOLD sequencing projects 102
AlphaFold2 3D model prediction Yes
3D model pTM-score0.83

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (70.423 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(40.845 % of family members)
Environment Ontology (ENVO) Unclassified
(42.254 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(57.746 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 14.68%    β-sheet: 34.52%    Coil/Unstructured: 50.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.83
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.1.26.0: automated matchesd2ciob_2cio0.67304
b.3.5.1: Cna protein B-type domaind1ti6b11ti60.63728
b.3.2.1: Carboxypeptidase regulatory domaind1h8la11h8l0.61952
d.15.7.1: Immunoglobulin-binding domainsd1hz6a11hz60.61812
b.3.5.0: automated matchesd3rkpa33rkp0.59916


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 142 Family Scaffolds
PF00873ACR_tran 6.34
PF00158Sigma54_activat 1.41
PF00072Response_reg 0.70
PF04392ABC_sub_bind 0.70
PF01609DDE_Tnp_1 0.70
PF00296Bac_luciferase 0.70
PF14684Tricorn_C1 0.70
PF02321OEP 0.70
PF01569PAP2 0.70
PF04185Phosphoesterase 0.70
PF01850PIN 0.70
PF01243Putative_PNPOx 0.70

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 142 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.41
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 0.70
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 0.70
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.70
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.70
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 0.70
COG3293TransposaseMobilome: prophages, transposons [X] 0.70
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 0.70
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.70
COG5421TransposaseMobilome: prophages, transposons [X] 0.70


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms70.42 %
UnclassifiedrootN/A29.58 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004463|Ga0063356_101627503All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium962Open in IMG/M
3300005937|Ga0081455_10304898All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1141Open in IMG/M
3300005981|Ga0081538_10018653All Organisms → cellular organisms → Bacteria5197Open in IMG/M
3300005981|Ga0081538_10021743All Organisms → cellular organisms → Bacteria4669Open in IMG/M
3300005981|Ga0081538_10024395All Organisms → cellular organisms → Bacteria4308Open in IMG/M
3300005981|Ga0081538_10074178All Organisms → cellular organisms → Bacteria1853Open in IMG/M
3300005981|Ga0081538_10111557All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1341Open in IMG/M
3300005981|Ga0081538_10215058All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium770Open in IMG/M
3300006169|Ga0082029_1339670All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1168Open in IMG/M
3300006844|Ga0075428_100297075All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1737Open in IMG/M
3300009094|Ga0111539_10156390All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2668Open in IMG/M
3300009156|Ga0111538_10278815All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2116Open in IMG/M
3300009157|Ga0105092_10136928All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1356Open in IMG/M
3300009168|Ga0105104_10209850All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1059Open in IMG/M
3300010041|Ga0126312_10196532All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1409Open in IMG/M
3300010043|Ga0126380_10520528All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium918Open in IMG/M
3300010046|Ga0126384_10722607All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium884Open in IMG/M
3300010047|Ga0126382_10102309All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1854Open in IMG/M
3300010047|Ga0126382_10842911All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium786Open in IMG/M
3300010081|Ga0127457_1042716All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium798Open in IMG/M
3300010084|Ga0127461_1057679All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium669Open in IMG/M
3300010086|Ga0127496_1060024All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1106Open in IMG/M
3300010089|Ga0127454_1034019All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1095Open in IMG/M
3300010091|Ga0127485_1100150All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium929Open in IMG/M
3300010093|Ga0127490_1040101All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium761Open in IMG/M
3300010097|Ga0127501_1004094All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium783Open in IMG/M
3300010097|Ga0127501_1103721All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium550Open in IMG/M
3300010103|Ga0127500_1086375All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium825Open in IMG/M
3300010103|Ga0127500_1087288All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1017Open in IMG/M
3300010111|Ga0127491_1047000All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1067Open in IMG/M
3300010112|Ga0127458_1028799All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium917Open in IMG/M
3300010113|Ga0127444_1019221All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium776Open in IMG/M
3300010114|Ga0127460_1136336All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1227Open in IMG/M
3300010115|Ga0127495_1159215All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium819Open in IMG/M
3300010119|Ga0127452_1153588All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1079Open in IMG/M
3300010120|Ga0127451_1138950All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium758Open in IMG/M
3300010124|Ga0127498_1002224Not Available848Open in IMG/M
3300010130|Ga0127493_1112025All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium995Open in IMG/M
3300010132|Ga0127455_1025902All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1058Open in IMG/M
3300010133|Ga0127459_1032103All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium656Open in IMG/M
3300010134|Ga0127484_1079761All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium590Open in IMG/M
3300010136|Ga0127447_1127439All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1213Open in IMG/M
3300010141|Ga0127499_1218807All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium839Open in IMG/M
3300010142|Ga0127483_1197728All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1070Open in IMG/M
3300010145|Ga0126321_1161638All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1003Open in IMG/M
3300010147|Ga0126319_1528740All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium741Open in IMG/M
3300010154|Ga0127503_10149979All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium933Open in IMG/M
3300010362|Ga0126377_10376258All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1424Open in IMG/M
3300010376|Ga0126381_104188700All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium559Open in IMG/M
3300010398|Ga0126383_10989389All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium929Open in IMG/M
3300012204|Ga0137374_10507058All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium936Open in IMG/M
3300012206|Ga0137380_11032606All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium702Open in IMG/M
3300012212|Ga0150985_107737859All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1180Open in IMG/M
3300012355|Ga0137369_10328462All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1124Open in IMG/M
3300012356|Ga0137371_11206523All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium564Open in IMG/M
3300012358|Ga0137368_10055711All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium3314Open in IMG/M
3300012373|Ga0134042_1209376All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium627Open in IMG/M
3300012379|Ga0134058_1180122All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium832Open in IMG/M
3300012380|Ga0134047_1149399All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium939Open in IMG/M
3300012393|Ga0134052_1125146All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium657Open in IMG/M
3300012393|Ga0134052_1306779All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium813Open in IMG/M
3300012395|Ga0134044_1154632All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium690Open in IMG/M
3300012397|Ga0134056_1001070All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1264Open in IMG/M
3300012398|Ga0134051_1241694All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium927Open in IMG/M
3300012399|Ga0134061_1032135All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium840Open in IMG/M
3300012399|Ga0134061_1276901All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium998Open in IMG/M
3300012401|Ga0134055_1156477All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium927Open in IMG/M
3300012402|Ga0134059_1087035All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium819Open in IMG/M
3300012403|Ga0134049_1242870All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1025Open in IMG/M
3300012405|Ga0134041_1077306All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium673Open in IMG/M
3300012406|Ga0134053_1371702All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium597Open in IMG/M
3300012407|Ga0134050_1185572All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1077Open in IMG/M
3300012409|Ga0134045_1189482All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1015Open in IMG/M
3300012469|Ga0150984_107293566All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium818Open in IMG/M
3300012469|Ga0150984_109567523All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium681Open in IMG/M
3300012469|Ga0150984_122371854All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1238Open in IMG/M
3300012922|Ga0137394_10002293All Organisms → cellular organisms → Bacteria14136Open in IMG/M
3300012923|Ga0137359_10814204All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium809Open in IMG/M
3300012929|Ga0137404_10006187All Organisms → cellular organisms → Bacteria8112Open in IMG/M
3300012929|Ga0137404_10009490All Organisms → cellular organisms → Bacteria6699Open in IMG/M
3300012929|Ga0137404_10086930All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter2491Open in IMG/M
3300012929|Ga0137404_10464823All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1124Open in IMG/M
3300012930|Ga0137407_10114436All Organisms → cellular organisms → Bacteria2333Open in IMG/M
3300012930|Ga0137407_10118497All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2296Open in IMG/M
3300015053|Ga0137405_1073589All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2448Open in IMG/M
3300017997|Ga0184610_1094596All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium941Open in IMG/M
3300018063|Ga0184637_10519566All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium689Open in IMG/M
3300018422|Ga0190265_10678895All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1151Open in IMG/M
3300018465|Ga0190269_10285799All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium952Open in IMG/M
3300018466|Ga0190268_10295424All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium968Open in IMG/M
3300028878|Ga0307278_10036672Not Available2234Open in IMG/M
3300028878|Ga0307278_10050583All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1883Open in IMG/M
3300030006|Ga0299907_10213583All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1590Open in IMG/M
3300030620|Ga0302046_10950009All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium686Open in IMG/M
3300030903|Ga0308206_1121366All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium605Open in IMG/M
3300031092|Ga0308204_10188363All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium636Open in IMG/M
3300031092|Ga0308204_10212058All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium609Open in IMG/M
3300031548|Ga0307408_100478396All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1086Open in IMG/M
3300031740|Ga0307468_100813119All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium799Open in IMG/M
3300031901|Ga0307406_10412711All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1073Open in IMG/M
3300032180|Ga0307471_101007645All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1000Open in IMG/M
3300032205|Ga0307472_101142842All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium740Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil40.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.15%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.45%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Unclassified → Tabebuia Heterophylla Rhizosphere6.34%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.52%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.82%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil2.11%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.11%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.11%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere2.11%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.41%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.41%
Termite NestEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Termite Nest0.70%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.70%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.70%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.70%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003373Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1Host-AssociatedOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005981Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1Host-AssociatedOpen in IMG/M
3300006169Termite nest microbial communities from Madurai, IndiaEnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010081Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010084Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010086Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010089Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010091Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010093Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010097Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010103Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010104Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010109Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010111Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010112Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010113Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010115Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010117Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010119Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010120Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010122Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010124Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010127Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010128Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010130Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010132Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010133Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010134Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010136Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010140Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010141Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010142Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010145Soil microbial communities from Hawaii, USA to study soil gas exchange rates - KP-HI-INT metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010147Soil microbial communities from California, USA to study soil gas exchange rates - BB-CA-RED metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010905Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012373Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012380Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012384Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012395Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012400Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012402Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012405Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012407Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300027743Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300030903Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031901Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-2Host-AssociatedOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25407J50210_1017133313300003373Tabebuia Heterophylla RhizosphereIWLMALPAWAVEYRLQVTNIGFLNFSSYMGKASPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVRLLEDPAYGGTVPARVSLLPATGRQAWATYVFDGNPGDTIAFVVRSDMAAWQEVWFVAANPGGTLRRLSMGGPGIFGHSSREVPEVSQDFLANAVDRGTFPQYVAQRAKAVDG
Ga0063356_10162750313300004463Arabidopsis Thaliana RhizosphereMALPAWAVEYRLEVTHVDALTFSSYKGKATPWWAQNEPLERLEARVDTQQLSPAAVLPGREIQLLEDPAYGGTVPTHVSQLPATRQQAWTTYIFDGRPGDTVAFMVRSDMAAWQEVWNVAANPGGQPRRLSMVSPGLFGRFWQEAPEVSQLYLANAVDRGTFPQYVAQHAKAVTGMSFVMGQGYNRLYAPDRVYMLIKLPPEPH
Ga0081455_1030489813300005937Tabebuia Heterophylla RhizosphereMKARTLVGGMFALLWLTALPAWATEYRLQVANIDDRTFSSYEGKAPSFWSQKEPMGRLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTNVSLLPATRYQDWSTYVFDANPGDTVAFEVTTDMAGWEEAMDVAADDHGTLRRLSIGGPGIFGGARGGPQGSQYFLANAVDHETFPQYVTQHAKAVDGMSVVVGQGTQPDYPDWVYVVLKLPPEPHTFQAVVGWKDHKGDRFDKKM*
Ga0081538_1001669113300005981Tabebuia Heterophylla RhizosphereLIWLMALPAWAVEYRLQVTNVDFLTFSSYMGKATPWWAQNEPLGRLEARLDAQQLSPAAVIPGREVHLLEDPAYGGTVPARVSYLPATGRQAWTTYVFDANPGDTVAFVVRSDMAAWQEVWSVAANPGGTLRRLSMAGPGLVGHFWQEVPE
Ga0081538_1001865383300005981Tabebuia Heterophylla RhizosphereLKSRTLVTGLFALLWFMALPAWAVEYRLQVSNVDFLNFSAHMGKATPWWAQNEPMGRLEARLDQQQFSPAAVLPGREVQLLEDPAYGGTVPTRVSLLPATGRQAWTTYVFDANPGDTVAFVVRSDMAAWQEVWFVGANPGGTLRRLSMAGPGLFGRFWQEVPEVSQDFLADAVDRGTFPQYVAQRAKAVDGMSLVVGQGHDTFYDPDRLYVLIKLPPEPHTFKVVIGWRDHDDRGTG*
Ga0081538_1002174363300005981Tabebuia Heterophylla RhizosphereMELIRRTLVGWIFTLIWLMALPAWAVEYRLQVTNIGFLNFSSYMGKASPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVRLLEDPAYGGTVPARVSLLPATGRQAWTTYVFDGNPGDTVAFVVRSDMAAWQEVWFVAANPGGALRRLSMGAPGIFGHSSREVPEVSQDFLANAVDRGTFPQYVAQRAKAVEGMSLLVGEGHDTFYDADRVYVLLTLPPEPHTFKVVVGWRDHHNRGNDD
Ga0081538_1002439533300005981Tabebuia Heterophylla RhizosphereVNGRTRVGWLVAFIWLMAWPAWAVEYRLQVTNVDFRTFSSYMGKASPWWAQNEPMGRLETRVDAQQFSPAAVIPGREVQLLEDPAYGGTVPRRVSLLPATGRQAWTTYVFDANPGDPVVFVVRSYMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQEVPQVSQLLLANAVDRGTLPRWVAQRAKTVDGMAFVVGKGYDRYYAPDRLYVRTTLPSAPHTFKVVIGWRDRNNRGAD*
Ga0081538_1007417813300005981Tabebuia Heterophylla RhizosphereMKARTLVGGMFALLWLTGVPARATEYRLQVANIDDQVFASYEGKGGSFWSQKEPMGRLEARLDQNKFSGAAILPGHHVELLEDPAYGGTTPTKVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMYAWQQVWDVAADDHGTLRRLSIGGPGFFGGSREVPQVSQDFLANAADRGTFPQYVAQHAKAVDGMSFVVGQGDNPSNDPDRLYVILKLPPEPHTFQAVVGWKDNNGNRMTRRGGNN*
Ga0081538_1011155713300005981Tabebuia Heterophylla RhizosphereMKSRTLVGGIFALLWLMALPAWAAEYRLQVANIDDRLFSSYEGNGTSWWRQNEPMGRMEARLDQQKLSPAAILPGHHVELLEDPAYGGIAPTRVSLLPATRHQDWTTYVFDANPGDTVAFVVRTDVYGWQEVMDVAANPDGSFRRLSIGGPGIFGSSSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPYNDPDRVYIVLKLPPQPHTFQTVIGWQDHGNRINPRAGRF*
Ga0081538_1021505813300005981Tabebuia Heterophylla RhizosphereSYEGKGSSFWSQKEPMGRLEARLDQQKFSPAAILPGHHVELLEDPGYGGTTPTRVSLLPATGHQDWSTYVFDANPGDTVAFEVTTDMAGWEEAMDVAADQNGTLRRLSIGGPGIFGGSREVPQVSQDFLATAVDRGTFPQWVAQHAKAADGMSFVVGQGDNSVYDPDRVYIVLKLPPQPHTFKAVIGWNDRGNRMGQSAGR*
Ga0081538_1024476713300005981Tabebuia Heterophylla RhizosphereMSGARRSDLLWHEASAGLCLSAMPLRAHNYGRSIDLKGRTLVGWLFAFIWLMALPAWAMEYRLQVTNIGFLNFSAHMGKATPWWAQNEPMGRLEARLDAQQFSPAAVIPGREVQLLQDPAYGGTVPARVSYLPATGRQAWTTYVFDGNPGDTVAFVVSSYMAAWQEVWFVAANPGGALRRLSLGGPGIFGRSSREVP
Ga0082029_133967013300006169Termite NestMALPAWAVEYRLQVTNVDALTFSSYMGRATPWWRQNEPMGRLEARLDAQEFSPAAVLPGREVHLLEDPAYGGKVPGRVSRLPATGTQAWTTYVFDGEPGDSVAFIVRSDMAAWQEIWDVAANPGGVLRRLSIGGPAMFGRAWQEVPEASQAFLANAVDRGTFPQWMARRAKAIDGMSFVVGQGHDTVYDPDRLYVLLTLPPQPHTFKVVVGWRDHDDRGSG*
Ga0075428_10029707523300006844Populus RhizosphereVLNSRTFVGWILGLIWLMALPAWAVEYRLEVTNVDALTFSSYEGKATSWWAQNKPLGLLEARLDTQQFSPVAVLPGREVQLLEDPAYGGTVPAHVSQLPSTRQQAWTTYIFDGRPGDIVAFVVRSDMAAWQEVWDVAANPGGQLRRLSMASPGLFGRFWQEVPEVSQLYLAHAIDRGTFPQYVAQHAKAVDGMSFVVGQGYNRLYAPDRVYMLIKLPPEPLTFKVVIGWRDHDNRGDG*
Ga0111539_1015639013300009094Populus RhizosphereMALPAWAVEYRLEVTNVDALTFSSYEGKATSWWAQNKPLGLLEARLDTQQFSPVAVLPGREVQLLEDPAYGGTVPAHVSPLPSTRQQAWTTYIFDGRPGDTVAFVVRSDMAAWQEVWDVAANPGGQLRRLSMASPGLFGRFWQEVPEVSQLYLAHAIDRGTFPQYVAQHAKAVDGMSFVVGQGYNRLYAPDRVYMLIKLPPEPLTFKVVIGWRDHDNRGDG*
Ga0111538_1027881513300009156Populus RhizosphereMALPAWAVEYRLEVTNVDALTFSSYEGKATSWWAQNKPLGLLEARLDTQQFSPAAVLPGREVQLLEDPAYGGTVPAHVSQLPSTRQQAWTTYIFDGRPGDTVAFVVRSDMAAWQEVWDVAANPGGQLRRLSMASPGLFGRFWQEVPEVSQLYLAHAIDRGTFPQYVAQHAKAVDGMSFVVGQGYNRLYAPDRVYMLIKLPPEPLTFKVVIGWRDHDNRGDG*
Ga0105092_1013692813300009157Freshwater SedimentRTLVGWIFAFIWLMALPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISRLPATRQQAWTTYVFEGKPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMGGPGIFGHSSREVPQVSQDFLANAVDRGTFPQYVAQHAKPVDGMSLVVGEGHNTFYDPDRVYVLLKLPPDPHTFKVVVGWRDHDNRGNDD*
Ga0105104_1009676723300009168Freshwater SedimentMALPAWAVEYRLQVTNVDFLTFSSYMGQDTPWWGQHEPMGRLEARLDAQQFAPAAVVPGREVHLLEDPAYEGTVPARVSQFPAAGRQAWTTYVFDANPGDTVAFVVRSAMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQEVLEVSQDFLAHAVDRGTLPQ*
Ga0105104_1020985013300009168Freshwater SedimentFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISRLPATRQQAWTTYVFEGKPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMGGPGIFGHSSREVPQVSQDFLANAVDRGTFPQYVAQHAKPVDGMSLVVGEGHNTFYDPDRVYVLLTLPPDPHTFKVVVGWRDHDNRGNDD*
Ga0126315_1055076523300010038Serpentine SoilMALPAWAAEYRLQVTNIGFLNFSSYMGRATPWWAQNESMGRLEGRLEAQQFTPAAVIPGREVQLLEDSPYGGTVPARISLLPATRQQAWTAYVFDGRPGDTVPFVVRSDMAAWQEIWDVASNPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLAN
Ga0126312_1019653213300010041Serpentine SoilMALPAWAVEYRLQVTNVGFLNFSSYMWKATPWWAQNEPMGRLEARLDAQQFSPAADIPGREVQLLEDPTYGGTVPARVSLLPATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEIWQVAANPGGQLRRLSMAGPGIFGRFWQEVPEVSQLFLANAVDRGTFPQYVAQHAKAVDGMSLVVGQGYDTFYAPDRVYILIKLPPEPHTFKVVIGWRDHDNRGNE*
Ga0126380_1052052813300010043Tropical Forest SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANIDDEIFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRPDIAAWEDAEDVASDVKCTFRRLSIGGPGFFGGSREVPQVDQEFLANAADRGTFPQWVAEHAKALDGMAVVVGQGENTQAYPDRVYLVLKLPAQPH
Ga0126380_1124131713300010043Tropical Forest SoilMELIRRTLIGWIFASIWLMALPAWAVEYRLQVTNIGFLNFSSYMGKATSWWAQNEPMGRLEARLDAQQFSPAAVIPGREVQLLQDPAYGGTVPARVSYLPATGRQAWTTYVFDGNPGDTVAFVVRSYMAAWQEVWFVAANPGGALRRLSMGGPSIFGRSSREVPEVSQDFLANAVDRGTFPQYVAQRA
Ga0126384_1072260713300010046Tropical Forest SoilLKRRTLVGSIFAVIWLVALPAWAVEYRLEVTNMDALTFSSHMGKATPWWHQNEPMGGLEARLDAMQFPTAAVIPGREVHLLDDPAYGGTVPTRVSLLPATGKQAWTMYVWDGKPGDIVAFAVTSDMAAWQEVWDVAANPGGALRRLSIGGPSIFGHAWQEIPKVSQEFLANAVDRGTFTQWVAQHAKVVDGMSFVVGQGHNTFYSPDRVYVFVKLPPEPHTFKVVIGWKDHNDRGTG*
Ga0126384_1131021913300010046Tropical Forest SoilMFALIWLTALPAWAVEYRLQVANIDDRTFSSYEGKASSWTSEKEPMGRLEARLDQERFSPTAILPGHHVELLEDPAYGGTTPTRVSLLPATGHQDWTTYVFDANPGDTVAFVVRTDMDAWQEVMDVAADAQGTFRRLSIGGPGLFGGSLEVPEVSQDF
Ga0126382_1010230913300010047Tropical Forest SoilMFMLLWFTALPAWATEYRLQVANIGDQVFASYEGKAKSFWSQKEPMGRLEARLDDNHFSRAAILPGHHMELLEDPAYGGVTPTKVSLLPATRYQDWSTYVFDANPGDTVVFVVRTDMDAWQRVENVAADDHGTLRRLSIGGPGFFGGSREVPQVPQEFLANAADRGTFPQYVAQHAKTVDGMSFVVGQGLNPDYDPDQVYITIKLPPEPHTFQAVIGWQDRIGDRMNQNDAR*
Ga0126382_1068281213300010047Tropical Forest SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKVSSFWSMKEPMGRLEARLDDNRFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRPDIAAGEEAEDVASDVKCGFRRLSIGGPGFFGGSREVPQVDQE
Ga0126382_1084291113300010047Tropical Forest SoilGFLNFSSYEGKATPWWAQNEPMGRLEARLDAQQFSPAAVIPGREVQLLEDPAYGGTVPARVSLLPATRQQAWSTYVFDGKPGDTVAFVVKSYMAAWQEIWQVAANPGGQLRRLSMAGPGIFRRFWQEVPEVSQLFLANAVDRGTFPQYVAQHAKAVDGMSLVVGQGYDTFYAPDRLYILLKLPPEPHTFKVVIGWRDHDNRSNK*
Ga0127457_104271613300010081Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGRLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFV
Ga0127461_105767913300010084Grasslands SoilLLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNR
Ga0127496_106002413300010086Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMAAWQKVWDVAADANGTLRRLSIGGPGIFGGSREVPQVAQEFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0127454_103401913300010089Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGNN*
Ga0127485_110015013300010091Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQ
Ga0127490_104010113300010093Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMN
Ga0127501_100409413300010097Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQP
Ga0127501_110372113300010097Grasslands SoilTFSSYEGKAPSWTSMKEPMGRLEARLDDNQFSRTAILPGHHVELLEDPAYGGTTPTRVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMVAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQP
Ga0127500_108637513300010103Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMVAWQEVIDVAADANGTFRQLSIGGPGIFGGSREVPEVSQDFLANAVDRGTFPQYVAQRAKAVDGMSFVVGRGNNSEDDADRVYITLKLPPQPHTFKAVIGWHDRQGDRMEQSD
Ga0127500_108728813300010103Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPLTFQAAIGWQDRGNLTNQDEAGGGNN*
Ga0127446_108912313300010104Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWV
Ga0127497_107115913300010109Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNQFSRAAILPGHHVEMLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDR
Ga0127491_104700013300010111Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAHSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNLTNQDEAGGGK*
Ga0127458_102879913300010112Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSREVPQVAQEFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPRYDPDRVYIVLKLPPQPHTFQTVIGWKDRGNL
Ga0127444_101922113300010113Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAQPHTFQMAIGWQDRGN
Ga0127460_104870113300010114Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVA
Ga0127460_113633613300010114Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGRLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0127495_115921513300010115Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLADNQFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDE
Ga0127449_101589813300010117Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDHGTFPQWVAQHAKA
Ga0127452_115358813300010119Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0127451_113895013300010120Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGSSREVPQVTQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAG
Ga0127488_106993913300010122Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVARTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDHGTFPQW
Ga0127498_100222413300010124Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGSSREVPQVTQEFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPRYDPDRVYIVLKL
Ga0127489_110958513300010127Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAK
Ga0127486_107755013300010128Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAV
Ga0127493_111202513300010130Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGNN*
Ga0127455_102590213300010132Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNPDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGG
Ga0127459_103210313300010133Grasslands SoilRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNPDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGNN
Ga0127484_107976113300010134Grasslands SoilSMKEPMGRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNRMNQGNGGM*
Ga0127447_112743913300010136Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGK*
Ga0127456_116232513300010140Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMAAWQKVWDVAADANGTLRRLSIGGPGIFG
Ga0127499_104024013300010141Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHA
Ga0127499_121880713300010141Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGN
Ga0127483_119772813300010142Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEA
Ga0126321_116163823300010145SoilMFALLWLTALPTWAAEYRLEYRLQVANIDDQVFASYEGKASSFWSQDQPMGRLAARLDDNQFSRAAILPGHHVELLEDPAYGGITPTRVSLLPATRYQDWSTFVFDANPGDTVAFVVRTDMYAWQQVEDVGANVDGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRRTFPQYVAQRAKAVDGMSFVVGQGDNPRYDPDRLYVLLKLPPEPHTFKVLVGWKDRGNLIHQSGGQG
Ga0126319_152874013300010147SoilNMKARTLVGGMFALLWLTALPAWAVEYRLQVANIDDQTFSSYEGKGNSFWSQKEPMGRLEARLDDNHFSRTAILPGHHIELLEDPAYGGTTPTKVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMVAWQQAVDVAADANGTLRRLSIGGPGIFGKSSREVPQVAEEFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPRYDPDRVYIVLKLPPQPHTFQTVIGWRDRGNLRNQDEGGGDN*
Ga0127503_1014997913300010154SoilMFALLWLTALPAWAAEYRLQVANIDDRTFSSYQGKAPSWTSMKEPMGRLEARLDNTQFSNAAILPGHHVELLEDPAYGGTTPTRVSLLPATGHQDWSTYVFDANPGDTVAFVVRTDMYAWQEVMDVAADANGTLRRLSIGGPGIFGKSSREVPQVAEEFLANTVDRGTFPQYVAQHARAVDGMSFVVGQGDNPRYDPDRVYIVLKLPSQPHTFQTVIGWKDRGNLMNQDQGG
Ga0126377_1037625823300010362Tropical Forest SoilMFAFIWLMALPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISLLLATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQDIPQVSQDFLANAVDRGTFPQYVAQHAKPVDGMSLVVGEGHNTFYDPDRVYVLLKLPPEPHTFKVVVGWRDHDNRGNDD*
Ga0126381_10418870013300010376Tropical Forest SoilALTFSSHMGKATPWWHQNEPMGGLEARLDAMQFPTAAVIPRREVHLLDDPAYGGTVPTRVSLLPATGKQAWTMYVWDGKPGDIVAFAVTSDMAAWQEVWDVAANPGGALRRLSIGGPSIFGHAWQEIPKVSQEFLANAVDRGTFTQWVAQHAKVVDGMSFVVGQGHNTFYSPDRVYVFVKLPPEPH
Ga0126383_1009208343300010398Tropical Forest SoilMFALIWLTALPAWAVEYRLQVANIDDRTFSSYEGKASSWTSEKEPMGRLEARLDQERFSPAAILPGHHVELLEDPAYGGTTPTRVSLLPATGHQDWTTYVFDANPGDTVAFVVRTDMDAWQEVMDVAADAQGTFRRLSIGGPGLFGGSREVPEVSQDFLANAVD
Ga0126383_1098938923300010398Tropical Forest SoilLKRRTLVGSIFAVIWLVALPAWAVEYRLEVTNMDALTFSSHMGKATPWWHQNEPMGGLEARLDAMQFPTAAVIPGREVHLLDDPAYGGTVPTRVSLLPATGKQAWTMYVWDGKPGDIVAFAVTSDMAAWQEVWDVAANPGGALRRLSIGGPSIFGHAWQEIPKVSQEFLANAVDRGTFTQWVAQHAKVVDGMSFVVGQGHNTFYSPD
Ga0138112_104439213300010905Grasslands SoilMKARTLVGGMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFP
Ga0137374_1050705813300012204Vadose Zone SoilVKRRTLVEWIFVLIWLLALPAWAVEYRLQVTNLDYLNFSSYLENATSSWRQNEPMERLEKRLDDMKFPPAAVIPGREVQLLEDPAYGGKMPARVSLLPATGRQAWTTFVWDGNPGDTVAFVVKSDMAAWQEVWDVAANPGGTLRRLSIGGPSIFGHPWQEVPEVSQSFLANAVDRGTFSQWVARKAKAVDGMSVVVGQGHNT
Ga0137362_1113605713300012205Vadose Zone SoilMKEPMGRLEARLDNNHFSNAAILPGHHVELLEDPAYGGTTPTRVSLLPATRYQDWSTYVFDANPGDTVAFVVRTDMAAWQKVWDVAADANGTLRRLSIGGPGIFGKSSREVPQVAEDFLANAVDRGTFPQYVAQHAKAIDGMSLLVGQGDNPRYDPDRVYIVLKLPAQPHTFQTVIGWKDRGNLMNQDEGAGDN*
Ga0137380_1103260613300012206Vadose Zone SoilMERLETRLDDMKFPPAAVIPGREVHLLEDPAYGGKAPARVSLLPATGRQAWTTFVWDGNPGDTLTFVVTSEMAAWQEVWAVAANPGGTLRRLSIGGPAIFGHPWQEVPEVSQDFLANAVDRGTFTQWVARNAKAVDGMSVVVGQGHNTFYDPDRVYVLSTLPPEPHTFKVVIGWRDHDDRGSG*
Ga0137376_1156880013300012208Vadose Zone SoilLLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDHGTFPQWVAQHAKAVDGMSFV
Ga0150985_10773785913300012212Avena Fatua RhizosphereMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKAPSWTSMKEPMGRLEARLDNNQFSNAAILPGHHVELLEDPAYGGTTPTRVSLLPATQYQDWSTYVFDANPGDTVAFVVRTDMAAWQEATDVAADTNGTLRRLSIGGPGIFGGSREVPQVAQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYIVLKLPPQPHTFQAAIGWQNNQGDLMNHGQDR*
Ga0137369_1032846213300012355Vadose Zone SoilMPQRAHHHRRKIELTRRTLVGWIFAFIWLMALPAWAAEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLEAQQFSPATVIPGREVQLLEDPAYGGTVPARISLLPATRQQAWTTYVFDGKPRDSVAFVVRSDMAAWQEIWDVAANPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLANAVDRGTFPQYVAQRAKTVDGMSFVVGQGHDTFYDADRLYVLITLPPEPHTFKVVIGWRDHNNRGDG*
Ga0137371_1120652313300012356Vadose Zone SoilLQVANIDDRTFSSYEGKASSWTSEKEPMGRLEARLDQEQFSPAAILPGHHVELLEDPAYGGTTPTRVSLLPATGHQDWTTYVFDAKPGDTVAFVVRTDMAAWQEVRDVAADANGTLRRLSIGGPGIFGKSSREVPQVADDFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPRYDPDRVYIVLK
Ga0137368_1005571113300012358Vadose Zone SoilAWAVEYRLQVTNLDYLNFSSYLENATSSWRQNEPMERLETRLDEMKFPSAAVIPGREVQLLEDPAYGGNMPARVSLLPATGRQAWTTFVWDGNPGDTVAFVVKSDMAAWQEVWDVAANPGGTLRRLSIGGPSIFGHPWQEVPEVSQSFLANAVDRGTFSQWVARKAKAVDGMSVVVGQGHNTFYAPDRAYVLIKLPPEPRTFKVVIGWRDHDNRGSG*
Ga0137368_1028295313300012358Vadose Zone SoilVKRRTLVEWIFVLIWLLALPAWAVEYRLQVTNLDYLNFSSYLENATSSWRQNEPMERLEKRLDDMKFPPAAVIPGREVQLLEDPASGGRMPIRLSMLEANGRPGWTRFVWAGTPGDNVAFVVKSDMAAWQEVWDVAANPGGTLRRLSIGGPSTFAHPWQEVPEVSQS
Ga0134042_120937613300012373Grasslands SoilEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGN
Ga0134058_118012213300012379Grasslands SoilALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLQDPAYGGTTPTRVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGEIIERWSHRPPGRWDRILLATRAGA*
Ga0134047_114939913300012380Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNPDYDPDRVYITLKLPAEPHTFQAAIGWQDRGN
Ga0134036_104142213300012384Grasslands SoilLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQH
Ga0134043_113092613300012392Grasslands SoilMFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAISWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKASMACPSWSVKGT
Ga0134052_112514613300012393Grasslands SoilRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGK*
Ga0134052_130677913300012393Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQG
Ga0134044_115463213300012395Grasslands SoilFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDHGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEA
Ga0134056_100107013300012397Grasslands SoilMFALLWLTALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGK*
Ga0134051_124169413300012398Grasslands SoilFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTRVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYYPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0134061_103213513300012399Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQT
Ga0134061_127690113300012399Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAVIGWQDRGNL
Ga0134048_129201013300012400Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGGSRQVPQVS
Ga0134055_115647723300012401Grasslands SoilALPAWATEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGLTVELLQDPAYGCTTRLHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNPDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGGGNN*
Ga0134055_134597813300012401Grasslands SoilMKSRTLVGGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANA
Ga0134059_108703513300012402Grasslands SoilGIFALLWLTALPAWAAEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGSSREVPQVTQEFLANAVDRGTFPQWLAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0134049_124287013300012403Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0134041_107730613300012405Grasslands SoilLPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANTVDRGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQAAIGWQDRGNLTNQDEAGG
Ga0134053_109681213300012406Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMGRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFG
Ga0134053_137170213300012406Grasslands SoilFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRADMVAWEDAEDVASDVKCTFRRLSIGGPGFFGGSREVPQVSQDFLANAAERGTFPQWVAQHAKSLDGMSFVVGQGENTQAYPDRV
Ga0134050_118557213300012407Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGNNPDYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQ
Ga0134045_118948213300012409Grasslands SoilMFALLWLTALPAWAAEYRLQVANMDDRTFSSYEGKANSWTSMKEPMRRLEARLDDNHFSRAAILPGHHVELLQDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVTFVVRTDMAAWQQAVDVAADANGALRRLSIGGPGIFGGSREVPQVSQDFLANAVDHGTFPQWVAQHAKAVDGMSFVVGQGNNSDYDPDRVYITLKLPAEPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0134060_118597713300012410Grasslands SoilMKSRTFVGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRADMVAWEDAEDVASDVKCTFRRLSIGGPGFFGGSQEVPQVSQTSSPTPPSEARSRSGWRS
Ga0150984_10729356613300012469Avena Fatua RhizosphereMGRLEARLDAQQFSPAAVIPGREVQLLEDPAYGGTVPTRISLLPATRQQAWTTYVFDAKPGDTVAFVVRSDMAAWQEIWDVAANPGGSLQRLSMAGPGIFGRFWQDIPQVSQDFLANAVDRGTFPQYVAQRAKPVDGMSFVVGEGHDTFYDPDRLYVLLTLPPEPHTFKVVIGWRDHDNRGNDD*
Ga0150984_10956752313300012469Avena Fatua RhizosphereAVEYRLQVANMDDRTFSSYEGKANSWTSQKEPMGNLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTHVSLLPATRHQDWSTYVFDANPGDTVVFVVRTDMAAWQQAVDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYITLKLPAQPHTFQTAIGWQDRGNRMNQGNGGM*
Ga0150984_12237185423300012469Avena Fatua RhizosphereMKSRTLVGGIFALLWLMALPAWAVEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGRLEARLDQQKLSPTAVLPGHHVELLEDPAYGGTTPTRVSLLPATRQQDWSTYVFDANPGDTVAFVVRTDMVAWQQVMDVAADANGTLQRLSIGGPGIFGNSSREVPEVAQEFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYILLKLPPQPHTFQTVIGWQDRGNRMNQRAGR*
Ga0137358_1084190213300012582Vadose Zone SoilMTYRTLIVGLFASLWLLALPAWAMEYQLEVTNVDFLTLSSYMGKATPWWHQNEPLGRLEARLDAQQFSHPAVLPGRQVHLLEDPKYGGTVPTRVSLLPATGKQAWTTYVWDGKPGDTVVFDVTSDMTAWQEIWQVAANPEGTLRRLSIGGPALLGHPWQEVPEVSQLF
Ga0137394_1000229353300012922Vadose Zone SoilMFALIWLMALPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDTQQFSPAAVLPGREVQLLEDPAYGGTVPARISLLPATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQEVPEVSQDFLANAVDRGTFPQYVAQRAKAVDGMSLLVGEGHDTFYDADRVYVLLKLPPEPHTFKVVVGWRDHDNRGNDD*
Ga0137394_1048222623300012922Vadose Zone SoilMKEPMGRLEARLDDNQFSRATVLPGHHVELLEDPAYGGTTPTRVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMVAWQQAVDVAADANGTLRRLSIGGPGIFGSSREVPQVTQEFLANAVDRGTFPQWLAQHAKAVDGMSFVVGQGDNPRYDPDRVYIVLKLPPQPHTFQSVIGWRDRGNLRNQDEGGGDN*
Ga0137359_1081420413300012923Vadose Zone SoilMFALLWLTALPAWATEYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGRLEARLDNNQFSNAAILPGHHVELLEDPAYGGTVPTRVSLLPATRHQDWTTYVFDANPGDTVAFVVRTDMVAWQQVEDVVANTDGTFRRLSIGGPGIFGNSSREVPGVSQGFLATAVDRGTFAQWVAQHAKTVGGMSLAVGEGHDTSYDPDRVYVLLKLPPEPHTFKVVVGWRDHGNQGNDD*
Ga0137404_1000618743300012929Vadose Zone SoilMPPRAHHHRRKIELTRRTLVGWIFAFIWLMALPAWAVEYRLQVTNVGFLNFSAYMGKATPWWAQNEPMGRLEARLDTQQFSPAAVLPGREVQLLEDPVYGGTVPAHISLLPATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEIWDVAANPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLANAVDRGTFPQYVAQRAKRVDGMSFVVGQGHDTFYDADRLYVLVTLPPEPHTFKVVIGWRDHDNRGDD*
Ga0137404_1000949023300012929Vadose Zone SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPVYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRADMVAWEDAEDVASDVKCTFRRLSIGGPGFFGGSREVPQVSQDFLANTAERGTFPQWVAQHAKSLDGMSFVVGQGENTQAYPDRVYIVLKLPPEPHTFQAVIGWA*
Ga0137404_1008693013300012929Vadose Zone SoilMAFSAWAVEYRFQVANINDQLFASYEGQGSSFWSQKAPMNRLEARLDQQQFSRAAILPGHHVELLEDPAYGGTVPTRVSLLPATRHQDWTTYVFDANPGDTVAFVVRTDMVAWQQVEDVVANTDGTFRRLSIGGPGIFGNSSREVPGVSQGFLATAVDRGTFAQWVAQHAKTVGGMSLAVGEGHDTSYDPDRVYVLLKLPPEPHTFKVVVGWRDHGNQGNDD*
Ga0137404_1046482313300012929Vadose Zone SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANMDDRTFSSYEGKAPSWTSMKEPMGHLEARLDDNQLSRAAILPGHHVELLQDPAYGGTTPTKVSLLPATRYQDWSTYVFDANPGDTVAFVVRTDMAAWQKVWDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQYVAQHAKAIDGMSLLVGQGDNPRYDPDRVYIVLKLPAQPHTFQTVIGWKDRGNLMNQDEGAGDN*
Ga0137407_1011443623300012930Vadose Zone SoilMFALLWLTALPAWAAQYRLQVANIDDRTFSSYEGKAPSWTSMKEPMGRLEARLDDNQFSRAAILPGHHVELLEDPAYGGTTPTNVSLLPATRHQDWSTYVFDANPGDTVAFVVRTDMYAWQKVWDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPEYDPDRVYITLKLPPQPHTFQAVVGWQDRQGGRIDAR*
Ga0137407_1011849713300012930Vadose Zone SoilMAFSAWAVEYRFQVANINDQLFASYEGQGSSFWSQKAPMNRLEARLDQQQFSRAAILPGHHVELLEDPAYGGTVPTRVSLLPATRHQDWTTYVFDANPGDTVAFVVRTDMVAWQQVEDVVANTDGTFRRLSIGGPGIFGNSSREVPGVSQAFLATAVDRGTFAQWVAQHAKTVGGMSLAVGEGHDTSYDPDRVYVLLKLPPEPHTFKVVVGWRDHGNQGNDD*
Ga0137407_1090368213300012930Vadose Zone SoilMKEPMGRLEARLDNNQFSNAAILPGHHVELLEDPAYGGTTPTKVSLLPATRYQDWSTYVFDANPGDTVAFVVRTDMAAWQKVWDVAADANGTLRRLSIGGPGIFGNSSREVPQVAQEFLANAVDRGTFPQYVAQHAKAIDGMSLLVGQGDNPRYDPDRVYIVLKLPAQPHTFQTVIGWKDRGNLMNQDEGAGDN*
Ga0126375_1063155813300012948Tropical Forest SoilMKSRTLVGGIFSLLWLMALPAWAVEYRLQVANIDDETFASYEGKAPSFLSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRPDIAAWEDAEDVASDVKCTFRRLSIGGPGFFGGSREVPQVDQEFL
Ga0137405_107358933300015053Vadose Zone SoilMPPRAHHHRRKIELTRRTLVGWIFAFIWLMALPAWAVEYRLQVTNVGFLNFSAYMGKATPWWAQNEPMGRLEARLDMQQFSPAAVLPGREVQLLEDPVYGGTVPAHISLLPATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEIWDVAANPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLANAVDRGTFPQYVAQRAKRVDGMSFVVGQGHDTFYDADRLYVLVTLPPEPHTFKVVIGWRDHDNRGDD*
Ga0137403_1020184513300015264Vadose Zone SoilMFALLWLTALPAWATEYRLQVANIDDQTFSSYEGKAPSWTSMKEPMGRLEARLDNNQFSNAAILPGHHVELLEDPAYGGTTPTKVSLLPATRYQDWSTYVFDANPGDTVAFVVRTDMVAWQQVMDVAAEANGTLRRLSIGGPGIFGGSREVP
Ga0184610_109459613300017997Groundwater SedimentMALPAWAVEYRLQVTNVDFLNFSSYMGKATPWWAQNEPMGRLEARLDAGQFSLAAVLPGREVQLLQDPVYGGTVPARVSRLPATGRQAWTTYVFDANPGDTVAFVVRSDMAAWQEVWFVGANPGSTLRRLSMAGPGIFGRFWQEVPEVSQGFLANAVDRGTFPQYVAQRAKAVDGMSFVVGEGHDTFYDPDRLYVLLTLPPEPHTFKVVVGWRDHDDRGDG
Ga0184637_1051956613300018063Groundwater SedimentMKARTLVGGIFALIWLMALPAWAVEYRLEVTNLDDQVFASYEGNGTSWWSQNEPMGRLEARLNQQQFSPAAVLPGHHVELLEDPAYGGTVPTRVSLLPATGRQAWTTYVFDANPGDTVAFVVRTDMIAWQEVMDVAASDNGTFRRLSIGGPGFFGGSREVPEVSQGFLANAVDRGTFPQYVARRAKAVEGMSLVVGEGHDTAY
Ga0190265_1067889513300018422SoilMKARTLVGGILALTWLMALPAWAVEYRLEVTNLDDQVFSSYEGNGTSWWSQNEPMGRLEARLNQQQFSPAAVLPGHHVELLEDPAYGGTVPTRVSLLPATGRQAWTTYVFDANPGDTVAFVVRTDMIAWQEVMDVAANDHGTFRRLSIGGPGFFEGSREVPEVSQGFLANAVDRGTFPQYVAQRAKAVDGMSLVVGEGHAYYDPDRLYVLVTLPPEPHTFKVVVGWR
Ga0190272_1172165213300018429SoilMKARTLVGGMFALLWLTALPAWATEYRLQVANIDDQVFSSYEGKGNSFWSQKEPMGRLEARLDNDHFSNAAILPGHHVDLLEDPGYGGTTPTNVSLLPATGDQDWSTYVFDGNPGDTVAFVVRTYMYAWQEAMDVAADDHGMLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQWVAQHAKAVDGMS
Ga0190269_1028579913300018465SoilMELIRRTLVGWMFAFIWLMTLPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISLLPATRQQAWTTYVFDGNPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMAGPGIFGHFWQEVPQVSQLFLANAVDRGTFPQYVAQHAKAVDGMSLVVGQGYDTFYAPDRVYILIKLPPERHTFK
Ga0190268_1029542413300018466SoilMELRRRTLVGWLCAFIWLMALPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISLLPATRQQAWTTYVFDGNPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMGGPGIFGHSSREVPEVSQDFLANAVDRGMFPQYVAQRAKAVDGLSLLVGEGH
Ga0209593_1005575533300027743Freshwater SedimentYRTLVGGLLACIWLMALPAWAVEYRLQVTNVDFLTFSSYMGQDTPWWGQHEPMGRLEARLDAQQFAPAAVVPGREVHLLEDPAYEGTVPARVSQFPAAGRQAWTTYVFDANPGDTVAFVVRSAMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQEVLEVSQDFLAHAVDRGTLPQ
Ga0307278_1002966833300028878SoilMNYRTLIVELFASLWLLALPAWAREYWLEVTNVDFLTLSSYMGKATPWWHQNEPLGRLEGRLDAQQFSPAAVLPGRQVHLLEDPKYGGTVPTRVSLLPATGKQAWTTYVWDGKPGDTVVFDVTSDMAAWQEIWQVAANPEGTLRR
Ga0307278_1003667223300028878SoilMKARTLIGGMFALLWLTALPTWAAEYRLQVANMDDRTFSSYEGQANSWTSQKEPMGRLEARLDDNQFSRATILPGHHVELLEDPAYGGTTPTRVSLLPATRQQDWSTYVFDANPGDTVAFVVRTDMAGWQEAMDVAADDHGTLRRLSIGGLGIFGGSREVPQVSQYFLANAVDHGTFPQWVAQHAKAVDGMSFVVGQGNLPNYPDRLYVVLKLPPEPHTFQAVVGWRDRKGDRMDKKM
Ga0307278_1005058323300028878SoilMDFAFIWLMALPAWAAEYRLQVTNIGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVIPGREVQLLEDPVYGGTVPARISLLPAHRQQAWTTYVFDGRPGDTVAFGVRSDMAAWQEVWDVAANPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLANAVDRGTFPQYVAQHAKAVDGMSFVVGEGHDTFYDADRLYVLITLPPEPHTFKVVIGWRDHDNRGDG
Ga0299907_1021358323300030006SoilTALPTWAAEYRLEYRLQVANIDDQVFASYEGKASSFWSQDQPMGRLAARLDDNQFSRAAILPGHHVELLEDPAYGGITPTRVSLLPATRYQDWSTFVFDANPGDTVAFVVRTDMYAWQQVEDVGANVDGTLRRLSIGGPGIFGGSREVPQVSQDFLANAVDRGTFPQYVAQRAKAVDGMSFVVGQGDNPRYDPDRLYVLLKLPPEPHTFKVVVGWKDRGNLIHQSGGQGD
Ga0268386_1070616913300030619SoilMKARTLIGGMFALLWLTALPTWAAEYRLEYRLQVANIDDQVFASYEGKASSFWSQDQPMGRLAARLDDNQFSRAAILPGHHVELLEDPAYGGITPTRVSLLPATRYQDWSTFVFDANPGDTVAFVVRTDMYAWQQVEDVGANVDGTLRRLSIGGPGIFGGSREVPQVSQDFLANA
Ga0302046_1095000913300030620SoilVGGIFALIWLMALPAWAVEYRLEVTNLDTQVFASYEGNGTSWWSQNEPMGRLEARLNQQQFSPAAVLPGHHVELLEDPAYGGTVPTRVSLLPATGRQAWTTYVFDANPGDTVAFVVRTDMIAWQEVMDVAASDNGTFRRLSIGGPGFFGGSREVPEVSQAFLAHAVDRGTFPQYVARRAKAVDGMSLVVGGGIVPTKRRARSG
Ga0308206_112136613300030903SoilFSSYEGKAPSWTSMKEPMGRLEARLDNSRFSNAAILPGHHVELLEDPAYGGTTPTKVSLLPATGHQDWSTYVFDANPGDTVAFVVRTDMFAWQQVWDVAADTNGTLRRLSIGGPGIFGKSSREVPQVAEDFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGDNPQYDPDRVYIVLKLPPQPHTFQAAIGWQNNQGDLMNQ
Ga0308204_1018836313300031092SoilNIDDRIFSSYEGTGSSFWSQKELMGRLEARLDQQQFSPAAILPGHHVELLEDPAYGGTTPTRVSLLPATGYQDWSTYVFDANPGDTVAFVVRTDMAAWQEVRDVAADANGTLRRLSIGGPGIFGKSSREVPQVAEEFLANAVDRGTFPQYVAQHAKPVDGMSFVVGQGDNPRYDPDRVYIVLKLPPQPHTFQTVIGWRDRGNLRNQDEGGG
Ga0308204_1021205813300031092SoilRTFSAYEGKAPSWTSMKEPMGRLEARLDNNQFSNAAILPGHHVVLLEDPAYGGTTPTRVSLLPATQYQDWSTYVFDANPGDTVAFVVRTDMAAWQQATDVAADTNGTLRRLSIGGPGIFGGSREVPQVAQDFLANAVDRGTFPQWVAQHAKAVDGMSFVVGQGDNPQYDPDRVYIVLKLPPQPHTFQAAIGWQNNQGDLMNQ
Ga0307408_10047839613300031548RhizosphereLTSRTLVGWIFAFIWLMALPAWAVEYRLEVTNIDALTFSSYMGKASPWWAQNEPMGRLEARLDQQQFSPAAVIPGREIQLLEDPAYGGTTPTHVSLLPATRQQVWTTYVFDANPGDTVAFVVKSDMAAWQEIWDVAANPGGTLRRLSMAGPGIFGRFWQEVPEVSQDFLANAVDRGTFPQYVAKRAKAVDGMSLVVG
Ga0307468_10081311913300031740Hardwood Forest SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDNSRFSNAAILPGHHVELLEDPAYGGTTPTRVSLLPATGYQDWSTYVFDANPGDTVAFVVRTDMYAWQEVMDVAADANGTLRRLSIGGPGIFGGSREVPQVAEDFLANAVDRGTFPQYVAQHAKAVDGMSFVVGQGNDLNYPDRVYVLL
Ga0307406_1041271123300031901RhizosphereMALPAWAVEYRLEVTNIDALTFSSYMGKASPWWAQNEPMGRLEARLDQQQFSPAAVIPGREIQLLEDPAYGGTTPTHVSLLPATRQQVWTTYVFDANPGDTVAFVVKSDMAAWQEIWDVAANPGGTLRRLSMAGPGIFGRFWQEVPEVSQDFLANAVDRGTFPQYVAKRAKAVDGMSLVVGQGHDTFYDPDRLYVLIKLPSEPHTFKVVLGWRDHDNRGDG
Ga0307406_1207051813300031901RhizosphereRTLVGWIFAFIWLMALPAWAAEYRLQVTNIGFLNFSSYMGKATPWWAQNEPMERLEARLETQQLSPAAVLPGREVQLREDPAYGRTVPARISLLPATRQQAWTAYVFDGRPGDTVPFVVRSDMAAWQEIWDVASNPGGTLRRLSMAGPGIFGHFWQEVPEVSQDFLANA
Ga0307471_10100764523300032180Hardwood Forest SoilGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRADMVAWEDAEDVASDVKCTFRRLSIGGPGFFEGSREVPQVSQDFLANAAERGTFPQWVAQHAKSLDGMSFVVGQGENTQAYPDRVYIVLKLPPEPHTFQAVIGWA
Ga0307471_10420051013300032180Hardwood Forest SoilLVGWIFALIWLMALPAWAVEYRLQVTNVGFLNFSSYMGKATPWWAQNEPMGRLEARLDAQQFSPAAVLPGREVQLLEDPAYGGTVPARISLLPATRQQAWTTYVFDGKPGDTVAFVVRSDMAAWQEVWFVAANPGGTLRRLSMAGPGIFGRFWQEVPEVSQDFLANAVD
Ga0307472_10075192313300032205Hardwood Forest SoilMKARTLVGGIFVLLWLMALPAWAMEYRLDVTNLDDRLFSSYEKNGTSWWSQKEPIGRLEARLDQQQFSPAAVLPGHHVELLEDPAYGGTTPTHVSLLPATRQQAWTTYVFDGNPGDTVAFVVRTDMAAWQEVEDVAANPDGTFRRLSIGGPGIFGNSSREVPEVSQDFLANAVDRGTFP
Ga0307472_10114284213300032205Hardwood Forest SoilMKSRTLVGGIFALLWLTALPAWAVEYRLQVANIDDETFASYEGKAPSFWSMKEPMGRLEARLDDNQFSRAAILPGHRVELVQDPAYGGITPTKVARLPATRHQDWTTYVFDANPGDTVVFVVRADMVAWEDAEDVASDVKCTFRRLSIGGPGFFGGSREVPQVSQDFLANAIERGTFPQWVAQHAKSLDGMSFVVGQGENTQAYPDRVYIVLK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.