NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F070914

Metagenome / Metatranscriptome Family F070914

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F070914
Family Type Metagenome / Metatranscriptome
Number of Sequences 122
Average Sequence Length 311 residues
Representative Sequence VEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Number of Associated Samples 96
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 36.97 %
% of genes near scaffold ends (potentially truncated) 70.49 %
% of genes from short scaffolds (< 2000 bps) 75.41 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.541 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(19.672 % of family members)
Environment Ontology (ENVO) Unclassified
(31.967 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.016 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.06%    β-sheet: 2.87%    Coil/Unstructured: 77.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 122 Family Scaffolds
PF00355Rieske 9.02
PF03807F420_oxidored 4.10
PF00005ABC_tran 1.64
PF03401TctC 0.82
PF13531SBP_bac_11 0.82
PF03118RNA_pol_A_CTD 0.82
PF07690MFS_1 0.82
PF04392ABC_sub_bind 0.82
PF01520Amidase_3 0.82
PF04972BON 0.82
PF02482Ribosomal_S30AE 0.82
PF00162PGK 0.82
PF028262-Hacid_dh_C 0.82
PF04402SIMPL 0.82
PF07859Abhydrolase_3 0.82
PF02518HATPase_c 0.82

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 122 Family Scaffolds
COG01263-phosphoglycerate kinaseCarbohydrate transport and metabolism [G] 0.82
COG0202DNA-directed RNA polymerase, alpha subunit/40 kD subunitTranscription [K] 0.82
COG0657Acetyl esterase/lipaseLipid transport and metabolism [I] 0.82
COG0860N-acetylmuramoyl-L-alanine amidaseCell wall/membrane/envelope biogenesis [M] 0.82
COG1544Ribosome-associated translation inhibitor RaiATranslation, ribosomal structure and biogenesis [J] 0.82
COG2859Outer membrane channel-forming protein BP26/OMP28, SIMPL familyCell wall/membrane/envelope biogenesis [M] 0.82
COG2968Uncharacterized conserved protein YggE, contains kinase-interacting SIMPL domainFunction unknown [S] 0.82
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.82
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.82
COG3471Predicted secreted (periplasmic) proteinFunction unknown [S] 0.82


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.54 %
UnclassifiedrootN/A2.46 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000580|AF_2010_repII_A01DRAFT_1039455All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria732Open in IMG/M
3300000597|AF_2010_repII_A1DRAFT_10001806All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5576Open in IMG/M
3300005167|Ga0066672_10503319All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria787Open in IMG/M
3300005172|Ga0066683_10349156All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria918Open in IMG/M
3300005174|Ga0066680_10107767All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1711Open in IMG/M
3300005332|Ga0066388_102379223All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria960Open in IMG/M
3300005363|Ga0008090_10067172All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2645Open in IMG/M
3300005363|Ga0008090_10245595All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1598Open in IMG/M
3300005445|Ga0070708_100330017All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300005467|Ga0070706_100404993All Organisms → cellular organisms → Bacteria1270Open in IMG/M
3300005552|Ga0066701_10203161All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1215Open in IMG/M
3300005554|Ga0066661_10069778All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2050Open in IMG/M
3300005556|Ga0066707_10256824All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1141Open in IMG/M
3300005559|Ga0066700_10457401All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria895Open in IMG/M
3300005764|Ga0066903_100516649All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2031Open in IMG/M
3300005764|Ga0066903_100555897All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1970Open in IMG/M
3300006028|Ga0070717_10098532All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2478Open in IMG/M
3300007255|Ga0099791_10043518All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1997Open in IMG/M
3300009012|Ga0066710_100839170All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1411Open in IMG/M
3300009792|Ga0126374_10273007All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1117Open in IMG/M
3300010046|Ga0126384_10166099All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1715Open in IMG/M
3300010046|Ga0126384_10647410All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria931Open in IMG/M
3300010046|Ga0126384_10824791All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria832Open in IMG/M
3300010358|Ga0126370_10292927All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1286Open in IMG/M
3300010359|Ga0126376_10114491All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2088Open in IMG/M
3300010360|Ga0126372_10307279All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1398Open in IMG/M
3300010361|Ga0126378_10328451All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1637Open in IMG/M
3300010361|Ga0126378_10483329All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1353Open in IMG/M
3300010362|Ga0126377_10493202All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1255Open in IMG/M
3300010366|Ga0126379_10147835All Organisms → cellular organisms → Bacteria2184Open in IMG/M
3300010366|Ga0126379_10267205All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1696Open in IMG/M
3300010366|Ga0126379_10826459All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1026Open in IMG/M
3300010376|Ga0126381_100905708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1270Open in IMG/M
3300010376|Ga0126381_101293289All Organisms → cellular organisms → Bacteria1054Open in IMG/M
3300010376|Ga0126381_101971308All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria842Open in IMG/M
3300010398|Ga0126383_10631021All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1145Open in IMG/M
3300011270|Ga0137391_10094148All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2591Open in IMG/M
3300011270|Ga0137391_10148123All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2044Open in IMG/M
3300012200|Ga0137382_10181179All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1441Open in IMG/M
3300012202|Ga0137363_10082348All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2401Open in IMG/M
3300012205|Ga0137362_10105755All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2375Open in IMG/M
3300012207|Ga0137381_10906412All Organisms → cellular organisms → Bacteria762Open in IMG/M
3300012208|Ga0137376_10030045All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4300Open in IMG/M
3300012210|Ga0137378_10289240All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1524Open in IMG/M
3300012210|Ga0137378_10539910All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1074Open in IMG/M
3300012211|Ga0137377_10063303All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3449Open in IMG/M
3300012356|Ga0137371_10285650All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1286Open in IMG/M
3300012357|Ga0137384_10289357All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1364Open in IMG/M
3300012361|Ga0137360_10084553All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2389Open in IMG/M
3300012361|Ga0137360_10206862All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1592Open in IMG/M
3300012362|Ga0137361_10094828All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2583Open in IMG/M
3300012362|Ga0137361_10098110All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2543Open in IMG/M
3300012362|Ga0137361_10297290All Organisms → cellular organisms → Bacteria1477Open in IMG/M
3300012363|Ga0137390_10198361All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1994Open in IMG/M
3300012685|Ga0137397_10180870All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1566Open in IMG/M
3300012911|Ga0157301_10126178All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria786Open in IMG/M
3300012923|Ga0137359_10110714All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2429Open in IMG/M
3300012923|Ga0137359_10175030All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1916Open in IMG/M
3300012948|Ga0126375_10553478All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria869Open in IMG/M
3300012955|Ga0164298_10198250All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1171Open in IMG/M
3300012957|Ga0164303_10352412All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria890Open in IMG/M
3300012971|Ga0126369_10428828All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1366Open in IMG/M
3300013297|Ga0157378_11150708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria814Open in IMG/M
3300016341|Ga0182035_10278103All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1363Open in IMG/M
3300016341|Ga0182035_10402077All Organisms → cellular organisms → Bacteria1151Open in IMG/M
3300016341|Ga0182035_10671670All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria900Open in IMG/M
3300016357|Ga0182032_10279538All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1310Open in IMG/M
3300016357|Ga0182032_10350821All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1181Open in IMG/M
3300016357|Ga0182032_10873918All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria763Open in IMG/M
3300016371|Ga0182034_10286631All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1314Open in IMG/M
3300016371|Ga0182034_10384777All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300016371|Ga0182034_10634804All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria904Open in IMG/M
3300016387|Ga0182040_10320474All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1192Open in IMG/M
3300016404|Ga0182037_10272248All Organisms → cellular organisms → Bacteria1349Open in IMG/M
3300016422|Ga0182039_10329845All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1274Open in IMG/M
3300016445|Ga0182038_10435323All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1107Open in IMG/M
3300018433|Ga0066667_10418015All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1088Open in IMG/M
3300018468|Ga0066662_10174796All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1661Open in IMG/M
3300021478|Ga0210402_10830150All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria850Open in IMG/M
3300021560|Ga0126371_10151986All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2377Open in IMG/M
3300021560|Ga0126371_10220084All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2002Open in IMG/M
3300021560|Ga0126371_11612389All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria775Open in IMG/M
3300025910|Ga0207684_10223479All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1625Open in IMG/M
3300025915|Ga0207693_10350741All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1155Open in IMG/M
3300025922|Ga0207646_10417925All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1211Open in IMG/M
3300026319|Ga0209647_1024109All Organisms → cellular organisms → Bacteria → Proteobacteria3695Open in IMG/M
3300026551|Ga0209648_10113989All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2193Open in IMG/M
3300027874|Ga0209465_10065861All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1751Open in IMG/M
3300028828|Ga0307312_10042913All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2677Open in IMG/M
3300028884|Ga0307308_10143747All Organisms → cellular organisms → Bacteria1142Open in IMG/M
3300031226|Ga0307497_10079594All Organisms → cellular organisms → Bacteria1229Open in IMG/M
3300031545|Ga0318541_10101666All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1544Open in IMG/M
3300031573|Ga0310915_10104092All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1921Open in IMG/M
3300031719|Ga0306917_10189389All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1552Open in IMG/M
3300031720|Ga0307469_11172928All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria725Open in IMG/M
3300031744|Ga0306918_10465704All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria988Open in IMG/M
3300031748|Ga0318492_10044702All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2048Open in IMG/M
3300031771|Ga0318546_10356575All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1018Open in IMG/M
3300031833|Ga0310917_10266098All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1156Open in IMG/M
3300031835|Ga0318517_10050398All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1742Open in IMG/M
3300031860|Ga0318495_10144814All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1071Open in IMG/M
3300031890|Ga0306925_10201745All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2145Open in IMG/M
3300031890|Ga0306925_10437668All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1398Open in IMG/M
3300031893|Ga0318536_10223258All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria959Open in IMG/M
3300031910|Ga0306923_10124471All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2929Open in IMG/M
3300031910|Ga0306923_10485195All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1399Open in IMG/M
3300031942|Ga0310916_10502171All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1033Open in IMG/M
3300031946|Ga0310910_10354743All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1159Open in IMG/M
3300031954|Ga0306926_10867229All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1083Open in IMG/M
3300031981|Ga0318531_10189562All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria926Open in IMG/M
3300032001|Ga0306922_11277665All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria743Open in IMG/M
3300032041|Ga0318549_10168165All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria980Open in IMG/M
3300032059|Ga0318533_10314294All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1138Open in IMG/M
3300032076|Ga0306924_11194438All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria823Open in IMG/M
3300032094|Ga0318540_10320682All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria748Open in IMG/M
3300032180|Ga0307471_101964894All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria733Open in IMG/M
3300032261|Ga0306920_100227137All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2785Open in IMG/M
3300032261|Ga0306920_101328142All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1033Open in IMG/M
3300033289|Ga0310914_10158213All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2003Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.67%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil18.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.10%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.28%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.46%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.64%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil1.64%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.82%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000580Forest soil microbial communities from Amazon forest - 2010 replicate II A01EnvironmentalOpen in IMG/M
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005363Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome F II A100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012911Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S088-202R-2EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031748Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f22EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031835Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f21EnvironmentalOpen in IMG/M
3300031860Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f25EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031981Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f25EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A01DRAFT_103945513300000580Forest SoilAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEAREAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIA
AF_2010_repII_A1DRAFT_1000180643300000597Forest SoilMRVFASTMLIIGAVLSAFATPFAGQCVFADPVEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEAREAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD*
Ga0063356_10235001513300004463Arabidopsis Thaliana RhizospherePGYTMTRQGPERAIGRLHPEFVNRLAGAIAEARSAGLTFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTQASLLWHEIAARHGVICPYGPYNPMEWNHCQPTWVKIILPENPLRETVTVDGPISLEGMFEIGYSLITTSGTAIEPAVDPPAHFFKQPQARTAMPADPRTTSAANMRTAHNQTTKSVHPVLDRYAMRWPPISGGKVGWPKGVPRIAKFDDDPLKLNSRASLRTSSSQSILIIANSTIGRSTTRSALERQHQ
Ga0066672_1050331913300005167SoilTATPGYTMMRQGPERAIGRLHPEFINRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTVDLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKSRPSPPKPFALPRLIGLRAAGR
Ga0066683_1034915613300005172SoilMRIFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQ
Ga0066680_1010776713300005174SoilMRIFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPTSSARSRTLMLRPIVMPRIIAATTRAD*
Ga0066388_10237922313300005332Tropical Forest SoilTVEGGLDTSARCNPEDIAVHAPQPSSTSEWPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAENPLRETVTADGPISLEGMFEVGSSVIAASGSAGASSADPPAHFFNPPRAGAAMPVEQRLTAAASIKPAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMIAAMAQGR*
Ga0008090_1006717223300005363Tropical Rainforest SoilETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEAREAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSVFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD*
Ga0008090_1024559513300005363Tropical Rainforest SoilMRVFASTMLMIGAALSAFTTPFPGQSVFADSVEPATVEAELDTSALCNPEDIAVHAPQPSSTSERPCPADTREPDETARARAYLIATATPGYTMTRQGPERAIGWLHPEFVKRLAAAIAEARGAGLLFAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGAPGTPSALLWHEIAARNGVICPYGPYNLLEWNHCQPTWVKIIFADNPLRETVTADGPISLEDMFEVGHSLIAASGTVGAPTGDPPAHFFKPPQARPAMAIDTKTANSRITPKTTLSTLDGYTMRWPLLSGGRVGWPKGVPRIAMLDDETRSSGSRAANRRFSPPSLTSIPLMPNKEVPNKSTSPWGVQLVGGLS
Ga0070708_10033001713300005445Corn, Switchgrass And Miscanthus RhizosphereMRALASAMLMIAAMFTGFATPFAGQVFADSVEAGTVEAGLDTSELCNPDNTYVPQPDAPSGRPCPAHVGETDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTWVKIILADDPLRETVTADGPITLGGMFELGGSVIAASGSLSGETAGPPQLFKPPQLRTTMPVEQRITAATSIKTAQSWRSPHPDRSIFNRRAMKRPQLSVGKLGWPNGVPRIAILDAELRKSMSSARSRISVLKPIVMPRITVAIM
Ga0070706_10040499323300005467Corn, Switchgrass And Miscanthus RhizosphereMLMIAAMFTRFATPFAGQVFADSVEAGTVEAGLDTSELCNPDNIYVPQPAAPSGRPCPAHVGETDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFYSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTWVKIILADDPLRETVTADGPITLGGMFELGGSVIAASGSLSGETAGPPQLFKPPQLRTTMPVEQRITAATSIKTAQAWRSPHPDRSIFNRRAMKRPQLSVGKLGWPNGVPRIAILDAELRKSMSSARSRISVLKPIVMPRITVAIMPPESNQR*
Ga0066701_1020316123300005552SoilATCNSEDIVVYAPQPSSARESPCPADAREVDETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTADLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGASKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR*
Ga0066661_1006977853300005554SoilMRIFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTD
Ga0066707_1025682413300005556SoilMRIFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIAN
Ga0066700_1045740113300005559SoilDSVEPRTVEGGLNTSARCNPEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKS
Ga0066903_10051664923300005764Tropical Forest SoilMRAFASATLIIGVVLGAFATPSAGQGVSADPVEPRTVEGGLDTSARCNPEDTAVHAPQPSSTSERPCPADTREPGETARARSYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPSFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAENPLRETVTADGPISLEGMFEVGSSMIAASGSAGASSADPPAHFFNPPRAGAAMPVEQRLTASIKPAYERGRSQTDRSIFDSRAVKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPWILRLWRKGDRQAGPIAALSTSTI*
Ga0066903_10055589723300005764Tropical Forest SoilMRAFASAMLMIGGVLGAFATPSAGQELFADSVGQQIVEAGRNTSASCNPEDIVIYGPQLPSASERPCPAHTREADETARARLYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPAFGVGGFSDKFQSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPHNALEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGVSSADSPTHFFKPPQARTAMPVEQRLTAVAFIKTAYEGGRSRIDRSIFDGRAMKSPFPVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLLLKPIVMPRMIAAISQGR*
Ga0070717_1009853223300006028Corn, Switchgrass And Miscanthus RhizosphereMRALASAMLMIAAMFTRFATPFAGQVFADSVEAGTVEAGLDTSELCNPDNIYVPQPAAPSGRPCPAHVGETDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFYSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTWVKIILADDPLRETVTADGPITLEGMFELGGSVIAASGSLSGETAGPPQLFKPPQLRTTMPVEQRITAATSIKTAQAWRSPHSDRSIFNRRAMKRPQLSVGKLGWPNGVPRIAILDAELRKSMSSARSRISVLKPIVMPRITVAIMPPESNQR*
Ga0099791_1004351813300007255Vadose Zone SoilMLSAFASPLSGQRVLADSGEPVTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREVDETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIVASGSVDGPTADLPAHFLKPLQMPVEPWITPAANIKIAGNRRAVQTGRSVLSAGSTKPARLSGGKLGWPNGVPRIADLDEEPRNANSAPKSRPSPPKPFALPRLIGLRAAGRTDLLH*
Ga0066710_10083917013300009012Grasslands SoilMRIFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPVLCRQVWLA
Ga0126374_1027300713300009792Tropical Forest SoilQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIGRLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLIFAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFYPPRVGAAMPGKRRLTAAAFIQTAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPEGVPRIANLEGEPRRPKSSAKNRMLLLKPIVMPRMIAAMGQGR*
Ga0126384_1016609913300010046Tropical Forest SoilVRAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRSHQTNRSIFDSRAMKGPLSVGKSGWPKGVPRIANLADEPRRPNSSAKNHTLLLKPIVMPRIAAMAQGR*
Ga0126384_1064741013300010046Tropical Forest SoilMRVFASTMLTIGAVLSAFTTPFAGQCVFADPVEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAVETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARKAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAGDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQT
Ga0126384_1082479113300010046Tropical Forest SoilLTIGAVLSAFTNPFAGQWVFADSVEPGTVEAGLDTSARCNPEDIAVHAPQPSSTSERPCPAHTREAGETARARAYLIVTATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVRLILAEDPLRETVTADGPISLQGMFEVGYSVIAASSSVGGPTGDPPAHSAIEQRIIAAANIRTAYERGPPQTYRSIFGSRV
Ga0126370_1029292713300010358Tropical Forest SoilMRVFASVMLMIGGVLSAYASPSDGQGVFADSVEPGTVEAGLDTSARCNPEDIAVYGPQLPPTNDRQCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSGVGASNAGPPTHLFNPPRVVEQRLTATAFIQTAYEGGRSHQTNRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPNSPAKNRTLLLKPIVMPRMIAAMAQGR*
Ga0126376_1011449123300010359Tropical Forest SoilMRVFASVMLMIGGALSAYASPSDGQGVFADSVEPRAVEAGLDTSARCNPEDIAVYGPQLPPTNDRQCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSGVGASNAGPPTHLFNPPRVVEQRLTATAFVQTAYEGGRSHQTNRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPNSPAKNRTLLLKPIVMPRMIAAMAQGR*
Ga0126372_1030727913300010360Tropical Forest SoilMRAFASTMLTIGAVLSAFTNPFAGQWVFADSVEPGTVEAGLDTSARCNPEDIAVHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLILAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFYPPRVGAAMPGKRRLTAAAFIQTAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPEGVPRIANLEGEPRRPKSSAKNRMLLLKPIVMPRMIAAMGQGR*
Ga0126378_1032845113300010361Tropical Forest SoilMRAFASTMLTIGAVLSAFTNPFAGQWVFADSVEPGTVEAELDTSALCNPEDIAVHASQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIGRLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVRLILAEDPLRETVTADGPISLQGMFEVGYSVIAASSSVGGLTGDSPAHSAMEQRITATANIRTAYERGPPQTYRSIFDSRVKKSPRLYVDKLGWPKGVPRIANLEGEPRRPKSSATSRILLKPIVMPRMIGTTTRPKVTGARQAGPGPT*
Ga0126378_1048332923300010361Tropical Forest SoilMLIIGAVLGAFATPSARQGVFADSVEPRTVEGGLDTSARCNPEDIMVYGPQLPSTSEEPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDITGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKVILAEDALRETVTADGPISLEGMFEVGSSVIAASSSAGASSADPPAHFFNPPRAGAARPVEQRLTAAASIKPAYERGRSQTDRSIFDSRAMKNPFSVSKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMAAMAQGR*
Ga0126377_1049320213300010362Tropical Forest SoilMLIIGAVLGAFATPSARQGVFADSVEPRTVEGGLDTSARCNPEDIMVYGPQLPSTSEEPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLILAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFYPPRVGAAMPGERRLTAAAFIQTAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPEGVPRIANLEGEPRRPKSSAKNRMLLLKPIVMPRMIAAM
Ga0126379_1014783513300010366Tropical Forest SoilYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSVFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD*
Ga0126379_1026720523300010366Tropical Forest SoilMRVFASVMLMIGGVLSAYASPSAGQGVFADSVEPGTVEAGLDTSARCNPEDIAVYGPQLPPTNDRQCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAVGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLDGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATAFIQAAYEGGRSHQTNRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPNSPANRTLLLKPIVMPRMIAAMAQGR*
Ga0126379_1082645913300010366Tropical Forest SoilFTTPFPGQWVFADSVEPATVEAELDTSALCNLEDIAVHAPQPSSTSERPCPADTREPDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRRAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVRLILAEDPLRETVTADGPISLQGMFEVGYSVIAASSSVGGLTGDSPAHSAMEQRITAAANIRTAYERGPPQTYRSIFDSRVKKSPRLYVDKLGWPKGVPRIANLEGEPRPPKSSATSILLKPIVMPRMIGTTTRPKVTGARQAGPGPT*
Ga0126381_10090570823300010376Tropical Forest SoilMLTIGAVLSAFTNPFAGQWVFADSVEPGTVEAGLDTSARCNPEDIAVHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRRAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLIFAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFNPPRVGAAMPGEHRLTAAAFIQTAYERGRSQTDRFIFDSRAMKSPFSVGKSGWPKGVPRIANLEGEPRRPKSSAKNRILVLKPIVMPRMIAAMGQ
Ga0126381_10129328913300010376Tropical Forest SoilMRVFASVMLMIGGALSAYASPSDGQGVFADSVEPRAVEAGLDTSARCNPEDIAVYGPQLPPPSDRPCPAHAGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAVGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHETAARHGVICPYGPHNLLEWNYCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATAFIQAAYEGGRSHQTNRSIFDSRAMKSPFSVGKSGWPKGVPRIAN
Ga0126381_10197130813300010376Tropical Forest SoilMLIIGAVLGAFATPSARQGVFADSVEPRTVEGGLDTSARCNPEDIMVYGPQLPSTSEEPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDITGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKVILAEDALRETVTADGPISLEGMFEVGSSVIAASSSAGASSADPPAHFFNPPRAGAARPVEQRLTA
Ga0126383_1063102113300010398Tropical Forest SoilCNPEDIAVHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIGRLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLILAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFNPPRVGAAMPGEHRLTAAAFIQTAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPEGVPRIANLEGEPRRPKSSAKNRMLLLKPIVMPRMIAAMGQGR*
Ga0137391_1009414813300011270Vadose Zone SoilMGAVLSAFVASSAGQWVLADSIEPVTLETENGTPAICNSDDISVYVPQPSSTSESPCPAHARETGETARARAYLIATATPGYTMTRQGPERAIGRLHPQFVNRLAAAIAEARGAGLLFAGVFSAYRPPAFGVGGFVDKFHSLHTYGLAVDVTGIGGPGTPEALLWHEIAARHGVICPYGPHNLVEWNHCQPTRVKIIVAENPLRETVTADGPISLEDMFEVGYSLIAGSGGIDGPSADPPAHFFKPPLARPAILEPRINVAATIKTAPNRRVSQTDRSVFNGGPIQRPRLSGDKLGWPKGVAPIVDLSDQPRGQTSPAKSRRLPPKPIVMPRIMADYGAGRK*
Ga0137391_1014812313300011270Vadose Zone SoilMRTFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPCPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTAAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137382_1018117923300012200Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAALHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQ
Ga0137363_1008234823300012202Vadose Zone SoilMRTFASGMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPCPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137362_1010575523300012205Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137381_1090641213300012207Vadose Zone SoilRAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTVDLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR*
Ga0137376_1003004513300012208Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPP
Ga0137378_1028924023300012210Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDGRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137378_1053991013300012210Vadose Zone SoilMTLEAGHGTPATCNSEDIVVYAPQPASARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFINRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTADLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPAMPRLIGLTAAGQIALR*
Ga0137377_1006330343300012211Vadose Zone SoilMTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREVDETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASGSVDGPTADLPAHFLKPLQMPVEPWITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGW
Ga0137371_1028565013300012356Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIGRLHPEFVNRLAAALAEARAAGLPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIASRHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGFSVIAASSSVGAPSADPPTHFFNPPRVGAAMPVEQRL
Ga0137384_1028935713300012357Vadose Zone SoilVDETARARAYLIATATPGYTMMRQGPERAIGRLHPEFINRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWQEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTVDLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR*
Ga0137360_1008455313300012361Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPYPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAHGR*
Ga0137360_1020686223300012361Vadose Zone SoilMTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTADLLAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR*
Ga0137361_1009482833300012362Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPCPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTGAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137361_1009811033300012362Vadose Zone SoilMTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFINRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIVASGSVDGPTADLPAHFLKPLQMPVEPWITPAANIKIAGNRRAVQTGRSVLSAGSTKPARLSGGKLGWPNGVPRIADLDEEPRNANSAPKSRPSPPKPFALPRLIGLRAAGRTDLLR*
Ga0137361_1029729013300012362Vadose Zone SoilETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHSVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSNVGAPTADPPAHFFKPPQARTAMPVEQRVTAAANIKTAYTSGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPTSSARSRTLMLRPIVMPRIIAATTRAD*
Ga0137390_1019836123300012363Vadose Zone SoilMLMIGGVLSAFATPSAGQGVFADSVEPRIVEAGRNTSARCNAEDIAVYGPQLPSTSERPCPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHFFNPPRPRAAMPVEQRLTAAAFIKTAYEGGRSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRMLSLKPIVMPRMIAAIAQGR*
Ga0137358_1024031613300012582Vadose Zone SoilPRGAAYPSQAALSDASRSDPSTPAALSTPARVSDGLLFFQPNPGSEPAPRTAAELAEARAYLIETASPGYTMTLQGPEVAIGRLHPEFAVRLASAIREARSAGLAFAGVFSAYRPPAFGIGGFSDKFNSLHTYGLAVDMHGIGSPGSPEAQLWHQIAAKSGVVCPYGPRARTEWNHCQPTSVKIILAENPLRETVTADGPISLQGMFEVGYSVIAASSSVDAPTGDPPAHSAMEQRITAPANIRTAYERKRPQTYRSIFDSRAKKSSRLYVDNWPRGVPRIANLDDEPRRPKSSATSRTPLKPIVMPRMIVTTARPKVTGARQAGPGST*
Ga0137397_1018087023300012685Vadose Zone SoilMLSAFASPLSGQRVLADSGEPVTLEAGHGTPATCNSEDIVVYAPEPSSARQSPCPADAREAGETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIVASGSVDGPTADLPAQFFKPLQMPVEPWITPAANIKIAGNRRAVQTGRPVLDAGSTKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKSRPSPPKSFALPRLIGLTAAGRTDLLR*
Ga0157301_1012617813300012911SoilLIETATPGYTMTRQGPERAIGRLHPEFVNRLAGAIAEARSAGLTFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTQASLLWHEIAARHGVICPYGPYNPMEWNHCQPTWVKIILPENPLRETVTVDGPISLEGMFEIGYSLITTSGTAIEPAVDPPAHFFKQPQARTAMPADPRTTSAANMRTAHNQTTKSVHPVLDRYAMRWPPISGGKVGWPKGVPRIAKFDDDPLKLNSRA*
Ga0137359_1011071423300012923Vadose Zone SoilMTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSFALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTADLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR*
Ga0137359_1017503013300012923Vadose Zone SoilMLMIAAVFTAFATPFAGQRVFADSVEAGTVEAGLDTSELCNPDNIVVYAPQPAAPSGRPCPAHAGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTRVKIILAEAPLRETVTADGPITLEGMFELGYSVIAASSSLRGETDGPPQQLFKPPQLRTTMPVEQRITAAAS
Ga0126375_1055347813300012948Tropical Forest SoilKEWDRMRVFASVMLMIGGVLSAYASPSAGQGVFADSVEPRAVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHAGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDALRETVTADGPISLEGMFEVGSSVIAASSGVGASNAGPPTHLFNPPRVVEQRLTATAFIQTAYEGGRSHQTNR
Ga0164298_1019825013300012955SoilGIVVEPQSFATSEGRCPPHASDTNRTARARAYLIETATPGYTMTRQGPERAIGRLHPEFVNRLAGAIAEARSAGLTFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTQASLLWHEIAARHGVICPYGPYNPMEWNHCQPTWVKIILPENPLRETVTVDGPISLEGMFEIGYSLITTSGTAIEPAVDPPAHFFKQPQARTAMPADPRTTSAANMRTAHNQTTKSVHPVLDRYAMRWPPISGGKVGWPKGVPRIAKFDDDPLKLNSRASLRTSSS*
Ga0164303_1035241213300012957SoilGTCPAYAIEANPTARARAFLIETATPGYTMTRQGPERAIGRLHPEFVNRLAAAIAEARGAGLTSAGIFSAYRPPAFGVGGFVDKFHSLHTYGLAVDVTGIGAPGTPTALLWHEIAARHGVICPYGPYNPMEWNHCQPTWVKIILPENPLRETVTTDGPITLEGMFEVGYSLITTAGSAGGSTADPPAHFFKQPQAWTAMPADPRIASSANRKTAPNQTTPKSVLPDRYAMRWPPISGGKVAWPKGVPRIAKLYDEPLKLNSRAALGTSSPKPILIPRVMAARSNTTIPKSAELAKT
Ga0126369_1042882813300012971Tropical Forest SoilHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLILAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFFYPPRVGAAMPGKRRLTAAAFIQTAYERGRSQTDRSIFDSRAMKSPFSVGKSGWPEGVPRIANLEGEPRRPKSSAKNRMLLLKLIVMPRMIAAMGQGR*
Ga0157378_1115070813300013297Miscanthus RhizosphereRAYLIETATPGYTMTRQGPERALGRLHPEFVNRLAGAIAEARSAGLTFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTQASLLWHEIAARHGVICPYGPYNPMEWNHCQPTWVKIILPENPLRETVTVDGPISLEGMFEIGYSLITTSGTAIEPAVDPPAHFFKQPQARTAMPADPRTTSAANMRTAHNQTTKSVHPVLDRYAMRWPPISGGKVGWPKGVPRIAKFDDDPLKLNSRASLRTSSSQSILIIANSTIGRST
Ga0182035_1027810313300016341SoilHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVYGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTYRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSRTVMLRPIVMPRIIAATMRAD
Ga0182035_1040207713300016341SoilGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0182035_1067167013300016341SoilGETARARAYLIATATPGYTMTRQGPERSIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIIFAEDPLRDTVTADGPISLEGMFEVGSSVIAASSSVGASGVDPPTHFFNPPRAGTAIPVEQRPTAAYERGRSQTDRSIFDSRVMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMIAAMAQGR
Ga0182032_1027953813300016357SoilKEWDRMRVFASVMLMIGGVLSAYASPSDGQGVFADSVEPRTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWPKGVPRIANLIPRRPNSSAKNHPLLLKPIVMPRIAAMAQGR
Ga0182032_1035082113300016357SoilRPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTDYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0182032_1087391813300016357SoilPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQT
Ga0182034_1028663113300016371SoilETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGRQ
Ga0182034_1038477713300016371SoilETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0182034_1063480413300016371SoilMIGGVLSAYASPSAGQGVFADSVEPGTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHAGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWP
Ga0182040_1032047413300016387SoilSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0182037_1027224813300016404SoilGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAAAMRAD
Ga0182039_1032984513300016422SoilPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRLKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0182038_1043532313300016445SoilAEKLDLSIRCPLYPQKRTNRRRLDSSALCQKRTQRRRARSSVPVNGWTVGIPFLAGGVQAPRLAELLKASTVGLCSGKAKEWDRMRALASAMLMIGGVLSAYASPSDGQGVFADSVEPRTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVVHRLTATSLIQTAYEGGRS
Ga0066667_1041801513300018433Grasslands SoilLSAFASPLSGQRVLADSGEPMTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAAALAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIAASRSVDGPTVDLPAHFFKPLQMPVEPRITPAANIKIAGNRRAVQTGRTVLNAGATKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR
Ga0066662_1017479623300018468Grasslands SoilGMRPTRLATLIAGAMLSAFASPLSGQRVLADSGEPVTLEAGHGTPATCNSEDIVVYAPQPSSARESPCPADAREADETARARAYLIATATPGYTMMRQGPERAIGRLHPEFINRLAAAIAEARGAGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAPGTSYALIWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIIPAENPLRETVTSDGPISLEGMFEVGYSLIVASGSVDGPTADLPAHFLKPLQMPVEPWITPAANIKIAGNRRAVQTGRSVLSAGSTKPARLSGGKLGWPNGVPRIANLDEEPRNANSAPKTRPSPPKPLAMPRLIGLTAAGQIALR
Ga0210402_1083015013300021478SoilDRASGSAPRTEAEIAEARAYLIETASPGYTMTLQTPEVAIGRLNPEFAVRLASAIREARSAGLSFAGVFSAYRPPAFGVGGFSDKFNSLHTYGLAVDMHGIGSPGSSEAQLWHQIAAKNGVVCPYGPRNRAEWNHCQPTTVKIILADNPLRETVTLKGPINLQGMFEVGDSVIAASSSVGAPTGDPPAHSAMEQRITAPANIRTAYERKRPQTYRSIFDSRAKKSSRLYVDNWPRGVPRIANLDDEPRRPKSSATSRTPLKPIVMPRMIVTTARPKVTGARQA
Ga0126371_1015198633300021560Tropical Forest SoilMRVFASTMLMIGAALSAFTTPFPGQSVFADSVEPATVEAELDTSALCNPEDIAVHAPQPSSTSERPCPADTREPDETARARAYLIATATPGYTMTRQGPERAIGWLHPEFVKRLAAAIAEARGAGLLFAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGAPGTPSALLWHEIAARNGVICPYGPYNLLEWNHCQPTWVKIIFADNPLRETVTADGPISLEDMFEVGHSLIAASGTVGAPTGDPPAHFFKPPQARPAMAIDTKTANSRITPKTTLSSLDGYTMRWPLLSGGRVGWPKGVPRIAMLDDETRSSGSRAANRRFSPP
Ga0126371_1022008423300021560Tropical Forest SoilGLDTSARCNPEDIAVHAPQPSLTSERPRPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEAREAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSVFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0126371_1161238913300021560Tropical Forest SoilGLDTSARCNPEDIAVHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRRAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVRLILAEDPLRETVTADGPISLQGMFEVGYSVIAASSSVGGLTGDSPAHSAMEQRITAAANIRTAYERGPPQTYRPIFDSRVKKSPRLYVDKL
Ga0207684_1022347923300025910Corn, Switchgrass And Miscanthus RhizosphereMRALASAMLMIAAMFTRFATPFAGQVFADSVEAGTVEAGLDTSELCNPDNIYVPQPAAPSGRPCPAHVGETDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFYSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTWVKIILADDPLRETVTADGPITLEGMFELGGSVIAASGSLSGETAGPPQLFKPPQLRTTMPVEQRITAATSIKTAQAWRSPHPDRSIFNRRAMKRPQLSVGKLGWPNGVPRIAILDAELRKSMSSARSRISVLKPIVMPRITVAIMPPESNQR
Ga0207693_1035074113300025915Corn, Switchgrass And Miscanthus RhizosphereSPHAAISGASRSERSIPAASSIPARVSDELLVDPASGSAPRTEAEITEARAYLVETASPGYTMTLQGPELAIGRLNPEFAVRLASAIREARSAGLSFAGVFSAYRPPAFGVGGFSDKFNSLHTYGLAVDMHGIGSPGSSEAQLWHQIAAKNGVVCPYGPRNRAEWNHCQPTTVKIILADNPLRETVTLKGPINLQGMFEVGDSVIAASSSVGAPTGDPPAHSAMEQRITAPANIRTAYERKRPQTYRSIFDSRAKKSSRLYVDNWPRGVPRIANLDDEPRRSKSSAASRTPLKPIVMPRMIVTTARPKVTGARQAGPGST
Ga0207646_1041792513300025922Corn, Switchgrass And Miscanthus RhizosphereMRALASAMLMIAAMFTRFATPFAGQVFADSVEAGTVEAGLDTSELCNPDNIYVPQPAAPSGRPCPAHVGETDETARARAYLIATATPGYTMTRQGPERAIERLHPEFVKRLAAAIAEARGAGMPMAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYSLLEWNHCQPTWVKIILADDPLRETVTADGPITLEGMFELGGSVIAASGSLSGETAGPPQLFKPPQLRTTMPVEQRITAATSIKTAQAWRSPHSDRSIFNRRAMKRPQLSVGKLGWPNGVPRIAIL
Ga0209647_102410933300026319Grasslands SoilMRPFGLPTLIMGAVLSAFVASSAGQWVLADSIEPVTVEAENGTPAICNSDDLSVYVPQLPSTSERPCPAHTREADETARARAYLIATATPGYTMTRQGPERAIERLHPEFANRLAAAIAEARAAGLPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPGSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTRFFNPPRPGAAMPVEHRLTAAFIKTAYEGERSQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIADLADEPRRPKSSAKNRM
Ga0209648_1011398913300026551Grasslands SoilMRPFGLPTLIMGAVLSAFVASSAGQWVLADSIEPVTLETENGTPAICNSDDISVYVPQPSSTSESPCPAHARETGETARARAYLIATATPGYTMTRQGPERAIGRLHPQFVNRLAAAIAEARGAGLLFAGVFSAYRPPAFGVGGFVDKFHSLHTYGLAVDVTGIGGPGTPEALLWHEIAARHGVICPYGPHNLVEWNHCQPTRVKIIVAENPLRETVTADGPISLEDMFEVGYSLIAGSGGIDGPSADPPAHFFKPPLARPAILEPRINVAATIKTAPNRRVSQTDRSVFNGGPIQRPRLSGDKLGWPKGVAPIVDLSDQPRGQTSPAKSRRLPPKPIVMPRIMADYSAGRK
Ga0209465_1006586113300027874Tropical Forest SoilMRVFASTMLMIGAVLSAFTTPFPGQWVLADSVEPGTEEAGLDTSALCNPEDIAVHAPQPSSTSERPCPADTREPGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEAREAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFRPPQARTAMPLEQRITSAANIKTAYERGPPQTDRSIFDSRAMKSPFSVGKSGWPKGVPRIANLADEPRRPNSPANRTLLLKPIVMPRMIAAMGQGR
Ga0307312_1004291333300028828SoilMQTFRLSTLTALTMLAAFASRFPGQRVLADPVEPVTLERAEGLQAICNSEDIVVYGSQPPSASPCPVNAKDAGETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAGAIAAARGSGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAAGTPAALLWHDIATRHRLICPYGPYDLGEWNHCQPTRVKIIFANDPLRETVTADGPISLESMFEIGYSLIAASDSTAGPSADSTTHFFGPSQPRSLMPVEVRLNPAASIATAYRWRAPQNTRPIFNPGPTKPQPSVRKLVVWPKGVPRIAILEQEPRGLKSRPTSVILPMKPLAMPQILAGYSPGRK
Ga0307308_1014374723300028884SoilTLERTEGLQAICNSEDIVVYGSQPPSASPCPVNAKDAGETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAGAIAAARGSGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAAGTPAALLWHDIATRHRLICPYGPYDLGEWNHCQPTRVKIIFANDPLRETVTADGPISLESMFEIGYSLIAASDSTAGPSADSTTHFFGPSQPRSLMPVEVRLNPAASIATAYRWRAPQNTRPIFNPGPTKPQPSVRKLVVWPKGVPRIAILEQEPRGLKSRPTSVILPMKPLAMPQILAGYSPGRK
Ga0307497_1007959413300031226SoilFRLSTLTALTMLAAFASRFPGQRVLADPVEPVTLERAEGLQAICNSEDIVVYGSQPPSASPCPVNAKDAGETARARAYLIATATPGYTMMRQGPERAIGRLHPEFVNRLAGAIAAARGSGLPFAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGAAGTPAALLWHDIATRHRLICPYGPYDLGEWNHCQPTRVKIIFANDPLRETVTADGPISLESMFEIGYSLIAASDSTAGPSADSTTHFFGPSQPRSLMPVEVRLNPAASIATAYRWRAPQNTRPIFNPGPTKPQPSVRKLVVWPKGVPRIAILEQEPRGLKSRPTSVILPMKPLAMPQILAGYSPGRK
Ga0318541_1010166613300031545SoilTAVLSAFATPFAGQCVFADPVEPGTEEAGLDTSALCNPEDTVVYAPQAPSISERPCPVHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAATMRAD
Ga0310915_1010409223300031573SoilVEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0306917_1018938923300031719SoilMGGRRAFRSSAGGVQSPRLAELLKASTVGLCSGKAKEWDRMRALASAMLMIGGVLSAYASPSDGQGVFADSVEPRTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVSADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWPKGVPRIA
Ga0307469_1117292813300031720Hardwood Forest SoilVVDRASGSAPRTEAEIAEARAYMIETASPGYTMTLQTPEVAIGRLNPEFAVRLASAIREARSAGLSFAGVFSAYRPPAFGVGGFSDKFNSLHTYGLAVDMHGIGSPGSSEAQLWHQIAAKNGVVCPYGPRNRAEWNHCQPTTVKIILADNPLRETVTLKGPINLQGMFEVGDSVIAASSSVGAPTGDPPAHSAMEQRITAPANIRTAYERKRPQTYRSIFDSRAKKSSRLYVDNWPRGVP
Ga0306918_1046570413300031744SoilCNPEDTVVYAPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVYGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAATMRAD
Ga0318492_1004470223300031748SoilVEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAALSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0318546_1035657513300031771SoilTVGLCSGKAKEWDRMRALASAMLMIGGVLSAYASASAGQGVFADSVEPRIVEAGLDTSARCNPEDIAVYGSQLPPTSERPCPEHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVSADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWPKGVPRIANLIPRRPN
Ga0310917_1026609813300031833SoilELLKSSTVELCRRKAKEWDRMRAFASAMLMIGGVLSAYASPSDGQGVFADSVEPRTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLDGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWPKGVPRIANLIPRRPNSSAKNHPLLLKPIVMPRIAAMAQGR
Ga0318517_1005039823300031835SoilMRVFASTMLIIGAVLSAFATPFAGQCVFADPVEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREGGETARARAYLIATATPGYTMTRQGPQRAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0318495_1014481413300031860SoilEPGTEEAGLDTSAPCTPEDTVAYAAQAPSTSERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDSLRETVTADGPISLQDMFEVGYSVIAASSSVGAPTAHPPAHFFKLPQARTAMPVEQRITAAANIKTAYASGPPQTDRSIFDSRAMKSPRLSVGKLGWPKGVPRIANLDDEPRRPKSSARSRTSVLRPIVMPRIIAATTRAD
Ga0306925_1020174523300031890SoilMGGRRAFRSSAGGVQSPRLAELLKASTVGLCSGKAKEWDRMRALASAMLMIGGVLSAYASASAGQGVFADSVEPRIVEAGLDTSARCNPEDIAVYGSQLPPTSERPCPEHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVSADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPRVGAAMPVEQRLTATALIQTAYEGGRLRQTTRSIFDSRAMKGPFSVGKSGWPKGVPRIANLIPRRPNSSAKNHPLLLKPIVMPRIAAMAQGR
Ga0306925_1043766813300031890SoilPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAATMRAD
Ga0318536_1022325813300031893SoilRTVEAGLDTSARCNPEDIAVYGPQLPPTSDRPCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPPAVAAMPVEQRPTAAYERGRSQTDRSIFDSRVMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMIAAMAQGR
Ga0306923_1012447143300031910SoilCPAHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFTDKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVSADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPPAVAAMPVEQRPTAAYERGRSQTDRSIFDSRVMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMIAAMAQGR
Ga0306923_1048519523300031910SoilMIGAVLGAFTTPFAGQWVFADSVEPGTAEAEFDTSALCSSEDIAVYAPQPSSTSERPCPAQTSEADEAARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGLPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGRPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKLILAEDPLRETVTVDGPISLEGMFEVGSSVIAASSSVGAKNAYPPTHFSNPPRVGAAMPGEHRLTAAALIQTAYERGRPQTDRSIFDSRYEKPVLCRQVWLAE
Ga0310916_1050217123300031942SoilMRVFASAMLTIEAVLSAFTTPFAGQWVFADPVEPGIEEAGLDTSALCNPEDTVVYAPQAPSTSERPCPAHPREVAETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPFAGIFSAYRPPVFGVGGFADKFHSLHTYGLAVDVTGIGAPGTPSALLWHEIAARNGVICPYGPYNPLEWNHCQPTWVKIILSDNPLRETVTADGPISLEDMFEVGYSLIAASGTGGMSTGDPPAQFFKPPQARPAMAIYPQAKIANNRITPKTTLSTSDGYTMRWPPL
Ga0310910_1035474323300031946SoilVEAGLDTSARCNPEDIAVHAPQPSSTSERPCPADTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIIFAEDPLRDTVTADGPISLEGMFEVGSSVIAASSSVGASSADPPTHFFNPPRAGTAIPVEQRPTAAAFIRTAYEDGRLQTDRSIFDSRAMKSLFSVGKSGWPKGVPRIANLADEPRRPKSSVKNRTLSLKPIVMPRMIAAMAQGR
Ga0306926_1086722913300031954SoilVLSAFATPFAGQCVFADPVEPGTEEAGLDTSALCNPEDTVVYAPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVYGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTYRS
Ga0318531_1018956213300031981SoilCNPEDTVVYAPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAATMRAD
Ga0306922_1127766513300032001SoilPCPVHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEV
Ga0318549_1016816513300032041SoilLDTSALCNPEDTVVYAPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAAAMRAD
Ga0318533_1031429413300032059SoilPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRLPQARTAMPLEQRITAAANIKTAYASGPRQTYRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSRTVMLRPIVMPRIIAATMRAD
Ga0306924_1119443813300032076SoilATPFAGQCVFADPVEPGTEEAGLDTSALCNPEDTVVYAPQAPSISERPCPAHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVYGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTYRSIFD
Ga0318540_1032068213300032094SoilATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVYGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVML
Ga0307471_10196489413300032180Hardwood Forest SoilYLIETASPGYTMTLQGPEVAIGRLHPEFAVRLESAIREARSAGLPFAGVFSAYRPPAFGIGGFSDKFNSLHTYGLAVDMHGIGSPGSPEAQLWHQIAAKNGLVCPYGPRARTEWNHCQPTSVKIILAENPLRETVRADGPISLQVMFEVGYSVIVASSSVGAPTGDPPAHSAMEQRITAPAKIRTAYERRRPQTYGSIFDSRVKESSRLYVDNWPKAVPRIANLDDEPRRPKSSATSRTLLKPI
Ga0307472_10020787023300032205Hardwood Forest SoilPSKTAMSPGYTIASSPHAAISGASRSERSIPAASSIPARVSDELLVDPASGSAPRTEAEITEARAYLVETASPGYTMTLQGPELAIGRLNPEFAIRLASAIREARSAGLSFAGVFSAYRPPAFGVGGFSDKFNSLHTYGLAVDMHGIGSPGSSEAQLWHQIAAKNGVVCPYGPRNRAEWNHCQPTTVKIILADNPLRETVTLKGPINLQGMFEVGDSVIAASSSVGAPTGDPPAHSAMEQRITAPANIRTAYERKRPQTYRSIFDSRAKKSSRLYVDNWPRGVPRIANLDDEPRRPKSSATSRTPLKPIVMPRMIVTTARPKVTGARQAGPGST
Ga0306920_10022713743300032261SoilSPRLAELLKASTVGLCSGKAKEWDRMRALASAMLMIGGVLSAYASASAGQGVFADSVEPRIVEAGLDTSARCNPEDIAVYGSQLPPTSERPCPEHTGEADETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAITEARAAGLSLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGSPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTWVKIILAEDPLRETVTADGPISLEGMFEVGSSVIAASSSVGASNADPPTHLFNPPPAVAAMPVEQRPTAAYERGRSQTDRSIFDSRVMKSPFSVGKSGWPKGVPRIANLADEPRRPKSSAKNRTLLLQPIVMPRMIAAMAQGR
Ga0306920_10132814213300032261SoilRPCPVHTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARAAGMPLAGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPHNLLEWNHCQPTRVKIILAEDPLRETVTADGPISLQDMFEVGYSVIAASSSVDGPTAAPPAYFFRPPQARTAMPLEQRITAAANIKTAYASGPRQTDRSIFDSRAMKSPRLFVGKLGWPKEVPRIANLDDEPRRPKSSAGSHTVMLRPIVMPRIIAATMRAD
Ga0310914_1015821313300033289SoilCNPEDIAVHAPQPSSTSERPCPADTREAGETARARAYLIATATPGYTMTRQGPERAIERLHPEFVNRLAAAIAEARGAGLPSVGIFSAYRPPAFGVGGFADKFHSLHTYGLAVDVTGIGGPGTPSSLLWHEIAARHGVICPYGPYNLLEWNHCQPTWVKIIFAEDPLRDTVTADGPISLEGMFEVGSSVIAASSSVGASSADPPTHFFNPPRAGTAIPVEQRPTAAAFIRTAYEDGRLQTDRSIFDSRAMKSLFSVGKSGWPKGVPRIANLADEPRRPKSSVKNRTLSLKPIVMPRMIAAMAQGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.