NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F038103

Metagenome / Metatranscriptome Family F038103

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F038103
Family Type Metagenome / Metatranscriptome
Number of Sequences 166
Average Sequence Length 120 residues
Representative Sequence MPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN
Number of Associated Samples 113
Number of Associated Scaffolds 166

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 76.51 %
% of genes near scaffold ends (potentially truncated) 28.31 %
% of genes from short scaffolds (< 2000 bps) 65.66 %
Associated GOLD sequencing projects 102
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (53.012 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(30.723 % of family members)
Environment Ontology (ENVO) Unclassified
(43.976 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.217 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 3.33%    β-sheet: 32.67%    Coil/Unstructured: 64.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 166 Family Scaffolds
PF07238PilZ 6.02
PF02371Transposase_20 3.01
PF13620CarboxypepD_reg 2.41
PF13460NAD_binding_10 1.81
PF07883Cupin_2 1.81
PF00072Response_reg 1.20
PF00106adh_short 1.20
PF00196GerE 1.20
PF01546Peptidase_M20 1.20
PF00990GGDEF 1.20
PF13505OMP_b-brl 1.20
PF07690MFS_1 1.20
PF04366Ysc84 1.20
PF01925TauE 0.60
PF13727CoA_binding_3 0.60
PF01037AsnC_trans_reg 0.60
PF14110DUF4282 0.60
PF05593RHS_repeat 0.60
PF05299Peptidase_M61 0.60
PF02621VitK2_biosynth 0.60
PF03544TonB_C 0.60
PF09900DUF2127 0.60
PF11154DUF2934 0.60
PF06537DHOR 0.60
PF00589Phage_integrase 0.60
PF00180Iso_dh 0.60
PF16640Big_3_5 0.60
PF01381HTH_3 0.60
PF00239Resolvase 0.60
PF01799Fer2_2 0.60
PF13424TPR_12 0.60
PF14329DUF4386 0.60
PF16861Carbam_trans_C 0.60
PF12697Abhydrolase_6 0.60
PF00083Sugar_tr 0.60
PF06262Zincin_1 0.60
PF13365Trypsin_2 0.60
PF02770Acyl-CoA_dh_M 0.60

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 166 Family Scaffolds
COG3547TransposaseMobilome: prophages, transposons [X] 3.01
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 1.20
COG0308Aminopeptidase N, contains DUF3458 domainAmino acid transport and metabolism [E] 0.60
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 0.60
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.60
COG1427Chorismate dehydratase (menaquinone biosynthesis, futalosine pathway)Coenzyme transport and metabolism [H] 0.60
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.60
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.60
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.60
COG3209Uncharacterized conserved protein RhaS, contains 28 RHS repeatsGeneral function prediction only [R] 0.60
COG3488Uncharacterized conserved protein with two CxxC motifs, DUF1111 familyGeneral function prediction only [R] 0.60
COG3824Predicted Zn-dependent protease, minimal metalloprotease (MMP)-like domainPosttranslational modification, protein turnover, chaperones [O] 0.60
COG3975Predicted metalloprotease, contains C-terminal PDZ domainGeneral function prediction only [R] 0.60


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms53.01 %
UnclassifiedrootN/A46.99 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10165566All Organisms → cellular organisms → Bacteria1499Open in IMG/M
3300002245|JGIcombinedJ26739_100116142All Organisms → cellular organisms → Bacteria2517Open in IMG/M
3300002681|Ga0005471J37259_111348Not Available541Open in IMG/M
3300002907|JGI25613J43889_10010083All Organisms → cellular organisms → Bacteria2613Open in IMG/M
3300002910|JGI25615J43890_1031782Not Available873Open in IMG/M
3300002914|JGI25617J43924_10041880All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1654Open in IMG/M
3300004082|Ga0062384_100005587All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4408Open in IMG/M
3300004091|Ga0062387_101112447Not Available613Open in IMG/M
3300004132|Ga0058902_1213904Not Available504Open in IMG/M
3300004631|Ga0058899_11979444All Organisms → cellular organisms → Bacteria → Acidobacteria2308Open in IMG/M
3300005439|Ga0070711_100700166All Organisms → cellular organisms → Bacteria → Acidobacteria853Open in IMG/M
3300005445|Ga0070708_100244569All Organisms → cellular organisms → Bacteria → Acidobacteria1685Open in IMG/M
3300005445|Ga0070708_101389745Not Available655Open in IMG/M
3300005467|Ga0070706_100407348All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300005468|Ga0070707_100134925All Organisms → cellular organisms → Bacteria → Acidobacteria2401Open in IMG/M
3300005518|Ga0070699_102211620Not Available502Open in IMG/M
3300005536|Ga0070697_100115351All Organisms → cellular organisms → Bacteria2243Open in IMG/M
3300005602|Ga0070762_10255752Not Available1090Open in IMG/M
3300005610|Ga0070763_10479745Not Available709Open in IMG/M
3300005921|Ga0070766_10040547Not Available2581Open in IMG/M
3300005921|Ga0070766_11008184Not Available573Open in IMG/M
3300005921|Ga0070766_11269818Not Available510Open in IMG/M
3300006086|Ga0075019_10731347Not Available627Open in IMG/M
3300006102|Ga0075015_100410425Not Available766Open in IMG/M
3300007258|Ga0099793_10001604All Organisms → cellular organisms → Bacteria → Acidobacteria7310Open in IMG/M
3300007258|Ga0099793_10008692All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3853Open in IMG/M
3300007258|Ga0099793_10012889All Organisms → cellular organisms → Bacteria3285Open in IMG/M
3300007258|Ga0099793_10183431All Organisms → cellular organisms → Bacteria → Acidobacteria1000Open in IMG/M
3300007265|Ga0099794_10164536Not Available1129Open in IMG/M
3300009038|Ga0099829_10277500All Organisms → cellular organisms → Bacteria1371Open in IMG/M
3300009088|Ga0099830_10747436Not Available806Open in IMG/M
3300009089|Ga0099828_10073105All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2908Open in IMG/M
3300009089|Ga0099828_10077496All Organisms → cellular organisms → Bacteria → Acidobacteria2830Open in IMG/M
3300010329|Ga0134111_10444932Not Available561Open in IMG/M
3300010359|Ga0126376_10340963All Organisms → cellular organisms → Bacteria → Acidobacteria1324Open in IMG/M
3300010366|Ga0126379_13273564Not Available542Open in IMG/M
3300010376|Ga0126381_101364423Not Available1025Open in IMG/M
3300010859|Ga0126352_1193051Not Available699Open in IMG/M
3300011120|Ga0150983_11401104Not Available750Open in IMG/M
3300011269|Ga0137392_10600147Not Available913Open in IMG/M
3300011271|Ga0137393_10648123Not Available905Open in IMG/M
3300011271|Ga0137393_11679360Not Available524Open in IMG/M
3300012096|Ga0137389_10000501All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium21548Open in IMG/M
3300012096|Ga0137389_10190847Not Available1701Open in IMG/M
3300012189|Ga0137388_10157091All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2019Open in IMG/M
3300012203|Ga0137399_11289580Not Available614Open in IMG/M
3300012205|Ga0137362_10326392Not Available1329Open in IMG/M
3300012361|Ga0137360_10214174All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1566Open in IMG/M
3300012361|Ga0137360_11907974Not Available500Open in IMG/M
3300012362|Ga0137361_10348343All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1359Open in IMG/M
3300012363|Ga0137390_10025804All Organisms → cellular organisms → Bacteria → Acidobacteria5531Open in IMG/M
3300012582|Ga0137358_10022319All Organisms → cellular organisms → Bacteria4089Open in IMG/M
3300012683|Ga0137398_10602738Not Available760Open in IMG/M
3300012685|Ga0137397_11139985Not Available565Open in IMG/M
3300012918|Ga0137396_10013769All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5038Open in IMG/M
3300012918|Ga0137396_10166123All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1613Open in IMG/M
3300012918|Ga0137396_10249083All Organisms → cellular organisms → Bacteria → Acidobacteria1310Open in IMG/M
3300012918|Ga0137396_10329688Not Available1129Open in IMG/M
3300012925|Ga0137419_10389384Not Available1087Open in IMG/M
3300012927|Ga0137416_11513451Not Available610Open in IMG/M
3300012930|Ga0137407_11087332Not Available758Open in IMG/M
3300012944|Ga0137410_10073664All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis2481Open in IMG/M
3300012951|Ga0164300_10855941Not Available570Open in IMG/M
3300012957|Ga0164303_10035531All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Peregrinibacteria → Candidatus Peribacteria → unclassified Candidatus Peribacteria → Candidatus Peribacteria bacterium2090Open in IMG/M
3300012957|Ga0164303_10367198Not Available876Open in IMG/M
3300012958|Ga0164299_10690795All Organisms → cellular organisms → Bacteria → Acidobacteria712Open in IMG/M
3300012961|Ga0164302_10663283Not Available766Open in IMG/M
3300012986|Ga0164304_10724753All Organisms → cellular organisms → Bacteria → Acidobacteria758Open in IMG/M
3300014501|Ga0182024_10005741All Organisms → cellular organisms → Bacteria27626Open in IMG/M
3300014501|Ga0182024_11158082Not Available908Open in IMG/M
3300014969|Ga0157376_13091318Not Available504Open in IMG/M
3300015054|Ga0137420_1089623All Organisms → cellular organisms → Bacteria1903Open in IMG/M
3300020199|Ga0179592_10000448All Organisms → cellular organisms → Bacteria → Acidobacteria16365Open in IMG/M
3300020199|Ga0179592_10056767All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1784Open in IMG/M
3300020579|Ga0210407_10012861All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6223Open in IMG/M
3300020579|Ga0210407_10014959All Organisms → cellular organisms → Bacteria5768Open in IMG/M
3300020579|Ga0210407_10224113Not Available1463Open in IMG/M
3300020579|Ga0210407_10280299All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1300Open in IMG/M
3300020579|Ga0210407_10363443All Organisms → cellular organisms → Bacteria → Acidobacteria1132Open in IMG/M
3300020580|Ga0210403_10000123All Organisms → cellular organisms → Bacteria88589Open in IMG/M
3300020580|Ga0210403_10000628All Organisms → cellular organisms → Bacteria36205Open in IMG/M
3300020580|Ga0210403_10002618All Organisms → cellular organisms → Bacteria16174Open in IMG/M
3300020580|Ga0210403_10014208All Organisms → cellular organisms → Bacteria → Acidobacteria6417Open in IMG/M
3300020580|Ga0210403_10014395All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6372Open in IMG/M
3300020580|Ga0210403_10022782All Organisms → cellular organisms → Bacteria5002Open in IMG/M
3300020580|Ga0210403_10032282All Organisms → cellular organisms → Bacteria4186Open in IMG/M
3300020580|Ga0210403_10167794All Organisms → cellular organisms → Bacteria → Acidobacteria1798Open in IMG/M
3300020580|Ga0210403_10488028Not Available1001Open in IMG/M
3300020580|Ga0210403_11028916Not Available643Open in IMG/M
3300020581|Ga0210399_10000726All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae26031Open in IMG/M
3300020581|Ga0210399_10024082All Organisms → cellular organisms → Bacteria4835Open in IMG/M
3300020581|Ga0210399_10183600All Organisms → cellular organisms → Bacteria → Acidobacteria1737Open in IMG/M
3300020581|Ga0210399_10662111All Organisms → cellular organisms → Bacteria → Acidobacteria860Open in IMG/M
3300020583|Ga0210401_10069863All Organisms → cellular organisms → Bacteria3308Open in IMG/M
3300021170|Ga0210400_10064831All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2852Open in IMG/M
3300021170|Ga0210400_10706773Not Available829Open in IMG/M
3300021171|Ga0210405_10004160All Organisms → cellular organisms → Bacteria → Acidobacteria13727Open in IMG/M
3300021178|Ga0210408_10663700Not Available823Open in IMG/M
3300021180|Ga0210396_10005151All Organisms → cellular organisms → Bacteria12684Open in IMG/M
3300021180|Ga0210396_10047318All Organisms → cellular organisms → Bacteria3947Open in IMG/M
3300021181|Ga0210388_11655501Not Available530Open in IMG/M
3300021401|Ga0210393_10074318Not Available2686Open in IMG/M
3300021403|Ga0210397_10254127Not Available1277Open in IMG/M
3300021403|Ga0210397_10713076Not Available772Open in IMG/M
3300021405|Ga0210387_10115764Not Available2259Open in IMG/M
3300021405|Ga0210387_11068601Not Available705Open in IMG/M
3300021406|Ga0210386_11436105Not Available577Open in IMG/M
3300021407|Ga0210383_10030880All Organisms → cellular organisms → Bacteria4480Open in IMG/M
3300021407|Ga0210383_10990213Not Available713Open in IMG/M
3300021432|Ga0210384_10084026All Organisms → cellular organisms → Bacteria2862Open in IMG/M
3300021477|Ga0210398_10023123All Organisms → cellular organisms → Bacteria5277Open in IMG/M
3300021477|Ga0210398_11140428Not Available618Open in IMG/M
3300021478|Ga0210402_10019388All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5835Open in IMG/M
3300021478|Ga0210402_11100578All Organisms → cellular organisms → Bacteria → Acidobacteria722Open in IMG/M
3300021479|Ga0210410_10001752All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae19970Open in IMG/M
3300021479|Ga0210410_10687533Not Available904Open in IMG/M
3300021560|Ga0126371_11027446Not Available965Open in IMG/M
3300022523|Ga0242663_1111986Not Available555Open in IMG/M
3300022532|Ga0242655_10074669All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium886Open in IMG/M
3300022532|Ga0242655_10200461Not Available610Open in IMG/M
3300022722|Ga0242657_1251373Not Available507Open in IMG/M
3300022724|Ga0242665_10001830All Organisms → cellular organisms → Bacteria3161Open in IMG/M
3300024330|Ga0137417_1385066All Organisms → cellular organisms → Bacteria2306Open in IMG/M
3300025910|Ga0207684_10508858All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300026320|Ga0209131_1006790All Organisms → cellular organisms → Bacteria7305Open in IMG/M
3300026376|Ga0257167_1058465Not Available598Open in IMG/M
3300026377|Ga0257171_1074492Not Available596Open in IMG/M
3300026551|Ga0209648_10198348All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1535Open in IMG/M
3300027562|Ga0209735_1115073Not Available586Open in IMG/M
3300027643|Ga0209076_1001592All Organisms → cellular organisms → Bacteria → Acidobacteria4696Open in IMG/M
3300027643|Ga0209076_1092040All Organisms → cellular organisms → Bacteria → Acidobacteria861Open in IMG/M
3300027655|Ga0209388_1149733All Organisms → cellular organisms → Bacteria → Acidobacteria660Open in IMG/M
3300027660|Ga0209736_1017329All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2239Open in IMG/M
3300027671|Ga0209588_1029847All Organisms → cellular organisms → Bacteria1740Open in IMG/M
3300027862|Ga0209701_10094371All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium1875Open in IMG/M
3300027889|Ga0209380_10147716Not Available1373Open in IMG/M
3300027898|Ga0209067_10719330Not Available579Open in IMG/M
3300027910|Ga0209583_10151966All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium948Open in IMG/M
3300028047|Ga0209526_10087648All Organisms → cellular organisms → Bacteria2191Open in IMG/M
3300028047|Ga0209526_10999360Not Available500Open in IMG/M
3300028536|Ga0137415_10018345All Organisms → cellular organisms → Bacteria → Acidobacteria6943Open in IMG/M
3300028536|Ga0137415_10069141All Organisms → cellular organisms → Bacteria → Acidobacteria3395Open in IMG/M
3300028906|Ga0308309_11442322Not Available589Open in IMG/M
3300028906|Ga0308309_11862742Not Available508Open in IMG/M
3300029636|Ga0222749_10397472Not Available730Open in IMG/M
3300030596|Ga0210278_1150752Not Available571Open in IMG/M
3300030730|Ga0307482_1188274Not Available621Open in IMG/M
3300031231|Ga0170824_114903874Not Available580Open in IMG/M
3300031715|Ga0307476_10003478All Organisms → cellular organisms → Bacteria9406Open in IMG/M
3300031715|Ga0307476_11356269Not Available518Open in IMG/M
3300031716|Ga0310813_12292118Not Available512Open in IMG/M
3300031718|Ga0307474_10132874All Organisms → cellular organisms → Bacteria → Acidobacteria1872Open in IMG/M
3300031720|Ga0307469_11294754Not Available692Open in IMG/M
3300031740|Ga0307468_101875823Not Available570Open in IMG/M
3300031753|Ga0307477_10002695All Organisms → cellular organisms → Bacteria → Acidobacteria13561Open in IMG/M
3300031753|Ga0307477_10050383All Organisms → cellular organisms → Bacteria2864Open in IMG/M
3300031753|Ga0307477_10203884Not Available1375Open in IMG/M
3300031754|Ga0307475_10225328All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1501Open in IMG/M
3300031754|Ga0307475_10446675Not Available1039Open in IMG/M
3300031754|Ga0307475_11026242Not Available647Open in IMG/M
3300031754|Ga0307475_11237469Not Available580Open in IMG/M
3300031823|Ga0307478_11180423All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium638Open in IMG/M
3300031962|Ga0307479_10086632All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3033Open in IMG/M
3300031962|Ga0307479_10102349All Organisms → cellular organisms → Bacteria2783Open in IMG/M
3300031962|Ga0307479_10332040All Organisms → cellular organisms → Bacteria → Acidobacteria1503Open in IMG/M
3300032121|Ga0316040_121884Not Available501Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil30.72%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.24%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil6.02%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.82%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.01%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.41%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.41%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.20%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost1.20%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.60%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.60%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.60%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.60%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002681Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF120 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004132Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF240 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010859Boreal forest soil eukaryotic communities from Alaska, USA - C5-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027898Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030596Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO135-VCO085SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032121Soil microbial communities from Bohemian Forest, Czech Republic ? CSA3 metaT (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1016556623300001593Forest SoilMPCRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEAFALKVSLWGGLVALRAAVNREQKLFLVNQATGETAESKVAYLGPMQLSGRRLRLVAIEFLKPSPGFWGLAFPTVDPSRSQTRPYAKVGSTDN*
JGIcombinedJ26739_10011614233300002245Forest SoilMPGRNGAEIRRSGRVTLRVPLKIFEPGSNKRFLVAEASAVKVSLWGGIIALGAAVNRDQKLFLLNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLRPSPGFWGLSFPTVDSCRSQSRQYAKVGSGN*
Ga0005471J37259_11134813300002681Forest SoilMPERNGDETRRSGRVTLRVPLRIYERGSNNRFLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPHAN*
JGI25613J43889_1001008313300002907Grasslands SoilMPERNRAETRRSGRVTLRVPLKIYERGSNKPFLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN*
JGI25615J43890_103178223300002910Grasslands SoilMPLGTIEDCGPVVDSTCTYRPVTHPRSGPMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLDEEAYAVNVSLWGGLIAFGAAVDRDQKLFLFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
JGI25617J43924_1004188013300002914Grasslands SoilMPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0062384_10000558723300004082Bog Forest SoilMSGRNGTETRRSCRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGEIAESQIVYLGPMHLGGRRLRLVAIEFLKASPGFWSIGFPSAVPCRVRAANHSN*
Ga0062387_10111244713300004091Bog Forest SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRYQKLLLVNQATGESAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRARAANHSN*
Ga0058902_121390413300004132Forest SoilLPRSEDMLERNGAETRRSGRVTLRVPLKIYERGSNNRLLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPHAN*
Ga0058899_1197944423300004631Forest SoilMPGRKTGAETRRSGRVTLCVPLKIYEPDSNKYFLVEEASAVKVSLWGGLIALSVAVTPDQKLLIANQATGETAESQIVYLRPMEPSGKLNLVAIEFLKPSPGFWGV
Ga0070711_10070016623300005439Corn, Switchgrass And Miscanthus RhizosphereMPGRNGTKTRRSGRVTLRVPLRIYEPASNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPTVDPCRSQT
Ga0070708_10024456913300005445Corn, Switchgrass And Miscanthus RhizosphereMPERNGAETRRSGRVTLRVPLKVYERGSNKPSLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVGPCRSQTGRYSN*
Ga0070708_10138974513300005445Corn, Switchgrass And Miscanthus RhizospherePLKIYEPGSNKRFLVEEASAVKVSLWGGLIALRTAVNQDQKLSVVNQATGETAESKVVYLGPTQLSGGLRLVAIEFLRSSPDFWGMVFPAVDPFRSPTTRYAKAGSSHN*
Ga0070706_10040734813300005467Corn, Switchgrass And Miscanthus RhizosphereMPGRNGAETRRSGRVTLRVPLKIYEPGSNKRFLVEEASAVKVSLWGGLIALRTAVNQDQKLSVVNQATGETAESKVVYLGPTQLSGGLRLVAIEFLRSSPDFWG
Ga0070707_10013492523300005468Corn, Switchgrass And Miscanthus RhizosphereMPGRNGAETRRSGRVTLRVPLKIYEPGSNKRFLVEEASAVKVSLWGGLIALRTAVNQDQKLSVVNQATGETAESKVVYLGPTQLSGGLRLVAIEFLRSSPDFWGMVFPAVDPFRSPTTRYAKAGSSHN*
Ga0070699_10221162013300005518Corn, Switchgrass And Miscanthus RhizosphereMNRAQNNRFLVEEAYAVKVSLWGGLIALRTAVNKDQKLSVVNQATGETADSKVVHLGPMQLGGGLRLVGIEFLRSSPDFWGMVFPTVDPCRTQTRLCAKTGSSYN*
Ga0070697_10011535133300005536Corn, Switchgrass And Miscanthus RhizosphereMPERNGAETRRSGRVTLRVPLKIYERDSNKSSSSLIEEAYSVKVSLWGGLIAFGAVVDRDQKLLMLNQATGEMAESQVIYLRPMRLAGGLGMVAIKFLKPSPGFWGVVFPTADPCRSKTGQYAN*
Ga0070762_1025575223300005602SoilMLGRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKNQKLSMVNQATGETADSKVVHLGPMQLGGGLRLVAVEFLRSSPAFWGMVFPTVDPCRTPTTRYAKAGSSHN*
Ga0070763_1047974523300005610SoilMPGRKTGAETRRSGRVTLCVPLKIYEPDSNKYFLVEEASAVKVSLWGGLIALSVAVTPDQKLLIANQATGETAESQIVYLRPMEPSGKLNLVAIEFLKPSPGFWGVKFPTVDPCRSQTRQYPN*
Ga0070766_1004054743300005921SoilGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRVRAANHSN*
Ga0070766_1100818413300005921SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRYQKLLLVNQATGESAESQIVYLGPMHLGGRRLRLVAIEFLKPS
Ga0070766_1126981813300005921SoilRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKDQKLSMVNEATGETADSKVVHLGPMQLGGGLRLVAVEFLRSSPAFWGMVFPTVDPCRTPTTRYAKAGSSHN*
Ga0075019_1073134713300006086WatershedsAETRCSGRVTLRVPLKIYEPDSDRYFLIEETCAVKVSLWGGLIALSAAVDRDQKLLVANQATGETAESQVVYLKPMELSGRLNLVAIEFLKPSPSFWGVNFPTVDPSRSQTMEYAN*
Ga0075015_10041042513300006102WatershedsMPERNGAETRRSGRVTLRVPLKIYEPDSDRYFLIEETCAVKVSLWGGLIALSAAVDRDQKLLVANQATGETAESQVVYLKPMELSGRLNLVAIEFLKPSPSFWGVNFPTVDPSRSQTMEYAN*
Ga0099793_1000160443300007258Vadose Zone SoilMPERNRAETRRSGRVTLRVPLKIYERGSNRPFLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN*
Ga0099793_1000869233300007258Vadose Zone SoilMPERTGAETRRSGRVTLRVPLKIYERGSNNRFLGEEAYAVKVSLWGGLIAFGTAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGRLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTTRSLASRDR*
Ga0099793_1001288943300007258Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
Ga0099793_1018343123300007258Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERGSNSRFLDEEAYAVSVSLWGGLIALGAALDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN*
Ga0099794_1016453623300007265Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0099829_1027750023300009038Vadose Zone SoilMPERNRAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0099830_1074743623300009088Vadose Zone SoilMPERNRAETRRSGRVTLRVALKIYERGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVNPCRSQTRQYAN*
Ga0099828_1007310563300009089Vadose Zone SoilMPERNRAETRRTGRVTLRVALKIYERGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYAN*
Ga0099828_1007749633300009089Vadose Zone SoilVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQELFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH*
Ga0134111_1044493213300010329Grasslands SoilMPERGGAETRRSSRVTLRVPLKIYERGSGGLIALGATVDRDQKPFVFNQATGETAESEVLYLRPMRLAGGLGMVAIKFLKPSPGFWGVAFPTVDPCRSQTREYPN*
Ga0126376_1034096333300010359Tropical Forest SoilMAGRSGTEARRSSRLTLRVTLKIYQPDSNNRFLLEEAYAVKVSLWGGLIALRTAVHPGQRLSLLNRATGATAESKVVYLGPMHSGARLTRLVAIEFLKSSPDFWSVVFPRLTVPAPKPSTIQTRISRVEYRR*
Ga0126379_1327356413300010366Tropical Forest SoilETRRSGRLTLRVTLKIYQPDANNRSLVEETSAVKVSLWGGLIALRTAVHPGQRLSLLNRATGETAESKVVYLGPMRSSARLTRLVAIEFLKPSPSFWGVVFPRLTVTTTGTSTFQTRSTGRGS*
Ga0126381_10136442313300010376Tropical Forest SoilVAGRSGTEARRSGRLTLRVTLKIYQPDSNNRFLIEEAYAVKVSLWGGLIALRTAVHPGQRLSLLNRATGATAESKVVYLGPMHSGARLTRLVAIEFLKSSPDFWSVVFPRLTVPAPKPSTIQTRISRVEYRR*
Ga0126352_119305123300010859Boreal Forest SoilVTLRVPLKIYEPGSNKRSLVEEASAVKVSLWGGLVPLKATVNRDQKLFLVNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLRPSPGFWGLVFPAVEAGRSHTTQYAH*
Ga0150983_1140110413300011120Forest SoilVKVSLWGGLIALRTAVTKDQKLSMVNEATGETADSKVVHLGPMQLGGGLRLVAVEFLRSSPAFWGMVFPTVDPCRTPTTRYAKAGSSHN*
Ga0137392_1060014713300011269Vadose Zone SoilVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH*
Ga0137393_1064812313300011271Vadose Zone SoilVTLRVPLKIYERGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH*
Ga0137393_1167936013300011271Vadose Zone SoilGPMPERNGAETRRSGRVPLRVPLKIYERGSNNRFLVEEAYAVNVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYAN*
Ga0137389_10000501133300012096Vadose Zone SoilMPERNRAETRRSGRVTLRVALKIYERGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYAN*
Ga0137389_1019084713300012096Vadose Zone SoilMPGRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVVLRATVSRDQKLFLVNQATGESAESKVAYLGPMQLGGRRLRLVAIEFLKPSPGFWGLAFPTVDPSRSQAKQYAH*
Ga0137388_1015709123300012189Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERGSNKPSLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVGPCRSRTGQYSN*
Ga0137399_1128958013300012203Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYEPGSNNGFHVEEAYAVRVSLWGGLIAFGAALDRDKKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDACRSQTR
Ga0137362_1032639213300012205Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAALDRDKKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFMKPSPGFWGVAFPKVDPRQSQTGQYAN*
Ga0137360_1021417413300012361Vadose Zone SoilMPEGNGAETRRSGRVTLRVPLKIYEEGSNNPFLVEAYSVKVSLWGGLIAFGATVDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
Ga0137360_1190797413300012361Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNKPALVEEAYSVKVSLWGGLIALGAVVDQDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSAGFWGVVFPTVDPCRSQT
Ga0137361_1034834313300012362Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNKPALVEEAYSVKVSLWGGLIALGAVVDQDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSAGFWGVVFPTVDPCRSQTGKYAN*
Ga0137390_1002580443300012363Vadose Zone SoilMPERNGAETRRSGRVPLRVPLKIYERGSNNRFLVEEAYAVNVSLWGGLIAFVAAVDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVNPCRSQTRQYAN*
Ga0137358_1002231923300012582Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSSSRAQRT*
Ga0137398_1060273823300012683Vadose Zone SoilMPEGNRAETRRSGRVTLRVPLKIYEEGSNNPFLVEAYSVKVSLWGGLIAFGATVDRDQKLFVFNQATGEIAESQIIYLRPMRLAGGLGLVAIKFMKPSPGFWGVAF
Ga0137397_1113998513300012685Vadose Zone SoilMPERNGAEARRSGRVTLRVPLKIYERDSNNRFLDEEAYAVNVSLWGGLIAFGAAVDRDQKLFLFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFW
Ga0137396_1001376923300012918Vadose Zone SoilMPERTGAETRRSGRVTLRVPLKIYERGSNNRFLGEEAYAVKVSLWGGLIAFGTAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTTRSLASRDR*
Ga0137396_1016612333300012918Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRHQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
Ga0137396_1024908333300012918Vadose Zone SoilPLKIYERGSNSRFLDEEAYAVSVSLWGGLIALGAALDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN*
Ga0137396_1032968823300012918Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERGSNEPSLVEEAYSVKVSLWGGLIAFGAAVDRDKKLFVFNQATGEIVESQVIYLRPMRLAGGLGLVAIKFLEPSPGFWGVAFPTVDPCRSQTGQYAN*
Ga0137419_1038938413300012925Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGTAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0137416_1151345113300012927Vadose Zone SoilMPGRNGQETRRSGRVTLRVPLRIYEPGESKWFLVEEASALKVSLWGGLVALKAPVKRDQELFLVNQATGQTAESKVTYLGPMHLEGRRLRLVAIEFLRPSPDFWGLGFPAVDLSPSQPI*
Ga0137407_1108733213300012930Vadose Zone SoilMPERNGAGTRRSGRVTLRVPLKINERGSNQPFLIEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYVRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTARAAS*
Ga0137410_1007366413300012944Vadose Zone SoilMPERNGAEARRSGRVTLRVPLKIYERDSNNRFLDEEAYAVNVSLWGGLIAFGAAVDRDQKLFLFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
Ga0164300_1085594113300012951SoilMLGRTVTKTRRSGRVTLRVPLKIFEPGSNRRFLVEEASALKVSLWGGLIAISAAVTLNQKLFLANQATGETAESMVVYLGPMQLGGRRLRLVAIEFLRPSPGFWGM
Ga0164303_1003553113300012957SoilKTRRSGRVTLRVPLRIYEPGSNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPKVDPCRSQTGENGNQHPRARLHHDEQNDA*
Ga0164303_1036719823300012957SoilMPERSGAETRRSGRVTLRVPLKIYERDSNKSSLVEEAYSVKVSLWGGLIAFGAVVDRDQKLFMLNQATGEIAESQVIYLRPMRLAGGLGMVAIKFLKPSPGFWGVVFPTVDPCRSKTGQHAN*
Ga0164299_1069079513300012958SoilRLRLPDRSYEPIPARNSSSERVMPGRNGTKTRRSGRVTLRVPLKIFEPGSNRRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPTVDPCRSQTGENGNQHPRARLHHDEQNDA*
Ga0164302_1066328323300012961SoilMPGRNGTKTRRSGRVTLRVPLRIYEPGSNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPKVDPCRSQTGENGNQHPRARLHHDEQNDA*
Ga0164304_1072475313300012986SoilSNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPKVDPCRSQTGENGNQHPRARLHHDEQNDA*
Ga0182024_1000574113300014501PermafrostRRSGRVTLRVPLKIYEPGSNNRFLVEEAYSVKVNLWGGLIALRTAVTKDQKLSMVNQATGETADSKVVHLGPMHLGGELRLVAIEFLRSSPDFWGMVFPTGDPCRAPTTRYAKAGSSHN*
Ga0182024_1115808213300014501PermafrostMLSRNGAETRRSGRVTLRVPLRIYEPGSNNRFLVEEAYSVKVSLWGGLIALRTAVTKDQKLSMVNQATGEAADSKVVHLGPMQLGGGLRLVAIEFLRSSPDFWGMVFPRVDPCRAPTTLYAKAGSSHT*
Ga0157376_1309131813300014969Miscanthus RhizosphereMLGRTVTKTRRSGHVTLRVPLKLFEPGSNRRFLVEEASALKVSLWGGLIAISAAVNLNQKLLLANQATGETAESMVVYLGPMQLGGRRLRLVAIEFLRPSPGFGGMVFPTPDPWRSQTGGNGNQNSRAR
Ga0137420_108962323300015054Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRDQKLFLFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT*
Ga0179592_10000448223300020199Vadose Zone SoilMPERNRAETRRSGRVTLRVPLKIYERGSNKPFLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN
Ga0179592_1005676723300020199Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLDEEAYAVNVSLWGGLIAFGAAVDRDQKLFLFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT
Ga0210407_1001286153300020579SoilMPERNGAETRRSGRVTLRVPLRIYERGSNIRSLGEEAYAVKVSLWGGLIAFGAAVERDQTLFVFNQATGEIVESRVVYLRPMRLAGGVGLVAIKFLKPSPGFWGVVFPTVDSSRSQTKQYAN
Ga0210407_1001495943300020579SoilMPERNGAETRRSGRVTLRVSLRIYERGSNNRFLDEEAYAVKVSLWGGLIAFGAALERDQTLFVFNQATGEIAESQVVYLGPMGLAGGLGLVAIKFLKPSPGFWGVVFPTVGPSRSTRQYA
Ga0210407_1022411323300020579SoilMLQRNGAATRRSGRVTLRVPVKIYERGSNKHFFVEEAYAVQVSLWGGLIAFGAAVDQDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVDFPTVDPCRSPTGQNAN
Ga0210407_1028029923300020579SoilMPSRNGAGPRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVALRAAVNRDQRLFLVNQATGETAESKVAYLGPMQLGGRRLRLVAVEFLRPSPGFWGLAFPTIDPSRPQTRQYAH
Ga0210407_1036344313300020579SoilMPGRKTGAETRRSGRVTLRVPLKVYEPDSNKYFLVEEACAVKVSLWGGLIALDAAVNRDQKLLIANQATGETAESQVVYLRPMEPNGKLNLVAIEFLNPSPGFWGVKFPMVDLCRSQARQYAN
Ga0210403_10000123363300020580SoilMPERNGAETRRSGRVTLRVPLKIYERGPNNRFLDEEAYAVNVSLWGGLIAFGAALERNQTLFLFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQYANEKGSDAESPKV
Ga0210403_1000062873300020580SoilMPERNGAETRRSGRVTLRVPLKIYERGPNNRFLDEEAYAVNVSLWGGLIAFGAALERNQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQYAN
Ga0210403_1000261883300020580SoilMPCRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVALRVAVNREQKLFLVNQATGETAESKVAYLGPMHLGGRRLRVVAIEFLKPSPGFWGLAFPTVDSNRSQSRQFAH
Ga0210403_1001420873300020580SoilMLQRNGAATRRSGRVTLRVPVKIYERGSNKHFFVEEAYAVQVSLWGGLIAFGAAVDQDQKLFVFNQATGEIAESQVIYLRPMRLAGGAALVAIKFLKPSPGFWGVVFPAVDPCRSQTGQFAN
Ga0210403_1001439573300020580SoilMPCRNEATTRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVALRVAVNREQKLFLVNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLRPSPGFWGLAFPTVDPSRSQTKQYAH
Ga0210403_1002278253300020580SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVNVSLWGGLIAFGAALQRDQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQHAN
Ga0210403_1003228253300020580SoilMPERNGDETRRSGRVTLRVPLRIYERGSNNRFLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPHAN
Ga0210403_1016779443300020580SoilMPERNRAETRRSGRVTLRVPLKIYEPGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQIIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYAN
Ga0210403_1048802813300020580SoilMLGRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKNQKLSMVNQATGETADSKVVDLGPMLLGGGLRLVAIEFLRSSPDFWGMVFPTVDPCRTPTTRYAKAGSSHN
Ga0210403_1102891623300020580SoilMPERNRAETRRSGRVTLRVPLKIYEPSSNKSFLLEEAYSVKVSLCGGLIAFGAAVDRDQKLFVFNQATGEIAESRVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVDFPTVDPCRSPTGQNAN
Ga0210399_10000726233300020581SoilMPERNGPETRRSGRVTLRVPLKIYERGSNHRFLDDEAYAVNVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0210399_1002408273300020581SoilMLGRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKDQKLSMVNQATGETADSKVVHLGPMLLGGRLRLVAIEFLRSSPDFWGMVFPTVDPCRTPTTRYAKAGSSHN
Ga0210399_1018360023300020581SoilMPSRNGAGPRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVALRAAVNRDQRLFLVNQATGETAESKVAYLGPMQLGGSRLRLVAIEFLRPSPGFWGLAFPTVDPSRSQTKQYAH
Ga0210399_1066211123300020581SoilMPGRKTGAETRRSGRVTLCVPLKIYEPDSNKYFLVEEASAVKVSLWGGLIALSVAVTPDQKLLIANQATGETAESQIVYLRPMEPSGKLNLVAIEFLKPSPGFWGVKFPTVDPCRSQTRQYPN
Ga0210401_1006986313300020583SoilMLGRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKDQKLSMVNEATGETADSKVVHLGPMQLGGGLRLVAVEFLRSSPAFWGMVFPTVDPCRTPTTRYAKAGSSHN
Ga0210400_1006483123300021170SoilMPERNRAETRRSGRVTLRVPLKIYEPSSNKSFLLEEAYSVKVSLCGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVDFPTVDPCRSPTGQNAN
Ga0210400_1070677313300021170SoilPERNRAETRRSGRVTLRVPLKIYEPGSNKPFLVEEAYAVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQIIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYA
Ga0210405_10004160153300021171SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVNVSLWGGLIAFGAALQRDQTLFVFNQATGEIAESQVVYLRPLRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQAKQHAN
Ga0210408_1066370023300021178SoilMPGRNGTKTRRSGRVTLRVPLRIYEPGSNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPKVDPCRSQTGENGNQHPRARLHHDEQNDA
Ga0210396_1000515183300021180SoilMPERNGAEFRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVTVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQHAN
Ga0210396_1004731813300021180SoilMPERNGAETRRSGRVTLRVPLKIYERGPNNRFLDEEAYAVNVSLWGGLIAFGAALERDQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLKPSPSFWGVVFPTVDPSRSQTRQHAN
Ga0210388_1165550113300021181SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRYQKLLLVNQATGESAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPS
Ga0210393_1007431873300021401SoilSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRARTANHSN
Ga0210397_1025412713300021403SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRYQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRVRAANHSN
Ga0210397_1071307613300021403SoilMLGRNGAETRRSGRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKNQKLSMVNQATGETADSKVVHLGPMLLGGRLRLVAIEFLRSSPDFWGMVFPTVDPCRTPTTRYAKAGSSHN
Ga0210387_1011576453300021405SoilMSGRNGTETRRSGRVTLRVPVKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRARTANHSN
Ga0210387_1106860123300021405SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRYQKLLLVNQATGESAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRARVANHSN
Ga0210386_1143610513300021406SoilPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRVRAANHSN
Ga0210383_1003088023300021407SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRARTANHSN
Ga0210383_1099021313300021407SoilRATLRVPLRIYEPGSNNRFIVEEAYSVKVSLWGGLIALRTAVTKDQKLSMVNEATGETADSKVVHLGPMQLGGGLRLVAVEFLRSSPAFWGMVFPTVDPCRTPTTRYAKAGSSHN
Ga0210384_1008402623300021432SoilMPERNGAEFRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVTVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDSSRSQAKQHAN
Ga0210398_1002312333300021477SoilMPGRNGEGIRRSGRVTLRVPLKVYEPGSNKRFLVEEACALKVSLWGGLVCLKTTVNRAQKLFLVNQATGESAESRVAYLGPMHLGGRRLRLVAIEFLRPSPNFWGLAFPTIDPSRTRSAHYSH
Ga0210398_1114042813300021477SoilMSGRNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPG
Ga0210402_1001938833300021478SoilMPERNGAETRRSGRVTLRVPLRIYERGSNNRLLGEEAYAVQVSLWGGLIAFAAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPCAN
Ga0210402_1110057823300021478SoilIYEPGSNQRFLVEEASAVKVSLWGGLVALRTAVNPNQKLFLANQATGETAESKVVYLGPMQLGGKRLRLVAVEFLRPSPAFWGMVFPKVDPCRSQTGENGNQHPRARLHHDEQNDA
Ga0210410_10001752183300021479SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVTVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQHAN
Ga0210410_1068753313300021479SoilMPGRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEASALKVSLWGGLVALRVAVNREQKLFLVNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLKLSPGFWGLAFPTVDPSRSQTRQYAH
Ga0126371_1102744613300021560Tropical Forest SoilMPGKSGEETRRSGRLTLRVTLKIYQPDANNRSLVEETSAVKVSLWGGLIALRTAVHPGQRLSLLNRATGETAESKVVYLGPMRSSARLTRLVAIEFLKPSPGFWGVVFPRLTVTTTGTSTFQTRSTGRGS
Ga0242663_111198613300022523SoilMPERNGAETRRSGRVTLRVPLKIYERGPNNRFLDEEAYAVNVSLWGGLIAFGAALERNQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQHANEKGSDAESPKV
Ga0242655_1007466913300022532SoilSGAMPERNGAETRRSGRVTLRVPLKIYERGPNNRFLDEEAYAVNVSLWGGLIAFGAALERNQTLFLFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDPSRSQTRQHAN
Ga0242655_1020046123300022532SoilMPGRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEAFALKVSLWGGLVALRAAVNREQKLFLVNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLRPSPGFWGLAFPTVDPSRSQTKQYAH
Ga0242657_125137313300022722SoilPPRSGVMPERNGDETRRSGRVTLRVPLRIYERGSNNRFLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPHAN
Ga0242665_1000183033300022724SoilMPERNGAEFRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVTVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDSSRSQAKQHAN
Ga0137417_138506623300024330Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRFRDEEAYAVTVSLWGGLIAFGAAVGRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWVWSFPRSMHADLRPGSMPPRITTGRAPS
Ga0207684_1050885813300025910Corn, Switchgrass And Miscanthus RhizosphereMPGRNGAETRRSGRVTLRVPLKIYEPGSNKRFLVEEASAVKVSLWGGLIALRTAVNQDQKLSVVNQATGETAESKVVYLGPTQLSGGLRLVAIEFLRSSPDFWGM
Ga0209131_1006790103300026320Grasslands SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAQSQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT
Ga0257167_105846513300026376SoilMPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN
Ga0257171_107449213300026377SoilMPERNGAETRRSARVTLRVPLKIHEPGSNKPSLVEEAYSVKVSLWGGLIAFGAAVDRDQKLFVSNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCRSQTGQYAN
Ga0209648_1019834823300026551Grasslands SoilVPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0209735_111507313300027562Forest SoilMPCRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEAAALKVSLWGGLVVLRATVSRDQKLFLVNQATGESAESKVAYLGPMQLGGRRLRLVAVEFLKPSPGFWGLAFPTVDPSRSQTRQYAH
Ga0209076_100159223300027643Vadose Zone SoilMPERTGAETRRSGRVTLRVPLKIYERGSNNRFLGEEAYAVKVSLWGGLIAFGTAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGRLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTTRSLASRDR
Ga0209076_109204013300027643Vadose Zone SoilTLRVPLKIYERGSNSRFLDEEAYAVSVSLWGGLIALGAALDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0209388_114973323300027655Vadose Zone SoilETRRSGRVTLRVPLKIYERGSNKPFLVEEAYSVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVIYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPCQSQTRQYAN
Ga0209736_101732933300027660Forest SoilMPCRNEAGTRRSGRVTLRVPLRIYEPGSNKRFLVEEAFALKVSLWGGIVALRVAVNREQKLFLVNQATGEAAESKVAYLGPMQLSGRRLRLVAIEFLKPSPGFWGLAFPTVDPSRSQTRPYAKVGSTDN
Ga0209588_102984723300027671Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERDSNNRFLEEEAYAVNVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLEPSPGFWGVVFPSCSRAQRT
Ga0209701_1009437123300027862Vadose Zone SoilLPERNGAETRRSGRVTLRVPLKIYEPGSNNRFLGEEAYAVKVSLWGGLIAFGAAVDRDQKLLVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTH
Ga0209380_1014771613300027889SoilNGTETRRSGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRVRAANHSN
Ga0209067_1071933023300027898WatershedsAETRCSGRVTLRVPLKIYEPDSDRYFLIEETCAVKVSLWGGLIALSAAVDRDQKLLVANQATGETAESQVVYLKPMELSGRLNLVAIEFLKPSPSFWGVNFPTVDPSRSQTMEYAN
Ga0209583_1015196613300027910WatershedsMPVRNGAETRRSGRVTLRVPLKIYEPDSNKYFLIEEACAVKVSLWGGLIALGAAVDRGQKLLIANQATGETAESQVVYLKPMELSGRLMLVAIEFLKPSPGFWGVDFPTVDPYRFQTREYAN
Ga0209526_1008764833300028047Forest SoilMPGRNGAEIRRSGRVTLRVPLKIFEPGSNKRFLVAEASAVKVSLWGGIIALGAAVNRDQKLFLLNQATGETAESKVAYLGPMQLGGRRLRLVAIEFLRPSPGFWGLSFPTVDSCRSQSRQYAKVGSGN
Ga0209526_1099936013300028047Forest SoilMLGRNGAETRRSGRVTLRVPLKIFEPGSNKRFLVEEASAVKVSLWGGLIALRTAVNQDQKLSVVNQATGETAESKVVYLGPIQLSGGLRLVAIEFLRPSPDFWGMVFPAVDPCRSLPTRYAKAGSSQN
Ga0137415_1001834533300028536Vadose Zone SoilMPERNGAETRRSGRVTLRVPLKIYERGSNSRFLDEEAYAVSVSLWGGLIALGAALDRDQKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0137415_1006914123300028536Vadose Zone SoilMPERTGAETRRSGRVTLRVPLKIYERGSNNRFLGEEAYAVKVSLWGGLIAFGTAVDRDQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPRSIHDARPAVTTRSLASRDR
Ga0308309_1144232213300028906SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRFLDEEAYAVTVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVDSSRSQAKQHAN
Ga0308309_1186274223300028906SoilGRVTLRVPLKIFEPNTNKRFLVEEAYSLKVSLWGGLIALRAPVNRHQKLLLVNQATGETAESQIVYLGPMHLGGRRLRLVAIEFLKPSPGFWSIGFPSAVPCRVRAANHSN
Ga0222749_1039747213300029636SoilMLGRNGTETRRSGRVTLRVPLKIFEPGSDKRFLVEEASAVKVSLWGGLIALRTAVNRDQKLVVVNQATGETAESQVVYLGPIQLSGGLRLVAIEFLRLSPDFWGMVFPTDNPCRSQITRHAKAGSSHN
Ga0210278_115075223300030596SoilERNGDETRRSGRVTLRVPLRIYERGSNNRFLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0307482_118827413300030730Hardwood Forest SoilGAETRRSARMTLRVPLKIYERGSSSRNRFLDEEAYAVKVSLWGGLIAFGAPVERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVGIKFLKPSPGFWGVDFPTLDPCRAQAKQQAAN
Ga0170824_11490387413300031231Forest SoilMPGRNGAGARRSGRVTLRVPLKIYEPGENKWFLVEEASALKVSLWGGLVALKAPVKRNQELFLVNQATGQTAESKVTYLGPMHVEGRRLRLVAIEFLRPSPDFWGLGFPVVD
Ga0307476_1000347823300031715Hardwood Forest SoilMAERNGAETRRSARMTLRVPLKIYERGSSSRNRFLDEEAYAVKVSLWGGLIAFGAPVERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLVGIKFLKPSPGFWGVDFPTLDPCRAQAKQQAAN
Ga0307476_1135626913300031715Hardwood Forest SoilRRSGRVTLRVPLKIYERGSNSRLLDEEAYAVNVSLWGGLIAFGAALERDQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0310813_1229211813300031716SoilPLKIFEPGSNRRFLVEQASALKVSLWGGLIAISALVNLNQKLFLANQATGETEQSMVVYLGPMQLGGRRLRLVAIEFLRPSPGFWGMVFPTPDPWRSQTGGNGNQNPRARTAP
Ga0307474_1013287413300031718Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKIYEPGSKNGVHVEEAYAVRVSLWGGLIAFGAAVDRDKKLFVFNQATGEIAESQVVYLRPMRLAGGVGLVAIKFLKPSPGFWGVVFPTVA
Ga0307469_1129475413300031720Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKINERGSNEPFLIEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYVRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTVRAAS
Ga0307468_10187582313300031740Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKINERGSNEPFLIEEAYSVKVSLWGGLIAFGAAVDRDQKLFVFNQATGEIAESQVIYVRPMRLAGGLGLVAIKFLKPSPSFWGVVFPTV
Ga0307477_1000269513300031753Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKIYERGSNSRLLDEEAYAVNVSLWGGLIAFGAALERDQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWG
Ga0307477_1005038363300031753Hardwood Forest SoilMPGRNGTQTRRSGRVTLRVSLRIYEPGSNNRFLVEEAYSVKVSLWGGLIALRSAVNKDQKLSMVNQATGETADSKVVHLGPTQLSGGLRLIAIEFLRPSPDFWGMVFPEVDPCRSQIIQSAKAGSSHN
Ga0307477_1020388413300031753Hardwood Forest SoilMPEKNGTGTRRSGRATLRVPLKIYERGSDKPLLVEEAYAVRLSLWGGLIAFETAVERDQKLFVFNQATGEIAESRIVYLRPMQLDGRHRLAAIEFLRPSPGFWGVDFPKCDPCQSRATGPTSRAELAQRSNGV
Ga0307475_1022532823300031754Hardwood Forest SoilMPERNEAEIRRSGRVTLHVPLKIYERGPNNRFIDEEAYAVKVSLWGGLIAFGAAVDRDEKLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPKADPCRSQSRQYAN
Ga0307475_1044667513300031754Hardwood Forest SoilMPDRNGAETRRSGRVTLRVSLRIYERGSNNRFLDEEAYAVKVSLWGGLIAFGAALERDQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVGPSRSTG
Ga0307475_1102624213300031754Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKIYEPGSKNGVHVEEAYAVRVSLWGGLIAFGAAVDRDKKLFVFNQATGEIAESQVVYLRPMRLAGGVGLVAIKFLKPSPG
Ga0307475_1123746913300031754Hardwood Forest SoilMPERNGPEARRSGRVTLRVPLKIYERGSNHRFLDEEAYAVNVSLWGGLIAFGAAVERNQTLYLFNQATGEIAESQVVYLRPMRLAGGLGLAAIRFLNPSPAFWGVVFPTVDPSRSQTGSMPTRIPTGRAPS
Ga0307478_1118042323300031823Hardwood Forest SoilRNGPETRRSGRVTLRVPLKIYERGSNHRFLDDEAYAVNVSLWGGLIAFGAALERDQTLYVFNQATGEIAESQVVYLRPMRLAGGLGLAAIKFLKPSPGFWGVVFPTVDPSRSQTRQYAN
Ga0307479_1008663233300031962Hardwood Forest SoilMPERNGVEIRRSGRVTLHVPLKIYERGPNNRSLGEEAYAVKVSLWGGLIAFGAAVERNQTLFVFNQATGEIAESQVVYLRPMRLAGGVGLVAIKFLKPSPGFWGVVFPTVDSSRSQTRQYAN
Ga0307479_1010234923300031962Hardwood Forest SoilMPERNGAETRRSGRVTLRVPLKIYERGSNNRSSGEEAYAVSVSLWGGLIAFGAALGRDQKLFVFNQATGEIAESQVVYLRPMRLAGGVGLVAIKFLKPSPGFWGVVFPTVDPSRSQTKAACQLESQRGRVPS
Ga0307479_1033204033300031962Hardwood Forest SoilMPEKNGAETRRSGRVTLRVPLKIYERGSTNRFLGEEAYAVKVSLWGGLIAFGAAVDRGQKLFVSNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLEPSPGFWGVVFPRSIHDARPAVTTRSLASRDR
Ga0316040_12188413300032121SoilSGDMPERNGDETRRSGRVTLRVPLRIYERGSNNRFLGEEAYAVQVSLWGGLIAFGAAVERHQTLFVFNQATGEIAESQVVYLRPMRLAGGLGLVAIKFLKPSPGFWGVVFPTVDPSRSQTRPHAN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.