NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F084680

Metagenome / Metatranscriptome Family F084680

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F084680
Family Type Metagenome / Metatranscriptome
Number of Sequences 112
Average Sequence Length 268 residues
Representative Sequence MIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Number of Associated Samples 86
Number of Associated Scaffolds 112

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 61.26 %
% of genes near scaffold ends (potentially truncated) 54.46 %
% of genes from short scaffolds (< 2000 bps) 69.64 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (57.143 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(41.964 % of family members)
Environment Ontology (ENVO) Unclassified
(38.393 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(67.857 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 3.95%    β-sheet: 23.36%    Coil/Unstructured: 72.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 112 Family Scaffolds
PF03743TrbI 18.75
PF00155Aminotran_1_2 8.04
PF027395_3_exonuc_N 6.25
PF08240ADH_N 1.79
PF12704MacB_PCD 1.79
PF00248Aldo_ket_red 0.89
PF02771Acyl-CoA_dh_N 0.89
PF12697Abhydrolase_6 0.89
PF13442Cytochrome_CBB3 0.89
PF08281Sigma70_r4_2 0.89
PF03062MBOAT 0.89
PF00107ADH_zinc_N 0.89

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 112 Family Scaffolds
COG2948Type IV secretory pathway, VirB10 componentIntracellular trafficking, secretion, and vesicular transport [U] 18.75
COG02585'-3' exonuclease Xni/ExoIX (flap endonuclease)Replication, recombination and repair [L] 6.25
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.89


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A57.14 %
All OrganismsrootAll Organisms42.86 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100093357All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2811Open in IMG/M
3300002245|JGIcombinedJ26739_100180203All Organisms → cellular organisms → Bacteria1999Open in IMG/M
3300002245|JGIcombinedJ26739_100644448All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis937Open in IMG/M
3300004082|Ga0062384_100281978Not Available1023Open in IMG/M
3300004092|Ga0062389_100005392All Organisms → cellular organisms → Bacteria7704Open in IMG/M
3300004092|Ga0062389_100159739Not Available2141Open in IMG/M
3300004092|Ga0062389_100389156Not Available1508Open in IMG/M
3300004092|Ga0062389_101114402Not Available975Open in IMG/M
3300004135|Ga0058884_1363691All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter678Open in IMG/M
3300004139|Ga0058897_11156954Not Available1059Open in IMG/M
3300004479|Ga0062595_100905195Not Available744Open in IMG/M
3300004631|Ga0058899_12001391Not Available871Open in IMG/M
3300004635|Ga0062388_100047996All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2762Open in IMG/M
3300005332|Ga0066388_101307398Not Available1251Open in IMG/M
3300005334|Ga0068869_100097676All Organisms → cellular organisms → Bacteria2218Open in IMG/M
3300005339|Ga0070660_100359265Not Available1200Open in IMG/M
3300005434|Ga0070709_10613862Not Available839Open in IMG/M
3300005440|Ga0070705_100287901Not Available1171Open in IMG/M
3300005459|Ga0068867_101046474Not Available742Open in IMG/M
3300005598|Ga0066706_10031319All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter3398Open in IMG/M
3300006172|Ga0075018_10194733Not Available958Open in IMG/M
3300006176|Ga0070765_101300833All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium KBS 89686Open in IMG/M
3300006852|Ga0075433_10211294All Organisms → cellular organisms → Bacteria1724Open in IMG/M
3300006893|Ga0073928_10000394All Organisms → cellular organisms → Bacteria103544Open in IMG/M
3300006904|Ga0075424_100893288Not Available948Open in IMG/M
3300009137|Ga0066709_100742655Not Available1416Open in IMG/M
3300009553|Ga0105249_10115245All Organisms → cellular organisms → Bacteria2546Open in IMG/M
3300011120|Ga0150983_11506392All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1117Open in IMG/M
3300011120|Ga0150983_11761458Not Available1241Open in IMG/M
3300011120|Ga0150983_12354261Not Available1121Open in IMG/M
3300011120|Ga0150983_12921494Not Available1526Open in IMG/M
3300011120|Ga0150983_14544152Not Available934Open in IMG/M
3300011120|Ga0150983_16548131Not Available1051Open in IMG/M
3300012189|Ga0137388_10300584Not Available1471Open in IMG/M
3300012582|Ga0137358_10015759All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4793Open in IMG/M
3300013296|Ga0157374_10316446Not Available1546Open in IMG/M
3300013306|Ga0163162_10025945All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5792Open in IMG/M
3300014501|Ga0182024_10000434All Organisms → cellular organisms → Bacteria107287Open in IMG/M
3300015371|Ga0132258_10916228All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2213Open in IMG/M
3300018431|Ga0066655_10373908Not Available938Open in IMG/M
3300018482|Ga0066669_10377361All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1190Open in IMG/M
3300020579|Ga0210407_10175260All Organisms → cellular organisms → Bacteria1661Open in IMG/M
3300020579|Ga0210407_10376165All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1111Open in IMG/M
3300020581|Ga0210399_10000211All Organisms → cellular organisms → Bacteria46988Open in IMG/M
3300020581|Ga0210399_10712512Not Available824Open in IMG/M
3300020583|Ga0210401_10002228All Organisms → cellular organisms → Bacteria → Acidobacteria21635Open in IMG/M
3300020583|Ga0210401_10018180All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6780Open in IMG/M
3300021168|Ga0210406_10270669Not Available1389Open in IMG/M
3300021168|Ga0210406_10402018Not Available1097Open in IMG/M
3300021170|Ga0210400_10494046Not Available1010Open in IMG/M
3300021178|Ga0210408_10705992Not Available794Open in IMG/M
3300021180|Ga0210396_10119909All Organisms → cellular organisms → Bacteria → Acidobacteria2374Open in IMG/M
3300021181|Ga0210388_10198373All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1756Open in IMG/M
3300021401|Ga0210393_10443120All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1060Open in IMG/M
3300021403|Ga0210397_10104465All Organisms → cellular organisms → Bacteria1919Open in IMG/M
3300021406|Ga0210386_10010009All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7473Open in IMG/M
3300021407|Ga0210383_10089349All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2586Open in IMG/M
3300021420|Ga0210394_10091690All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2627Open in IMG/M
3300021420|Ga0210394_10889348Not Available775Open in IMG/M
3300021432|Ga0210384_10082836All Organisms → cellular organisms → Bacteria2884Open in IMG/M
3300021432|Ga0210384_10784068Not Available850Open in IMG/M
3300021433|Ga0210391_10061487All Organisms → cellular organisms → Bacteria2979Open in IMG/M
3300021433|Ga0210391_10344410Not Available1169Open in IMG/M
3300021478|Ga0210402_10082133All Organisms → cellular organisms → Bacteria2868Open in IMG/M
3300021478|Ga0210402_10488957All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1143Open in IMG/M
3300021479|Ga0210410_10021344All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5597Open in IMG/M
3300021559|Ga0210409_10039481All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4486Open in IMG/M
3300022498|Ga0242644_1007960Not Available901Open in IMG/M
3300022504|Ga0242642_1033107Not Available755Open in IMG/M
3300022507|Ga0222729_1019635Not Available790Open in IMG/M
3300022522|Ga0242659_1004765Not Available1695Open in IMG/M
3300022523|Ga0242663_1013611Not Available1134Open in IMG/M
3300022527|Ga0242664_1033778Not Available873Open in IMG/M
3300022531|Ga0242660_1068863Not Available811Open in IMG/M
3300022531|Ga0242660_1072221Not Available797Open in IMG/M
3300022533|Ga0242662_10033413Not Available1252Open in IMG/M
3300022533|Ga0242662_10116892Not Available777Open in IMG/M
3300022557|Ga0212123_10000827All Organisms → cellular organisms → Bacteria94039Open in IMG/M
3300022557|Ga0212123_10080441All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2753Open in IMG/M
3300022712|Ga0242653_1010988Not Available1127Open in IMG/M
3300022718|Ga0242675_1019104Not Available943Open in IMG/M
3300022720|Ga0242672_1038312All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Terracidiphilus → Terracidiphilus gabretensis760Open in IMG/M
3300022724|Ga0242665_10011481Not Available1833Open in IMG/M
3300022724|Ga0242665_10095492Not Available874Open in IMG/M
3300022726|Ga0242654_10079967Not Available990Open in IMG/M
3300022726|Ga0242654_10088497Not Available953Open in IMG/M
3300025961|Ga0207712_10478610Not Available1061Open in IMG/M
3300026095|Ga0207676_10949632Not Available845Open in IMG/M
3300026548|Ga0209161_10270960Not Available859Open in IMG/M
3300027610|Ga0209528_1087243Not Available690Open in IMG/M
3300027908|Ga0209006_10092183All Organisms → cellular organisms → Bacteria2691Open in IMG/M
3300028047|Ga0209526_10005425All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis8689Open in IMG/M
3300028047|Ga0209526_10246932Not Available1222Open in IMG/M
3300028381|Ga0268264_10007826All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis8893Open in IMG/M
3300030937|Ga0138302_1136105Not Available924Open in IMG/M
3300030946|Ga0075379_10987712Not Available903Open in IMG/M
3300030998|Ga0073996_11986309Not Available874Open in IMG/M
3300031057|Ga0170834_100189650All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1279Open in IMG/M
3300031128|Ga0170823_16855120Not Available1253Open in IMG/M
3300031231|Ga0170824_112329593Not Available1119Open in IMG/M
3300031590|Ga0307483_1010223Not Available815Open in IMG/M
3300031720|Ga0307469_10047526All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2668Open in IMG/M
3300031720|Ga0307469_10586150Not Available994Open in IMG/M
3300031753|Ga0307477_10400377Not Available940Open in IMG/M
3300031823|Ga0307478_10203489Not Available1594Open in IMG/M
3300031962|Ga0307479_10015241All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7213Open in IMG/M
3300031962|Ga0307479_10047826All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4111Open in IMG/M
3300032174|Ga0307470_10008201All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4211Open in IMG/M
3300032180|Ga0307471_100260994All Organisms → cellular organisms → Bacteria1793Open in IMG/M
3300032515|Ga0348332_12971387Not Available1203Open in IMG/M
3300032756|Ga0315742_10918462Not Available850Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil41.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil14.29%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.04%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil5.36%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.57%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring2.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.68%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.79%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.79%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.79%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.79%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.89%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.89%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.89%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.89%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004135Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022498Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022504Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022522Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022527Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022718Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022720Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027610Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030937Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A4_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300030946Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - OA7 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030998Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-3A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031590Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032756Forest Soil Metatranscriptomics Site 2 Humus Litter Mineral Combined AssemblyEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10009335713300002245Forest SoilLAGALMIALVVLTASAQDPPDVQRTAKPDSNHAPAPVIAPLGAGTAFNASLVETLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSGVAPMPSNTPKSRVNGSAAVPVVEDGAGSAESSDPLVVSTLYQEPRSTLRPPPKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPNADSDAPASSSTDLDPQ*
JGIcombinedJ26739_10018020323300002245Forest SoilMMKLAASLVLLGALVGPGASAQAPSLLPPSAKKELTPHAPPATVVAPLGIGTAFNAFLDDSLDTRKTKAADPITAEVAEDVTYERSTIFPKGTKIMGHVVRVTSGGRGRAGCAIFVQFDKAILKDGQEVVLNAGIQALAVGTVAPLQDMDTPKNDMETAPHALPVEDNSSNPAANSGALVVSTTYEAPRNALRAPLAVAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLLSAKKNMHLDSGTRLLLVVQPPPSADADPTNSLDLDQIQ*
JGIcombinedJ26739_10064444813300002245Forest SoilAVMVSRFCGSQPSRFPKGGRVMIKFAASLLFXGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGHGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ*
Ga0062384_10028197823300004082Bog Forest SoilMIKFAASLIFAGALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGEIVTAEVSEDVSYQRCIVFPKGTKVTGHVVRVTSGGRGSAGSAVFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKAKSTAPQAVPVVEDDSESVASSDAVVVSTLYQAPRSTLRTPLTPGPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLS
Ga0062389_10000539233300004092Bog Forest SoilMMKFTASLVFAGALLVLGANAQTPPDLQRSSKSEAKTEAPPAPVIAPLGAGTTFNASLDDTLDTRKSKAGDVVTAETAEDVSYEQCLIFPKGTQIVGHVVRVTPGGRGRGGSAIFVQFDKAMMKDGQEVTLNAGIQALAAAGVAPMPSDTASKSLDAATRQVPVQESSTGAEVTSGALVVSTIYEAPRTTLRPPLAGPLVAEGEFKSDGLFTPESKGAFGRPDLKVYTPTSEGSHGTVLLSAKKNMHLDGGTHLLLVVQPPPTAGESDAAGPLDLDPN*
Ga0062389_10015973923300004092Bog Forest SoilMIKFAASLIFAGALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGEIVTAEVSEDVSYQRCIVFPKGTKVTGHVVRVTSGGRGSAGSAVFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKAKSTAPQAVPVVEDDSESVASSDAVVVSTLYQAPRSTLRTPLTPGPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHIDAGTHLLLVVQPPPTADSAAPSSSSTNLDPQ*
Ga0062389_10038915613300004092Bog Forest SoilMCDAAVMVWVFAVPSLSGFPRGGRVMIKFAASLLFAGSLLASAANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKAKAGDVVTAEASEDVSYQRCVIFPKGTKISGHVVRVTSGGHGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGSVAPMPSATPKNKSAAPQAVPVAEDDSELVASSDAVVVSTLYQQPRSTLRAPLTPGPASEGEFTSDGLFSPGSKGAFGRPDLKIYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPSSSSTNLGPQ*
Ga0062389_10111440223300004092Bog Forest SoilAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGRGSAGSAIFIQFDKATVKDGQEVILNAGIQALAVGGVAPMPAATPKSKSTAQQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPASSPTNLNPQ*
Ga0058884_136369113300004135Forest SoilQAPAPVIAPLGVGTAFNAALEDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKIAGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKSKSTAPQAEPVVEDGSGSVASSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPAAD
Ga0058897_1115695413300004139Forest SoilLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTAGSVAPSSSSTNLDPQ*
Ga0062595_10090519513300004479SoilAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDSVVAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGGRGKAGSAIFVQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTNSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDRGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0058899_1200139113300004631Forest SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAASNATVVSTIYESPRTALRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTRLLLVVQPPPSGDSDAANPLDLDLSQ*
Ga0062388_10004799613300004635Bog Forest SoilKSEAKTEAPPAPVIAPLGAGTTFNASLDDTLDTRKSKAGDVVTAETAEDVSYEQCLIFPKGTQIVGHVVRVTPGGRGRGGSAIFVQFDKAMMKDGQEVTLNAGIQALAAAGVAPMPSDTASKSLDAATRQVPVQESSTGAEVTSGPLVVSTIYEAPRTTLRPPLAGPLVAEGEFKSDGLFTPESKGAFGRPDLKVYTPTSEGSHGTVLLSAKKNMHLDGGTHLLLVVQPPPTAGESDAAGPLDLDPN*
Ga0066388_10130739823300005332Tropical Forest SoilLAVGTVFNAVLGDTLDARKTRAGDPISAEVAEDVSYERATIFPKGTKITGHVVRVTSGAHGKAGSAMFVQFDKATMKDGQEVMLNAGIQALAVTAIAPMPSSADSAKSAGSRTLPVDENAVGAQSSDDALVVSTIYERPGTTLRTPLAPAPAAEGEFNSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSGRKNLHLESGTHLLLVVQPPPTNEPEGSSAGSLNLDPQ*
Ga0068869_10009767623300005334Miscanthus RhizosphereMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0070660_10035926513300005339Corn RhizosphereGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0070709_1061386213300005434Corn, Switchgrass And Miscanthus RhizosphereVGTVFNAMLGDTLDARKTRAGDAISAEVAEDVSYERATIFPKGTKLTGHVVRVTSGARGKTGSAIFVQFDKATMKDGQEVMLNAGIQALAVAAIAPMPSSTDSAKNGGSRPLPVDENAVGAESSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSAHKNLHLDSGTHLLLVVQPPPTDEPEGSSASSLNLDPQ*
Ga0070705_10028790123300005440Corn, Switchgrass And Miscanthus RhizosphereALIDRRGSSVDFPGLGGFPEVREDASMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0068867_10104647413300005459Miscanthus RhizosphereKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0066706_1003131933300005598SoilMKSAASLLLAGALVFVLNASAQTPPNLQNSAKTDLAAQPPAPVIAPLGVGTAFNASLSDTLDTRKTRAGDAVTAEIAEDVSYERCVVLPKGTKVEGHVVRVTSGGRGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDSDSPKSAAPHRLPVEDNTGGTGTGKDALVSTIYEKPRRTLRTPLPPVPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVIQPPPTDEPEGSSANSLDLDPH*
Ga0075018_1019473323300006172WatershedsPPAPVIAPLGIGTTFNANLEDTLDTRKTRAGDPITAEISEDVSYERCMILPKGTKVTGHIVRVTSGGRGRAGSAIFVMFDKAMMKDGQEVMLNAGIQALATGPVASLPAETEASKNLSAGPHSMPVDDTTGGTAVASDALVVSTIYDAPRTTLRPPMAPAPAAEGEVGPDGLFTPESKGAFGRPDLKVYTPTSEGSHGTVLLSTRKNLHLDSGTHLLLVIQPPPSGEADPAASSLDLDPQ
Ga0070765_10130083313300006176SoilLKFSTSLLAGALMIALVVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVETLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKNRVNGPAGVPVVEEGAGSAESSDPLVVSTLYQEPRSTLRPPPKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTP
Ga0075433_1021129413300006852Populus RhizosphereMKSAASLLLAVALVLVLTASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVVAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGGRGKAGSAIFVQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTNSPKSAAPHRLPVENNPGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0073928_10000394413300006893Iron-Sulfur Acid SpringMIKFAASLLFTGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGHGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ*
Ga0075424_10089328813300006904Populus RhizosphereASLLLAVALVLVLTASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVVAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGGRGKAGSAIFVQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTNSPKSAAPHRLPVENNPGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0066709_10074265513300009137Grasslands SoilVFVLNASAQTPPNLQNSAKTDLAAQPPAPVIAPLGVGTAFNASLSDTLDTRKTRAGDAVTAEIAEDVSYERCVVLPKGTKVEGHVVRVTSGGRGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDSDSPKSAAPHRLPVEDNTGGTGTGKDALVSTIYEKPRRTLRTPLTPVPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVIQPPPTDEPEGSSANSLDLDPH*
Ga0105249_1011524513300009553Switchgrass RhizosphereGSSVDFPGLGGFPEVREDASMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0150983_1150639213300011120Forest SoilMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAASNATVVSTIYESPRTALRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTR
Ga0150983_1176145823300011120Forest SoilMMKFAASLLFAGALLVLTANAQNPPDIQPSAKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGRAGSAIFIQFDKAIVKDGQDVILSAGIQALAVGAVAPMPSGAPKTGATSAQVPVVQDSAGSKESSDALVVSTLYQEPRSTLRPPVVRGLESAGEFNSDGLFTSDSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPKVDSEAPVSAPTDLAPQ*
Ga0150983_1235426113300011120Forest SoilMKSAASILLAGALVFVLRAEAQTPPNLQSSKTGNNVQTPPAPVIAPLGVGTVFNAALEGTLDTRKTHAGDAVVAETMEDVTYERCMVFPKGTKIMGHVVRVTSGARGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGTVAPMPSEADAPKATSPHALAVEDTSARTASSSDALVVSTIYEAPRTTLRTPLTPAPAAEGEFTSAGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKIMHLDSGTHLLLVVQPPPTDESEATGTGSLDLDPQ*
Ga0150983_1292149423300011120Forest SoilMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTAGSVAPSSSSTNLDPQ*
Ga0150983_1454415213300011120Forest SoilMLKFSTSLLAGALMIALVVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVETLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRSTLRPPLKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPKADSDAPASSSTDLDPQ*
Ga0150983_1654813123300011120Forest SoilVDCSGLDGFPKVREDTSMKSAAALFLAGSLVFALGANAQTQQTPPSIQINGTQTDLKAQPPAPVIAPLAVGTVFNAMLGDALDARKTRAGDAISAEVAEDVSYERATIFPKGTKLTGHVVRVTSGARGKTGSAIFVQFDKATMKDGQEVMLNAGIQALAVAAIAPMPSSTDSAKNGGSRPLPVDENAVGAESSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSAHKNLHLDSGTHLLLVVQPPPTDEPEGSSASSLNLDPQ*
Ga0137388_1030058423300012189Vadose Zone SoilMMKLTASLLLIGALVVPGASGQAPSLLQPSGKKEIKAQTPPAAVVAPLCVGTAFNASLDETLDTRKTRAGDPVTAEVAEDVTYERSTIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKNMESAPHAVPVEDNSSSSAGSNATVVSTIYESPRTTLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLLSAKKNLHIDSGTRLLLVVQPPPSGDSDAANPLDLDLNQ*
Ga0137358_1001575943300012582Vadose Zone SoilMMKFAASLLFAGALLVLTANAQNPPDIQPSAKTDSTQAPAAVIAPLGVGTAFNASLIDALDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGRRGSAGSAIFIQFDKAIVKDGQDVILSAGIQALAVGAVAPMPSGAPKTGATSAQVPVVQDSAGSKESSDALVVSTLYQEPRSTVRPPVVRGLESAGQFNSDGLFTADSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNM
Ga0157374_1031644623300013296Miscanthus RhizosphereMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVVAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGGRGKAGSAIFVQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTNSPKSAAPHRLPVENNPGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPDSKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0163162_1002594553300013306Switchgrass RhizosphereMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVEINTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ*
Ga0182024_10000434803300014501PermafrostMIKVAVNLLLTIVTLVTIAVFGLTLSAQTPPDIQKTAKIGGNEASAPVIAPLGVGTAFNALLDDTLDTRKTKAGDSFTAETAEDVSYQRCMIFPKGTKIVGHVVRASAGGHGHAGSAIFVQFDKATTKDGQEVILNAGIQALAVAGVAPMSAASPKTGNAAPVEPIVDESASSAPPGDALVVSTLYPGQRSALDSQAPQGQFTSNGLFSENSKGAFGRPDMKVYTPTSEGSHGTVLLSSRKNMHLDAGTHLLLVVQPPPNADPVAPGSSTTDLDPQ*
Ga0132258_1091622823300015371Arabidopsis RhizosphereMIKFTAGLLILGATMAMAANAQNPPEIQRSSKTELTAQTPPAPVIAPLGIGTTFNATLDDTLDTRKTRAGDTVTAEIAEDVSYERCMIFPKGTKVTGHVVRVTSAGRGKAGSAIFMQFDKAMMKDGQEVTLHAGIQALAAAPLATDLAKSAASVPHPLPVEENSADNPGNSVTAGSPLIVSTVYESPKETLRPPMTPAPAALGEFNSDGLFTPESKGAFGRPDLKVYTPTSDGSHGTVLLSVRKNMHLDAGTHLLLVVQPPPSGDSDAGATSLDLDPQ*
Ga0066655_1037390813300018431Grasslands SoilMKSATSLLLAGALVFVLNASAQTPPNLQNSAKTDLAAQPPAPVIAPLGVGTAFNASLSDTLDTRKTRAGDAVTAEIAEDVSYERCVVLPKGTKVEGHVVRVTSGGRGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDSDSPKSAAPYRLPVEDNTSGTGTGKDALVSTIYEKPRRTLRTPLTPVPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVIQPPPTDEPE
Ga0066669_1037736123300018482Grasslands SoilMKSAASLLLAGALVFVLNASAQTPPNLQNSAKTDLAAQPPAPVIAPLGVGTAFNASLSDTLDTRKTRAGDAVTAEIAEDVSYERCVVLPKGTKVEGHVVRVTSGGRGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDSDSPKSAAPHRLPVEDNTGGTGTGKDALVSTIYEKPRRTLRTPLPPVPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVIQPPPTDEPEGSSANSLDLD
Ga0210407_1017526023300020579SoilMLPEWCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTAD
Ga0210407_1037616523300020579SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAAGNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGS
Ga0210399_10000211143300020581SoilMMKFAASVLFAGALLVLTANAQNPPDLQPATKTDSSQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDSVTAETVEDVSYQRCVIFPKGTKIVGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQDVILNAGIQALAVGAVAPMPLGTPKSEPAGAQAVPIVEDGAASAESSDALVVSTLYQEPRTTVRPPLMPGLAATGEFTSAGLFSLDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNLHLDAGTHLLLVVQPPPNADSEAPNSSSSDLDPQ
Ga0210399_1071251213300020581SoilASGQAPSLLQPSGKKETKVQAPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPITAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDTDAPKSLDTAPHAVPVEDNSAGAAGNNATVVSTIYESPRTSLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDNGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0210401_1000222863300020583SoilMIKFAASLLFAGALLALKANAQNPPDLQPSAKSSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGHGSAGSAIFIQFDKATVKEQEVILNAGIQALAVGAVSPMPSATPKTKSAAPQAVPVVEDGSGSVASSDAVVVSTIYQEQRNGLRAPLAPSPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPTSSPTNLNPQ
Ga0210401_1001818023300020583SoilMLPEWCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210406_1027066923300021168SoilFAASLLFASALMVLTAGAQNPPDLQQSAKTDSTEAPAAVIAPLGVGTAFNASLDDTLDTRKTKAGALVTAETSEDVTYQRCLIFPKGTKIVGHVVRVTAGGRGRAGSAIFIQFDKASVKDGQDVILNAGIQALAVGAVAPMPSSSPKDGATSTPAVPVLQDSAGSPESSDALVVSTLYQEPRSTLRPPRLRGLAPEGGLNSEGLFTADSKGAFGRPDLKIYTPTSEGSRGSVLVSSRKNMHLDAGTHLLLVVQPPSNRDVEAHSSSPTDPDPQ
Ga0210406_1040201823300021168SoilMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210400_1049404623300021170SoilRIMLRLMGSLVLIGTLVVPGASGQAPSLLQPSGKKETKVQAPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPITAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDTDAPKSLDTAPHAVPVEDNSAGAAGNNATVVSTIYESPRTSLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDNGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0210408_1070599213300021178SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAASNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTRLLLVVQP
Ga0210396_1011990933300021180SoilILNYLGVVGMSLIETKQIGYVLVRAVQKCNTAVMVSRFCGSQPSRFPKGGRVMMKFAASLLFAGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDAVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210388_1019837323300021181SoilMMKFAASLLLGALLVLTANAQTPPDIQRSSKTDPSQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDLVTAETAEDVSYQQCIIFPKGTKIVGHIVRVTSGGRGRAGSAIFIQFDKATIKDGQEVILNAGIQALAVGAVAPMPGTSSKSGTVASQAVPVVDESAGTAASSDALVVSTLYQEPRSTLQPPLTAPSAAEGEFTSNGLFSLDSKGAFGRPDLKVYTPTSEGSRGTVLVSSKKNMHLDAGTHLLLVIQPPPSADPGAPGSSSTDLDPQ
Ga0210393_1044312023300021401SoilMIKFAASLLFAGVLLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASQDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVEDGQEVILNAGIQALAVGTVSPMPSATPKNKSTAPEAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGATSEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGS
Ga0210397_1010446523300021403SoilMIKFAASLLFAGALLALTANAQNPPDLQPSAKSSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGHGSAGSAIFIQFDKATVKEQEVILNAGIQALAVGAVSPMPSATPKTKSAAPQAVPVVEDGSGSVASSDAVVVSTIYQEQRNSLRAPLAPSPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210386_1001000943300021406SoilMVWFVPCRSAMLPEWCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210383_1008934923300021407SoilMIKFAASLLFAGALLALTANAQNPPDLQPSAKSSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGHGSAGSAIFIQFDKATVKEQEVILNAGIQALAVGAVSPMPSATPKTKSAAPQAVPVVEDGSGSVASSDAVVVSTIYQEQRNSLRAPLAPSPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPTSSPTNLNPQ
Ga0210394_1009169013300021420SoilNPPDLQPSAKSSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGHGSAGSAIFIQFDKATVKEQEVILNAGIQALAVGAVSPMPSATPKTKSAAPQAVPVVEDGSGSVASSDAVVVSTIYQEQRNGLRAPLAPSPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPTSSPTNLNPQ
Ga0210394_1088934813300021420SoilMIKFAASLLFAGALLALTANAQNPPDLQPSAKSDATQAPAPVIAPLGVGTAFNATLDDTLDSRKTKAGDVVTAEASEDVSYQRCVIFPKGTKIAGQVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGAVAPMPSATTKSKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQQPRSTLRAPLTPSPASEGEFTSDGLFYPGSKGAFGRPDLKVYTPTSEGSHGTV
Ga0210384_1008283633300021432SoilMLRLMGSLVLIGTLVVPGASGQAPSLLQPSGKKETKVQAPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPITAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDTDAPKSLDTAPHAVPVEDNSAGAAGNNATVVSTIYESPRTSLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDNGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0210384_1078406813300021432SoilMMLKFSTSLLAGALMIALLVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVEALDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRNTLRPPLKTALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHL
Ga0210391_1006148733300021433SoilMIKFAASLLFAGALLALTAHAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDAVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSAAPKSKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGATSEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0210391_1034441023300021433SoilMIKFAASLLFAGALLALTANAQNPPDLQPSAKSSDVTQAPAPVIAPLGVGTAFNASLEDTLDTRKTKAGDIVTAEASEDVSYQRCVIFPKGTKITGHVVRVTSGGHGSAGSAIFIQFDKATVKEQEVILNAGIQALAVGAVSPMPSATPKTKSAAPQALPVVEDGSGSVASSDAVVVSTIYQEQRNSLRAPLAPSPASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPSNADSETPTSSPTNLNPQ
Ga0210402_1008213333300021478SoilMMKFAASVLFAGALLVLTANAQNPPDLQPATKTDSSQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDSVTAETVEDVSYQRCVIFPKGTKIVGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQDVILNAGIQALAVGAVAPMPLGTPKSEPAGAQAVPIVEDGAASAESSDALVVSTLYQEPRTTVRPPLMPGLAATGEFTSAGLFSLDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNLHLDA
Ga0210402_1048895713300021478SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAAGNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLV
Ga0210410_1002134433300021479SoilMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTSAAASNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0210409_1003948133300021559SoilMLKFSTSLLAGALMIALVVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVETLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRSTLRPPLKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPKADSDAPASSSTDLDPQ
Ga0242644_100796013300022498SoilCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242642_103310713300022504SoilMMLKFSTSLLAGALMIALLVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVEALDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRNTLRPPLKTALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSH
Ga0222729_101963513300022507SoilLVLTANAQNPPDLQPATKTDSSQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDSVTAETVEDVSYQRCVIFPRGTKIVGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQDVILNAGIQALAVGAVAPMPLGTPKSEPAGAQAVPIVEDGAASAESSDALVVSTLYQEPRTTVRPPLMPGLAATGEFTSAGLFSLDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNLHLDAGTHLLLVVQPPPNADSEAPNSSSSDLDPQ
Ga0242659_100476523300022522SoilCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242663_101361123300022523SoilMKFAARVPFAGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDAVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242664_103377813300022527SoilVPSWFRAPVDRLLRFSSPAAKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAASNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0242660_106886313300022531SoilMMLKFSTSLLAGALMIALLVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVEALDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVLFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRNTLRPPLKTALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLL
Ga0242660_107222113300022531SoilLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQLPPTADSVAPSSSSTNLDPQ
Ga0242655_1011031213300022532SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDTDAPKSLDTAPHAVPVEDNSAGAAGNNATVVSTIYESPRTSLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTP
Ga0242662_1003341313300022533SoilCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDAVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKSKSTAPQAEPVVEDDSESVVSSDAVVVSTLYQAPRSTLRTPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242662_1011689213300022533SoilIAPLGIGTTFNANLEETLDTRKTRAGDPISAEISEDVNYERCMILPKGTKVTGHIVRVTSGAHGRTGSAIFVMFDKAMLKDGQEVMLNAGIQALATGPVASLPESSKSLNAGPHSVPVDDTTGGTAAASDALVVSTIYEAPRATIRPPMTQAPATEGSVGPDGLFTPESKGAIGRPDLKVYTPTSEGSHGTVLLSTRKNLRLDSGTHLLLVVQPPPTGDADPAAASSLDLDPQ
Ga0212123_10000827393300022557Iron-Sulfur Acid SpringMIKFAASLLFTGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGHGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0212123_1008044113300022557Iron-Sulfur Acid SpringVQRAVKTDSNHAPAPVIAPLGAGTAFNASLVDTLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSGVAPMPSNAPKSKVNGPAAVPVVEDGSVSTESNDPLVVSTLYQEPRSALRPPLKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPNADSDAPGSSTTDLDPQ
Ga0242653_101098813300022712SoilLLVLTANAQNPPDLQPSAKSDVAQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDAVTAEASEDVSYQRCVVFPKGTKITGHVVRVTSSGRGSAGSAIFIQFDKANVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242675_101910413300022718SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAAGHATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLCTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDSGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0242672_103831213300022720SoilAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVSPMPSATPKNKSTAPEAVPVVEDDSESVVSSDAVVVSTLYQSPRSTLRAPLAPGSVSEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242665_1001148133300022724SoilCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATLKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0242665_1009549213300022724SoilMMLKFSTSLLAGALMIALLVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVEALDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRSTLRPPLKAALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPKADSDAPASSSTDL
Ga0242654_1007996713300022726SoilLCVGTAFNASLDESLDTRKTRAGDPITAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDTDAPKSLDTAPHAVPVEDNSAGAAGNNATVVSTIYESPRTSLRTPLAAAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLVSAKKNMHIDNGTRLLLVVQPPPSGDSDAANPLDLDLSQ
Ga0242654_1008849713300022726SoilMMRLMGSLVLMGALVVPGASGQAPSLLQPSGKKETKAQTPPAAVVAPLCVGTAFNASLDESLDTRKTRAGDPVTAEVAEDVTYERSIIFPKGTKIVGHVVRVTSGGRGRAGSAIFVQFDKATLKDGQEVILNAGIQALAVGAVAPLTPDADVPKSLDTAPHAVPVEDNSTGAAAGNATVVSTIYESPRTTLRTPLAAAPLAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVWFRRRRTCISIAARACYWWCSRRRPETRTPRTPWTWI
Ga0207712_1047861013300025961Switchgrass RhizosphereAEAVGRVVEPWFEEWVQEAPERLLRHSIANHSATVLNSAMVTICSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ
Ga0207676_1094963213300026095Switchgrass RhizosphereMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLL
Ga0209161_1027096013300026548SoilAGALVFVLNASAQTPPNLQNSAKTDLAAQPPAPVIAPLGVGTAFNASLSDTLDTRKTRAGDAVTAEIAEDVSYERCVVLPKGTKVEGHVVRVTSGGRGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDSDSPKSAAPHRLPVEDNTGGTGTGKDALVSTIYEKPRRTLRTPLPPVPAAEGEFTSDGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVIQPPPTDEPEGSSANSLDLDPH
Ga0209528_108724313300027610Forest SoilKSLTGLLAGALMIALVVLTASAQDPPDVQRTAKPDSNHAPAPVIAPLGAGTAFNASLVETLDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSGVAPMPSNTPKSRVNGSAAVPVVEDGAGSAESSDPLVVSTLYQEPRSALRPPLKAALTPEGEFTSDGLFSPESKGAFGRPDLKVYTPTS
Ga0209006_1009218333300027908Forest SoilMIKFAASLLFAGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVSPMPSATPKNKSTAPEAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGATSEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSH
Ga0209526_1000542533300028047Forest SoilMMKLAASLVLLGALVGPGASAQAPSLLPPSAKKELTPHAPPATVVAPLGIGTAFNAFLDDSLDTRKTKAADPITAEVAEDVTYERSTIFPKGTKIMGHVVRVTSGGRGRAGCAIFVQFDKAILKDGQEVVLNAGIQALAVGTVAPLQDMDTPKNDMETAPHALPVEDNSSNPAANSGALVVSTTYEAPRNALRAPLAVAPVAEGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLLSAKKNMHLDSGTRLLLVVQPPPSADADPTNSLDLDQIQ
Ga0209526_1024693223300028047Forest SoilMMKFAASLFFAGALLVLGAGAQTPPDLQQNAKTQAKDPTPPAPVIAPLGVGTAFNASLGDTLDTRKTRTGDAVTAETAEDVSYERSVIFPKGTKVIGHIVRVSSGGRGRAGSAIFIQFDKALLNDGQEVILNAGIQALAVGAVAPMPSDADAPKSLNRPPHAMPVEDNTTGATSTDALVVSTIYEAPRATVRAPLAAPLVAEGEFTSGGLFTPESKGAFGRPDVKVYTPTSEGSHGTVLLSTRKNMHLDGGTRLLLVVQPPPSGDSDAPNSLDLDPNQ
Ga0268264_1000782633300028381Switchgrass RhizosphereMKSAASLLLAVALVLVLSASAQTPPNLQNSAKTDLTAQPPAPVIAPLGVGTVFNALLSDTLDTRKTRAGDPVAAEIAEDVSYERCVVFPKGTKVEGHVVRVNSGAGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGAIAPMPSDTDSPKSAAPHRLPVENNTGGSGTSNDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSEGLFSPESKGAFGRLDLKIYTPTSEGSHGTVLLSARKNMRLDSGTHLLLVVQPPPTDEPEGSSASSLDLDPQ
Ga0138302_113610513300030937SoilEWCLSFAVPSLSGFPKGGRVMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0075379_1098771213300030946SoilMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNAALEDMLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKENMHLDAGPHLLLVVQPPP
Ga0073996_1198630913300030998SoilMKKFAVRLLMFGATMALGVMSTTGARAQNPPEIQRASKSESKAQTPPARVIAPLGIGTTFNATLSDSLDTRKTRAGDLVTAEIAEDVSYEQCMIFPKGTKVTGHIVRVTSAGRGKAGSAIFVQFDKAVMKDGQEVMLNAGIQALAANAVTASAEAGAAKSLANGPRTVPVEDNSGSTPTVSDALIVSTIYDAPRTTLRAPMAPAPVAEGEFGSDGLFTPESKGAFGRPDLKVYTPTSEGSHGTVLLSARKNMHLDGGTHLLLVVQPPPSGDSDPASAT
Ga0170834_10018965023300031057Forest SoilMIKFAVSLLFAAALLALTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNASLDDTLDTRKTKAGDIVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNL
Ga0170823_1685512023300031128Forest SoilMKSAATLFLAGSLVFVLGAKAQTQPTPPSIQTNSAQTDLKAQTPPAPVIAPLAVGTVFNAVLGDTLDAGKTRAGDLISAEVAEDVSYERATVFPKGTKLTGHVVRVTSGAHGKAGSAIFVQFDKATMKDGQEVMLNAGIQALAVGAVAPMPSSADSAKNSGPRTLPVDENAAGAESSDDALVVSTIYERPRTTLRTPLTPAPAAEGEFTSNGLFSPESKGAFGRPDLRIYTPTSEGSHGTVLLTARKNLHLDSGTHLLLVVQPPPTDEPEGSSASSLNLDPQ
Ga0170824_11232959313300031231Forest SoilMMKFAATLLFAGALLVLTANAQNPPDIQRSTKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGHGRAGSAIFIQFDKASVKDGQDVILNAGIQALAVGAVAPMPSGTPKPGATSGQVPVAQDSAGSKESSDALVVSTLYQEPRGTLRPPVVRGLESAGEFNSNGLFTADSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPNVDSEAPVSAPTDLDPQ
Ga0307483_101022313300031590Hardwood Forest SoilMKFAASLFFAGALLVLTANAQNPPDIQPSAKTDSTQAPAAVIAPLGVGTAFNASLIDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGRRRAGSAIFIQFDKAIVKDGQDVILSAGIQALAVGAVAPMPSGAPKTGATSAQVPVVQDSAGSKESSDALVVSTLYQEPRSTLRPPVVRGLESAGQFNSDGLFTSDSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLL
Ga0307469_1004752623300031720Hardwood Forest SoilMKSAASILLAGALVFVLRAEAQTPPNLQSSKTGNNVQTPPAPVIAPLGVGTVFNAALEGTLDTRKTHAGDAVVAETMEDVTYERCMVFPKGTKIMGHVVRVTSGARGKAGSAIFIQFDKAMMKDGQEVMLNAGIQALAVGTVAPMPSEADAPKATSPHALAVEDTSARTASSSDALVVSTIYEAPRTTLRTPLTPAPAAEGEFTSAGLFSPESKGAFGRPDLKIYTPTSEGSHGTVLLSARKIMHLDSGTHLLLVVQPPPTDESEATGTGSLDLDPQ
Ga0307469_1058615023300031720Hardwood Forest SoilMMKFAASLLFAGALLVLTANAQNPPDIQRSTKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGRGRTGSAIFIQFDKASVKDGQDVILNAGIQALAVGAVAPMPSGAPKTGATSAQVPVVQDSAGSKESSDALVVSTLYQEPRSTLRPPVVRGLESAGEFNSNGLFTADSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPNVDSEAPVSAPTDLDPQ
Ga0307477_1040037713300031753Hardwood Forest SoilMMRFAASLALAGAFLVSGTHAQNPPDLQRTGKSEAKSAELPPAPVIAPLGAGTAFNASLDETLDTRKSRAGDTVTAETAEDVSYQRCLIFPKGTKIVGHIVRVTSGGRGKAGSAIFVQFDKAMMKDGQEVILNAGIQALAVVGIAPVPSDAEKRLDAATRQLPVQDSSANSATSDALVVSTIYESPRTTLRPPLGAPLVTEGEFTSDGLFTPESKGAFGRPDLKVYTPTSEGSHGTVLLSAKKNMHLDGGTHLLLVVQPPPPTGESDMTGPLDLDPNQ
Ga0307478_1020348913300031823Hardwood Forest SoilVMIKFAASLLFAGALLALTAHAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGRGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGAASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPTADSVAPSSSSTNLDPQ
Ga0307479_1001524153300031962Hardwood Forest SoilMLKFSTSLLAGALMIALLVLTASAQDPPDVQRTAQPDSNHAPAPVIAPLGAGTAFNASLVEALDSRKTKAGDLVSAETVEDVAYQQCTIFPKGTKITGHVVRVTSGGRGQTGSAIFVQFDKATTKDGQEVILNAGIQALAVSSVAPMPSNTPKSRVNGPAAVPVVKDGAGSAESSDPLVVSTLYQEPRNTLRPPLKTALTPEGEFTSDGLFSPDSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPPKADSDAPASSSTDLDPQ
Ga0307479_1004782633300031962Hardwood Forest SoilMMKFAASLLFAGALLVLTANAQNPPDIQPFAKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGRAGSAIFIQFDKAIVKDGQDVILSAGIQALAVGAVAPMPAGAPKTGATSAQVPVVQDSAGSKESSDALVVSTLYQEPRSTLRPPVVRGLESAGEFNSDGLFTSDSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPKVDSEAPVSAPTDLAPQ
Ga0307470_1000820133300032174Hardwood Forest SoilMMKFAATLLLAGALLVLTANAQNPPDIQRSAKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGHGRAGSAIFIQFDKASVKDGQDVILNAGIQALAVGAVAPMPSGTPKTGATSTQAVPVVQDSAGSTQSSDALVVSTLYQEPRSTLRPPVVRGLESAGEFNSNGLFTADSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPNVDSEAPVSAPTDLDPQ
Ga0307471_10026099423300032180Hardwood Forest SoilMMKFAASLLFAGALLVLTANAQNPPDIQRSAKTDSTQAPAAVIAPLGVGTAFNASLVDTLDTRKLKAGDPVTAETSEDVSYQRCMIFPKGTKIVGHVVRVTSGGHGRAGSAIFIQFDKASVKDGQDVILNAGIQALAVGAVAPMPSGTPKTGATSTQAVPVVQDSAGSTQSSDALVVSTLYQEPRSTLRPPVVRGLESAGEFNSNGLFTADSKGAFGRPDLKVYTPTSEGSHGSVLVSSRKNMHLDAGTHLLLVVQPPPNVDSEAPVSAPTDLDPQ
Ga0348332_1297138723300032515Plant LitterMKMKLTAGLLFAGALLVLGAGAQTPPDLAASAKKQVTREQTPPAPVVAPLCIGTAFNASLDDTLDTRKTRAGDPVTAEVTEDVNYERSVIFPKGTKVVGHVVRVTSGGRGRAGSAIFIQFDKAILKDGQEVILNAGIQALAVGTIAPLPSDADTPKNLESAPHAVPVEDNAGNPAAGSDALVVSTIYEAPRSVLRAPLGPAPVAQGEFTSDGLFTPDSKGAFGRPDLKVYTPTSDGSHGTVLLSAKKTMHLDAGTHLLLVVQPPPTGETESANPLDLDLQ
Ga0315742_1091846213300032756Forest SoilMIKFAASLLFTGALLVLTANAQNPPDLQPSAKSDVTQAPAPVIAPLGVGTAFNATLDDTLDTRKTKAGDTVTAEASEDVSYQRCIVFPKGTKITGHVVRVTSGGHGGAGSAIFIQFDKATVKDGQEVILNAGIQALAVGTVAPMPSATPKNKSTAPQAVPVVEDDSESVVSSDAVVVSTLYQAPRSTLRAPLTPGSASEGEFTSDGLFSPGSKGAFGRPDLKVYTPTSEGSHGTVLLSSKKNMHLDAGTHLLLVVQPPTADSVASS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.