NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F040352

Metagenome / Metatranscriptome Family F040352

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040352
Family Type Metagenome / Metatranscriptome
Number of Sequences 162
Average Sequence Length 180 residues
Representative Sequence MKDLLKPLAIAGVAFLGCVANGTAQVQQSITFSLTVYDQSDTVVRTLRVSTKDVIENLAGTNVPGGKLWLVMPTDPSPDGNGTIGAFLRVTDSHGNIIVDTTTDTFNIYQTAFSQTSARTYAWNQFSLAFGGLGAELYGTATWSKSSRSPGGQGSFHCSVSGHCALGGVTSGEDPCTGSISGGAPKPAS
Number of Associated Samples 106
Number of Associated Scaffolds 161

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 46.88 %
% of genes near scaffold ends (potentially truncated) 57.41 %
% of genes from short scaffolds (< 2000 bps) 81.48 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.691 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.222 % of family members)
Environment Ontology (ENVO) Unclassified
(41.975 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(58.025 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 6.91%    β-sheet: 44.70%    Coil/Unstructured: 48.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.4.4.2: OMPT-liked1k24a_1k240.56
f.4.1.1: OMPA-liked1p4ta_1p4t0.56
f.4.1.4: OMPA-liked5b5eo_5b5e0.54
f.4.3.2: Porinsd1a0tp_1a0t0.53
b.61.6.1: YceI-liked1wuba_1wub0.52


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 161 Family Scaffolds
PF01253SUI1 3.11
PF01894UPF0047 2.48
PF12951PATR 1.86
PF00072Response_reg 1.24
PF13287Fn3_assoc 1.24
PF08530PepX_C 1.24
PF13306LRR_5 0.62
PF13277YmdB 0.62
PF00196GerE 0.62
PF13432TPR_16 0.62
PF13426PAS_9 0.62
PF12344UvrB 0.62
PF02837Glyco_hydro_2_N 0.62
PF00903Glyoxalase 0.62
PF02481DNA_processg_A 0.62
PF01435Peptidase_M48 0.62
PF02836Glyco_hydro_2_C 0.62
PF13360PQQ_2 0.62
PF00271Helicase_C 0.62
PF01795Methyltransf_5 0.62
PF00289Biotin_carb_N 0.62
PF01120Alpha_L_fucos 0.62
PF00248Aldo_ket_red 0.62
PF13884Peptidase_S74 0.62
PF08780NTase_sub_bind 0.62
PF05957DUF883 0.62
PF11376DUF3179 0.62
PF08240ADH_N 0.62
PF00884Sulfatase 0.62
PF10518TAT_signal 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 161 Family Scaffolds
COG0023Translation initiation factor 1 (eIF-1/SUI1)Translation, ribosomal structure and biogenesis [J] 3.11
COG0432Thiamin phosphate synthase YjbQ, UPF0047 familyCoenzyme transport and metabolism [H] 2.48
COG0758Predicted Rossmann fold nucleotide-binding protein DprA/Smf involved in DNA uptakeReplication, recombination and repair [L] 1.24
COG2936Predicted acyl esteraseGeneral function prediction only [R] 1.24
COG3250Beta-galactosidase/beta-glucuronidaseCarbohydrate transport and metabolism [G] 1.24
COG027516S rRNA C1402 N4-methylase RsmHTranslation, ribosomal structure and biogenesis [J] 0.62
COG3669Alpha-L-fucosidaseCarbohydrate transport and metabolism [G] 0.62
COG4575Membrane-anchored ribosome-binding protein ElaB, inhibits growth in stationary phase, YqjD/DUF883 familyTranslation, ribosomal structure and biogenesis [J] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.69 %
All OrganismsrootAll Organisms25.31 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16588502Not Available1259Open in IMG/M
2088090014|GPIPI_17157619All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales9079Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101384806Not Available847Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101503271All Organisms → cellular organisms → Bacteria → Proteobacteria2223Open in IMG/M
3300000955|JGI1027J12803_100396042All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium7160Open in IMG/M
3300000955|JGI1027J12803_100645612Not Available2575Open in IMG/M
3300003321|soilH1_10251021All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1967Open in IMG/M
3300003321|soilH1_10286502All Organisms → cellular organisms → Bacteria3338Open in IMG/M
3300003321|soilH1_10382725All Organisms → cellular organisms → Bacteria1347Open in IMG/M
3300004114|Ga0062593_100007161All Organisms → cellular organisms → Bacteria4860Open in IMG/M
3300004114|Ga0062593_100497205Not Available1129Open in IMG/M
3300004114|Ga0062593_101487189Not Available729Open in IMG/M
3300004114|Ga0062593_102048735Not Available637Open in IMG/M
3300004153|Ga0063455_100009577All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → unclassified Caulobacteraceae → Caulobacteraceae bacterium2170Open in IMG/M
3300004153|Ga0063455_101379631Not Available542Open in IMG/M
3300004157|Ga0062590_102637899Not Available534Open in IMG/M
3300004463|Ga0063356_100305783All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1975Open in IMG/M
3300004479|Ga0062595_100068298Not Available1731Open in IMG/M
3300004479|Ga0062595_100177895All Organisms → cellular organisms → Bacteria1283Open in IMG/M
3300004479|Ga0062595_102103304Not Available549Open in IMG/M
3300004480|Ga0062592_101035122Not Available753Open in IMG/M
3300004643|Ga0062591_100205977Not Available1450Open in IMG/M
3300004643|Ga0062591_100949417Not Available812Open in IMG/M
3300004643|Ga0062591_100999557Not Available795Open in IMG/M
3300004803|Ga0058862_10970170Not Available635Open in IMG/M
3300005178|Ga0066688_10138070Not Available1519Open in IMG/M
3300005329|Ga0070683_100499229Not Available1162Open in IMG/M
3300005329|Ga0070683_101285687Not Available703Open in IMG/M
3300005329|Ga0070683_101743476All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Isosphaerales → Isosphaeraceae → Singulisphaera → Singulisphaera acidiphila599Open in IMG/M
3300005330|Ga0070690_100057568All Organisms → cellular organisms → Bacteria2495Open in IMG/M
3300005332|Ga0066388_104444058Not Available714Open in IMG/M
3300005332|Ga0066388_108041159Not Available527Open in IMG/M
3300005338|Ga0068868_100071247All Organisms → cellular organisms → Bacteria2772Open in IMG/M
3300005340|Ga0070689_101713694Not Available572Open in IMG/M
3300005526|Ga0073909_10051495Not Available1496Open in IMG/M
3300005526|Ga0073909_10221899All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas putida group → Pseudomonas putida827Open in IMG/M
3300005526|Ga0073909_10623188Not Available534Open in IMG/M
3300005535|Ga0070684_100337908All Organisms → cellular organisms → Bacteria1384Open in IMG/M
3300005561|Ga0066699_10500838Not Available869Open in IMG/M
3300005576|Ga0066708_10779473All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfovibrionales → Desulfovibrionaceae → Desulfovibrio → Desulfovibrio desulfuricans601Open in IMG/M
3300005577|Ga0068857_100008516All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia8868Open in IMG/M
3300005598|Ga0066706_10389910All Organisms → cellular organisms → Bacteria1108Open in IMG/M
3300005618|Ga0068864_102195759Not Available558Open in IMG/M
3300005764|Ga0066903_104684696Not Available728Open in IMG/M
3300005764|Ga0066903_108289691Not Available531Open in IMG/M
3300006163|Ga0070715_10949562Not Available532Open in IMG/M
3300006237|Ga0097621_100188634Not Available1784Open in IMG/M
3300006797|Ga0066659_10784826Not Available786Open in IMG/M
3300006854|Ga0075425_100216706Not Available2202Open in IMG/M
3300006954|Ga0079219_11334701Not Available634Open in IMG/M
3300007265|Ga0099794_10055670All Organisms → cellular organisms → Bacteria1912Open in IMG/M
3300009012|Ga0066710_102946877Not Available665Open in IMG/M
3300009098|Ga0105245_10679382Not Available1062Open in IMG/M
3300009101|Ga0105247_11101946Not Available626Open in IMG/M
3300009137|Ga0066709_100402892All Organisms → cellular organisms → Bacteria1898Open in IMG/M
3300009143|Ga0099792_10399294Not Available841Open in IMG/M
3300009545|Ga0105237_12383555Not Available539Open in IMG/M
3300009553|Ga0105249_12992602Not Available543Open in IMG/M
3300009553|Ga0105249_13030167Not Available539Open in IMG/M
3300009792|Ga0126374_10187504Not Available1295Open in IMG/M
3300010147|Ga0126319_1258761Not Available756Open in IMG/M
3300010159|Ga0099796_10369665Not Available622Open in IMG/M
3300010360|Ga0126372_11654641Not Available680Open in IMG/M
3300010362|Ga0126377_11494030Not Available749Open in IMG/M
3300010397|Ga0134124_12998884Not Available516Open in IMG/M
3300010397|Ga0134124_13213812Not Available501Open in IMG/M
3300010399|Ga0134127_10817205Not Available982Open in IMG/M
3300010401|Ga0134121_10877125Not Available869Open in IMG/M
3300010401|Ga0134121_11320915Not Available728Open in IMG/M
3300010403|Ga0134123_12723347Not Available562Open in IMG/M
3300011119|Ga0105246_11078638Not Available732Open in IMG/M
3300011119|Ga0105246_11888352Not Available573Open in IMG/M
3300011120|Ga0150983_11165956Not Available514Open in IMG/M
3300011120|Ga0150983_13187985Not Available656Open in IMG/M
3300012200|Ga0137382_11033589Not Available589Open in IMG/M
3300012200|Ga0137382_11245697Not Available527Open in IMG/M
3300012201|Ga0137365_10929855Not Available633Open in IMG/M
3300012202|Ga0137363_10097263All Organisms → cellular organisms → Bacteria2229Open in IMG/M
3300012205|Ga0137362_10715534Not Available860Open in IMG/M
3300012205|Ga0137362_11727069Not Available513Open in IMG/M
3300012208|Ga0137376_10675773Not Available891Open in IMG/M
3300012211|Ga0137377_11608139Not Available573Open in IMG/M
3300012212|Ga0150985_103762716Not Available585Open in IMG/M
3300012212|Ga0150985_107291826Not Available759Open in IMG/M
3300012212|Ga0150985_108837087Not Available721Open in IMG/M
3300012212|Ga0150985_115794995Not Available810Open in IMG/M
3300012212|Ga0150985_117659489Not Available743Open in IMG/M
3300012351|Ga0137386_10482373Not Available893Open in IMG/M
3300012353|Ga0137367_10357426Not Available1039Open in IMG/M
3300012359|Ga0137385_10688662Not Available854Open in IMG/M
3300012361|Ga0137360_10051558All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → unclassified Nitrospira → Nitrospira bacterium SG8_35_42966Open in IMG/M
3300012361|Ga0137360_11488302Not Available581Open in IMG/M
3300012361|Ga0137360_11873152Not Available506Open in IMG/M
3300012469|Ga0150984_101664631Not Available3433Open in IMG/M
3300012469|Ga0150984_102772170All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1763Open in IMG/M
3300012469|Ga0150984_103428386Not Available809Open in IMG/M
3300012469|Ga0150984_106323959Not Available557Open in IMG/M
3300012469|Ga0150984_111033637Not Available753Open in IMG/M
3300012469|Ga0150984_115504661Not Available540Open in IMG/M
3300012469|Ga0150984_117229462Not Available887Open in IMG/M
3300012469|Ga0150984_119825436Not Available632Open in IMG/M
3300012469|Ga0150984_120387714Not Available869Open in IMG/M
3300012683|Ga0137398_10367902Not Available975Open in IMG/M
3300012683|Ga0137398_10531019Not Available810Open in IMG/M
3300012685|Ga0137397_10016238Not Available5207Open in IMG/M
3300012685|Ga0137397_10016238Not Available5207Open in IMG/M
3300012917|Ga0137395_10349241Not Available1054Open in IMG/M
3300012922|Ga0137394_10094137All Organisms → cellular organisms → Bacteria2517Open in IMG/M
3300012922|Ga0137394_10120804Not Available2217Open in IMG/M
3300012922|Ga0137394_10239704Not Available1550Open in IMG/M
3300012922|Ga0137394_10426842All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1129Open in IMG/M
3300012925|Ga0137419_10572305Not Available905Open in IMG/M
3300012929|Ga0137404_10727991Not Available898Open in IMG/M
3300012929|Ga0137404_10959288Not Available781Open in IMG/M
3300012930|Ga0137407_11480941Not Available646Open in IMG/M
3300012944|Ga0137410_10638751Not Available883Open in IMG/M
3300012960|Ga0164301_11228896Not Available603Open in IMG/M
3300012989|Ga0164305_11735089Not Available562Open in IMG/M
3300013297|Ga0157378_10430017Not Available1306Open in IMG/M
3300013297|Ga0157378_11641681Not Available689Open in IMG/M
3300013307|Ga0157372_11515801All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300014326|Ga0157380_11158574Not Available815Open in IMG/M
3300014969|Ga0157376_10361394Not Available1392Open in IMG/M
3300015241|Ga0137418_10176146All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia soli1866Open in IMG/M
3300015371|Ga0132258_10008169All Organisms → cellular organisms → Bacteria21299Open in IMG/M
3300015371|Ga0132258_10011056All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales18833Open in IMG/M
3300015371|Ga0132258_10378043All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → unclassified Verrucomicrobia subdivision 3 → Verrucomicrobia subdivision 3 bacterium3512Open in IMG/M
3300015371|Ga0132258_11147335All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1964Open in IMG/M
3300015371|Ga0132258_11258203Not Available1870Open in IMG/M
3300015371|Ga0132258_13058765All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1158Open in IMG/M
3300015373|Ga0132257_100698875Not Available1260Open in IMG/M
3300015374|Ga0132255_100292122All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula → Pedosphaera parvula Ellin5142344Open in IMG/M
3300016404|Ga0182037_12086852Not Available510Open in IMG/M
3300018433|Ga0066667_11235884Not Available651Open in IMG/M
3300018468|Ga0066662_10706673Not Available963Open in IMG/M
3300019888|Ga0193751_1234096Not Available582Open in IMG/M
3300019890|Ga0193728_1069498Not Available1676Open in IMG/M
3300019996|Ga0193693_1000552All Organisms → cellular organisms → Bacteria7731Open in IMG/M
3300020140|Ga0179590_1097535Not Available790Open in IMG/M
3300021168|Ga0210406_10719822Not Available767Open in IMG/M
3300021168|Ga0210406_11299449Not Available524Open in IMG/M
3300021411|Ga0193709_1000001All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia296769Open in IMG/M
3300024288|Ga0179589_10039322All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1719Open in IMG/M
3300025905|Ga0207685_10736863Not Available539Open in IMG/M
3300025916|Ga0207663_10544202Not Available907Open in IMG/M
3300025924|Ga0207694_11259888Not Available625Open in IMG/M
3300025927|Ga0207687_10549143Not Available969Open in IMG/M
3300025934|Ga0207686_10862735Not Available729Open in IMG/M
3300025944|Ga0207661_10480468Not Available1134Open in IMG/M
3300025961|Ga0207712_11743263Not Available558Open in IMG/M
3300025961|Ga0207712_12054582Not Available511Open in IMG/M
3300026116|Ga0207674_10024579All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6434Open in IMG/M
3300027821|Ga0209811_10117325Not Available968Open in IMG/M
3300027903|Ga0209488_10176302All Organisms → cellular organisms → Bacteria → Proteobacteria1614Open in IMG/M
3300031057|Ga0170834_100110103All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300031058|Ga0308189_10061228Not Available1082Open in IMG/M
3300031720|Ga0307469_11712070Not Available606Open in IMG/M
3300032893|Ga0335069_10047946All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula5639Open in IMG/M
3300033412|Ga0310810_10049967All Organisms → cellular organisms → Bacteria5097Open in IMG/M
3300033412|Ga0310810_10114093Not Available3201Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil7.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.56%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere5.56%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere4.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil3.70%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere3.70%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere3.09%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere3.09%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.09%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.47%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.85%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.85%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.23%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.23%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.23%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.23%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.23%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.23%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.62%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.62%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.62%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.62%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.62%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.62%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300004803Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010147Soil microbial communities from California, USA to study soil gas exchange rates - BB-CA-RED metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300019996Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a2EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_009791902088090014SoilMQPPTAGRMQGTQDCPPGLEGVRMNAARPRHLTLAAQPNGKLRDETMKKSLKSLAIAGIAVLASVASSTAQVQQSISFSLTIYDQSDTGVRALRVSNKDVIENLAGAXXXWLVMPSDPSPDGSGNIGAFLRVTDSHGNIIVETTSDSFNIYQTSFSQTSTHTYAWNQFSLDFGGLGSELYGTAIWTRSFRDPLGQGAFHCSVSGHFALGGVTDGSQPCSGSISGSAPKPAG
GPIPI_031574902088090014SoilMAFLGYAASGSAQVQQSISFSFTVYNQTDTGVRTVRLGTKDVIENLAGIKVPGGKLWLVMPTDPSVDGNGTLGAILRVTDARGNIIAQTTTDSFNLYQSSFSQTATHTYAWNGFSLAFGGLDAELYGTAIWSKNFRNEGGQGTV
INPhiseqgaiiFebDRAFT_10138480613300000364SoilMNAARPRHLTLAAQPNGKLRDETMKKSLKSLAIAGIAVLASVASSTAQVQQSISFSLTIYDQSDTGVRALRVSNKDVIENLAGAAVPGGKLWLVMPSDPSPDGSGNIGAFLRVTDSHGNIIVETTSDSFNIYQTSFSQTSTHTYAWNQFSLDFGGLGSELYGTAIWTRSFRDPLGQGAFHCSVSGHFALGGVTDGSQPCSGSISGSAPKPAG*
INPhiseqgaiiFebDRAFT_10150327133300000364SoilMAGLGGVGNGRAQVQQSVNLSLTVYNQTDAGVRALRVTTRDVIQNLAGTNVPGGRLWLVMPTDPGVDGNGTIGAFLRVTDWRGNVIQQTTTDSFNIYQNTAAQTTTRTYAWNGFSLSFGGLGAELFGTAIWNKGLWGPGGLGSFHCSVNGYCALGGVTDGQKPCIGSISGSAPSPAR*
JGI1027J12803_10039604213300000955SoilVYNQTDTGVRTVRLGTKDVIENLAGIKVPGGKLWLVMPTDPSVDGNGTLGAILRVTDARGNIIAQTTTDSFNLYQSSFSQTATHTYAWNGFSLAFGGLDAELYGTAIWSKNFRNAGGQGTVHCSVSGFLRLSGLTDGQQPCTGSIAAGPPRRAN*
JGI1027J12803_10064561243300000955SoilMAFLGFVANSTAQVQQSINFSLTIYDQSDTRVRTLRVSNKDIIENLAGTKVRGGKLWLVMPDDPGVDRNGTIGALLRVTDAHGNVVVDTTTDSHNIYQNTASETATRTYAWNGFSLAFGGLGAELFGTATWTKGRRSPGGLGSFHCSVSGYCALSGITEGQKPCIGSISGSTPRAAH*
soilH1_1025102123300003321Sugarcane Root And Bulk SoilMKMTSKAFAIAGLTVLGFVTNSTAQVQQTISFNLTVYNQTETGVRALRVSNKDIIQNLTGTNVPGNRLWLIQPSDPGIDGNGTIGAFLRVTDARGNILAETTTDSFNIYQNTASQTATRTYAWNGFSLSFGGLGAELFGTAIWSKSARGAGGLGAFHCSVNGYCALGGITDGQEPCVGSIAGGVPGAVH*
soilH1_1028650213300003321Sugarcane Root And Bulk SoilMKRLLKALAIAGVAVLGFVANSKAQVQQNISFSLTIFNQTDAGVRPLRITTREVIANLAGTNVPGGRLWLVMPNDPSPDGSGNIGAWLRVTDARGNIIVETTSDYFNIYQPTFSQNASQTYAWNQFSLAFGGIGAEVYGTATWTRGLRGPGGLGAFHCTVSGHCGLAGITNGDMPCTGSISGGPARPAS*
soilH1_1038272513300003321Sugarcane Root And Bulk SoilMKKLIKSLAIAGVAFLGLVASGIAQVQQSISFSLTVYDETDTGIRAVRLTTRDIIVNLAGTNVPGGKLWLVMPNDPTPGANGNIGAFLRVTDARGNVIVDTTSDLFNIYQTYSSETSTRIYAWNQFSLAFGGIGAELYGTTLWTKSPRGPGGQGAFHCTVSGHCGLG
Ga0062593_10000716133300004114SoilMAFLGCLANGIAQVQQSIGFTLTVYDQADSGVRSLRVSSKDVIEHLAGTKVPGAKLWLVMPDDPGVDRNGTIGAVLEVTDSRGNVVAKTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTAIWSKSSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0062593_10049720513300004114SoilMAFLGCIANSTAQVQQSINFTLTIYDQSDTGIRTLRVSNKDIIQNLAGTNVPGGKLWLMMPPSPGVDGNGTIGAFLRVTDAHGNVIVDTTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWNKGQWGPGGLGSFRCSVSGYCALGGITAGQEPCLGSISGGTPQAMH*
Ga0062593_10148718913300004114SoilMAILGCVANSTAQVQQSINFTLTIYDQSDTGVRTLRVSTKDVIQNLAGDNVPGGKLWLVMPNDPSVDGNGTIGAFLRVTDSHGNVIVDTTTDTFNIYQTSFSQTSTRTYAWNQFSLAFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGITDGEQPCTGSIGGGAPRPSS*
Ga0062593_10204873513300004114SoilLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFHCTVSGHCGIGGITDGERPCTGSISGGPPRPAN*
Ga0063455_10000957723300004153SoilMKNLSKVLATAGLAFLGYAATGSAQVQQSINFSLTVYNQTETGVRTLHVTTKDIIENLAGTNVPGGKLWLIMPNDPTPDGTGNIGAALRVTDSHGNIITETTSDSFNIYQTVSSQSGTRTYAFNQFSLSFGGVGAELYGTASWSRSSRSPGGQGSFHCTVGGHFGLGGVTDGEMPCSGSISAGAPTIAR*
Ga0063455_10137963113300004153SoilCIDSAFEGTVATPSRVKTERGNTAGNITRMSVMKRVMKTVVITLAGLMAALATSQAQVQQSISISLTLYNQTDTGIRTVRMNNRDVMQNLVGTNVPGGKLWLVMPSDPSPDGSGNIGAFLRVTDSHNNILAETDSSMFNIYQTFQSQTATRIVGFNQFSFDFGGFGAELYGTATWTKSAH
Ga0062590_10263789913300004157SoilMKMLTKAWAIAGVAVLGFAVNGKAQVQQSISFSLTMYSQTDTGVRPLRVSTKDVIANLAGTNVPGGKLWLVMPSNPSPDGNGNIGAFLRVTDSRGNIIADTTSDYFNIYQPVSSQTATHTYAWNQFSLSFGGLSAELYGTVGWSKSLKSQSGQG
Ga0063356_10030578323300004463Arabidopsis Thaliana RhizosphereMKMLLKAWALAGVVILGFATNSKAQVQQSISFSLTVYHQTDTGVRALRVSTKDVIANLAGTNVPGGKLWLVMPANPSQDENGNIGAFLRVTDSKGNIIVDTTSDYFNIYQPFSSQTSNHTYAWNQFSLAFGGLSAELYGTAGWSKTLKSQGGQGSFHCTVNGHCALAAISDDQGPCIGSISGGSPRPAD*
Ga0062595_10006829823300004479SoilMKKLLKTLAIAGLAILGCLANGTAQVRQSIGFTLTVYDQADSGVRALRVSSKDVIDHLAGTKVGGGKLWLVMPDNPGVDRNGTIGAVLQVTDSRGNVVAVTTTRTFNIYQNTAAQAASRTYAWNGFSIDFGGLEAELYGTVIWSKTSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGSKPKPAS*
Ga0062595_10017789513300004479SoilMAVLGCVANSTAQVQQSISFSLTVYDQSDTSVRTLRVSTKDVIENLAGSSVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVETTTDSFNIYQTAFSQTSTRTYAWNQFSLSFGGLGAELY
Ga0062595_10210330413300004479SoilRTMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSTFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFHCTVSGHCGIGGITDGERPCTGSIS
Ga0062592_10103512223300004480SoilMKKLLKALAIAGMAFLGWVASSTAQVQQSISFSLTVYDQSDTGVRALRLSNKDIIENLAGTNVAGGKLWLVMPTDPGVDGNGTIGAFLRVTDSRGNVVAETTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWSKGVWGPGGLGSFR
Ga0062591_10020597713300004643SoilMAFLGCIANSTAQVQQSINFTLTIYDQSDTGIRTLRVSNKDIIQNLAGTNVPGGKLWLMMPPSPGVDGNGTIGAFLRVTDAHGNVIVDTTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTATWNKGQWGPGGLGSFRCSVSGYCALGGITAGQEPCLGSISGGTPQAMH*
Ga0062591_10094941713300004643SoilMAILGCVANSTAQVQQSINFTLTIYDQSDTGVRTLRVSTKDVIQNLAGDNVPGGKLWLVMPNDPSVDGNGTIGAFLRVTDSHGNVIVDTTTDTFNIYQTSFSQTSTRTYAWNQFSLAFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGI
Ga0062591_10099955713300004643SoilISFSLTMYSQTDTGVRPLRVSTKDVIANLAGTNVPGGKLWLVMPSNPSPDGNGNIGAFLRVTDSRGNIIADTTSDYFNIYQPVSSQTATHTYAWNQFSLSFGGLSAELYGTVGWSKSLKSQSGQGSFHCTVNGHCALAAISDDQRPCIGSISGGSPRPAN*
Ga0058862_1097017013300004803Host-AssociatedMKKLSKVLAIAGMAVLGCVANSTAQVQQSISFSLTVYSQTDSGVRALRLGNKDIIQNLVGTNVPGGHLWLVMPSNPGVDGNGTIGAFLRVRDAHGNVLVETTTDSFNIYQNTASLTVNRTYAWNGFSLSFGGLGAELFGTAIWSKTFRNQGGLGAFHCSVNGYCALGGITEGQ
Ga0066688_1013807013300005178SoilAVLGFVANGKAQVQQSISFSLTVYNQTDTGVRAVRITTRDVIANLVGTNVPAGRLWLVMPTDPSPDENGNIGAFLRVTDLRGNIIVETTSDSFNIYQPSFSQTGTRIFAWNQFSLSFGGVGAELYGTAIWSKSPRGPGGQGAFHCSVSGPCGLGGITDGEMPCTGSISGGAPRPAQ*
Ga0070683_10049922933300005329Corn RhizosphereMKKLLKASVIAGVALLGFVANGTAQVQQSISLSLTVYNQTDTAVRAVRVSTRDVIANLAGTNVPGAKLWLVMPNDPSPGGNGNIGAFLRVTDSHGNVIVETTSDSFNIYQTVSSQTATRTYAWNQFSLAVGGLSAELYGTATWNKSLRGPGGLGSFHCSVSGHCAISGITNGDQPCIGSISGGAPKPSS*
Ga0070683_10128568713300005329Corn RhizosphereLATAGIAGMAVLGCVANSTAQVQQSISFSLTVYDQSDTSVRTLRVSTKDVIENLAGSSVPGGKLWLVMPNDPSVDGNGTIGAFLRVTDPQGNVVVETTTDSFNIYQTAFSQTSTRTYAWNQFSLSFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGITDGEQPCSGSISGGAPKPAG*
Ga0070683_10174347613300005329Corn RhizosphereMKKLSKTLAIAGVAFLGFVANGKADVQQTISFSLTMYNQTDTGIRALRISTRDIIQNLAGTNVPGARLLLVMPDNPSPDGNGSIGAVLRVTDSRGNIIAETTSDSFNIYQTFSSQVGTRTVAWNQFSLSIGGVGAELYGTATWSKSSTGAGGQGSFRCTVSGHCGLGDATNGE
Ga0070690_10005756833300005330Switchgrass RhizosphereMKFFLKALVIVAFLGSVANSRAQVQQSISLSLTVYNQTDTSIRAVRVSTKDIIANLVGTNVSGGKLWLVMPSDPSPDGNGNIGASLRVTDSHGNVIVETTSDSFNIYQAFSSQTSTRTYAWNQFSLAFGGVGAELYGTVIWSKSLRVGGQGSFHCSVSGHCSLGGITDGDMPCIGSISGGAPKPSS*
Ga0066388_10444405813300005332Tropical Forest SoilMKKPLKALAITGVAFLGCVASATAQVQQSITFSLTVYDQSDTNVHTLRVSTKDVIENLAGTNMPGAKLWLVMPTDPSPDGNGTIGAFLRVTDSHGNIVVDTTTASFNIYQTSFSQTSTHTYAWNQFSLAFGGLGAELYGMATWSKNLRGPGGQGSFHCSVSGHCGIGGVTNSDAPCTGSISGGSPRPAG*
Ga0066388_10804115913300005332Tropical Forest SoilMSTMKRLLKAWAIAGMAVLSFVAQGKAQVQQSIGFSLTVFNQTDTGVRTLRVGTRDVIANLVGTNVPGGKLWLVMPTDPTPDGSGNIGAFLRVTDSKGNIIVETTSDYFNIYQPSFSQTATHTYAWNQFSLNFGGLSAELYGTATWSKGLHG
Ga0068868_10007124743300005338Miscanthus RhizosphereRLMRTMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFRCTVSGHCGIGGITDGERPCTGSISGGPPRPAN*
Ga0070689_10171369413300005340Switchgrass RhizosphereMKRLLKALAIAGVGFLGFVVNGKAQVQQSISFSLTLYNQTDTSVRAVRISTRDVIANLAGTNVPGGKLWLVMPNAPTPDENGNIGAFLRVTDSKGNIIVETTPDYFNIYQTFSSQNATRIYGWNQFSLSFGGLSAELYGTAAWSKSSRGPGGQGSFHCTVSGHCGLGGISD
Ga0073909_1005149513300005526Surface SoilMAFLGCIANSTAQVQQSINFTLTIYDQSDTGIRTLRVSNKDIIQNLAGTNVPGGKLWLMMPPSPGVDGNGTIGAFLRVTDAHGNVIVDTTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWNKGQWGPGGLGSFRCSVSGYCALGGITAGQEPCLGSISGGTPQAMH
Ga0073909_1022189913300005526Surface SoilMNAARPRHLTLAAQPNGKLRDETMKKSLKSLAIAGIAVLASVASSTAQVQQSISFSLTIYDQSDTGVRALRVSNKDVIENLAGTSVPGGKLWLVMPSDPSPDGNGNIGAFLRVTDAQGNIIAETTSDSFNIYQTSFSQTSTRTYAWNQFSLDFGGLGSELYGTAIWTRSFRDPLGQGTFHCSVSGHFALGGVTDGSQPCSGSISGSAPKPAS*
Ga0073909_1062318813300005526Surface SoilVAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGAKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFHCTVSGHCGIGGITDGERPCTGSISGGPPR
Ga0070684_10033790813300005535Corn RhizosphereMKKLLKASVIAGVALLGFVANGTAQVQQSISLSLTVYNQTDTAVRAVRVSTRDVIANLAGTNVPGAKLWLVMPNDPSPGGNGNIGAFLRVTDSHGNVIVETTSDSFNIYQTVSSQTATRTYAWNQFSLAVGGLSAELYGTATWNKSLRGPGGLGSFHC
Ga0066699_1050083813300005561SoilLHVWRRVTLDAQRRRQALRMSTMKKLLKALAIAGVAVLGFVANGKAQVQQSISFSLTVYNQTDTGVRAVRITTRDVIANLVGTNVPGGKLWLVMPADPSPDENGNIGAFLRVTDLRGNIIVETTSDSFNIYQPSFSQTGTRIFAWNQFSLSFGGVGAELYGTAIWSKSPRGPGGQGAFHCSVSGPCGLGGITDGEMPCTGSISGGAPRPAQ*
Ga0066708_1077947313300005576SoilVLHVWRWVTLDAQPRQALREKSIMKSLLKGLAIAGVAVLGFVANGKAQVQQSISFSLTLYNQTDTSVRAVRVTTRDVIVNLAGTNVRGGKLWLVMPTDPSPGGNGNLGAFLRVTDSRGNIIRDTTSASFNIYQTFSSQTDTRIFAWNQFSLSFGGLGAELYGTAIWSKSPRGPGGQGSFHCSVS
Ga0068857_10000851653300005577Corn RhizosphereMKKLSKVLAIAGMAVLGCVANSTAQVQQSISFSLTVYSQTDSGVRALRLGNKDIIQNLVGTNVPGGHLWLVMPSNPGVDGNGTIGAFLRVRDAHGNVLVETTTDSFNIYQNTASLTVNRTYAWNGFSLSFGGLGAELFGTAIWSKTFRNQGGLGAFHCSVNGYCALGGITEGQEPCVGSISGGAPKGG*
Ga0066706_1038991023300005598SoilMNTARLASETLQRTAAAARTARLSTMKKLLKALAIAGVAVLGFVANGKAQVQQSISFSLTLYNQTDTSVRAVRVSTRDVIVNLAGTNVRGGKLWLVMPTSPSPDTNGNIGAFLRVTDARGNIIVDSTSDSFNVYQTSSSQTGTRIYAWNQFSLSFGGLDAELYGTAIWSKSSRGPGGQGSFHCSVSGHSGVGGVTSADMPCIGSISGGAPKPAS*
Ga0068864_10219575913300005618Switchgrass RhizosphereIVAFLGSVANSRAQVQQSISLSLTVYNQTDTSIRAVRVSTKDIIANLVGTNVSGGKLWLVMPSDPSPDGNGNIGASLRVTDSHGNVIVETTSDSFNIYQAFSSQTSTRTYAWNQFSLAFGGVGAELYGTVIWSKSLRAGGQGSFHCSVSGHCSLGGITDGDMPCIGSISGGAPKPSS*
Ga0066903_10468469613300005764Tropical Forest SoilALKTMKMSLKVFAIAGIAVLASLASSSAQVQQSISFSLTIYDQSDTGVRALRVSNKDVIENLAGTNVPGGKLWLVMPTDPSPDGSGNIGAFLRVTDSHGNIIVETTSDLFNIYQNSFSQDSTHTFAWNQFSLDFGGLGSELYGTAIWTRSLRDPLGQGAFHCSVSGHLALGGVTDGSQPCSGSISGGAPKPAG*
Ga0066903_10828969113300005764Tropical Forest SoilAGIAFLGYAASGSAQVQQSITFSITVYDQTGSGVRTIRVGTKDLIENLAGNKVPGGKLWLVMPADPSVDGNGTIGAVLRVTDGRGNIIVDTTTDTFNIYQSSFSQTDTRTYAWNGFSIDFGGLDAELYGTAIWSKSFRHPQGQGSFHCSVSGHFTVGGITDGQQPCVGSISGSAPS
Ga0070715_1094956213300006163Corn, Switchgrass And Miscanthus RhizosphereMKKLLKPLVIAGMAFLGYVANSTAQVQQSISFSLTVYNQSDTGVRALRLSNKDIIQNLAGTNVAGGKLWLIMPTDPGVDGNGTIGAFLRVTDSRGNILAETTTDSFNIYQNTAAQTSTRTYAWNGFSLSFGGLGAELYGTAIWSKGVWGPGG
Ga0097621_10018863433300006237Miscanthus RhizosphereMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFHCTVSGHCGIGGITDGERPCTGSISGGPPRPAN*
Ga0066659_1078482613300006797SoilIKIKVCTFDSACFGRKAPYVPNPQPRAGCNALRETTMKNIFKALAIAGVAVLGFVANGTAQVQQSISFSLTLYDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDENGNIGAFLRVTDSRGNIIAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLGAELYGTATWTKSPRGPGGQGSFHCTVSGHCALGGVSNGEVPCTGSIIGGAPRPAS*
Ga0075425_10021670623300006854Populus RhizosphereMRTMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGAKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFHCTVSGHCGIGGITDGERPCTGSISAGPPRPAN*
Ga0079219_1133470113300006954Agricultural SoilTFSFTLYNQTDTGVRALHISNRDVIENLAGTNVPGGKLWLVMPNDPSPDGSGNIGAFLRLTDSKGNVIVETTSDTFNIYQTFSSQTGGRIYAWNQFSLAFGGLSAELYGLAIWSKSPRGPGGQGSFHCTVSGHSAIGGITNGEMPCIGSVSGGVPRPAS*
Ga0099794_1005567023300007265Vadose Zone SoilMRTMKKLLKALAIAGLAFLGCVANGTAQVKQSINFSLTVYDQSDTGVRTLRVSTKDVIENLAGINVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQNTSSQAGTRTYAWNGFSLSFGGLGAELYGTAIWSKSFRSPGGLGTFHCSVSGYCALGGITDGQEPCIGSISGGAPSPAH*
Ga0066710_10294687713300009012Grasslands SoilLAIAGVAVLGFVANGKGQVQQSISFSLTMYNQTDTGVRALRVSTRDVIANLVGTNVPGGRLLLVMPTDPSPDGNGNIGAFLRVTDSRGNIIVETDSASFNIYQTFSSQTATGIYAWNQFSLSFGGLGAELFGTATWSKSSQGPGGQGSFHCSVSGHCALGGITDGEQPCTGSISGGAPKPAS
Ga0105245_1067938213300009098Miscanthus RhizosphereMKKGLKTLAIAGVAILGFVANGKAQVQQSINFSLTLYNQTDTGIRPLRISTRDVIANLVGTNVPGGRLWLVMPTDPSPDENGNIGAFLRVTDSRGNIMAETTSDYFNIYQTFFSQSGARLVAWNQFSLSFGGLSAELYGTAIWSKSLRGPGGQGSFHCSVSGHSGLGGVSDGDVPCTGVISGGAPRPAS*
Ga0105247_1110194623300009101Switchgrass RhizosphereMKKLLKPLAIAGIAGMAVLGCVANSTAQVQQSISFSLTVYDQSDTSVRTLRVSTKDVIENLAGSSVPGGKLWLVMPSDPSVDGNGTIGAFLRVTDSHGNVVVETTTDSFNIYQTAFSQTSTRTYAWNQFSLSFGGLGAELYG
Ga0066709_10040289213300009137Grasslands SoilAGVAVLGFVANGKAQVQQSVSFSLTVYNQTDTGVRAVRITTRDVIANLVGTNVPAGRLWLVMPTDPSPDENGNIGAFLRVTDSRGNIIAETTSDPFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSLRGPGGQGSFHCSVSGHCALGGITDGERPCIGSISGGAPSPAN*
Ga0099792_1039929413300009143Vadose Zone SoilMLDAQQQRQALRMSTMKMLLKASAIAGVAVLGFVANGKAQVQQSISFTLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSLRGPGGQGSFHCSVSGHCALSGITDGDRPCIGSISGGA
Ga0105237_1238355513300009545Corn RhizosphereLTVYDQADSGVRALRVSSKDVIDHLAGTKVGGGKLWLVMPDNPGVDRNGTIGAVLQVTDSRGNVVAVTTTRTFNIYQNTAAQAASRTYAWNGFSLDFGGLEAELYGTVIWSKTSRGPGGLGTFHCSVNGFCALGGTTAGQQPCIGSISGDAPKPAD*
Ga0105249_1299260213300009553Switchgrass RhizosphereVIVAFLGSVANSRAQVQQSISLSLTVYNQTDTSIRAVRVSTKDIIANLVGTNVSGGKLWLVMPSDPSPDGNGNIGASLRVTDSHGNVIVETTSDSFNIYQAFSSQTSTRTYAWNQFSLAFGGVGAELYGTVIWSKSLRAGGQGSFHCSVSGHCSLGGITDGDMPCIGSISGGAPKPSS*
Ga0105249_1303016713300009553Switchgrass RhizosphereTPTHSHDGKHLREINTMKRVLKTLAIAGVAVLGFVANSKAQVQQSINFSLTMYNQNDTGVRALRISTRDVIANLAGTNVPGGKLWLVMPSDPSPDVNGNIGAFLRVTDSKGNIIVETTSDYFNIYQTFSSQNTTRIYAWNQFSLSFGGLSAELYGTVAWSNRSRGGQGSFHCTVSGHCG
Ga0126374_1018750413300009792Tropical Forest SoilMKTMKSLLKPLAIAGIAVLACVANSTAQVQQSISFSLTIFDQSDTGVRTLRVSTKDVIENLAGTTVPGGKLWLVMPTDPSLDGNGTIGAFLRVTDSQGNIVADTTSDTFNIYQTSFSQTGTRTYAWNQFSLAFGGLGAELYGTATWNKNFKNPGGQGSFHCSISGHCALGGITAGEDPCTGSISGTAPRPAS*
Ga0126319_125876113300010147SoilIMKKLLNVLALAGMAVLGFVENGNAQVQQSISFSLKLFDQTDTGVRMLRLTTKDVIENLAGTNVTGGKLWVVMPNTPTPDGNGNIGAFLRVTDSHGNIIVESTSDTFNIYQTVSSQAGNRIYAWNQFSLAFGGIGAELYGTATWTKSARGPGGQGSFHVAVSGHCGVGGTSNGEVPCTGTIVGGAPKVAN*
Ga0099796_1036966513300010159Vadose Zone SoilMKKSLKVLAIAGATVLGCVANSTAQVQQSISFNLTVYNQTDTSVRPLRVSTKDVIENLAGAPVPGGKLWLVMPTDPGVDGNGTIGAVLRVTDSRGNIVQETTTDSFNIYQNTSSETAKSTYAWNGFSLSFGGRGAELYGTAIWSKSFRSPGGL
Ga0126372_1165464113300010360Tropical Forest SoilMKKLLKPLAVAGIAVLACAANSTAQVQQSISFSLTIFDQSDTGVRTLRVSTKDVIENLAGTNMPGAKLWLVMPTDPSPDGNGTIGAFLRVTDSHGNIVVDTTTASFNIYQTSFSQTSTHTYAWNQFSLAFGGLGAELYGMATWSKNLRGPGGQGSFHCSVSGHCALGGITTGEDPCTGSISGGAPKPAG*
Ga0126377_1149403023300010362Tropical Forest SoilAQVQQSINFSLTIYDQSDTGVRAVRVSTRDVIQNLAGTTVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSQGNIIAETTTDSFNIYQTSFSQSSTRTYAWNQFSVAFGGLGAELYGTATWSKSLRGPGGQGSFHCSVSGHCALGGITNGEQPCIGSISGGATKPAQ*
Ga0134124_1299888413300010397Terrestrial SoilMAFLGCVANSAAQVQQSISFSLTVYDQSDTGVRALRLSNKDIIQNLAGTNVPGGKLSLVMPTDPGVDGNGTIGAFLRVTDSRGNVVAETTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWSKGVWGPGGLGSFRTSVSGYCALGGITDGQEPCVGSISGS
Ga0134124_1321381213300010397Terrestrial SoilTLDAQLSRTARPARLMRTMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSTFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLR
Ga0134127_1081720523300010399Terrestrial SoilMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSTFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFRCTVSG
Ga0134121_1087712523300010401Terrestrial SoilMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGGKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTATWTKSLRGPGGQGSFRCTVSGHCGIGGITDGERPCTGSISGGPPRPAN*
Ga0134121_1132091513300010401Terrestrial SoilMKMLLKALAIAGVTVLGFVANGKAQVQQSISFSLTLYDQTDRGIRTLRVTTRDLIANLVGTNVPGGRLLVVMPTDPSPDESGNIGAFLRVTDSRGNIIVETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSPQGPGGQGTFHCSVSGHCALSGITDGDRPCIGSISGGAPRPAN*
Ga0134123_1272334713300010403Terrestrial SoilMSCVSRQVTFDAYRDDKHCENVNMKKLLKPLALAGVVFLGFVANCTAQVQQSISFTLTVYNQTDTSVRAVRVTTRDVIANLAGTNVPGGRLLLVMPTDPSPDENGNIGAFLRVTDSRGNIIAETTSDSFNIYQLSSSQTGTRTYAWNQFSLAVGGLSAELYGTAIWSKKF
Ga0105246_1107863823300011119Miscanthus RhizosphereMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGAKLWLIMPNDPSPGANGNIGAFLRVTDSRGNIIVETTPDYFNIYQTFSSQNATRIYGWNQFSLSFGGLSAELYGTAAWSKSSRGPGGQGSFHCTVSGHCGLGGISDGEVPCTGSISGGAPRPAS*
Ga0105246_1188835213300011119Miscanthus RhizosphereKTMKFFLKALVIVAFLGSVANSRAQVQQSISLSLTVYNQTDTSIRAVRVSTKDIIANLVCTNVSGGKLWLVMPSDPSPDGNGNIGASLRVTDSHGNVIVETTSDSFNIYQAFSSQTSTRTYAWNQFSLAFGGVGAELYGTVIWSKSLRAGGQGSFHCSISGHCSLGGITDGDMPCIGSISGGAPKPSS*
Ga0150983_1116595613300011120Forest SoilMLHVWRRSTRSQDGNRCGMKTMKKSLKALAIAGIAVLGCVASSTAQVQQSISFSLTVYDQSDTDVHPLRVSNKDVIENLAGTTVPGGKLWLVMPTDPSPDGNGNIGAFLRVTDAHGNIIVETTTDSFNIYQTAFSQTSNLTYAWNQFSLAFGGLGAELYGTAI
Ga0150983_1318798513300011120Forest SoilLEGVRTANCEKKTMKKLSKVLAIAGMAFLGYAAIGTAQVQQSISFSFTVYNQTATGIRALRVSTKDVIENLAGAKVPGGKLWLVMPTDPSVDGNGTLGAVLRVTDAHGNTIAETTTDTFNLYQSSFSQTATRTYAWNGFSIDFGGLDAELYGTATWSKSSRSPGGQGSFHCSVSGHFTLGGVTDGEQPGAGSITGGAPRSSD*
Ga0137382_1103358913300012200Vadose Zone SoilMKKSLKVLAIAGATVLGCVANSTAQVQQSISFSLTVYNQTDTSVRPLRVSTKDVIENLAGAPVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSRGQIVQETTTDSFNIYQNTSSETAKSTYAWNGFSLSFGGLGAELFGTTIWSKG
Ga0137382_1124569713300012200Vadose Zone SoilMKTIKRLSKALAVTGMAILGFAANSTAQVQQTISFSFTVYDQTDTGVRVLRLGNKDVMESLAGTNVPGGHLWLVMPIKPGVDRNGAIGAFLRVRDAHGNVIAQTTTHSFNIYQNTASLTGNRTYAWNGFSLSFGGLGAELYGTAIWSKSFRS
Ga0137365_1092985513300012201Vadose Zone SoilEENSNESALMPQVWRRVTLDAQRRRQALRMSIMKRSLKALAIAGVAVLGFVANGKAQVQQSISFSLTLYDQTDRGIRPLLVTTRDAIGNLAGTHVPGGRLWVVMPTDPSPDGNGNIGAFLRVTDSRGNIIAETTSDPFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSLRGPGGQGSFHCSVSGHCALGGITDGERPCIG
Ga0137363_1009726323300012202Vadose Zone SoilMKKLLKALAIAGLAFLGCVANGTAQVKQSINFSLTVYDQSDTGVRTLRVSTKDVIENLAGTNVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQNTSSQAGTRTYAWNGFSLSFGGLGAELYGTA
Ga0137362_1071553413300012205Vadose Zone SoilMSIMKRLLKGLAIAGIAFLGCLANGTAQVQQSINFGITVYDQSDTSVRTLRVSTKDVIENLAGSNVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQTSFSQTSTRTYAWNQFSLSFGGLGAELYGTAT
Ga0137362_1172706913300012205Vadose Zone SoilRQALRMSTMKMLLKASAIAGVAVLGFVANGKAQVQQSISFSLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSLRGPGGQGSFHCSVSG
Ga0137376_1067577313300012208Vadose Zone SoilMKMLLKASAIAGVAVLGFVANGKAQVQQSISFSLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAIWSKSLRGPGGQGSFHCSVSGHCALSGITDGDRPCIGSISGGAPRPAN*
Ga0137377_1160813913300012211Vadose Zone SoilMLHVWHRVTLDAQRRRQALRMSTMKKLLKALAIAGVAVLGFVANGKAQVQQSISFSLTLYDQTDTGTRALHVSTRDVIENLAGTNVPSGKLWLVMPTDPSPAGDGNIGAFLRVTDARGNIIVDTTSASFNIYQTSSSQTATRIYAWNQFSLSFGGLGAELYGTA
Ga0150985_10376271613300012212Avena Fatua RhizosphereVAFLGCVANCKAQVQQSITLSLTVYNQTDTSIRPLRVSNRDIILNLVGTNVPGGKLWLVMPSDPSPGGNGNIGAFLRVTDLHGNVIVETTSDSFNIYQLFTSQTATRTYAWDQFSLAFGGVSAEVYGTTLWSKSVHGPGGLGSFRCAVSGHCSLSGITNDDR
Ga0150985_10729182613300012212Avena Fatua RhizosphereMKKLSKTLAIAGVAFLGFVANGRADVQQTISFSLTMYNQTDTGIRTLRISTRDIIQNLAGTNVPGARLLLVMPDNPSPDGNGSIGAVLRVTDSRGNIIAETTSDSFNIYQTFSSQVGTRTVAWNQFSLSIGSVGAELYGTATWSKSSTAAGGQGSFRCTVSGHCGLGDATNGEVPCTGTISGGATKPTG*
Ga0150985_10812193513300012212Avena Fatua RhizosphereKEKIMKIGMRSWIAGLMMFLGTMGTGWSQVQQSLSISLTVYNETNDTIRAVRVTTRDIIRNFVGTNVPGGKLWLVMPSDPSPDGSGNIGAFLRVTDSHNNILAETDSSMFNIYQTFQSQTATRIVGFNQFSFDFGGFGAELYGTATWTKSAHGAGGQGSFHCSVSGRAGLGGVTNGEVPCTGSISGSSPRPAQ*
Ga0150985_10883708713300012212Avena Fatua RhizosphereMNNMKRLLNALTIAGVAVLGFVASGQAQVHQSISFNLTLRDQTDTGVRLLHISNRDVIENLVGTNVPGGKLWLIMPTDPSPGANGNIGAFLRVMDAKGNVLAETDSSTFNIYQTFSSQTPTGNRIYAFNQFSLAFGGLGAELYGTVTWTKNPLGPGGQGSFVCSVSGHCGLGGVSNGEVPCTGSISGGPPKPAQ*
Ga0150985_11579499513300012212Avena Fatua RhizosphereMSTMKKLSKALTIAGIAVLGYVANGTAQVQQSITFSLTVYNQTNNSVRAVHVTTRQVIENLAGTNVPVGKLWLVMPNDPSPGGNGNIGAFLRVTDAHGDVVAETDTSTFNIYQTASSQTANRTYAWNQFSLSFGGLGAELYGTAIWTKSPAGPGGQGSFHCSVSGHCALGGITDGEQPCTGSISGGAPRPAS*
Ga0150985_11765948913300012212Avena Fatua RhizosphereMKNLSKVLATAGLAFLGYAATGSAQVQQSINFSLTVYNQTETGVRTLHVTTKDIIENLAGTNVPGGKLWLIMPNDPTPDGTGNIGAALRVTDSHGNIITETTSDFFNIYQTVSSQSGTRTYAFNQFSLSFGGVGAELYGTATWSRSSRSPGGQGSFHCSVGGQFGLGGITDGEMPCSGTISASAPTIAR*
Ga0137386_1048237313300012351Vadose Zone SoilMKKLLKPLAIAGMAFLGCVANSTAQVQQSINFSLTIYDQSDTGVRSLRLSNKDVIQNLAGTNVPGGKLWLVMPADPGVDGNGTIGAFLRVTDSQGNVVVETTTDSFNIYQTAFSQTSTRTFAWNQFSLAFGGLGAELYGTATWSKSLRGPGGQGSFHCSVSGHCGLGGITDGEQPCAGSISGGAPKPSS*
Ga0137367_1035742613300012353Vadose Zone SoilIEASGNGKYCENKNMKNLSIALAIAGMTVLGSVVNGAAQVQQSINFSLTIYDQTDAGVRALRVGTKDVIENLAGTSVPGGHLWLVMPTDPGLDGNDTIGAFLRVTDAVGNILAETNPDTFNIYQSVYSQNATRTYAWNGFSLSFGGLGAELHGLATWNKGPRGPGGQGSFHCSVNGYCALGGITDGQRPCIGSISGGAPKPAS*
Ga0137385_1068866213300012359Vadose Zone SoilNGRADVQQTISFTLTMYNQTDTGVRALRISTRDIIENLAGTNVPGGRLLLVMPDNPSPDGNGSIGAVLRVTDSRGNIIAETTSDSFNIYQTFSSQAGTRTIAWNQFSLSIGSVGAELYGTATWSKSSTAAGGQGSFRCTVSGHCGLGDATNGEVPCTGTISGGATKPTG*
Ga0137360_1005155823300012361Vadose Zone SoilMDAARFASGDARRTTGMAITARMKTMKKSLKGLAIAGIAFLGCLANGTAQVQQSINFGITVYDQSDTSVRTLRVSTKDVIENLAGYNVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQTSFSQTSTRTYAWNQFSLSFGGLGAELYGTATWSKSFRSPGGQGSFHCSVSGHCALGGITDGEQPCTGSISGGAPKPAN*
Ga0137360_1148830213300012361Vadose Zone SoilMKMLLKASAIAGVAVLGFVANGKAQVQQSISFSLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPADPSPDGSGNIGAFLRVTDSRGNIMAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMA
Ga0137360_1187315213300012361Vadose Zone SoilMKKSLKVLAIAKATVLGCVTNCTAPVQQSISFSLTVYNQTDTSVRPLRVSTKDVIENLAGAPVPGGKLWLVMFIDFGVDGNGTIGAVLRVTDSRGNIVQETTSDSFNIYQNTSSETAKSTYAWNGFSLSFGG
Ga0150984_10166463123300012469Avena Fatua RhizosphereMSSDECCTSGVKRRSTQNQDANDCENKNMKKLSKSLAIAGMAFLGCVANSTADVQQSISFTLTVYSQTDNGVRAVRVSNKDLIENLAGANVPGGKLWLVMPPDPGVDGNGTIGAFLRVTDARGNVVADTTTDSFNIYQNTSSQAGTRTYAWNGFSLSFGGLGAELFGTAIWSKTLWGPGGLGTFRCSVSGFCALTAITDGQKPCLGSISAGTPKPVN*
Ga0150984_10277217023300012469Avena Fatua RhizosphereMKKLFKAMAIAGTAFLGLVTSGTAQVQQSISFSLTIYNQSDTGVHALRVSTKDVIENLAGSPVPGGRLWLIMPNDPGVDGNGTIGAFLRVTDAHGNILAETDTGTFNIYQTAYSQTSTRTYAWNQFSIDFGGLGAELYGMATWSKNPRGPGGQGSFHSTVAGHCALSGSTDGEQPCSGSINGGAPAPVH*
Ga0150984_10342838613300012469Avena Fatua RhizosphereMSTMKNILKALTIAGVAVLGYVANSSAQVQQSISFSLTVYNQTNSSVRALHVTTRDVIENLAGTNVPGGKLWLIMPNDPTPGGNGNIGAFLRVTDAHGNVIAETDTSTFNIYQTASSQTANRTYAWNQFSLSFGGLGAELYGTATWTKNPAGPGGQGSFHCTVSGHCALGGITDGEQPCTGSIAGGAPKPAG*
Ga0150984_10632395913300012469Avena Fatua RhizospherePRRQVLRKMSTMKKLSKTLAIAGVAFLGFVANGRADVQQTISFSLTMYNQTDTGIRTLRISTRDIIQNLAGTNVPGARLLLVMPDNPSPDGNGSIGAVLRVTDSRGNIIAETTSDSFNIYQTFSSQVGTRTVAWNQFSLSIGSVGAELYGTATWSKSSTAAGGQGSFRCTVSGHCGLGDATNGEV
Ga0150984_11103363713300012469Avena Fatua RhizosphereMKKIIKALAIAGVAFLGGVANCKAQVQQSITLSLTVYNQTDTSIRPLRVSTRDIILNLVGTNVPGGKLWLVMPSDPSPGGNGNIGAFLRVTDLHGNVIVETTSDSFNIYQLFTSQTATRTYAWDQFSLAFGGVSAEVYGTTLWSKSVHGPGGLGSFRCAVSGHCSLSGITNGDRPCIGSLSGGAPKPSS*
Ga0150984_11550466113300012469Avena Fatua RhizosphereLSKVLATAGLAFLGYAATGSAQVQQSINFSLTVYNQTETGVRTLHVTTKDIIENLAGTNVPGGKLWLIMPNDPTPDGTGNIGAALRVTDSHGNIITETTSDSFNIYQTVSSQSGTRTYAFNQFSLSFGGVGAELYGTASWSRSSRSPGGQGSFHCTVGGHFGLGGVTDGEMPCSGSISA
Ga0150984_11722946223300012469Avena Fatua RhizosphereMNNMKRLLNALTIAGVAVLGFVASGKAQVQQSISFNLTLRDQTDTGVRLLHISNRDVIENLVGTNVPGGKLWLIMPTDPSPGANGNIGAFLRVMDAKGNVLAETDSSTFNIYQTFSSQTPTGNRIYAFNQFSLAFGGLGAELYGTVTWTKNPLGPGGQGSFVCSVSGHCGLGGVSNGEVPCTGSISGGPPKPAQ*
Ga0150984_11982543613300012469Avena Fatua RhizosphereMKTVVITLAGLMAALATSQAQVQQSISISLTLYNQTDTGIRTVRMNNRDVMQNLVGTNVPGGKLWLVMPSDPSPDGSGNIGAFLRVTDSHNNILAETDSSMFNIYQTFQSQTATRIVGFNQFSFDFGGFGAELYGTATWTKSAHGAGGQGSFHCSVSGRAGLGGVTNGEVPCTG
Ga0150984_12038771413300012469Avena Fatua RhizosphereLSKVLATAGLAFLGYAATGSAQVQQSINFSLTVYNQTETGVRTLHVTTKDIIENLAGTNVPGGKLWLIMPNDPTPDGTGNIGAALRVTDSHGNIITETTSDSFNIYQTVSSQSGTRTYAFNQFSLSFGGVGAELYGTATWSRSSRSPGGLGSFHCTVGGHFGLGGVTDGEMPCSGSISAGAPTIAR*
Ga0137398_1036790213300012683Vadose Zone SoilMKMLLKASAIAGVAVLGFVANGKAQVQQSISFSLTLYDQTDTGVRALRVSTRDVIENLVGTNVPSGKLWLVMPTDPNPDGNGNIGAFLRVTDSRGNIIVDTTSASFNIYQTSSSQTGTRTYAWNQFSLSFGGIGAELFGTATWSKGSRGPGGQGSFHCSVS
Ga0137398_1053101913300012683Vadose Zone SoilMKTIKRLSKALAVTGMAILGFAANSTAQVQQTISFSFTVYDQTDNGVRVLRLGNKDVMESLAGTNVPGGHLWLVMPIKPGVDRNGAIGAFLRVRDAHGNVIAQTTTHSFNIYQNTASLTGNRTYAWNGFSLSFGGLGAELYGTAIWSKSFRS
Ga0137397_1001623813300012685Vadose Zone SoilVTGMAILGFAANSTAQVQQTISFSFTVYDQTDTGVRVLRLGNKDVMESLAGTNVPGGHLWLVMPVTPRVDRNGTIGAFLRVRDAHGNVIAQTTTHSFNIYQNTASLTGNRTYAWNGFSLSFGGLGAELYGTAIWSKSFRSPGGLGTFHSSVSGYCALGGITEGQRPCSGSISGGAPGTAH
Ga0137397_1001623823300012685Vadose Zone SoilMKKSLKVLAIAGATVLGCVANCTAQVQQSISFSLTVYNQTDTSVRPLRVSTKDVIENLAGAPVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSRGQIVQETTTDSFNIYQNTSSETAKSTYAWNGFSLSFGGLGAELFGTTIWSKGPRNAGGLGTFHCSVSGYCALSGITDGQRPCSGSISGSTPKPAS*
Ga0137395_1034924123300012917Vadose Zone SoilMKTMKKSLKGLAIAGIAFLGCLANGTAQVQQSINFGITVYDQSDTSVRTLRVSTKDVIENLAGSNVPGGKLWLVMPIDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQTSFSQTSTRTYAWNQFSLSFGGLGAELYGTATWSKSFRSPGGQGSFHCSVSGHCALGGITDGEQPCTGSISGGAPKPAN*
Ga0137394_1009413723300012922Vadose Zone SoilMKMSLKALAIAGVAVLGFVANAKGQVQQSISFSLTLYNQTDTGVRALRLSTRNVIENLVGTNVPGGRLWLVMPTEPTPDGSGNISAFLRVTDSRGNVITETTSDSFNIYQTFSSQTGTRVYAWNQFSLAFGGLSAELYGTVAWSKNPRGPGGQGSFHCSVSGHCGLGGVSNGDMPCTGSISGGAPRPAS*
Ga0137394_1012080423300012922Vadose Zone SoilMKDLLKPLAIAGVAFLGCVANGTAQVQQSITFSLTVYDQSDTVVRTLRVSTKDVIENLAGTNVPGGKLWLVMPTDPSPDGNGTIGAFLRVTDSHGNIIVDTTTDTFNIYQTAFSQTSARTYAWNQFSLAFGGLGAELYGTATWSKSSRSPGGQGSFHCSVSGHCALGGVTSGEDPCTGSISGGAPKPAS*
Ga0137394_1023970423300012922Vadose Zone SoilMKKSLKVLAVAGATVLGCVANCTAQVQQSISFSLTVYNQTDTSVRPLRVSTKDVIENLAGAPVPGGKLWLVMPTDPGVDGNGTIGAVLRVTDSRGNIVQETTTDSFNIYQNTSSETAKSTYAWNGFSLSFGGLGAELFGTTIWSKGPRSAGGLGAFHCSVSGYCALSGITDGQRPCSGSISGSTPKPAS*
Ga0137394_1042684223300012922Vadose Zone SoilMRASRIVEENSNEPARVLHVWRRVTLDAQQPRQALRMSTMKSLLKALAIAGVAVLGLVANGQAQVQQSISFSLTLRDQTDTGVRVLRVTTRDVIANLVGTNVPGGRLWLVMPDDPNPDASGNIHAFLRVTDSRGNIILDTTSDTFNIYQTFSSQAGTRIFAWNQFSLSFGGLGAELYGTVTWTRSPLGPGGQGSFHCTVSGHCGLGGISNGEVPCTGSIIGGAPRRAS*
Ga0137419_1057230513300012925Vadose Zone SoilMKMLLKASAIAGVAVLGFVANGKAQVQQSISFSLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIAETTSDTFNIYQPSFSQTATRTYGWNQFSLAFGGLSAELYGMAPWSKSSRGPGGQGSFHCTVSGHCGLG
Ga0137404_1072799113300012929Vadose Zone SoilMKRLLKALAIAGVAVLGFVANGKAQVQQSISFSLTLYDQTDTGVRALRVTTRDVIENLAGTNVPGGRLWLVMPTDPSPAGNGNIGAFLRVTDSRGNIILDTTSDTFNIYQTFSSQAGTRIFAWNQFSLSFGGLGAELYGTVTWTRSPLGPGGQGSFHCTVSGHCGLGGISNG
Ga0137404_1095928823300012929Vadose Zone SoilMTLNGQQRWQALRMRNMKKLSNTLAIAGVAILGFAANSKAQVQQSISFSLTLYNQTDTGVRALRVSTKDVIDNLAGTNVPGGHLWLIMPTDPGVDGNGTLGAFLRVTDASGNVIVDSTTDSFNLYQAAYSQAAGRTYAWNGFSLDFG
Ga0137407_1148094123300012930Vadose Zone SoilSISFSLTLYDQTDTSVRAMRVSTKDVIENLAGTNVPGGKLWLVMPNDPSPDSNGNIAAFLRVTDSKGNIIVDTTTDSFNIYQMFSSQTDTRVYAWNQFSLSFGGIGAELFGTATWSKSPHGPGGQGSFHCSVSGHCALSGITDGDRPCIGSISGGAPRPAN*
Ga0137410_1063875123300012944Vadose Zone SoilKAQVQQSISFSLTLYNQTDTGVRALRLSTRNVIENLVGTNVPGGRLWLVMPTGPTPDGSGNISAFLRVTDSRGNVITETTSDSFNIYQTFSSQTGTRVYAWNQFSLAFGGLSAELYGTVAWSKNPRGPGGQGSFHCSVSGHCGLGGVSNGDMPCTGSISGGAPRPAS*
Ga0164301_1122889613300012960SoilMKNLSKVLATAGLAFLGYAATGSAQVQQSINFSLTVYNQTETGVRTLHVTTKDIIENLAGTNVPGGKLWLIMPNDPTPDGTGNIGAALRVTDSHGNIITETTSDSFNIYQTFSSQAGTRTVTWNQFSISFGGLGAELYGTATWSKSSVGVGG
Ga0164305_1173508913300012989SoilTAQVQQSINFTLTIYDQSDTGVRTLRVSTKDVIQNLAGDNVPGGKLWLVMPNDPSVDGNGTIGAFLRVTDSHGNVIVDTTTDTFNIYQTSFSQTSTRTYAWNQFSLAFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGITDGEQPCTGSIGGGAPRPSS*
Ga0157378_1043001713300013297Miscanthus RhizosphereMKKLLKPLALAGVVFLGFVANCTAQVQQSISFTLTVYNQTDTSVRAVRVTTRDVIANLAGTNVPGGRLLLVMPTDPSPDENGNIGAFLRVTDSRGNIIAETTSDSFNIYQLSSSQTGTRTYAWNQFSLAVGGLSAELYGTAIWSKKFSGPGGLGSFHCSVNGHCSIGGITDTDRPCIGSIGGGAPKPSS*
Ga0157378_1164168113300013297Miscanthus RhizosphereMKKGLKALAIAGVAILGFVANGKAQVQQSINFSLTMYNQNDTGVRALRISTRDVIANLAGTNVPGGKLWLVMPSDPSPDVNGNIGAFLRVTDSKGNIIVETTSDYFNIYQTFSSQNTTRIYAWNQFSLSFGGLSAELYGTVAWSNRSRVGQGSFHCTVSGHCGLGGISNGDVPCTGSISGGAPKPAS*
Ga0157372_1151580123300013307Corn RhizosphereMKKLLKASVIAGVALLGFVANGTAQVQQSISLSLTVYNQTDTAVRAVRVSTRDVIANLAGTNVPGAKLWLVMPNDPSPGGNGNIGAFLRVTDSHGNVIVETTSDSFNIYQTVSSQTATRTYAWNQFSLAVGGLSAELYGTATWNKSLRGPGGLGSFHCS
Ga0157380_1115857413300014326Switchgrass RhizosphereVILGFATNSKAQVQQSISFSLTVYHQTDTGVRALRVSTKDVIANLAGTNVPGGKLWLVMPANPSQDENGNIGAFLRVTDSKGNIIVDTTSDYFNIYQPFSSQTSNHTYAWNQFSLAFGGLSAELYGTAGWSKTLKSQGGQGS
Ga0157376_1036139413300014969Miscanthus RhizosphereTLTVYDQADSGVRSLRVSSKDVIEHLAGTKVPGAKLWLVMPDDPGVDRNGTIGAVLEVTDSRGNVVAKTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTAIWSKSSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0137418_1017614623300015241Vadose Zone SoilMKRLLKALAIAGVAVLGFVANGKAQVQQSISFSLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIVDTTSASFNIYQTSSSQTGTRTYAWNQFSLSFGGIGAELFGTATWSKGSRGPGGQGSFHCSVSGHCALGGISNGEVPCTGSIIGGAPRPAS*
Ga0132258_1000816923300015371Arabidopsis RhizosphereMKKLSKTLAIAGMAFLGCLANGIAQVQQSIGFTLTVYDQADSGVRSLRVSSKDVIEHLAGTKVPGAKLWLVMPDDPGVDRNGTIGAVLEVTDSRGNVVAKTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTAIWSKSSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0132258_10011056133300015371Arabidopsis RhizosphereMKTMKKLLKPLAIAGMAFLGCIANSTAQVQQSINFTLTIYDQSDTGIRTLRVSNKDIIQNLAGTNVPGGKLWLMMPPSPGVDGNGTIGAFLRVTDAHGNVIVDTTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWNKGQWGPGGLGSFRCSVSGYCALGGITAGQEPCLGSISGGTPQAMH*
Ga0132258_1037804323300015371Arabidopsis RhizosphereMRTMKKLLKPLAIAGIAGMAILGCVANSTAQVQQSINFTLTIYDQSDTGVRTLRVSTKDVIQNLAGDNVPGGKLWLVMPNDPSVDGNGTIGAFLRVTDSHGNVIVDTTTDTFNIYQTSFSQTSTRTYAWNQFSLAFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGITDGEQPCTGSIGGGAPRPSS*
Ga0132258_1114733513300015371Arabidopsis RhizosphereMKKLLKPLAIAGMAFLGCVANSIAQVEQSISFSLTVYDQSDTSVRTLRVSTKDVIENLAGSSVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVETTTDSFNIYQTAFSQTSTRTYAWNQFSLSFGGLGAELYGTATWSKSARGPGGQGSFHCSVSGHCGLGGITDGEQPCSGSISGGAPKPSS*
Ga0132258_1125820323300015371Arabidopsis RhizosphereMKRLLKALAIAGVGFLGFVVNGKAQVQQSISFSLTLYNQTDTSVRAVRISTRDVIANLAGTNVPGGKLWLLMPNAPTPAANGNIGAYLRVTDSKGNIIVETTPDYFNIYQTFSSQNATRIYGWNQFSLSFGGLSAELYGTAAWSKSSRGPGGQGSFHCTVSGHCGLGGISDGEVPCTGSISGGAPRPAS*
Ga0132258_1305876513300015371Arabidopsis RhizosphereMKNLLKSLAIAGLTIFGCLANGTAQVQQSIGFTLTVYDQADSGVRALRVSSKDVIEHLAGSKVPGGKLWLVMPDDPGVDRNGTIGAVLQVTDSRGNVIAVTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTVIWSKRSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0132257_10069887523300015373Arabidopsis RhizosphereMKKLLKTLAIAGLTILGCLANGTAQVKQSIGFTLTVYDQADSGVRALRVSSKDVIEHLAGTKINGGKLWLVMPDDPGVDRNGTIGAVLEVTDSCGNVVAKTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTAIWSKSSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0132255_10029212243300015374Arabidopsis RhizosphereVYDQADSGVRSLRVSSKDVIEHLAGTKVPGAKLWLVMPDDPGVDRNGTIGAVLEVTDSRGNVVAKTTTRSFNIYQNTAAQTDSRTYAWNGFSLDFGGLEAELYGTAIWSKSSRGPGGLGTFHCSVNGFCALGGITAGQQPCIGSISGGAPKPAS*
Ga0182037_1208685213300016404SoilMKKSLKALAIAGVAVLGCVASSTAQVQQSISFSLTVYDQSDTDVRTLRVSNKDVIQNLAGTNVPGAKLWLVMPTDPSPDGNGNIGAFLRVTDAQGNIVVETTTDTFNIYQTSFSQTSTRTYAWNQFSLDFGGLGSELYGTAIWSKNLRAPLGQGSFHCSVSGH
Ga0066667_1123588413300018433Grasslands SoilQVQQSISFSLTLYDQTDSGVRAVRVGTRDVIANLVGTNVPGGRLLLVMPPDPSPDGSGNIGAFLRVMDSSGNIIVETTSDSFNIYQPSFAQTGTRIFAWNQFSLSFGGVGAELYGTAIWSKSPRGPGGQGAFHCSVSGPCGLGGITDGEMPCTGSISGGAPRPAQ
Ga0066662_1070667313300018468Grasslands SoilMKNIFKALAIAGVAVLGFVANGTAQVQQSISFSLTVFDQTDTGVRPVRVTTRDVIENLAGTNVPGGKLWLVMPNDPSPSGNGNIGAFLRVTDSKGNVVAETDSSTFNIYQTTSSQTTTGARSVAWNQFSLAFGGIGAELYGTATWTKSPLGPGGQGSFHCNVSGHCGIGGVTDGERPCTGSISGGAPRPAG
Ga0193751_123409613300019888SoilMKRLLKALAIAGVAVLGFVANGKAQVQQSISFSLTLRDQTDTGVRVLHVTTRDVIENLVGTNVPGGKLWLVMPTDPSPDSNGNIGAFLRVTDSKGNIIVDTTTDSFNIYQMFSSQSDTHIYAWNQFSLSFGGIGAELFGTATWSKSPHGPGGQGSFHC
Ga0193728_106949823300019890SoilVVILGVLAEGKAQVEQAITFNLRLYDQTDTGVRTLKIVNKDIILNLVGTNVPGAKLFLVMPNEPTPSGDTGNIGAWLRVTDAKGNILAETDTTRFNIYQTVSTQNPTHIYAWNQFSLAFGGIGAEVFGTTTWTKNARTPGGQGSFHCTVSGTCGVGGTSNGTVPCTGSISGGEPKPTN
Ga0193693_1000552123300019996SoilMRRWRRVTLDAQRRQQALRMSIMKKALKALAIAGVAVLGFVANGKAQVQQSISVSLTVYNQSDTGVRPLRVSTRDVIENLAGTNVPSGKLWLIMPTDPRPDENGNIGAFLRVTDSRGNILVDTTSDSFNIYQTSSSQSGTRTYAWNQFSLSFGGLGAELYGTATWSKDLHGPGGQGSFHCSVNGHCGIGGITAGEMPCVGSISVGAPRPAN
Ga0179590_109753513300020140Vadose Zone SoilMKRLSKALAVTGMAILGFAANSTAQVQQTISFSFTVYDQTDTGVRVLRLGNKDVMESLAGTNVPGGHLWLVMPIKPGVDRNGTIGAFLRVRDAHGNVIAQTTTRSFNIYQNTASLTGNRTYAWNGFSLSFGGLGAELYGTAIWSKSFRSPGG
Ga0210406_1071982213300021168SoilMKKSLKALAIAGIAVLGCVASSTAQVQQSISFSLTVYDQSDTDVRPLRVSNKDVIENLAGTTVPGGKLWLVMPTDPSPDGNGNIGAFLRVTDAHGNIIVETTTDSFNIYQTAFSQTSNLTYAWNQFSLAFGGLGAELYGTAIWSKSFRGPGGQGSFHCSVSGHFALGGVTEGSQPCSGSISGSAPKPSD
Ga0210406_1129944913300021168SoilGVRTANCEKKTMKKLSKVLAIAGMAFLGYAAIGTAQVQQSISFSFTVYNQTATGIRALRVSTKDVIENLAGAKVPGGKLWLVMPTDPSVDGNGTLGAVLRVTDAHGNTIAETTTDTFNLYQSSFSQTATRTYAWNGFSIDFGGLDAELYGTATWSKRSRSPGGQGSFHCSVSGH
Ga0193709_10000011153300021411SoilMKNLLKALAVAGVAFLGFVANGKAQVQQSISFTLTLYNQSDTSVRPVRVSTKDVIENLAGTNVPGGKLLLVMPTDPSPDGNGTIGAFLRVTDSRGNIIAETTTDSFNVYQTSFSQTTTHTYAWNQFSLSFGGLGAELYGTATWSKSFRGPGGQGSFHCSVSGHCALGGITDGEMPCMGSVSGGAPRAD
Ga0179589_1003932223300024288Vadose Zone SoilMAILGFAANSTAQVQQTISFSFTVYDQTDTGVRVLRLGNKDVMESLAGTNVPGGHLWLVMPIKPGVDRNGTIGAFLRVRDAHGNVIAQTTTRSFNIYQNTASLTGNRTYAWNGFSLSFGGLGAELYGTAIWSKSFRSLGNL
Ga0207685_1073686313300025905Corn, Switchgrass And Miscanthus RhizosphereMKKLLKPLVIAGMAFLGYVANSTAQVQQSISFSLTVYNQSDTGVRALRLSNKDIIQNLAGTNVAGGKLWLIMPTDPGVDGNGTIGAFLRVTDSRGNILAETTTDSFNIYQNTAAQTSTRTYAWNGFSLSFGGLGAELYGTAIWSKGVWGPGGLGS
Ga0207663_1054420213300025916Corn, Switchgrass And Miscanthus RhizosphereMKKLLKPLVIAGMAFLGYVANSTAQVQQSISFSLTVYNQSDTGVRALRLSNKDIIQNLAGTNVAGGKLWLIMPTDPGVDGNGTIGAFLRVTDSRGNILAETTTDSFNIYQNTAAQTSTRTYAWNGFSLSFGGLGAELYGTAIWSKGVWGPGGLGSFRTSVSGYCALGGITDGQEPCVGSISGSAPKSAAAR
Ga0207694_1125988813300025924Corn RhizosphereMKKLLKASVIAGVALLGFVANGTAQVQQSISLSLTVYNQTDTAVRAVRVSTRDVIANLAGTNVPGAKLWLVMPNDPSPGGNGNIGAFLRVTDSHGNVIVETTSDSFNIYQTVSSQTATRTYAWNQFSLAVGGLSAELYGTATWNKSLRGPGGLGKLLF
Ga0207687_1054914313300025927Miscanthus RhizosphereMKKLLKALAIAGMAFLGFVANSTAQVQQSINFSLTLYNQTDTGIRPLRISTRDVIANLVGTNVPGGRLWLVMPTDPSPDENGNIGAFLRVTDSRGNIMAETTSDYFNIYQTFFSQSGARLVAWNQFSLSFGGLSAELYGTAIWSKSLRGPGGQGSFHCSVSGHSGLGGVSDGDVPCTGVISGGAPRPAS
Ga0207686_1086273513300025934Miscanthus RhizosphereQPVWVREMSRSKGQDFLLTQQVFEQRKLEWVRRQVTLDAQLSRTARPARLMRTMKKLTKALAIAGVVVLGYVANSTAQVQQSISFSLTVFDQTETGVRPLRITNRDIIENLVGTNVPGAKLWLIMPNDPSPGANGNIGAFLRVKDSHGNVIAETDSSLFNIYQTTSSQTQTRTVAWNQFSLAFGGVGAELYGTVLWSKSLRGPGGQGSFHCSVSGHCSLGGITDGDMPCIGSISGGAPKPSS
Ga0207661_1048046813300025944Corn RhizosphereMKKLLKASVIAGVALLGFVANGTAQVQQSISLSLTVYNQTDTAVRAVRVSTRDVIANLAGTNVPGAKLWLVMPNDPSPGGNGNIGAFLRVTDSHGNVIVETTSDSFNIYQTVSSQTATRTYAWNQFSLAVGGLSAELYGTATWNKSLRGPGGLGSFHCSVSGHCAISGITNGDQPCIGSISGGAPK
Ga0207712_1174326313300025961Switchgrass RhizosphereFLKALVIVAFLGSVANSRAQVQQSISLSLTVYNQTDTSIRAVRVSTKDIIANLVGTNVSGGKLWLVMPSDPSPDGNGNIGASLRVTDSHGNVIVETTSDSFNIYQAFSSQTSTRTYAWNQFSLAFGGVGAELYGTVIWSKSLRAGGQGSFHCSVSGHCSLGGITDGDMPCIGSISGGAPKPSS
Ga0207712_1205458213300025961Switchgrass RhizosphereTPTHSHDGKHLREINTMKRVLKTLAIAGVAVLGFVANSKAQVQQSINFSLTMYNQNDTGVRALRISTRDVIANLAGTNVPGGKLWLVMPSDPSPDVNGNIGAFLRVTDSKGNIIVETTSDYFNIYQTFSSQNTTRIYAWNQFSLSFGGLSAELYGTVAWSNRSRGGQGSF
Ga0207674_1002457923300026116Corn RhizosphereMAVLGCVANSTAQVQQSISFSLTVYSQTDSGVRALRLGNKDIIQNLVGTNVPGGHLWLVMPSNPGVDGNGTIGAFLRVRDAHGNVLVETTTDSFNIYQNTASLTVNRTYAWNGFSLSFGGLGAELFGTAIWSKTFRNQGGLGAFHCSVNGYCALGGITEGQEPCVGSISGGAPKGG
Ga0209588_107427013300027671Vadose Zone SoilQSSSAPVVKKMSLQEETTLFPQPWTTGRMQPKQDCRGNSNELDAARLASGDARRTAGTAITARMRTMKKLLKALAIAGLAFLGCVANGTAQVKQSINFSLTVYDQSDTGVRTLRVSTKDVIENLAGTNVPGGKLWLVMPTDPGVDGNGTIGAFLRVTDSHGNVVVDTTTDSFNIYQNTSSQAGTRTYAWNGFSLSFGGLGAELYGTAIWSKSFRSPGGLGTFHCSVSGYCALGGITDGQEPCIGSISGGAPSPAH
Ga0209811_1011732513300027821Surface SoilMAFLGCIANSTAQVQQSINFTLTIYDQSDTGIRTLRVSNKDIIQNLAGTNVPGGKLWLMMPPSPGVDGNGTIGAFLRVTDAHGNVIVDTTTDSFNIYQNTASQTSTRTYAWNGFSLSFGGLGAELFGTAIWNKGQWGPGGLGSFRCSVSGYCALGGITAGQEPCLGSISGGTPQAAASRIFRRVPPP
Ga0209488_1017630233300027903Vadose Zone SoilMKMLLKASAIAGVAVLGFVANGKAQVQQSISFTLTLFDQTDRGIRPLRVTTRDVIANLAGTNVPGGRLWVVMPTDPSPDGSGNIGAFLRVTDSRGNIIAETTSDSFNIYQTFSSQAGTRTIAWNQFSLSIGGVGAELYGTATWSKSSSGAGGQGSFRCTVSGHCGLGDATNGEVPCTGTISGGATKPTG
Ga0170834_10011010313300031057Forest SoilSVASSTAQVQQSISFSLSVYDQNDTDVRTLRVSNKDIIENLAGTNVPGGKLWLVMPTDPSPDGSGNIGAFLRVTDAHGNVIVESTTDSFNIYQTAFSQTSSLTYAWNQFSLAFGGLGAELYGTAIWSKSFRSPGGQGSFHCSVSGHFALGGVTVGSQPCSGSISGSAPKPSD
Ga0308189_1006122823300031058SoilMKSLLKTLAIAGVAVLGFVANGQAQVQQSISISLTLRDQTDTGVRTLHISNRDVIENLVGTNVPGGRLLLVMPNDPTPDGNGNIGAFLRVTDSKGNVLAETDSSSFNIYQTVSSQTSTRIYAWNQFSFDFGGLGAELYGTATWSRSPLGPGGQGSFHCTVSGHCGLGGISNGQVPCTGSIAGGAPKPAN
Ga0307469_1171207013300031720Hardwood Forest SoilRGELESAHELRLASVTFDAYRDGKHCKNVDMKKLLKPLALAGVIFLGFVANCTAQVQQSISFTLTVYNQTDSSVRPVRVSTRDVIANLAGTNVPGGRLLLVMPTDPSPDENGNIGAFLRVTDSRGNIIAETSSDSFNIYQLSSSQTATRTYAWNQFSLSFGGLSAELYGTAIWNKKFSGPGGLGSFHCSVNGHCAIGGITDT
Ga0335069_1004794643300032893SoilMKTVLKSLAIAGVAVLGFAANGKAQVQQSINFNLTVYNQSDTGVRAVRVGTRDVIENLTGTNVPNAKLWLIMPTDPSPDGNGNIGAFLRVTDSKGNVLVETTSDSFNIYQTSSSQSGTRTYAWNQFSLSFGGLGAELYGTATWSKSGHSPGGQGSFHCSVNGHCALGGITSGEMPCVGSISGGAPRPAN
Ga0310810_1004996743300033412SoilMKKLSKVLAIAGMAFLGYAAIGAAQVQQSISFSFTVYNQTAAGIRAVRVGTKDVIQNLAGSNVRGKLWLVMPTDPSPDGNGTLGAVLRVTDAHGNIIAETTTDSFNLYQSSFSQTSTRTYAWNGFSLAFGGIDAELFGTAIWSKSFRSPGGQGTFHCSVSGHFRLSGITDGEQPCSGSIAGGATKPTS
Ga0310810_1011409313300033412SoilMKKSLKALAIAGIAVLGCVASSTAQVQQSISFSLTVYDQSDTGVRALRVSNKDVIENLAGTTVPGGKLWLVMPTDPSPDGNGNIGAFLRVTDAQGNIIAETTTDTFNVYQNSFSQDSTRTYAWNQFSLDFGGLGSELYGTAIWSKSFRSPGGQGSFHCSVSGHLALGGVTDGSQPCSGSISGSAPKPAG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.