NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F074444

Metagenome Family F074444

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074444
Family Type Metagenome
Number of Sequences 119
Average Sequence Length 129 residues
Representative Sequence MKKPLYAALDLHSRYSVLGSMDHEGRTHPRIRFPTEANILRAEVERLRRKRRPLYLTMEAGALTRWASAIVRPLAERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALV
Number of Associated Samples 108
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 16.52 %
% of genes near scaffold ends (potentially truncated) 82.35 %
% of genes from short scaffolds (< 2000 bps) 93.28 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.81

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.639 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.487 % of family members)
Environment Ontology (ENVO) Unclassified
(26.891 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.462 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.04%    β-sheet: 20.75%    Coil/Unstructured: 52.20%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.81
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
c.55.3.13: Tex RuvX-like domain-liked3bzca53bzc0.7257
c.55.3.16: RuvC-like domain from CRISPR-associated protein Cas9d4ogca14ogc0.70354
c.55.3.18: RuvC-like domain from CRISPR-associated protein Cas12a / Cpf1d5id6a15id60.70176
c.55.3.6: RuvC resolvased6lw3a_6lw30.69439
c.55.3.16: RuvC-like domain from CRISPR-associated protein Cas9d4oo8a14oo80.69404


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF01548DEDD_Tnp_IS110 21.85
PF02371Transposase_20 3.36
PF14534DUF4440 1.68
PF00571CBS 1.68
PF08241Methyltransf_11 0.84
PF00501AMP-binding 0.84
PF00011HSP20 0.84
PF00724Oxidored_FMN 0.84
PF02641DUF190 0.84
PF08002DUF1697 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG3547TransposaseMobilome: prophages, transposons [X] 25.21
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 0.84
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 0.84
COG19022,4-dienoyl-CoA reductase or related NADH-dependent reductase, Old Yellow Enzyme (OYE) familyEnergy production and conversion [C] 0.84
COG1993PII-like signaling proteinSignal transduction mechanisms [T] 0.84
COG3797Uncharacterized conserved protein, DUF1697 familyFunction unknown [S] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms96.64 %
UnclassifiedrootN/A3.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459004|F62QY1Z02H2EN2All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula501Open in IMG/M
2170459007|GJ61VE201AGXA2All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula505Open in IMG/M
2170459009|GA8DASG02GW74NAll Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula507Open in IMG/M
2170459019|G14TP7Y01C45N1All Organisms → cellular organisms → Bacteria743Open in IMG/M
2170459019|G14TP7Y01DPO2RAll Organisms → cellular organisms → Bacteria676Open in IMG/M
2228664021|ICCgaii200_c0527069All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100629496All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium → Acidobacterium capsulatum516Open in IMG/M
3300000955|JGI1027J12803_101613654All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium585Open in IMG/M
3300002896|JGI24802J43972_1010186All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300004114|Ga0062593_101849219All Organisms → cellular organisms → Bacteria → Proteobacteria665Open in IMG/M
3300004479|Ga0062595_102415591All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium522Open in IMG/M
3300005166|Ga0066674_10550161All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber514Open in IMG/M
3300005172|Ga0066683_10530324All Organisms → cellular organisms → Bacteria718Open in IMG/M
3300005175|Ga0066673_10437147All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300005175|Ga0066673_10504598All Organisms → cellular organisms → Bacteria712Open in IMG/M
3300005179|Ga0066684_10516051All Organisms → cellular organisms → Bacteria802Open in IMG/M
3300005180|Ga0066685_10896773All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300005181|Ga0066678_10343034All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae984Open in IMG/M
3300005289|Ga0065704_10013801All Organisms → cellular organisms → Bacteria1755Open in IMG/M
3300005294|Ga0065705_10043564All Organisms → cellular organisms → Bacteria1801Open in IMG/M
3300005340|Ga0070689_102039031All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber525Open in IMG/M
3300005435|Ga0070714_102343299All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300005440|Ga0070705_101636252All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber543Open in IMG/M
3300005445|Ga0070708_102212863All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber508Open in IMG/M
3300005467|Ga0070706_101701118All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber574Open in IMG/M
3300005468|Ga0070707_100290317All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber1589Open in IMG/M
3300005536|Ga0070697_101558890All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber591Open in IMG/M
3300005545|Ga0070695_101879451All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber503Open in IMG/M
3300005555|Ga0066692_10529655All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber747Open in IMG/M
3300005560|Ga0066670_10528225All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300005575|Ga0066702_10823119All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005610|Ga0070763_10872478All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter → Salinibacter ruber534Open in IMG/M
3300006046|Ga0066652_101771486All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300007255|Ga0099791_10504495All Organisms → cellular organisms → Bacteria588Open in IMG/M
3300009012|Ga0066710_103332710All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300009090|Ga0099827_10876067All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300009792|Ga0126374_11202547All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300010322|Ga0134084_10206348All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium690Open in IMG/M
3300010361|Ga0126378_12574420All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium581Open in IMG/M
3300010398|Ga0126383_13270898All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium529Open in IMG/M
3300012198|Ga0137364_10095163All Organisms → cellular organisms → Bacteria2087Open in IMG/M
3300012198|Ga0137364_10380737All Organisms → cellular organisms → Bacteria1055Open in IMG/M
3300012206|Ga0137380_11661635All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300012210|Ga0137378_10043068All Organisms → cellular organisms → Bacteria4047Open in IMG/M
3300012351|Ga0137386_11192099All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300012354|Ga0137366_10350549All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1080Open in IMG/M
3300012356|Ga0137371_10654650All Organisms → cellular organisms → Bacteria805Open in IMG/M
3300012357|Ga0137384_11568766All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300012362|Ga0137361_11466651All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300012582|Ga0137358_10713626All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300012914|Ga0157297_10320119All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300012927|Ga0137416_11486106All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium615Open in IMG/M
3300012927|Ga0137416_11907004All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium544Open in IMG/M
3300012929|Ga0137404_12260494All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300012930|Ga0137407_12269001All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium519Open in IMG/M
3300012960|Ga0164301_11139668All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300012975|Ga0134110_10154931All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium948Open in IMG/M
3300012977|Ga0134087_10427501All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300012984|Ga0164309_11228936All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium630Open in IMG/M
3300012988|Ga0164306_10945864All Organisms → cellular organisms → Bacteria706Open in IMG/M
3300014166|Ga0134079_10446679All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300015356|Ga0134073_10052256All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300015358|Ga0134089_10356854All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300015359|Ga0134085_10443902All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium587Open in IMG/M
3300015373|Ga0132257_100894460All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300016341|Ga0182035_10678457All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium896Open in IMG/M
3300016341|Ga0182035_11112683All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300016341|Ga0182035_12069200All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium517Open in IMG/M
3300016357|Ga0182032_11401976All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium605Open in IMG/M
3300016371|Ga0182034_10715720All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium852Open in IMG/M
3300018073|Ga0184624_10048403All Organisms → cellular organisms → Bacteria1713Open in IMG/M
3300018081|Ga0184625_10071982All Organisms → cellular organisms → Bacteria1758Open in IMG/M
3300018431|Ga0066655_11223551All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300019361|Ga0173482_10452587All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium611Open in IMG/M
3300019361|Ga0173482_10691444All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300020010|Ga0193749_1016297All Organisms → cellular organisms → Bacteria1452Open in IMG/M
3300021078|Ga0210381_10383919All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300025906|Ga0207699_10349723All Organisms → cellular organisms → Bacteria1043Open in IMG/M
3300025910|Ga0207684_11629368All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium522Open in IMG/M
3300025916|Ga0207663_11550011All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300025928|Ga0207700_10923439All Organisms → cellular organisms → Bacteria781Open in IMG/M
3300025929|Ga0207664_10386093All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300026322|Ga0209687_1295961All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300026325|Ga0209152_10482596All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium507Open in IMG/M
3300026326|Ga0209801_1127539All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1088Open in IMG/M
3300026326|Ga0209801_1288075All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300026332|Ga0209803_1195747All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300026334|Ga0209377_1193003All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300026527|Ga0209059_1315280All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300026537|Ga0209157_1303533All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300026547|Ga0209156_10464604All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300026550|Ga0209474_10035216All Organisms → cellular organisms → Bacteria3708Open in IMG/M
3300026557|Ga0179587_10457916All Organisms → cellular organisms → Bacteria835Open in IMG/M
3300026771|Ga0207552_102048All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300026822|Ga0207498_108015All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum506Open in IMG/M
3300027105|Ga0207944_1023982All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300027882|Ga0209590_10891944All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300027889|Ga0209380_10645556All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300028889|Ga0247827_11346045All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium501Open in IMG/M
3300031057|Ga0170834_107267763All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum586Open in IMG/M
3300031231|Ga0170824_105773348All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium611Open in IMG/M
3300031231|Ga0170824_122878192All Organisms → cellular organisms → Bacteria1233Open in IMG/M
3300031469|Ga0170819_15028915All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300031474|Ga0170818_111405489All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300031474|Ga0170818_114735588All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium944Open in IMG/M
3300031716|Ga0310813_10304619All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum1345Open in IMG/M
3300031718|Ga0307474_11644843All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum504Open in IMG/M
3300031719|Ga0306917_10557318All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum901Open in IMG/M
3300031820|Ga0307473_10887320All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300031823|Ga0307478_11065630All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium675Open in IMG/M
3300031833|Ga0310917_10073027All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2157Open in IMG/M
3300031894|Ga0318522_10427227All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum502Open in IMG/M
3300031945|Ga0310913_10735974All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum697Open in IMG/M
3300031946|Ga0310910_11571895All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum503Open in IMG/M
3300031996|Ga0308176_12366283All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Roseimicrobium → Roseimicrobium gellanilyticum567Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.49%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.13%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil5.04%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil5.04%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.36%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.52%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.52%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil2.52%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.52%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.68%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.68%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter1.68%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.84%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.84%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.84%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459004Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cm (2)EnvironmentalOpen in IMG/M
2170459007Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 10-21cmEnvironmentalOpen in IMG/M
2170459009Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect DNA Tissue lysis 0-10cmEnvironmentalOpen in IMG/M
2170459019Litter degradation MG4EngineeredOpen in IMG/M
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002896Soil microbial communities from Manhattan, Kansas, USA - Sample 300um NexteraEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012914Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S028-104C-2EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300018073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026771Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A4-12 (SPAdes)EnvironmentalOpen in IMG/M
3300026822Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027105Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF018 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031894Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f18EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E4B_066719602170459004Grass SoilMKKKSPVYTALDLHSRYSVLGSMEHGGKTGARIRFATEAETLRAQVARLRQARRPXFLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTR
L02_017109002170459007Grass SoilMKKKSPVYTALDLHSRYSVLGSMEHGGKTGARIRFATEAETLRAQVARLRQARRPLFLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWM
F47_133015302170459009Grass SoilMEHGGKTGARIRFATEAETLRAQVARLRQARRPLFLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQD
4MG_008965802170459019Switchgrass, Maize And Mischanthus LitterMKSNNPVYAALDLHSRHSVLGSMEHGGKMGARMRFATEAETLRAQVARLRQARRPLFLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVKGCVCFCAWQARKRCGWEPIGTPG
4MG_002915302170459019Switchgrass, Maize And Mischanthus LitterMKTPLYAALDLHSRYSVLGGMDHEGRVQGRVRFATTAQLLQTHVGALRQKKRPLYLTMEAGAVSRWASAIVRPLVERLIICEPRHNRLINSNPSKCDEADVEG
ICCgaii200_052706912228664021SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALV
INPhiseqgaiiFebDRAFT_10062949613300000364SoilMKTPLYAALDLHSRYSVLGSMDHEGRVQGRVRFATTAQLLQTHVGALRQKKRPLYLTMEAGAFSRWASAIVRPLVERLIICEPRHNRLINSNPSKCDEADVEGMCLLLRVGKLKEVWMGTERRREIYRELVYELLNWRDAQRELKALIKA
JGI1027J12803_10161365413300000955SoilNLMKKPIYAALDLHSRHSVLGSMDHDGQTQPRMRFPTEAKILRTEVERLRESRRPLYLTMEAGALTRWASAIVRPVVERLIICEPRHNRLINSNPQKSDEADVEGMCLLLRQRQAQRSVDGTRSHPRDLPRPGV*
JGI24802J43972_101018613300002896SoilMKKKSPVYAALDLHSRHSVLGSMEHGGKMGARIHFATEAETLRAQVARLRQAKRPLFLTMEAGALTRWAXAIVGPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRDAQRQLKALI
Ga0062593_10184921923300004114SoilMDHDGRTQAPTRFVTEAEILRSEVKALQERNRALHLTMEAEPLSRWASAIVRPLVERLVICEPRHNRLINSNPNKCDEADVEAMCLLLRL
Ga0062590_10098090313300004157SoilMKRPIYAALDLHSRMSVLGSMDHGGKTGARMRFATEAETLRAQVARLRQAKGPLFLTMEAGALTRWASAVVRPLVERLIICEPRHNRLINSNPTKSDEADVE
Ga0062595_10241559113300004479SoilMKKPIYAALDLHSGMSVLGSMDHDGRTQPRMRFPTEANILRAEVERLRRKRRPLYLTMEAGALTRWASAIVRPLAERLIICEPRHNRLINSNPTKSDEADVEGMCLL
Ga0062592_10253145323300004480SoilMDHDGRTQAPTRFVTEAEILRSEVKALQERNRALHLTMEAEPLSRWASAIVRPLVERLVICEPRHNRLINSNPNKCDEADVEAMCLLLRLNKLKEVWMGTDRTREIYRSLV
Ga0066674_1055016113300005166SoilMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVIRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVW
Ga0066683_1053032413300005172SoilQRELLQINPMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVIRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGVDRAREIYRALVYELLNWRDAQRELKKPD*
Ga0066673_1043714713300005175SoilMKNKKPVYAALDLHSRYSVLGSMEHGGKTGERMRFPTEAEGLRAEVTRLRQRKRPLHLTMEAGALTRWASGIVRPLVERLLICEPRHNRLINSNPTKRDEADVEGMCLLLRLGKLKEVWMGTERTREIYRALVYELLNWRDAQRFMPYPSGEYAVVYASKRPFGETKRG*
Ga0066673_1050459823300005175SoilMKRTPLYAALDLHSRYSVLGSMDHEGKAQGRARFATTGQMLERHVQGLRGKNRPIYLTMEAGAMSRWASTIVRPLVERLLICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDR
Ga0066684_1051605123300005179SoilMKRTPLYAALDLHSRYSVLGSMDHEGKAQGRARFATTGQMLERHVQGLRGKNRPIYLTMEAGAMSRWASTIVRPLVERLLICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWM
Ga0066685_1089677313300005180SoilPVYAALDLHSRYSVLGSMEHGGKTGERMRFPTEAEGLRAEVTRLRQRKRPLHLTMEAGALTRWASGIVRPLVERLLICEPRHNRLINSNPTKRDEADVEGMCLLLRLGKLKEVWMGTERTREIYRALVYELLNWRDAQRFMPYPSGEYAVVYASKRPFVKRSAGNRLRRPKISSRSSRTLVRA*
Ga0066678_1034303423300005181SoilMKRPIYAALDLHSRRSVLGSMDHDGQTQPRMRFPTEANILRAQVERLRQRQRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLIHSNPQKNDEADVEGMCLLLRLGK
Ga0065704_1001380113300005289Switchgrass RhizosphereMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFATSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLRQAQRSVDGTGSHPRDLPGAGL*
Ga0065705_1004356433300005294Switchgrass RhizosphereMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKXRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLRQAQRSVDGTGSHPRDLPGAGL*
Ga0070689_10203903113300005340Switchgrass RhizosphereMKKPLYAALDLHSRYSVLGSMDHEGRTHPRIRFPTEANILRAEVERLRRKRRPLYLTMEAGALTRWASAIVRPLAERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALV
Ga0070714_10234329913300005435Agricultural SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMSRWASTIVRPLVERLLICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKL
Ga0070705_10163625213300005440Corn, Switchgrass And Miscanthus RhizosphereMKTKTPLYAALDLHSRYSVLGSMDHNGKTQPRMRFPTQADVLQAQVRALKQKKRPLHLTMEAGALSRWASGIVRPLVERLIICEPRHNRLINSNPQKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLN
Ga0070708_10221286313300005445Corn, Switchgrass And Miscanthus RhizosphereMKTKTPLYAALDLHSRYSVLGSMDHNGKTQPRMRFPTQADVLQAQVRALKQKKRPLHLTMEAGALSRWASGIVRPLVERLIICEPRHNRLINSNPQKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRDAQREL
Ga0070706_10170111813300005467Corn, Switchgrass And Miscanthus RhizosphereMKKPIYAALDLHSGNSVLGSMDHEGRTQPRMRFATEATILRAQVERLRRKRRPLYLTMEAGPLTRWASGIVRPLAERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRDAQ
Ga0070707_10029031713300005468Corn, Switchgrass And Miscanthus RhizosphereMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDAQRELKALIKAR*
Ga0070697_10155889013300005536Corn, Switchgrass And Miscanthus RhizosphereMKKKSPVYAALDLHSRHSVLGSMEHGGKTGARIRFATEAETLRAAVTRLRQARRPLFLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLK
Ga0070695_10187945113300005545Corn, Switchgrass And Miscanthus RhizosphereMKTKTPLYAALDLHSRYSVLGSMDHNGRTQPRKRFATQADILRAQVRLLKQKKRPLHLTLEAGALSRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDR
Ga0066692_1052965513300005555SoilMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQDRAREIYRALVYELLNWRDAQRELKSLIKARY
Ga0066670_1052822513300005560SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDDADVDGMCLLLRLGKLKEVWMGVDRTREIYRALVYELLNWRPSPES*
Ga0066702_1082311923300005575SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLR
Ga0070763_1087247813300005610SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDAQRELKALIKAR
Ga0066652_10177148613300006046SoilVLSIKLMKRTPLYAALDLHSRYSVLGSMDHEGKAQGRARFATTGQMLERHVQGLRGKNRPIYLTMEAGAMSRWASTIVRPRVERLLICEPRHNRLINSNPTKSDEADVEGM
Ga0099791_1050449513300007255Vadose Zone SoilMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFATQADILRAQVRALKQKRRPLHLTLEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDAQ
Ga0066710_10333271023300009012Grasslands SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLRLGKLKEVWMGVDRNREIYRALVIELL
Ga0099827_1087606713300009090Vadose Zone SoilMKKKPLYAALDLHGEHSVLGSMDHDGNNQPRVRFATGAESLRAQLNALRAASKRPVHLTMEAGPLSRWASAIARPLVDQLIICEPRHNRLINANPTKSDGADVEGMCLL
Ga0126374_1120254713300009792Tropical Forest SoilMGPLAESSLIGFGFGVAKLSRASCYKSNLMKKPIYAALDLHSRYSVLGSMDHEGQMQPPMRFPTEANMLRAEVERLRRRGRPLYLTMEAGPLSRWAGAIVRSVVERLVICEARHNRLINSNRYKGDEVDVEGMCLLLRLGKLKEVWMGSDRTREIYRALVY
Ga0134084_1020634813300010322Grasslands SoilMKKPIYAALDLHSRRSVLGSMDHDGQTQPRMRFPTEANILRAQVERLRQRRRPLYLTMEAGALTRWASAIVPPVVERLIICEPRHNRLINSNPQKNDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALV
Ga0126378_1257442013300010361Tropical Forest SoilMKRPIYAALDLHSRRSVLGSMDHDGQTQPRMRFATEANILRGEVERLRQKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPQKCDEADVDGMCLLLRLGKLKEVWMGQDRTREIY
Ga0126383_1327089813300010398Tropical Forest SoilMKKPVYAALDLHSGHSVLGSMEHDGQTQPRMRFPTEASILRAQVGRLGRKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIY
Ga0137364_1009516313300012198Vadose Zone SoilMKTPLYAALDLHSAYSVLGSMDHDGRTQPRMRFATQAEILRAQVKALKHKRRPLHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQ
Ga0137364_1038073713300012198Vadose Zone SoilMKKKSPVYAALDLHSRHSVLGSMDHGGKAGARMRFATEAETLRAQVARLRQAKRPLFLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRAPGL*
Ga0137380_1166163513300012206Vadose Zone SoilMKKKSPVYAALDLHSRHSVLGSMDHGGKTGARMRFATEAETLRAQVARLRQARRPLFLTMEAGALTRWASAVVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRAREIYRALVY
Ga0137378_1004306813300012210Vadose Zone SoilMKRKTPLYAALDLHSAYSVLGSMDHDGRTQPRMRFATQAEILRAQVKALKHKRRPLHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCVLLRLGKLKEVWMGQDRAREIYRALV*
Ga0137386_1119209913300012351Vadose Zone SoilMKRKTPLYAALDLHSAYSVLGSMDHDGRTQPRMRFATQAEILRAQVKALKHKRRPLHLTMEAGALPRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGVDRAR
Ga0137366_1035054933300012354Vadose Zone SoilMDHSGKTQPRMRFATEAEGLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLIHSNPTKCDEADVEGMCLLLR
Ga0137371_1065465013300012356Vadose Zone SoilMKRKTPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQDRAREIYRALVYELLNWRDAQRELKSLIKA
Ga0137384_1156876613300012357Vadose Zone SoilMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVLRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMG
Ga0137361_1146665113300012362Vadose Zone SoilMKMKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFATQADILRAQVRALKQKRRPLHLTLEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDR
Ga0137358_1071362613300012582Vadose Zone SoilMKTKAPLYAALDLHSGYSVLGSMDHNGKTQPRTRFATQADLLRAQVKALKQKRRPLHLTLEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPQKSDEADVDGMCLLLRVGKLKDVWMGQDRTRAIYRALVYELLNWRDAQRELKALIKA
Ga0157297_1032011913300012914SoilMKSKSPIYAALDLNSRHSVLGSMEHGGKTGARMRFPTEANILRAQVERLRRRRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLL
Ga0137416_1148610613300012927Vadose Zone SoilMNTPLYAALDLHSRYSVLGSMDHEGRSESRKRFATEAKILRAEVEALKKKRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGMDRTRE
Ga0137416_1190700413300012927Vadose Zone SoilMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVLRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGMDRTRE
Ga0137404_1226049413300012929Vadose Zone SoilMKTKTPLYAALDLHSRHSVLGSMDHNGRTQPRMRFPTQAEVLRAQVRALKQKKRPLHLTMEAGALSRWASGIVRPLVERLIICEPRHNRLINSNPQKSDEADVEGMCLLLRLGKL
Ga0137407_1226900113300012930Vadose Zone SoilMKTKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVLRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTD
Ga0164301_1113966813300012960SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKAVWMGVDRTREIYRALVYELLNWRDAQRELKALIK
Ga0134110_1015493113300012975Grasslands SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLRLGKLKEVWMGVDRTREIYRALVYELLNWRDAQRELKSLI
Ga0134087_1042750113300012977Grasslands SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLRLGKLKQVWMGVDRTREIYRALVYELLNWRDAQRE
Ga0164309_1122893613300012984SoilMKSKSPVYAALDLHSRHSVLGSMEHGGKTGTRMRFPTEANILRAQVERLRRRRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDGTR
Ga0164306_1094586413300012988SoilMKSKSPVYAALDLHSRHSVLGSMEHGGKTGTRMRFPTEANILRAQVERLRRRRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDGTREIYRSLVYELLNW
Ga0134079_1044667913300014166Grasslands SoilMKRPIYAALDLHSRRSVLGSMDHDGQTQPRMRFPTEANILRAQVERLRQRRRPLYLTMEAGALTRWASAIVPPVVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWM
Ga0134073_1005225613300015356Grasslands SoilMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQDRVREITVPWFTSSSTGVMR
Ga0134089_1035685413300015358Grasslands SoilMKIKTPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEVLRAQVRALKQKKKRPLYLTMEAGALTRWASTIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGVSRTREIYRELVYEL
Ga0134085_1044390213300015359Grasslands SoilMKRPIYAALDLHSRHSVLGSMDHDGQTQPRMRFATEANILRAQVERLRRRQRPLYLTMEAGPLTRWASGIVRPLAQRLIICEPRHNRLINSNPQKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVY
Ga0132257_10089446023300015373Arabidopsis RhizosphereMKTPLYAALDLHTVFSVLGSMDHGGKLHPRVRFATQRELLRKEVNRLRESKRPVHLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGIDRTREIYRALVYELLNWRDAKQ*
Ga0182035_1067845723300016341SoilMKKPIYAALDLHSRMSVLGSMDHDGRTQPRMRFATEASLLRVQVERLRRRRQPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTERTREIYRALVYELLNWRDAQRELKALI
Ga0182035_1111268313300016341SoilVKIKPMKTPLYAALNLHSAYSVLGSMDHQGRTQRRIRFATEAPMLRAQVEALRQKRPVQLTMEAGALTRLASGIVRPLVERLVICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLK
Ga0182035_1206920013300016341SoilMKKPLYAALDLHSRYSVLGSMDHEGRTQPRMRFPTEANILRAEVERLRRKRRPLYLTMEAGALTRWASAIVRPLAERLIICEPRHNRLINSNPTKNDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDAQRE
Ga0182032_1140197613300016357SoilMKKPIYAALDLHSRMSVLGSMDHDGRTQPRMRFATEASLLRVQVERLRRRRQPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRL
Ga0182032_1183956913300016357SoilMKKPLYAALDLHSRYSVLGSMDHEGRTQPRMRFPTEANILRAEVERLRRKRRPLYLTMEAGALTRWASAIVRPLAERLIICEPRHNRLINSNPTKSDEADV
Ga0182034_1071572013300016371SoilMKRPIYAALDLHSRYSVLGSMDHDGRTQPRMRFATEANILRAEVGRLRRKRRPLYLTMEAGALSRWASGLVRPLVDRLTICEPRHNRLINSNPTKSDEADVEGM
Ga0184624_1004840333300018073Groundwater SedimentMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLRQAQRSVDGTGSHPRDLPGAGL
Ga0184625_1007198213300018081Groundwater SedimentMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVCFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLRQAQRSVDGTGSHPRDLPGAGL
Ga0066655_1122355113300018431Grasslands SoilMKRPIYAALDLHSRRSVLGSMDHDGQTQPRMRFPTEANILRAQVERLRQRRRPLYLTMEAGALTRWASAIVPPVVERLIICEPRHNRLINSNPTKSDEADVEGMCL
Ga0066669_1226002513300018482Grasslands SoilMKRKTPLYAALDLHSAYSVLGNMDHSGKTQPRMRFATQAEILRAQVKALKHKRRPLHLTMKAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEAD
Ga0173482_1045258713300019361SoilMKSKSPVYAALDLHSRHSVLGSMEHGGKTGARMRFPTEANILRAQVERLRRRRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRDAQ
Ga0173482_1069144413300019361SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLL
Ga0193749_101629713300020010SoilMKTKTPLYTALDLHSRYSVLGSMDHEGKIQPRVRFATQANTLRVHVSALKKNRRPLHLTLEAGPLTRWASTIARPLMERLVICKPRHNRLVNANPIKSDEADVEGMCLLLRLGKLKEDGWEQVAPERSIAPWFTTS
Ga0210381_1038391913300021078Groundwater SedimentMMKKKSPLYAALDLHSRHSVLGSMEHGGKAGARMRFATEAETLRAAVTQLRRAQRPLFLTMEAGALTRWASAVVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRDAQRQLKALIKAR
Ga0207699_1034972323300025906Corn, Switchgrass And Miscanthus RhizosphereMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLWAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQDRAREIYRALVYELLNWRDAQRELKSL
Ga0207684_1162936813300025910Corn, Switchgrass And Miscanthus RhizosphereMKKPIYAALDLHSGNSVLGSMDHEGRTQPRMRFATEATILRAQVERLRRKRRPLYLTMEAGPLTRWASGIVRPLAERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWM
Ga0207663_1155001113300025916Corn, Switchgrass And Miscanthus RhizosphereMKKKSPVYAALDLHSRHSVLGSMEHGGKTGARIRFATEAETLRAAVTRLRQARRPLFLTMEAGALTRWASAIVRPLVERLIICEPQHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYR
Ga0207700_1092343913300025928Corn, Switchgrass And Miscanthus RhizosphereMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLRMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRT
Ga0207664_1038609333300025929Agricultural SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMSRWASTIVRPLVERLLICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMG
Ga0209687_129596123300026322SoilMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQDRVREI
Ga0209152_1048259613300026325SoilMNTPLYAALDLHSRYSVLGSMDHEGRSESRKRFATEAKILRAEVEALKKKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGMDRTREIYRELVYELLN
Ga0209801_112753923300026326SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGM
Ga0209801_128807513300026326SoilMKRPIYAALDLHSRRSVLGSMDHDGQTQPRMRFPTEANILRAQVERLRQRQRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLIHSNPQKNDEADVEGMC
Ga0209803_119574723300026332SoilMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLLLRLGKLKEVWMGQ
Ga0209377_119300313300026334SoilMKRPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLIHSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRAREIYRALVYELLNWRDAQREL
Ga0209059_131528013300026527SoilMKRKTPLYAALDLHSAYSVLGSMDHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMC
Ga0209157_130353313300026537SoilMKTPLYAALDLHSRYSVLGSMDYDGNSQPKERFPTSALLLRQYIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLIHSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDA
Ga0209156_1046460413300026547SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPNVRFATSALLLRQNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGVDRTR
Ga0209474_1003521623300026550SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRMRFPTEANILRAEVEALRKRRRPVYLTMEAGAMTRWAGAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLRLGKLKEVWMGVDRTREIYRALVYELLNWRPSPES
Ga0179587_1045791613300026557Vadose Zone SoilMKRPLYAALDLHSAYSVLGSMEHSGKTQPRMRFATEAERLRAQVSALKQKRRPVHLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKCDEADVEGMCLILPLGKLKEVWMGQDRA
Ga0207552_10204813300026771SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELL
Ga0207498_10801513300026822SoilMKKPIYAALDLHSRMSVLGSMDHDGQTQPRMRFPTEASILRGEVERLRRKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRD
Ga0207944_102398213300027105Forest SoilMKTPLYAALDLPSRYSVLGSMDYDGNTQPKVRFATSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYEL
Ga0209590_1089194413300027882Vadose Zone SoilMKKKPLYAALDLHGEHSVLGSMDHDGNNQPRVRFATGAESLRAQLNALRAASKRPVHLTMEAGPLSRWASAIARPLVDQLIICEPRHNRLINANPTKSDGADVEGMCLLQRLGKLKEVWMGVDRTREIYRALVYELLNWRD
Ga0209380_1064555613300027889SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYEL
Ga0247827_1134604513300028889SoilMKKPTYAALDLHSRHSVLGSMDHDGRTQPRMRFATEANILRAQVERLRRRRRPLYLTMEAGPLSRWASAIVRPLVERLIICEPRHNRLINSNRYKSDEVDLEGMCLLLRLGKLKEVWMGQDRTREICRALVYELLN
Ga0170834_10726776313300031057Forest SoilMKKKSPVYTALDLHSRHSVLGSMEHGGKTGARIRFATEAETLRAQVARLRQAKRPLFLTMEAGALTRWASGIVRLLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGQDRTREIYRALVYELLNWRDAQRELKGLIKA
Ga0170824_10577334813300031231Forest SoilMKSPIYAALDLHSRMSVLGSMDHDGQTQPRMRFPTEASILRGEVERLRRKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLR
Ga0170824_12287819213300031231Forest SoilMKKTPLYAALDLHSRYSVLGSMDHEGNSQPRTRFATEANILRAEVEALRKRRRPVYLTMEAGAMTRWACAIVRPLVERLIICEPRYNRLINSNPTKSDEADVDGMCLLLRLGK
Ga0170819_1502891523300031469Forest SoilFGFGVAKFSKASCYKSKLMKSPLYAALDLHSRYSVLGSMDHGGKTGARMRFATEAETLRAQVARLRQAKGPLFLTMEAGALTRWASAVVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDGTREI
Ga0170818_11140548913300031474Forest SoilMKKKSPLYAALDLHSRMSVLGSMEHGGKTGARIRFATEAETLRAQVARLRKAKRPLFLTMEAGALTRWASAVVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRAL
Ga0170818_11473558813300031474Forest SoilMKRPIYAALDLHSRMSVLGSMDHDGQTQPRMRFPTEASILRGEVERLRRKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLR
Ga0310813_1030461923300031716SoilMKKPIYAALDLHSRHSVLGSMDHDGQTQPRMRFPTEAKILRTEVERLRESRRPLYLTMEAGALTRWASAIVRPVVERLIICEPRHNRLINSNPQKSDEADVEGMCLLLRQRQAQRSVDGTRSHPRDLPRPGV
Ga0307474_1164484313300031718Hardwood Forest SoilMMKKKPLYAALDLHSEHSVLGSMDHDGNNQPVVRFDTRAERLREELNALRARAKLRPLFLTMEAGPLSRWASVIARPLVERLIICEPRHNRYINSNPNKCDDADVAALCLLLRLNKLKEVWMGQDRTREIYRALV
Ga0306917_1055731813300031719SoilMKKPIYAALDLHSRMSVLGSMDHDGRTQPRMRFATEASLLRVQVERLRRRRQPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLR
Ga0307473_1088732013300031820Hardwood Forest SoilMKKKSPVYAALDLHSRHSVLGSMEHGGKTGARIRFATEAETLRAAVTRLRQARRPLFLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALVYELLNWRD
Ga0307478_1106563013300031823Hardwood Forest SoilMKSPLYAALDLHSRYSVLGSMDHDGRTQPRMRFPTQAEILRAQVRALKQKKRPLYLTMEAGALTRWASGIVWPLVERLIICEPRHNGLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGVDRTREIYRELVYELLNWRD
Ga0310917_1007302723300031833SoilMKSKSPVYAALDLHSRHSVLGSMEHGGKTGARMRFPTEANILRAQVERLRRRRRPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRA
Ga0318522_1042722713300031894SoilMKKPVYAALDLHSGHSVLGSMKHDGQTQPRMRFPTEASILRAQVERLGRKRRPLYLTMEAGALTRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRALEIL
Ga0310913_1073597413300031945SoilMKKPIYAALDLHSRMSVLGSMDHDGRTQPRMRFATEASLLRVQVERLRRRRQPLYLTMEAGALTRWASAIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWMGTDRTREIYRAL
Ga0310910_1157189513300031946SoilMKKPVYAALDLHSGHSVLGSMKHDGQTQPRMRFPTEASILRAQVERLGRKRRPLYLTMEAGALSRWASGIVRPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLNEVWMGTDRTREIYRALVYELL
Ga0308176_1236628313300031996SoilMKTPLYAALDLHSRYSVLGSMDYDGNTQPKVRFPTSALLLRKNIEALRQKKRPIYLTMEAGAFTRWASAIARPLVERLIICEPRHNRLINSNPTKSDEADVEGMCLLLRLGKLKEVWM


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.