NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F081791

Metagenome / Metatranscriptome Family F081791

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081791
Family Type Metagenome / Metatranscriptome
Number of Sequences 114
Average Sequence Length 47 residues
Representative Sequence MAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTKREVVAVVDRP
Number of Associated Samples 91
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 94.74 %
% of genes near scaffold ends (potentially truncated) 13.16 %
% of genes from short scaffolds (< 2000 bps) 12.28 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.333 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.825 % of family members)
Environment Ontology (ENVO) Unclassified
(38.596 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.246 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.00%    β-sheet: 28.00%    Coil/Unstructured: 56.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF00903Glyoxalase 13.16
PF04008Adenosine_kin 5.26
PF12838Fer4_7 4.39
PF00069Pkinase 3.51
PF00067p450 2.63
PF00072Response_reg 1.75
PF07238PilZ 1.75
PF03743TrbI 0.88
PF00582Usp 0.88
PF13519VWA_2 0.88
PF13376OmdA 0.88
PF11154DUF2934 0.88
PF08308PEGA 0.88
PF00872Transposase_mut 0.88
PF00656Peptidase_C14 0.88
PF00355Rieske 0.88
PF01061ABC2_membrane 0.88
PF07676PD40 0.88
PF14534DUF4440 0.88
PF12681Glyoxalase_2 0.88
PF13520AA_permease_2 0.88
PF13502AsmA_2 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 14.04
COG1839Adenosine/AMP kinaseNucleotide transport and metabolism [F] 5.26
COG2124Cytochrome P450Defense mechanisms [V] 2.63
COG2948Type IV secretory pathway, VirB10 componentIntracellular trafficking, secretion, and vesicular transport [U] 0.88
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.88
COG4249Uncharacterized conserved protein, contains caspase domainGeneral function prediction only [R] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.33 %
All OrganismsrootAll Organisms16.67 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005436|Ga0070713_100568490All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1075Open in IMG/M
3300005467|Ga0070706_100077016All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3086Open in IMG/M
3300005617|Ga0068859_102789350All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium536Open in IMG/M
3300006052|Ga0075029_100347832All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium955Open in IMG/M
3300012929|Ga0137404_11203935All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium697Open in IMG/M
3300019881|Ga0193707_1005724All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4337Open in IMG/M
3300020583|Ga0210401_10238297All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1678Open in IMG/M
3300026089|Ga0207648_10030212All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4801Open in IMG/M
3300026325|Ga0209152_10374355All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium551Open in IMG/M
3300026327|Ga0209266_1011954All Organisms → cellular organisms → Bacteria → Acidobacteria5013Open in IMG/M
3300026327|Ga0209266_1150695All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium941Open in IMG/M
3300026548|Ga0209161_10227782All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium999Open in IMG/M
3300027846|Ga0209180_10232692All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1061Open in IMG/M
3300027862|Ga0209701_10446670All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium712Open in IMG/M
3300027875|Ga0209283_10719864All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium622Open in IMG/M
3300027911|Ga0209698_11359231All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium518Open in IMG/M
3300027915|Ga0209069_10685292All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium600Open in IMG/M
3300031716|Ga0310813_10016190All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4978Open in IMG/M
3300032180|Ga0307471_103540076All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium553Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.40%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.26%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.39%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.39%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.63%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.75%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.88%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.88%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.88%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.88%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.88%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.88%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.88%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.88%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.88%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026067Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027535Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25386J43895_1018578623300002912Grasslands SoilMAYYKVRIEVWCDWNPGESDLEEIVQNTSTGEAICTKQEVIAVVDRPQDIDDDE
Ga0066673_1000719463300005175SoilMAYYKVRIEVWCDWNPAESDVEEIGETINAGEAFCTMREVVAIVDRPQELTTKKP*
Ga0066673_1008974613300005175SoilMAYYKVRIEVWCDWNPGESDLEEIVQNTSTGEAICTKQEV
Ga0066671_1021157423300005184SoilGGPMAYYKVRIEVWCDWNPAESDVEEIGETINAGEAFCTMREVVAIVDRPQELTTKKP*
Ga0070713_10056849013300005436Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPAESDLDEIAQGMGVGEALCTKREIVAIADRTQDII*
Ga0070708_10000541743300005445Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPAEGDLEEIAQGMGVGEAICTMREVVAVVNRPRY*
Ga0070706_10007701663300005467Corn, Switchgrass And Miscanthus RhizosphereVRLAYYKVRIEVWCDWNPAESDLEDIAQSMGVGETICTMREVVAVVNRPQDIEDEDEQQLATGRRL*
Ga0070697_10120393413300005536Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPADSDLEDIAQSMGVGEAICTQREVIAVVDR
Ga0066670_1082430113300005560SoilMAYYKVRIEVWCDWNPAESDVEEIGETINAGEASCTMREVVAIVDRPQELTTKKP*
Ga0066708_1070442813300005576SoilMAYYKVRIEVWCDWNPAESDLDDIAQAIGVGEAPCTKREVVAVVDRPKDIED
Ga0066708_1083092823300005576SoilMAYYKVRIEVWCDWNPGESDLEEIVQNTSTGEAICTKQEVIAV
Ga0068859_10278935023300005617Switchgrass RhizosphereMAYYKVRIEVWCDWNPAESDLDDIAQSLGVGEAVCTKREVVAVVDRPGDIDDEEAM
Ga0066789_1005187313300005994SoilMAYYKVRIEVWCDWNPVESDLEEIAQGMGVGEAICTMREVVAVVNRPQEIED
Ga0070717_1011595513300006028Corn, Switchgrass And Miscanthus RhizosphereLAYYKVRIEVWCDWNPAESDLEDIAQSMGVGETICTMREVVAVVNRPQDIEDEDEQQLATGRRL*
Ga0075029_10034783223300006052WatershedsMAYYKVRIEVWCDWNPAESDLDDIVQSMGVGEAICTMRGGCSPARY*
Ga0075026_10079899123300006057WatershedsMAYYKVRIEVWCDWNPAESDLDDIVQSMGVAEAICTMREIVAVV
Ga0070715_1092296513300006163Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPADSDLEDIAQSIGVGEAICTQREVIAVVDR
Ga0066660_1068663413300006800SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDCPQDIDDDEAM
Ga0073928_1027485113300006893Iron-Sulfur Acid SpringMAYFKVRIEIWCDWKPAESDLEEIAQSMGVGEAICTRRVVAVADRLQEDEEA
Ga0066710_10057116533300009012Grasslands SoilMAYYKVRIEVRCDWNPAESDVEEIGETINAGEAFCTMREVVAIVDRPQELTTKKP
Ga0066710_10274502913300009012Grasslands SoilMAYYKVRIEVWCDWNPSESDLEEIAQNTSTGEAICTKQEVVAVVDRPQDIDD
Ga0099829_1050769413300009038Vadose Zone SoilMNSPEGGAMAYYKVRIEVWCAWNPAESDLDDIVQSMGAGEAICTMREVIAVVNRPHDIED
Ga0099829_1051245533300009038Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDRPQDID
Ga0099830_1109160013300009088Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICT
Ga0099828_1094881123300009089Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICT
Ga0066709_10148051213300009137Grasslands SoilMAYYKVRIEVWCDWNPSESDLEEIAQNTSTGEAICTKQEVVAVVDRPQDIDDDEA
Ga0105238_1035605243300009551Corn RhizosphereMTYFKVRLEIWCDGNPAESDLEETARHILAGEAICTSREVV
Ga0126374_1080274613300009792Tropical Forest SoilMMAYYKVRIEVWCDWDPKESNLEEISENVRAGEAICINVV
Ga0134125_1189230123300010371Terrestrial SoilMAYYKVRIEVWCDWNPADSDLEDIAQSMGVGEAICTQREVIAVVDRPQNI
Ga0126383_1092861823300010398Tropical Forest SoilMAYYKVRIEVWCDWDPAASELEEIVENLRTGDAICTKRQVVD
Ga0137389_1027049813300012096Vadose Zone SoilAMAYYKVRIDGWCDWNPAESDLDETAQSMGVGEAICTRREVVAVVNRPQDIEDEEA*
Ga0137389_1042032033300012096Vadose Zone SoilMAYYKVRIEVWCDWNPSESDLEDIAQSMGVGEAMGTMREIVGVGNRPPYIEDEE
Ga0137389_1116778823300012096Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVVAGVNRPQD
Ga0137389_1162896113300012096Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEEIAQGMGVGEAICT
Ga0137388_1068936723300012189Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDIAQAMGVGEAICTKRDVIAVVDRAQDI
Ga0137388_1076476323300012189Vadose Zone SoilMAYYKVRIEVWGDWNPAESDLEDIAQSMGVGEAICTMREVVAVVNRP*
Ga0137388_1085850323300012189Vadose Zone SoilMNSPEGGAMAYYKVRIEVWCDWNPAESDLDDIVQSMGAGEAICTMR
Ga0137388_1191028713300012189Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDEIAQSMGVGEAICTRREVVAVVNRPQDIEDEEA*
Ga0137383_1005368923300012199Vadose Zone SoilMAYYKVRIEVWCDWDPVASELEEIVENIRTGDAILYQAPSG*
Ga0137363_1007196033300012202Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVVAVVNRP*
Ga0137362_1164634813300012205Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIVQSMGVGEAICTMREVVAVVNR
Ga0137378_1045132223300012210Vadose Zone SoilMGYYKVRIEVWCDWNPAESDLEDITQSMGVGEALCTKREVVSSRSTDT*
Ga0137378_1048184633300012210Vadose Zone SoilMAYYKVRIEVWCDWDPAESDLEEIAENISAGEAICTKREILNVVNRPQD
Ga0137377_1049937333300012211Vadose Zone SoilMAYHKVRIEVWCDWNPAESDLEEIAENISAGDAICTRREVVAVVDRPQDI
Ga0137377_1080099813300012211Vadose Zone SoilKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDRPQELVTDRKNS*
Ga0150985_11223074313300012212Avena Fatua RhizosphereMAYYKVRIEVWCDWNPAESDLEEIAESMSAGGSICTSTS*
Ga0137386_1067151913300012351Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEEIIQNTSTGEAICTKHEVVAIVDRPQDIDDDEA
Ga0137384_1143017413300012357Vadose Zone SoilMAYYKLRIEVWCDWNPGESDLEEIAQNISAQEAICTKREV
Ga0137385_1165468723300012359Vadose Zone SoilMAYHKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREIVAVVDRPQDIEDEKP*
Ga0137360_1114307313300012361Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDSAQSMGVGETICTMREVVAVVNRPQDIEDEE
Ga0137361_1022474113300012362Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICTM
Ga0137397_1065926213300012685Vadose Zone SoilMAYYKVRIEVWCDWNPADSDLDDIAQAMGVGEAICTKREVVAVVGRPQ
Ga0137416_1050379143300012927Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIAESMSVGEAICTKREVVSVV
Ga0137404_1120393523300012929Vadose Zone SoilMAYYKVRIEVWCDWNLAESDLEEIVQNTSTGEAICTQQEVVAVVDCP
Ga0137404_1155448923300012929Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTKREVVAVVDRP
Ga0164306_1079586623300012988SoilMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICTRREVV
Ga0120125_101734413300014056PermafrostMAYYKVRIEVWCDWNPAESDLEEIAESMSAGEAIC
Ga0157376_1232136613300014969Miscanthus RhizosphereMLGGGMAYYKVQIEVWCDWNPAESDLDDIAQAMGVGEAICTMREVVAVVD
Ga0137414_123168663300015051Vadose Zone SoilMPYYKVRIEVWCDWNPAESDLDDITQAMGVGEAICTMREVVAVVNRPQDIEDEEP*
Ga0137412_1071730423300015242Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIAQAMGVGEAICTMRDIVAVVDR
Ga0132258_1079232413300015371Arabidopsis RhizosphereMAYYKVRIEVWCDWDPAESELDDIAQAMGVGEAICTMRDVITVVDRPQD
Ga0182034_10001790113300016371SoilMAYYKVRIEVWCDWDPKESNLEEISENVRAGEAICTNVEVVAEVDR
Ga0066669_1011264023300018482Grasslands SoilMAYYKVRIEVWCDWNPAESDVEEIGETINAGEAFCTMREVVAIVDRPQELTTKKP
Ga0193707_100572443300019881SoilMAYYKVRIEVWCDWNPAKSDLEDIAQSMGVGEAICTMREVVTVVNRRKILKTKRP
Ga0193751_122620513300019888SoilMAYYKIQIEVWCDWNPAESDLEEIAESVSVGEAICTRREVVNVVNRPQDTA
Ga0210401_1023829713300020583SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREV
Ga0210401_1137324513300020583SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVITVVNRPQDI
Ga0210405_1011724913300021171SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVVAEIG
Ga0210405_1057068923300021171SoilMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICTKR
Ga0210394_1185809713300021420SoilMPYYKVRIEVWCDWNPAESDLDEISQSMGVGEAICTMREVVA
Ga0210410_1072912223300021479SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVAEAICTTREVVAVVNRPQGIEDE
Ga0210410_1157687713300021479SoilMAYYKVRIEVWCDWNPAESDLDDIAQAMGVGEAICTKREVV
Ga0210409_1142889323300021559SoilMAYYKVRIEVWCDWNPAESDLDDIAQAMGVGEAICTKREIVAIA
Ga0207710_1026437313300025900Switchgrass RhizosphereMAYYKVRIEVWCDWNPADSDLEDIAQSIGVGEAICTQREVIAVVDRPQDIEDEE
Ga0207685_1064659313300025905Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPADSDLEDIAQSIGVGEAICTQREVIAVVDRPQDIEDEEAMS
Ga0207684_1010565543300025910Corn, Switchgrass And Miscanthus RhizosphereLAYYKVRIEVWCDWNPAESDLEDIAQSMGVGETICTMREVVAVVNRPQDIEDEDEQQLATGRRL
Ga0207684_1025756323300025910Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWDPVASELEEIVENIRTGDAICTKRQVV
Ga0207646_1050013433300025922Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWDPVVSELEEIVENVRTGDAICTMRQV
Ga0207646_1063824033300025922Corn, Switchgrass And Miscanthus RhizosphereMAFYKVRIEVWCDWNPAESDLEEIIQNTSTGEAICTKQEVIAVVDRPQDIDDDEA
Ga0207700_1016345333300025928Corn, Switchgrass And Miscanthus RhizosphereMAYYKVRIEVWCDWNPAESDLDDIAQAMGVGEALCTMREVVAVVNRPQDIEDE
Ga0207640_1003778113300025981Corn RhizosphereMAYYKVRIEVWCDWNPAESDLDDIAQSLGVGEAVCTKRE
Ga0207678_1032481813300026067Corn RhizosphereMTYFKVRLEIWCDGNPAESDLEETARHILAGEAICTSREVVAVVDRPQDIE
Ga0207648_1003021213300026089Miscanthus RhizosphereMTYFKVRLEIWCDGNPAESDLEETARHILAGEAICTSREVVAVVDRPQDIENDAAM
Ga0209761_135019923300026313Grasslands SoilMAYYKVRIEVWCDWDPAESDPKEIVGHVALGEGAICT
Ga0209152_1037435523300026325SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEV
Ga0209801_123653123300026326SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVA
Ga0209266_1011954103300026327SoilMAYYKVRIEIWCDWNPAESDLEEIVHNTSTGEAICTKQEVV
Ga0209266_107035053300026327SoilMAYYKVRIEVWCDWNPGESDLEEIVQNTSTGEAICTKQEVIAVVDRP
Ga0209266_115069513300026327SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDCPQDI
Ga0257169_107751713300026469SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVVAVVNRPQDIE
Ga0209806_117919823300026529SoilMAYYKVRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDRPQDIDDD
Ga0209161_1022778223300026548SoilMAYYKIRIEVWCDWNPAESDLEEIVQNTSTGEAICTKQEVVAVVDCPQDI
Ga0209474_1063332013300026550SoilMAYYKVRIEVWCDWNPAESDVEEIGETINAGEAFCTM
Ga0209734_107622323300027535Forest SoilMAYYKVRIEVWCNWNPAESDLEEIAQSMGVGEAICTMREVVAVVNRPQ
Ga0209219_108939613300027565Forest SoilMPYYKVRIEVWCDWNPAESDLDEIAQSMGVGEAICT
Ga0209217_110436023300027651Forest SoilMAYRKVLLEVWCDWNPAESGLDEIAESMRVGEAMEAKQCR
Ga0209009_102657843300027667Forest SoilMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTMREVIAVVDRPQD
Ga0209180_1023269233300027846Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIVQSMGVGEAICTMREVVA
Ga0209701_1044667033300027862Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIVQSMGVGEAICTM
Ga0209283_1052039913300027875Vadose Zone SoilMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICTK
Ga0209283_1071986413300027875Vadose Zone SoilMAYYKVRIEVWCDWDPAESDPKEIVGHVALGEGAI
Ga0209698_1135923123300027911WatershedsMAYYKVRIEVWCDWNPAESDLEDIAQSMGVGEAICTKREVVSVVNRPQDI
Ga0209069_1068529233300027915WatershedsMAYYKVRIEVWCDWNPAESDLDDIAQSMGVGEAICTMRN
Ga0209526_1015944233300028047Forest SoilMAYYKVRFEVWCDWNPAESDLDDIAQSMGAGEAICTKREVVAVVDRPQDIED
Ga0222749_1021740213300029636SoilMAYYKVRIEVWCDWNPAESDLDDIAQSTGVGEAICTLRELVGVVNRPQDIEDE
Ga0170822_1312072913300031122Forest SoilMAYYKVRIEVWCDWDPAESDLEEIASNISAGEAVCTKREI
Ga0310813_10016190113300031716SoilMAYYKVRIEVWCDWNPAESDLDDIAQSLGVGEAVC
Ga0307469_1192966913300031720Hardwood Forest SoilMAYYKVQIEIWCDWNPAESDLEDIAQSMGVGEAICTMREVVAVIDRPQDIE
Ga0307469_1204052113300031720Hardwood Forest SoilMAYYKVRIEVWCAWDPVASELEEIVENIRTGDAIC
Ga0307473_1079495923300031820Hardwood Forest SoilMAYYKVRIEVWCDWNPADSDLEDLAQSMGVGEAICTKREVIAVADRPQDIEDEEAM
Ga0307473_1113984923300031820Hardwood Forest SoilLLWPYYKVRIEVWCDWNPAESDLEEIAENIGARDAICTMQEGIAVV
Ga0306926_1128542823300031954SoilMAYYRVRIEVWCDWDPAASDLNEIVEHINSGDAICTKRSD
Ga0307471_10354007623300032180Hardwood Forest SoilMAYYKVRIEVWCDWDPAESDLEDITQSMAVGEAICTKREVVAVVDRPQDIEDEEA
Ga0306920_10064690113300032261SoilMAYYKVRIEVWCDWDPKESNLEEISENVRAGEAICTN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.