NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068008

Metagenome / Metatranscriptome Family F068008

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068008
Family Type Metagenome / Metatranscriptome
Number of Sequences 125
Average Sequence Length 40 residues
Representative Sequence MSEKPSTTLPATVEKIIKPLSPRDPEKAQIAVEGADHLY
Number of Associated Samples 113
Number of Associated Scaffolds 125

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 6.40 %
% of genes from short scaffolds (< 2000 bps) 3.20 %
Associated GOLD sequencing projects 108
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(15.200 % of family members)
Environment Ontology (ENVO) Unclassified
(28.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.200 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 19.40%    Coil/Unstructured: 80.60%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 125 Family Scaffolds
PF12732YtxH 4.80
PF13502AsmA_2 3.20
PF00072Response_reg 2.40
PF04675DNA_ligase_A_N 2.40
PF13545HTH_Crp_2 2.40
PF01554MatE 2.40
PF02881SRP54_N 2.40
PF01594AI-2E_transport 1.60
PF02562PhoH 0.80
PF04851ResIII 0.80
PF00534Glycos_transf_1 0.80
PF04075F420H2_quin_red 0.80
PF01042Ribonuc_L-PSP 0.80
PF02348CTP_transf_3 0.80
PF04542Sigma70_r2 0.80
PF13620CarboxypepD_reg 0.80
PF09844DUF2071 0.80
PF11154DUF2934 0.80
PF01740STAS 0.80
PF12706Lactamase_B_2 0.80
PF00248Aldo_ket_red 0.80
PF07136DUF1385 0.80
PF07043DUF1328 0.80
PF01695IstB_IS21 0.80
PF01261AP_endonuc_2 0.80
PF07676PD40 0.80
PF13424TPR_12 0.80
PF04679DNA_ligase_A_C 0.80
PF16694Cytochrome_P460 0.80
PF13442Cytochrome_CBB3 0.80
PF12867DinB_2 0.80
PF07366SnoaL 0.80
PF01464SLT 0.80
PF00300His_Phos_1 0.80

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 125 Family Scaffolds
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 3.20
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 1.60
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 0.80
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.80
COG1083CMP-N-acetylneuraminic acid synthetase, NeuA/PseF familyCell wall/membrane/envelope biogenesis [M] 0.80
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.80
COG1212CMP-2-keto-3-deoxyoctulosonic acid synthetaseCell wall/membrane/envelope biogenesis [M] 0.80
COG1484DNA replication protein DnaCReplication, recombination and repair [L] 0.80
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.80
COG1702Phosphate starvation-inducible protein PhoH, predicted ATPaseSignal transduction mechanisms [T] 0.80
COG1861Spore coat polysaccharide biosynthesis protein SpsF, cytidylyltransferase familyCell wall/membrane/envelope biogenesis [M] 0.80
COG1875Predicted ribonuclease YlaK, contains NYN-type RNase and PhoH-family ATPase domainsGeneral function prediction only [R] 0.80
COG3872Uncharacterized conserved protein YqhQ, DUF1385 familyFunction unknown [S] 0.80
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.80
COG5487Uncharacterized membrane protein YtjA, UPF0391 familyFunction unknown [S] 0.80


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.00 %
All OrganismsrootAll Organisms4.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005602|Ga0070762_10005076All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6364Open in IMG/M
3300018086|Ga0187769_10281881All Organisms → cellular organisms → Bacteria → Acidobacteria1240Open in IMG/M
3300020199|Ga0179592_10309111Not Available701Open in IMG/M
3300021307|Ga0179585_1180239Not Available518Open in IMG/M
3300021401|Ga0210393_10116978All Organisms → cellular organisms → Bacteria2128Open in IMG/M
3300021406|Ga0210386_10051524All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3271Open in IMG/M
3300027117|Ga0209732_1001573All Organisms → cellular organisms → Bacteria → Acidobacteria3558Open in IMG/M
3300027376|Ga0209004_1033654Not Available844Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.20%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil6.40%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil6.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.60%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa5.60%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.20%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.20%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland2.40%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog2.40%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.60%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.60%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.60%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.60%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.60%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.60%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen1.60%
RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Roots1.60%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.80%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.80%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.80%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.80%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.80%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.80%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.80%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.80%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.80%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.80%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.80%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.80%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.80%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.80%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.80%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.80%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2199352024Bare-fallow DEEP SOILEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009552Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_20_150EnvironmentalOpen in IMG/M
3300009624Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_10EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010860Boreal forest soil eukaryotic communities from Alaska, USA - C5-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014199Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_30_metaGEnvironmentalOpen in IMG/M
3300015195Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-6c, vegetation/snow interface)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018021Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_19_150EnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021307Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022733Soil microbial communities from Bohemian Forest, Czech Republic ? CSU3EnvironmentalOpen in IMG/M
3300023030Soil microbial communities from Bohemian Forest, Czech Republic ? CSU2EnvironmentalOpen in IMG/M
3300024123Spruce roots microbial communities from Bohemian Forest, Czech Republic - CRU5Host-AssociatedOpen in IMG/M
3300024227Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic - CZU4Host-AssociatedOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025320Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025414Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300027117Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027376Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027575Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028772Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Fen_E3_1EnvironmentalOpen in IMG/M
3300028780Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_E3_2EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029999I_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030041Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Bog_N2_1EnvironmentalOpen in IMG/M
3300030043Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_E3_1EnvironmentalOpen in IMG/M
3300030045Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Fen_E1_3EnvironmentalOpen in IMG/M
3300030339III_Bog_N1 coassemblyEnvironmentalOpen in IMG/M
3300030399II_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300030688II_Bog_N2 coassemblyEnvironmentalOpen in IMG/M
3300030706Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_6_BS metaG (v2)EnvironmentalOpen in IMG/M
3300030813Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030831Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_141 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031837Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_N3_1EnvironmentalOpen in IMG/M
3300031866Metatranscriptome of soil microbial communities from Bohemian Forest, Czech Republic ? CSU5 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033547Spruce roots microbial communities from Maridalen valley, Oslo, Norway - NRE1Host-AssociatedOpen in IMG/M
3300033818Peat soil microbial communities from Stordalen Mire, Sweden - 713 S-3-MEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
deeps_011873002199352024SoilMSDVEKLSTTLPGMVEKIIKPAYSGQTEKAQISVEGADHLYR
JGI1027J11758_1253550213300000789SoilMAEKPSVNLKGTVEKIIKPVVPTQPEKAQIAVDGA
JGIcombinedJ26739_10175657113300002245Forest SoilVSETNESEKPKVTLPGTVEKIIPANSIAPEKAQIAVEGA
Ga0062387_10175686813300004091Bog Forest SoilMSEKPSTTLPATVEKIIKPIAPGEPEKAQISIEGADHLYKEIR
Ga0062386_10025511023300004152Bog Forest SoilVPDEEHKASTTLPGKVERVIKPHPQSGEPEKAQIAVEGADHLYREI
Ga0070707_10116400213300005468Corn, Switchgrass And Miscanthus RhizosphereMTENSSSSLRGTVEKIITSRVSTEPEKAQISIEGADHLYKELRI
Ga0070697_10139917923300005536Corn, Switchgrass And Miscanthus RhizosphereMPEKANVTLPGTVEKIINPPDPSEPEKAQINIQQGADPLYK
Ga0070762_1000507673300005602SoilMSEKASATLPAIVEKIIKPAYPSEPEKAQIAVEGADHLYRE
Ga0070762_1002549133300005602SoilMSEKASRTLPATVEKIIKPPVPDGTEKAQISVEGADHLYR
Ga0070762_1078033833300005602SoilMTDKPSVTLPGTVEKVIHSADPRIPEKAQIAVQGADDL
Ga0070717_1001233333300006028Corn, Switchgrass And Miscanthus RhizosphereMTEKPSVTLPKTVEKIIKPSEPSEPEKAQIAIEGADDLYRELRCA*
Ga0070717_1084960913300006028Corn, Switchgrass And Miscanthus RhizosphereMKQNSIVPDEKPSTTLPGIVEKVIKPRDPRDPEKAQIVVEGADHLY
Ga0070717_1177609413300006028Corn, Switchgrass And Miscanthus RhizosphereMPDDEKPSTTLPGTVEKIIKPWVSGEPEKAQISVEGADHLY
Ga0075029_10105923013300006052WatershedsMPEKPNIKLPATVEKIIKSPDPRMPEKAQISIERGADPLY
Ga0070712_100001763113300006175Corn, Switchgrass And Miscanthus RhizosphereMDEKPSTILPGTVEKIIKPPLPRMPEKAQITVEGGDHL*
Ga0070765_10115792313300006176SoilMSEISSDQNEKPSTTLAGTVEKIIKPAHPSLPEKAQIAVEGGEDLYR
Ga0097621_10200949523300006237Miscanthus RhizosphereMDEKPSTILPGTVEKIIKPPLPRMPEKAQITVEGG
Ga0075436_10057059813300006914Populus RhizosphereMTGNPRTTLAGKVEKIIESRAPTEPEKAQIVIEGADQL
Ga0066710_10146690313300009012Grasslands SoilMSENPSATLPGTVEKIIKSPHPGVPEKAQISVEAAD
Ga0099829_1066547923300009038Vadose Zone SoilMSEKTSTTLPGTVEKIIKPLSPDDTEKAQIAVEGA
Ga0099830_1132611813300009088Vadose Zone SoilMMEKPSVTLPGTVEKIIKPVHPSEPEKAQIAVEGA
Ga0099792_1051919413300009143Vadose Zone SoilMADRPSVTLPGTVEKVIESPHRGAPEKAEIAVEGADDLY
Ga0116138_117186223300009552PeatlandMTENPSVRLPGTVEKIIKPSGPSEAEKAQIAIEGADELY
Ga0116105_100709713300009624PeatlandMSANANTTLPATVEKIIKSPHPAIPEKAQIAIEGADH
Ga0126384_1102622313300010046Tropical Forest SoilMDEKASVVLPGTVEKIIKPVHPKEPERAEISIEGADHLYKEIRIE
Ga0126373_1255445813300010048Tropical Forest SoilMAEKPSTTRPGIVEKIIRPIVPDEPEKAQIAVEGADHLYR
Ga0099796_1028528033300010159Vadose Zone SoilMTEKIAEKPSVTLSGTVEKIIEPVHPSMPEKAQIAVEGADDLYQ
Ga0126370_1207505623300010358Tropical Forest SoilMPEKTENPSVTLPGKVEKIIKPLDRTDTEKAQINIEEGADPLYKEIRIENML
Ga0126372_1293095023300010360Tropical Forest SoilMPEKPSVTLPGTVDKIIHPPDPREPEKAQINIEDGADPLYKE
Ga0134125_1023547343300010371Terrestrial SoilMNEKPSAILPGTVEKIIKSPIPNESEKAQIAVEGADHLY
Ga0126351_127078813300010860Boreal Forest SoilMSENPSTTLPGTVERIIKPLSADEPEKAQIAIEGADHLYREI
Ga0150983_1601987313300011120Forest SoilMPDEQNSDKPSTTLPGIVEKVIKSPDPTEPEKAQIAVERADPL
Ga0137393_1017754113300011271Vadose Zone SoilMKENSIVPDEKPSTTLPGIVEKVIKPRDPRDPEKAQIAVEGADHL
Ga0137393_1048316713300011271Vadose Zone SoilMSEKPSTTLPGTVEKIIKPLSPDDTEKAQIAVEGADHL
Ga0137389_1184629123300012096Vadose Zone SoilMTEKPSVTLPGTVEKIIKPIHPSEREKAQISVDGADE
Ga0150985_11996120533300012212Avena Fatua RhizosphereMSTPVATVTLPGVVENIIKSPHPSLPDKAQITVEGADELY
Ga0137360_1071614523300012361Vadose Zone SoilMADKTSVTLPGTVEKVIESPHRGMPEKAEIAVEGADDL
Ga0150984_11708228623300012469Avena Fatua RhizosphereMTEKPSTARPGIVERIIKSPDPREPEKAQISIEGAD
Ga0137358_1070084623300012582Vadose Zone SoilMSEKPSVTLPGTVEKIIPSPDPREPEKAHINIEEGATPLYKEIRIENTLTNEDG
Ga0137358_1087491413300012582Vadose Zone SoilMADKPSVTLPGTVEKVIESPHRGMPEKAEIAVEGADDLYREI
Ga0137397_1084638613300012685Vadose Zone SoilMTEKIVEKPSVTLPGTVEKIIEPIHPSMTEKAQIAVEGADDLYQE
Ga0137419_1169580813300012925Vadose Zone SoilMADKPSVTLPGTVEKVIESPHRGMPEKAEIAVEGADDLY
Ga0137416_1098860113300012927Vadose Zone SoilMTEKPSVSLPGTVDKIITPPDPRDPEKAQINIEEGADPLYKE
Ga0181535_1026994023300014199BogMNPKPSATLPGTVEKIIKPPHPSEPEKAQIAVEGA
Ga0167658_103726833300015195Glacier Forefield SoilMEESMPEKPSISLPGTVDKIIRPPDPREPEKAQINIEEGADPLYKEIR
Ga0137409_1118494233300015245Vadose Zone SoilMPEKPSVTLPATVEKIITTPDPIEPEKAQISLEDGDP
Ga0187821_1050310513300017936Freshwater SedimentMTENSSTSLCGTVEKIITSRVSTEPEKAQISIEGAD
Ga0187882_123463223300018021PeatlandMSEKPSTTLPATVEKIIKPLSPRDPEKAQIAVEGADHLY
Ga0187769_1028188113300018086Tropical PeatlandMTEKPSVTLPGTVEKIIESPHPGEPEKAQISIEGADDLY
Ga0066662_1097380713300018468Grasslands SoilMPDTENPSTILPGTVEKIIKPWFPGDTEKAQISIQGADHMYREIRI
Ga0066669_1214247913300018482Grasslands SoilMSAKPSVTMPGTVEKIIPSHDPKEPDKAHISIEKGAIPLY
Ga0193721_113827423300020018SoilMTEGSSDQSEKPSATLSGTVEKIIKSPDPNVPEKAQITVEGA
Ga0179592_1030911133300020199Vadose Zone SoilMAEKPSVTLPGTVEKIIKPSQPGQPEKAQIEIEGADDM
Ga0179592_1040058713300020199Vadose Zone SoilMFSRRGTLMTEKPSVTLPGTVEKIIKPIHPSEPEKAQISVEGADELY
Ga0210401_1012306613300020583SoilMSEKPSTTLPGTVEKIIKPLSPDDTEKAQIAVEGADH
Ga0210400_1040672423300021170SoilMTLRPSATLPGTVEKIIKPSDPSEPEKAQIAEDGADHL
Ga0210405_1107828923300021171SoilMSENSIVPDEKPGTTLSGIVEKVIKPRGPRDPEKAQIVAQGADHLYGEIRI
Ga0210388_1009115913300021181SoilMSEKASATLPAIVEKIIKPAYPSEPEKAQITVEGADHLYREI
Ga0210388_1014393443300021181SoilMTENPSVTLPGTVEKIIKPSAPSDAEKAQIAIEGADELYR
Ga0179585_118023913300021307Vadose Zone SoilMTAKPSVTLPGTVEKIIQPSEPNQAEKAQIAIEGADDLYR
Ga0210393_1011697813300021401SoilMPEKPATVTLPGKVEAIIESFHPSEPEKAQIAVEGGDELY
Ga0210389_1067338033300021404SoilMTDKPRTADRPNVTLPGRVEKVIESPLPSEPEKAQISVEGA
Ga0210389_1080497323300021404SoilMTEKPSATLSGTVEMTIESIIPSEPEKAQITFGGADHPHKIRIENK
Ga0210387_1161191413300021405SoilMSEKPSTILPATVENIIKPLFPSEPEKAQITIEGADHLYRELR
Ga0210386_1005152483300021406SoilMTEKPSVTLPGTVEKIIKPAQPDQPEKAQIAIEGADDLY
Ga0210384_1096543723300021432SoilMSSEKAKDTMSSDEKPAITLPGTVEKIVPPVYPEPEKVQIHVEGADHLYK
Ga0210391_1011463513300021433SoilMASEQTHDKPAATLPGVVEKVIKPPSPAEPEKAQITVEGADH
Ga0210398_1095108723300021477SoilMSEKASATLPAIVEKIIKPAYPSEPEKAQIAVEGAD
Ga0212123_1038777123300022557Iron-Sulfur Acid SpringVTETNESEKPKVTLPGTVEKIIPANSIAPEKAQIAVEGADHLY
Ga0224562_102429413300022733SoilMPEATESKKPAVTLPGTVEKIIPANTIAPERAQIAVEGADHL
Ga0224561_101813123300023030SoilMSEKPSTTLPATVEKIIESPHPSVPEKVQLSVEGA
Ga0228600_101806813300024123RootsVTEKHTVTLPATVEKIIKPSDPREPEKAQISIAGADDLYREIRI
Ga0228598_100732623300024227RhizosphereMSEKSSTTLPATVEKIIKSPHPSVPEKVQIAVEGADHLF
Ga0137417_1494468123300024330Vadose Zone SoilMLPGTVQKIIKPIDPHAPDTAEIAIEGAEDLYREIRVENT
Ga0209171_1048612713300025320Iron-Sulfur Acid SpringMSEKPSTTLPAVVEKIIKPIYPSEPEKAQIAVEGADHL
Ga0208935_100627223300025414PeatlandMSANANTTLPATVEKIIKSPHPAIPEKAQIAIEGADHL
Ga0207652_1138274223300025921Corn RhizosphereVTLPGTVEKIIPSPHPSEPEKAQIGIDGADDLYREL
Ga0207664_1008390113300025929Agricultural SoilMTEKIDSKPSVTLPGTVEKIIKPLHPSMPEKAEIAVEGADELYQ
Ga0207664_1155879213300025929Agricultural SoilMPENPSTSIPAVVEKIIKVPGAPEKAQLIVEAGDDLYR
Ga0207702_1006588913300026078Corn RhizosphereVSRTKKATVTLPGTVEKIIPPPSRFSDEPEKAEIAVE
Ga0209240_104454813300026304Grasslands SoilMPEKPSVTLPGVVEQIIESPHPDMPEKAQIAVEGA
Ga0209471_123679713300026318SoilMAEKPSVTLPGTVEKIIKPSEPGQLEKAQIEVEGADDMYREL
Ga0257163_103820013300026359SoilMPEKTAEKPSVTLPGTIEKIIEPIHPSMPQKAEIAVHGA
Ga0209732_100157363300027117Forest SoilMTEKPSVTLPGTVEKIIKPADPRDPEKAQINVHGAEPLYQEIRID
Ga0209004_103365413300027376Forest SoilMTENASTTLVGTVEKIIKPRFPSEPERAQIVVEGADHLYK
Ga0209525_111141913300027575Forest SoilMSEKASATLPAIVEKIIKPAYPSEPEKAQIAVEGADHLYR
Ga0209625_104256313300027635Forest SoilMIEKSSTVLPGTVEKIIKSPYSAEPEKAQISVEGA
Ga0209118_111814813300027674Forest SoilMTEKSSAILPGTVEKIISSLVPTQPEKAQIRVEVAD
Ga0209580_1002740713300027842Surface SoilMDEKTSTILPGTVEKIIKPPFPSMPEKAQITVEGGDH
Ga0209579_1069501023300027869Surface SoilMTEKPSVTLPGTVERVIRPIDPNQPDKAQITVQGAD
Ga0209275_1022269913300027884SoilMSEKASRTLPATVEKIIKPPVPDGTEKAQISVEGADHLYRELR
Ga0209275_1082838713300027884SoilMTDKPSVTLPGTVEKVIHSADPRIPEKSQIAVQGADDL
Ga0209006_1040019123300027908Forest SoilMSEKPNVTLPATVEKIIKSPDPRMPEKAQINIERGAEPLYQEIRI
Ga0209069_1069537413300027915WatershedsMTEKPSATLPGTVEKTIKSPFPSEPEKAQIAVEGAYHLY
Ga0137415_1102843833300028536Vadose Zone SoilMTEKPSVSLPGTVDKIITPPDPRDPEKAQINIEEGADPLYKEIRI
Ga0302209_1015709913300028772FenMSEKPNATLPGTVEKIIKSPDPEVPDKAQITVECAD
Ga0302225_1019832823300028780PalsaMSEKPSTTLPGTVEKIIKPISPDEPEKAQIAIHGADDLYREI
Ga0308309_1030577613300028906SoilMTEKPSVTLPGTVEKIIKPTQPDQPEKAQIAVEGADDLYR
Ga0308309_1034895123300028906SoilMSEKPSTTLPATVEKIIKPVFPSEPERAQIAIHGA
Ga0311339_1002697583300029999PalsaMGDKPSTTLPATVEKIIKSPFPSTPEKAQLAVEGAD
Ga0302274_1025824113300030041BogMSEKASATLPATVEKIIKSPAPSIPEKAQIAVEGADHL
Ga0302306_1034268413300030043PalsaMSEKPSTTLPATVDKIIRPPSPRDPEKAQITVEGADHLY
Ga0302282_111954823300030045FenMTEKPSATLPATVEKIIKPIAPGEPEKAQISIEGADYLYQEI
Ga0311360_1009299713300030339BogMPENPSVTLPAVVDKIIEPSNPSEPEKAQINIQEGAEPLYQEIRIENNLTDENGQ
Ga0311353_1130643723300030399PalsaMTEGPSATLPGTVEKIVKSPVPSEPDIAQIAVEGA
Ga0311345_1036964033300030688BogMTEKPSVTLPGVVEEVIPPAHPSQPEKAQIAVANADD
Ga0310039_1028913113300030706Peatlands SoilMTEKPSATLPGTVEKIIKSPHPSEPEKVQIAVEGADELYK
Ga0265750_102471023300030813SoilMSEKPSTTLPAIVEKVIKSPHPNEPEKAQITVEGAD
Ga0308152_10934313300030831SoilMNEKPSAILPGTVEKIIKSPIPNEPEKAQIAVEGADH
Ga0073994_1239997913300030991SoilMFNGKVQRKGSPMTEKPSVTLPGTVEKIIKPIHPSEPEKAQISVDGAD
Ga0170834_10563523613300031057Forest SoilMTEKPSVTLPGVVQKIIKPFDPKAPDRAQIAVEGADELYRE
Ga0302324_10171318113300031236PalsaMSREPEKESSEKPAITLLGTVEKIIPAIQPVEPEKAQISLEGADHLYREI
Ga0170820_1607323923300031446Forest SoilVTEKPSATMPGAVEKIIKSPWGETEKAQIAIETADHLYRESGLK
Ga0302326_1271118723300031525PalsaMSEKASATLPATVEKIIKSPAPSIPEKAQIAVEGADH
Ga0310686_10164675123300031708SoilMTIMSEKPSTTLAATVEKVIKPVSPGEPEKAQIAVEGADHLYREL
Ga0307474_1056005213300031718Hardwood Forest SoilMSEKPNITLPATVEKIIKSPDPKMPEKAQINIERGAEPLYQ
Ga0307469_1223580413300031720Hardwood Forest SoilMPENEKPSTTLPGTVEKIIKPLIPGEPEKAQISVEGADHL
Ga0307477_1023166323300031753Hardwood Forest SoilTISENSIVPDEKPSTTLPGIVEKLIKPRHPRDPEKAQIVVQEQTT
Ga0307475_1031012733300031754Hardwood Forest SoilMSEKPSVTLPGTVEKIIESPHRDVPEKAEIAVHGADD
Ga0302315_1028199133300031837PalsaVSDKPAVTLPGTVEKIIPPVAGEPEKAQIAVDGADDLYRE
Ga0316049_11332723300031866SoilVTEKHTVTLPATVEKIIKPSDPREPEKAQISIAGAD
Ga0307479_1211547813300031962Hardwood Forest SoilMAENPSATLPAIVEKIIKFPGAPEKAQVAVEGADHLYREI
Ga0307471_10365271423300032180Hardwood Forest SoilMSEKPSTTLPGTVERIIKPLSADDPEKAQIAIEGADDLY
Ga0316212_104932323300033547RootsMAEKPSTTLPATVEKIIKSRSPNEPEKAQIAVEGADPLYRE
Ga0334804_026801_1_1083300033818SoilMSEKSSTTLSATVEKVIKPLSPREPEKAQIAVEGAD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.