NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F083036

Metagenome / Metatranscriptome Family F083036

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F083036
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 47 residues
Representative Sequence LPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Number of Associated Samples 94
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 16.67 %
% of genes near scaffold ends (potentially truncated) 4.42 %
% of genes from short scaffolds (< 2000 bps) 5.31 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.345 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.584 % of family members)
Environment Ontology (ENVO) Unclassified
(27.434 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.982 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 54.05%    β-sheet: 0.00%    Coil/Unstructured: 45.95%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF05532CsbD 1.77
PF00155Aminotran_1_2 1.77
PF00296Bac_luciferase 1.77
PF13358DDE_3 1.77
PF01979Amidohydro_1 0.88
PF04075F420H2_quin_red 0.88
PF00903Glyoxalase 0.88
PF07681DoxX 0.88
PF08837DUF1810 0.88
PF00535Glycos_transf_2 0.88
PF05406WGR 0.88
PF13565HTH_32 0.88
PF13701DDE_Tnp_1_4 0.88
PF01751Toprim 0.88
PF09929DUF2161 0.88
PF08309LVIVD 0.88
PF13280WYL 0.88
PF13518HTH_28 0.88
PF00848Ring_hydroxyl_A 0.88
PF02586SRAP 0.88
PF01717Meth_synt_2 0.88
PF11249DUF3047 0.88
PF01694Rhomboid 0.88
PF00872Transposase_mut 0.88
PF00595PDZ 0.88
PF03372Exo_endo_phos 0.88
PF00313CSD 0.88
PF00239Resolvase 0.88
PF01594AI-2E_transport 0.88
PF09650PHA_gran_rgn 0.88
PF11901DUF3421 0.88
PF13751DDE_Tnp_1_6 0.88
PF00753Lactamase_B 0.88
PF07993NAD_binding_4 0.88
PF031712OG-FeII_Oxy 0.88
PF08840BAAT_C 0.88
PF01244Peptidase_M19 0.88
PF13538UvrD_C_2 0.88
PF01527HTH_Tnp_1 0.88
PF01476LysM 0.88
PF01425Amidase 0.88
PF14863Alkyl_sulf_dimr 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.77
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 1.77
COG4638Phenylpropionate dioxygenase or related ring-hydroxylating dioxygenase, large terminal subunitInorganic ion transport and metabolism [P] 1.77
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.88
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 0.88
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.88
COG0705Membrane-associated serine protease, rhomboid familyPosttranslational modification, protein turnover, chaperones [O] 0.88
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.88
COG2135ssDNA abasic site-binding protein YedK/HMCES, SRAP familyReplication, recombination and repair [L] 0.88
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 0.88
COG2355Zn-dependent dipeptidase, microsomal dipeptidase homologPosttranslational modification, protein turnover, chaperones [O] 0.88
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.88
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.88
COG3831WGR domain, predicted DNA-binding domain in MolRTranscription [K] 0.88
COG4270Uncharacterized membrane proteinFunction unknown [S] 0.88
COG5276Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domainFunction unknown [S] 0.88
COG5579Uncharacterized conserved protein, DUF1810 familyFunction unknown [S] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.35 %
All OrganismsrootAll Organisms2.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10094605Not Available1812Open in IMG/M
3300009143|Ga0099792_10189641Not Available1162Open in IMG/M
3300012203|Ga0137399_10580370All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium942Open in IMG/M
3300020580|Ga0210403_10780858Not Available760Open in IMG/M
3300025916|Ga0207663_10609977All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium858Open in IMG/M
3300025941|Ga0207711_11891936All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria540Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.58%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil14.16%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.19%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.19%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.54%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.77%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.77%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.77%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.77%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.89%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.89%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.89%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.89%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010113Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010863Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (PacBio error correction)EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022529Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031751Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031897Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f16EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ51221_1014021323300003505Forest SoilLPFNWGSKAESALRLFGADRALLLIEGVAIAKVAMLGIAYFFRRRRQPML*
Ga0058899_1011580213300004631Forest SoilTQLLLPDWASKAESALRIFGADRALLLVEGIAMAKLIMLAIAYPFRRGR*
Ga0066672_1009460533300005167SoilLTTVADENLLLIKTVTRSLPFDWGSKIESALRIFGADRALLLIESIALAKLIMLGIAYPFRRRR*
Ga0066677_1043588223300005171SoilPPDWASKTESALRMFGADRALLLVEGVAVAKLIMLGIAQPFRRRRP*
Ga0066683_1030305113300005172SoilLPLDWSSNVESAVRIFGADRALLLIEGVALAKFIMLAVAHPSAAARNR
Ga0066684_1062459313300005179SoilKVESALRIFGADRALLLIEGVAVAKLILLSLAYPFRGPRP*
Ga0066388_10146423633300005332Tropical Forest SoilAADANLQLIKIVTGKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRR*
Ga0070667_10104032013300005367Switchgrass RhizosphereASKTESALRIFGADRALLLVEGIALAKLIMLGVAQPFRRRRP*
Ga0070713_10009610353300005436Corn, Switchgrass And Miscanthus RhizosphereKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0070710_1049428613300005437Corn, Switchgrass And Miscanthus RhizosphereLTQLLPPDWASKAESALRIFGADRALLLVEGIALAKLIMLAIAYPFRRGR*
Ga0070681_1058415423300005458Corn RhizosphereWASRVESALRIFGADRALLLVEGIALAKIIMLSLAYPFRRRS*
Ga0070672_10164157523300005543Miscanthus RhizosphereLPRDWSSKVESALRIFGADRALLLVEAVAVAKLLMLGVAYPFRRR*
Ga0066700_1061160813300005559SoilSKIESALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRRR*
Ga0066670_1081718213300005560SoilIKTVSRFLPFDWGGKVESALRIFGADRALLLIESVALAKLMMLGVAYPFRRDRP*
Ga0066705_1096979713300005569SoilADGNLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRR
Ga0066708_1088446323300005576SoilFDWGSKIESALRIFGADRALLLIESVALAKLIMLGIAYPLRRRR*
Ga0066708_1101735613300005576SoilDENLLLIKTVTRSLPFDWGSKIESALRIFGADRALLLIESIALAKLIMLGIAYPFRRRR*
Ga0068864_10107345623300005618Switchgrass RhizosphereLDWASKSESALRMFGADRALLLIEGVALAKLLMLAVAYPFRRRR*
Ga0068866_1051107313300005718Miscanthus RhizosphereRDWSSKVESALRIFGADRALLLVEAVAVAKLVMLGVAYPFRRR*
Ga0066903_10070235513300005764Tropical Forest SoilSALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRR*
Ga0066903_10852509233300005764Tropical Forest SoilTRVLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRRRRP*
Ga0068863_10129342313300005841Switchgrass RhizospherePAHWASKTESALRIFGADRALLLVEGIAVAKLVLLAVAQPFRRRRP*
Ga0068863_10263483613300005841Switchgrass RhizosphereREITRLLPLDWASKSESALRMFGADRALLLIEGVALAKLLMLAVAYPFRRRR*
Ga0066651_1009948013300006031SoilLPLDWASKVESALRIFGADRALLLIEGVAVAKLIMLSLAFSFRRFGR*
Ga0070712_10120195913300006175Corn, Switchgrass And Miscanthus RhizosphereIKTLTQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLVMLAIAYPFRRGR*
Ga0070765_10226089023300006176SoilDLLKTVTGTLPPDWGSKIESALRIFGADRALLLIESVVVAKAILLGIAYPFRRRQPMP*
Ga0075425_10279351423300006854Populus RhizosphereTRLLPRDWSSKVESALRIFGADRALLLVEAVAVAKLLMLGVAYPFRRR*
Ga0075434_10035177213300006871Populus RhizosphereIKTVTGLLPLDLASKAESALRIFGADRALLLIEGVAVAKLILLALAHPFRRFRP*
Ga0079219_1053796313300006954Agricultural SoilLPFDWGKKVESALRIFGADRALLLIEGVAFAKVIMLGFAYPFRRRR*
Ga0066709_10324198713300009137Grasslands SoilAESALRIFGADRALLLVEAIALAKLIMLAVAYPFRRGR*
Ga0099792_1006689113300009143Vadose Zone SoilPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0099792_1018964123300009143Vadose Zone SoilRIIKTLTQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0126380_1004723613300010043Tropical Forest SoilNLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRHRRP*
Ga0126373_1117208113300010048Tropical Forest SoilSKVESALRIFGADRALLLIEGVAIAKVVMLSIAYPFRRRR*
Ga0127444_106413213300010113Grasslands SoilSKVESALRIFGADRALLLMEGVAVAKLVMLSLAYPFRGFRR*
Ga0134063_1037353013300010335Grasslands SoilDWVSKVESALRIFGADRALLLMEGVAVAKLIMLSLAYPFRGFRL*
Ga0126378_1225837723300010361Tropical Forest SoilGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR*
Ga0126378_1319002323300010361Tropical Forest SoilDWGSKIESALRIFGADRALLLIEGVAIAKVVMLGLAYPFRRRR*
Ga0126379_1013708733300010366Tropical Forest SoilWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0126379_1099579823300010366Tropical Forest SoilDANLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRHRRP
Ga0126381_10058182863300010376Tropical Forest SoilTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRGR*
Ga0126381_10315944013300010376Tropical Forest SoilADANLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLSIAYPFRRRR
Ga0126381_10330555633300010376Tropical Forest SoilADANLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLSTAYPFRRRR
Ga0126381_10376030213300010376Tropical Forest SoilTGKLPFDWGSKVESALRIFGADRALLLIEGVAIAKIVMLGFAYPFRRRR*
Ga0126381_10392237513300010376Tropical Forest SoilKIVTSKLPFDWGSKIESALRIFGADRALLLIEGVAIAKIVMLGIAYPFRRRR*
Ga0126381_10443260713300010376Tropical Forest SoilKLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRHRRP*
Ga0126383_1236316813300010398Tropical Forest SoilLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRHRRP*
Ga0126383_1293805513300010398Tropical Forest SoilADANLQLIKIVTSKLPFDWGSKIESALRIFGADRAVLLIEGVAIAKVVMLGIAYPFRPRR
Ga0134127_1127294323300010399Terrestrial SoilENLRLIKEITQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLLMLAIAYPFRRRIR*
Ga0124850_106447913300010863Tropical Forest SoilPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRR*
Ga0137388_1080903713300012189Vadose Zone SoilWASKTESALRMFGADRALLLIEGVAIAKLIMLGIAQPFRRRRP*
Ga0137399_1035052623300012203Vadose Zone SoilVIAAPASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0137399_1058037033300012203Vadose Zone SoilTSLLPLDWPSKVESVLRILGADRALLLIEGVAAAKIIMLSLAYPFRRPRP*
Ga0137370_1010541533300012285Vadose Zone SoilTKLLPLDWASKVESALRIFGADRALLLIEGVAVAKLIMLSLAFSFRRFGR*
Ga0137398_1006828943300012683Vadose Zone SoilVESALRIFGADRALLLVEGIVIAKVVMLGIAHPFRHRRP*
Ga0137395_1091182523300012917Vadose Zone SoilDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0137394_1048061313300012922Vadose Zone SoilTQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0137419_1163121923300012925Vadose Zone SoilPPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR*
Ga0137410_1164709513300012944Vadose Zone SoilWASKTESALRIFGADRALLLVEGVAVAKLIMLGIAQPFRRRRP*
Ga0163162_1340954613300013306Switchgrass RhizosphereDWASKAESALRIFGADRALLLVESVALAKFIMLAVAYPFRRKR*
Ga0157375_1064428913300013308Miscanthus RhizosphereRDWSSKVESALRIFGADRALLLVEAVAVAKLLMLGVAYPFRRR*
Ga0182008_1053740713300014497RhizosphereDWASKSESALRMFGADRALLLIEGVALAKLLMLAVAYPFRRRR*
Ga0157379_1211259113300014968Switchgrass RhizosphereNLRLIKAVTQQLPPDWASKTESALRIFGADRALLLIEGVAIAKLVMLGLAYPFRRQRG*
Ga0137420_108508513300015054Vadose Zone SoilESVLRILGADRALLLIEGVAAAKIIMLSLAYPFRRPRP*
Ga0137420_120105223300015054Vadose Zone SoilVSCPPDWASKTESALRIFGADRALLLVEGVAVAKLIMLGIAQPFRRRRP*
Ga0132256_10390089213300015372Arabidopsis RhizosphereDWASKTESALRIFGADRALLLVEGIALAKLIMLGVAQPFRRRRP*
Ga0182041_1097800813300016294SoilSALRIFGADRALLLVEGVALAKLIMLGIAQPFRRGRPWRRN
Ga0182032_1124612913300016357SoilIKIITSKLPFDWGSKVESALRIFGADRALLLIEDVAIAKVVMLGIAYPFRRRR
Ga0190272_1332364513300018429SoilESALRIYGADRALLLIEGVALAKLLMLAVAYPFRRRH
Ga0066655_1125285413300018431Grasslands SoilKTESALRMFGADRALLLVEGVAVTKLIMLGIAQPFRRRRP
Ga0066662_1091192013300018468Grasslands SoilDWASKAESAFRNFGADRALLLVEGIAIAKLIMLAVAYPFRRGR
Ga0193751_103366043300019888SoilLPVDWGSKVESALRIFGADRALLLVEGIVIAKVVMLGIAHPFRHRRP
Ga0210403_1078085813300020580SoilNLRIIKTLTQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0210399_1104084133300020581SoilAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0210401_1083694513300020583SoilSALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0210400_1017110633300021170SoilDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0210388_1144410623300021181SoilLPFYWGSKVESALRIFGADRALLLIEGVAIAKIIMLGIALLFRRRRR
Ga0210393_1036620733300021401SoilASKTESALRMFGADRALLLVEGVAVAKLIMLGIAQPFRRRRP
Ga0210384_1098376123300021432SoilLLPPDWASKAESALRIFGADRALLLVEGIAIAKIIMLAIAYPFRRGR
Ga0210391_1070143023300021433SoilWGSKIESALRIFGADRALLLIESVVVAKAILLGIAYPFRRRQPMP
Ga0210390_1037103913300021474SoilLPFNWGSKAESALRLFGADRALLLIEGVAIAKVAMLGIAYFFRRCRQPML
Ga0210410_1087572423300021479SoilLPFNWGSKAESALRLFGADRALLLIEGVAIAKVAMLGIAYFFRRRRQPML
Ga0126371_1082894013300021560Tropical Forest SoilIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLSIAYPFRRRR
Ga0126371_1225626613300021560Tropical Forest SoilKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAITKVVMLGIAYPFRRRG
Ga0242668_107713013300022529SoilIKTLTQLLPPDWASKAESGLRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0207692_1023314223300025898Corn, Switchgrass And Miscanthus RhizosphereALRIFGADRALLLIEGVAVAKLILLALAHPFRRFRP
Ga0207692_1062993423300025898Corn, Switchgrass And Miscanthus RhizosphereESACRIFGADRALLLVEGIAIAKLIMLAVAYPFRRGR
Ga0207692_1065263413300025898Corn, Switchgrass And Miscanthus RhizosphereSALRIFGADRALLLVEGIALAKLIMLAIAYPFRRGR
Ga0207707_1142464813300025912Corn RhizosphereWASRVESALRIFGADRALLLVEGIALAKIIMLSLAYPFRRRS
Ga0207663_1060997713300025916Corn, Switchgrass And Miscanthus RhizosphereWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0207711_1189193613300025941Switchgrass RhizosphereLIKEITRLLPRDWSSKVESALRIFGADRALLLVEAVAVAKLLMLGVAYPFRRR
Ga0207676_1220781513300026095Switchgrass RhizosphereLDWASKSESALRMFGADRALLLIEGVALAKLLMLAVAYPFRRRR
Ga0209581_122599613300027706Surface SoilPFDWGSKAESALRIFGADRALLLVEGIGVAKLIMLAMAHPFRRSR
Ga0209656_1045691723300027812Bog Forest SoilLPFDWGSKAESALRLFGADRALLLIEAIGIAKLVMLGIAHPFRRRP
Ga0209488_1011221633300027903Vadose Zone SoilIIKTLTQLLPPDWASKAESALRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0308309_1024588113300028906SoilAESAFRIFGADRALLLVEGIAIAKLIMLAIAYPFRRGR
Ga0318541_1061551213300031545SoilDWASKTESALRIFGADRALLLVEGVALAKLIMLGIAQPFRRRRS
Ga0310887_1073360113300031547SoilTRLLPPDWASKSESALRIFGADRALLLIEGVALAKLLMLAVAYPFRRRRLSDF
Ga0318561_1027517833300031679SoilAQSGGKTESALRIFGADRALLLVEGVALAKLIMLGIAQPFRRGRPWRRN
Ga0318561_1052308113300031679SoilQLIKFVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0318494_1021438423300031751SoilFGADRALLLVEGVALAKLIMLGIAQPFRRRRPWRRN
Ga0310917_1053085613300031833SoilWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0318520_1102201313300031897SoilIKTITRVLPFDWGSKVESALRIFGADRALLLIEGVVIAKVIMLSIAYPFRRRQP
Ga0306923_1045374033300031910SoilADANLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0306921_1223012813300031912SoilESALRIFGADRALLLVEGVALAKLIMLGIAQPFRRRRS
Ga0310916_1104498423300031942SoilAAADANLQLIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0310910_1091860113300031946SoilSALRIFGADRALLLVEGVALAKLIMLGIAQPFRRRRS
Ga0310911_1043709523300032035SoilKIVTGKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGFAYPFRRRR
Ga0318533_1005433753300032059SoilIKIVTSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0306924_1092290723300032076SoilLQLIKIVTGKLPFEWGSKVESALRIFGADRALLLIEGVALAKFAMLGFAYPFRRRR
Ga0306924_1196982623300032076SoilLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR
Ga0307470_1064471913300032174Hardwood Forest SoilFDWPSKIESALRIFGADRALLLIEAVAVAKLIMLSFAYPFRRPRP
Ga0306920_10413680013300032261SoilIKIITSKLPFDWGSKVESALRIFGADRALLLIEGVAIAKVVMLGIAYPFRRRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.