NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101592

Metagenome / Metatranscriptome Family F101592

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101592
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 75 residues
Representative Sequence ESLYEAVVRGLNRLGSVGWESDTNETIKQVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR
Number of Associated Samples 81
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 73.08 %
% of genes near scaffold ends (potentially truncated) 6.86 %
% of genes from short scaffolds (< 2000 bps) 13.73 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.510 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.098 % of family members)
Environment Ontology (ENVO) Unclassified
(48.039 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.059 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 34.95%    β-sheet: 13.59%    Coil/Unstructured: 51.46%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF01904DUF72 4.90
PF00589Phage_integrase 2.94
PF01381HTH_3 2.94
PF00072Response_reg 1.96
PF07366SnoaL 1.96
PF13481AAA_25 0.98
PF00106adh_short 0.98
PF00196GerE 0.98
PF13432TPR_16 0.98
PF14373Imm_superinfect 0.98
PF00963Cohesin 0.98
PF13620CarboxypepD_reg 0.98
PF02557VanY 0.98
PF02371Transposase_20 0.98
PF14559TPR_19 0.98
PF08308PEGA 0.98
PF16289PIN_12 0.98
PF00248Aldo_ket_red 0.98
PF13274DUF4065 0.98
PF08534Redoxin 0.98
PF04679DNA_ligase_A_C 0.98
PF12161HsdM_N 0.98
PF11185DUF2971 0.98
PF04313HSDR_N 0.98
PF13365Trypsin_2 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 4.90
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.98
COG1876LD-carboxypeptidase LdcB, LAS superfamilyCell wall/membrane/envelope biogenesis [M] 0.98
COG2173D-alanyl-D-alanine dipeptidaseCell wall/membrane/envelope biogenesis [M] 0.98
COG3547TransposaseMobilome: prophages, transposons [X] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.51 %
All OrganismsrootAll Organisms25.49 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005445|Ga0070708_100959646All Organisms → cellular organisms → Bacteria → Acidobacteria802Open in IMG/M
3300006034|Ga0066656_10269026All Organisms → Viruses → Predicted Viral1097Open in IMG/M
3300006176|Ga0070765_100025525All Organisms → cellular organisms → Bacteria4502Open in IMG/M
3300006893|Ga0073928_10117691All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2196Open in IMG/M
3300012096|Ga0137389_10135175All Organisms → cellular organisms → Bacteria → Acidobacteria2003Open in IMG/M
3300012582|Ga0137358_10033912All Organisms → cellular organisms → Bacteria → Acidobacteria3362Open in IMG/M
3300012685|Ga0137397_10220630All Organisms → cellular organisms → Bacteria → Acidobacteria1411Open in IMG/M
3300012923|Ga0137359_10230693All Organisms → cellular organisms → Bacteria → Acidobacteria1653Open in IMG/M
3300012924|Ga0137413_10282420All Organisms → cellular organisms → Bacteria → Acidobacteria1153Open in IMG/M
3300012925|Ga0137419_10759430All Organisms → cellular organisms → Bacteria → Acidobacteria791Open in IMG/M
3300012929|Ga0137404_10002321All Organisms → cellular organisms → Bacteria → Acidobacteria12566Open in IMG/M
3300020140|Ga0179590_1002401All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3219Open in IMG/M
3300020199|Ga0179592_10244325All Organisms → cellular organisms → Bacteria → Acidobacteria807Open in IMG/M
3300020580|Ga0210403_10018384All Organisms → cellular organisms → Bacteria → Proteobacteria5582Open in IMG/M
3300020580|Ga0210403_11060783All Organisms → cellular organisms → Bacteria → Acidobacteria631Open in IMG/M
3300020581|Ga0210399_10027877All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4504Open in IMG/M
3300020581|Ga0210399_10276769All Organisms → cellular organisms → Bacteria → Acidobacteria1401Open in IMG/M
3300020583|Ga0210401_10020511All Organisms → cellular organisms → Bacteria → Acidobacteria6348Open in IMG/M
3300021171|Ga0210405_10001941All Organisms → cellular organisms → Bacteria22248Open in IMG/M
3300021180|Ga0210396_10673566All Organisms → cellular organisms → Bacteria → Acidobacteria895Open in IMG/M
3300021432|Ga0210384_10032226All Organisms → cellular organisms → Bacteria → Acidobacteria4886Open in IMG/M
3300021478|Ga0210402_11902319All Organisms → cellular organisms → Bacteria → Acidobacteria521Open in IMG/M
3300026374|Ga0257146_1007571All Organisms → cellular organisms → Bacteria → Acidobacteria1769Open in IMG/M
3300026490|Ga0257153_1006764All Organisms → cellular organisms → Bacteria → Acidobacteria2253Open in IMG/M
3300026555|Ga0179593_1257473All Organisms → cellular organisms → Bacteria → Acidobacteria1616Open in IMG/M
3300026557|Ga0179587_11004633All Organisms → cellular organisms → Bacteria → Acidobacteria549Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil20.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.90%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.96%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil1.96%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.98%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.98%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.98%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.98%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.98%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459009Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect DNA Tissue lysis 0-10cmEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006953Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHMB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300011106Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLMC (Metagenome Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027034Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027737Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300030878Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F47_066722102170459009Grass SoilLNLLGAVGRESDGETIRQVEVEIYQEPTKHVVDVPKLLKWVKEDSKYPGQKMRKEKLRKLLGVR
INPhiseqgaiiFebDRAFT_10472921113300000364SoilAVGWESDGETIRQVKVEIFQEPTKHTVDVPKLLKWIKEDAKYPEQEMRKEKLRKLLGTR*
JGI25612J43240_103955613300002886Grasslands SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKGINQNAMSPRQETLKEKLRKLLGKR*
JGI25613J43889_1004956113300002907Grasslands SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMNPR
JGI25616J43925_1009964633300002917Grasslands SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKLRKLLGKR*
Ga0066674_1003284543300005166SoilLKDVGWESNTDETIKRVEVEIHQEPTRHIVDVPKLLKWVQEDAGYPAQQTRKEKLRKLLNAQ*
Ga0066690_1041448813300005177SoilNRLKDVGWESNTDETIKRVEVEIHQEPTRLIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKL*
Ga0066388_10079648113300005332Tropical Forest SoilMADLIFYGLLFVENQLTAVGWESDDDETIAQVEVEIFQEPTRHIVDVRKLLKWLKDESARPGQETRKENLRKLLGTR*
Ga0070708_10095964613300005445Corn, Switchgrass And Miscanthus RhizosphereTEHSVEVYEESLYEAVVRGLNRLGCVGWESDTNETIKQVEVEIHQEPTKHVVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0066698_1081029713300005558SoilRGLNRLGDVGWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWVQEDSMYPAQQTRKEKLRKLLNAE*
Ga0066703_1002917253300005568SoilGLNRLQDVGWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWIEEKSTYPAQETRKAKLRKLLGTQ*
Ga0066705_1001633413300005569SoilVLRGLNRLQDIGWESNTGETVSRVEVEIHQEPTRHIVDVPKLLKWIEEKSTYPAQETRKAKLRKLLGTQ*
Ga0066694_1006893933300005574SoilLKDVGWESNTDETIKRVEVEIHQEPTRHIVDVPKLLKWVKEDAGYPAQQTRKEKLRKLLNAQ*
Ga0066656_1026902623300006034SoilVITLLELVRHSAVVYAESSYEAVVRGLNRLGSVGWESDTGETIKQVEVEIHQEPTKHVVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0075030_10090648223300006162WatershedsVVVYAESLYEAVVRGLNLLGNVGWESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0070765_10002552563300006176SoilMPKCVVRLRDTLDIQYSVVVYAESLYEAVVCGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWISQNAMSPRQQTLKEKLRKLLGPR*
Ga0079221_1053906633300006804Agricultural SoilVYAESLYEAYEAVLRGLNRLADVGWESAANETVKRVEVEIHQEPTRHIVELKWIEEKSMYPAQEARKAK*
Ga0073928_1011769113300006893Iron-Sulfur Acid SpringMKPSSQVEVEIHQEPTRHVVDVPKQLKWVRETAVCMSPAQQTKKEQLRKLLGMKKVERIG
Ga0075426_1038845233300006903Populus RhizosphereRLQDVGWESNGNETIQRVEVEIHQEPTRHTVDVPKLLKWVQEDSMYPAQQIRKAKLRKLLST*
Ga0074063_1127273123300006953SoilRGLNLLNSVGCESDGETIQKVEVEVHQEPTRHVVDVPRLLKWVKQSETHPGEKMRKEKLRKLLGMRRD*
Ga0099791_1065450923300007255Vadose Zone SoilRGLNRLGDVGWESETNETIKQVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0099830_1003057863300009088Vadose Zone SoilESDTNETIKQVEVEIHQEPTRHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLSKR*
Ga0099830_1145995423300009088Vadose Zone SoilVGWESDTNETIKQVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0074046_1042716023300010339Bog Forest SoilVVVYAESLYEAVVRGLNQRSEVGWESHADETIKQVEVEIYQEPTRHIVDVPKLLKWINQNAMSPRQETLKEKLRKLLGTR*
Ga0074044_1043704513300010343Bog Forest SoilSDRDETIKHVEVEIHHEPTRHVVDVPKLLKWVGDSAMSMSPAQDYRKEKLRKLLGMKKPERTKR*
Ga0126376_1002529133300010359Tropical Forest SoilLNQLTAVGWESDDDETIGQVEVEIHQEPTRHTVDVRKLLKWLKDESARPGQETRKEKLRKLLGAR*
Ga0151489_104322013300011106SoilNLLNSVGCESDGETIQKVEVEVHQEPTRHVVDVPRLLKWVKQSETHPGEKMRKEKLRKLLGMRRD*
Ga0137391_1059691633300011270Vadose Zone SoilESLYEAVVRGLNRLGSVGWESDTNETIKQVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137391_1141333313300011270Vadose Zone SoilESLYEAVVRGLNRLGSVGWESDTNETIKQVEVEIHQEPTKHVVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137393_1067718223300011271Vadose Zone SoilLRGLSRLEDVGWESNANETIQRVEVEIHQEPTRHIVDVPKLLKWVQEDSMYPAQQTRKEKLRKLLSAQ*
Ga0137393_1151203513300011271Vadose Zone SoilESLYEAVVRGLNRLGDVGWESDTNETIKRVAVEIHQEPTRHIVDVPKLLKWIEQNSMYPGQETRKEKLRKLLGKR*
Ga0137389_1013517523300012096Vadose Zone SoilMEHSVVVHAESLYEAVLRGLNRLAEVGRESDTGETIRQVEVEIYQEPTRHIVNVPKLLDWVKQDTMRPGQQTQKEKLRKLLGTR*
Ga0137388_1068140223300012189Vadose Zone SoilVGWESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137363_1001650743300012202Vadose Zone SoilMEDVGRESNADETIQHVEVEIHQEPTKHVVDVPKLLKWVGEKSMYPAQETRKAKLRKLLSTR*
Ga0137363_1140381323300012202Vadose Zone SoilMGKTLQNLLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR*
Ga0137399_1021267413300012203Vadose Zone SoilVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVLKLLKLINQNAMSPRQETLKEKLRKLLAT
Ga0137399_1150815213300012203Vadose Zone SoilVVVYAESLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKL
Ga0137399_1158962113300012203Vadose Zone SoilVEVYAESLYEAVLRGLNRLKDVGWESNTDETIKRVEVEIHQEPTRHIVDVPKLLKWVQEDAGYPAQQTRKAKLRKLLTQ*
Ga0137362_1010000443300012205Vadose Zone SoilLNQLTAVGWESADNETIAQVEVEIFQEPTHHVVDVRKLMKWLKVESVRPGQETRKEKLRKLLGAR*
Ga0137362_1144911313300012205Vadose Zone SoilQDVGWESNGNETIQRVEVEIHQEPTRHIVDVPKLLKWIAEKSMYPAQETRKAKLRKLLSTP*
Ga0137360_1065322223300012361Vadose Zone SoilVSRGLSRLQDVGWESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWIEQNSMYPGQETRKEKLRKLLGKR*
Ga0137361_1082546413300012362Vadose Zone SoilLTRAGIVRDVRLYEAVLCGLSRLQDVGWESDTNETIKRVEVEIHQELTRHIVDVPKLLKWVEQNSMYPGQQTRKEKLRKLLRKR*
Ga0137361_1171733623300012362Vadose Zone SoilAESLYEAVLRGLNRLVNVGWESDTNETIRQVEVEIYQEPTRHIVNVPKLLNWVKQDTMRPGEQTRKEKLRKLLGTR*
Ga0137390_1135527623300012363Vadose Zone SoilVVYAESLYEAVVRGLNLLGNVGWESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137358_1003391213300012582Vadose Zone SoilLYEAILRGLNQLTAVGWESADNETIAQVEVEIFQEPTHHVVDVRKLMKWLKVESVRPGQETRKEKLRKLLGAR*
Ga0137397_1022063013300012685Vadose Zone SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKGINQNAMSPRQETLKEKL
Ga0137395_1128444413300012917Vadose Zone SoilNRLQDVGWESNSEETIKRVEVEIHQEPTRHIVDVPNLLKWVKEDAGYPAQQTRKEKLRKLLSAQ*
Ga0137396_1026499223300012918Vadose Zone SoilVEVYAESLYEAVLRGLNRLGDVGWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137396_1036756923300012918Vadose Zone SoilVVVYAESLYEAVLRGLNQLSEVGWESHADETIKQVEVEIYQEPTKHVVDVPKLLKWINQNAMSPRQETLKEKLRLLLGTKPLERRRTK*
Ga0137396_1127640813300012918Vadose Zone SoilMVRSIAPLFYAESLYEAVVRGLNRLGSVGWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR*
Ga0137394_1074412813300012922Vadose Zone SoilESDANETIKQVEVEIHQEPTRHIVNVPKLLKWVEEKSMYPGQETRKEKLRKLLGKR*
Ga0137359_1012226413300012923Vadose Zone SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKLRKLLGTR*
Ga0137359_1023069313300012923Vadose Zone SoilESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWIEQNSMYPGQETRKEKLRKLLGKR*
Ga0137359_1040680623300012923Vadose Zone SoilAVFRGLNRLGDVGWESDTNETIKQVEVEIHQEPTRHIVDVPRLLKWVEQNSLYPGQETRKEKLRKLLGKR*
Ga0137413_1028242013300012924Vadose Zone SoilVHEAVVAESLYEAVLRGLNQLMDVGWESISDETISMVEVEIHQEPTRHMVNVPKLLEWVKQDGMRPLDQTRKEKLRKSLGTR*
Ga0137413_1042015313300012924Vadose Zone SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEAEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKLRKLLGTR*
Ga0137413_1109140123300012924Vadose Zone SoilVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR*
Ga0137419_1075943023300012925Vadose Zone SoilVGKEVRSNLLPLLIQNPEFRMPKCVVRLRDTLDIQHSVVVYAESLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQET
Ga0137404_10002321153300012929Vadose Zone SoilMRRGLNQLTAVGWESDDNETIAQVEVEIHQEPTRHVVDVRKLLKWVKDETARPRQETRKEKLRKLLGTR*
Ga0137410_1196017413300012944Vadose Zone SoilKNFLTRAGIVRDVRLYEAVLCGLSRLQDVGWESDTNETIKRVEVEIHQELTRHIVDVPKLLKWVEQNSMYPGQQTRKEKLRKLLRKR*
Ga0137418_1023167713300015241Vadose Zone SoilRGLNQLGDVGWESDANETIKQVEVEIHQEPTKHIVDVPKLLKWVEQNSIYPGQETRKEKLRKLLGKR*
Ga0137412_1023591823300015242Vadose Zone SoilMGKTLQNLLYEAVVRGLNQLSEVGWESHADETIKQVEVENYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR*
Ga0134072_1019339813300015357Grasslands SoilLNRLQDIGWESDTGETVSRVEVEIHQEPTRHVVDVPKLLKWIQEKSAYPAQETRKAKLRKLLTQ*
Ga0066667_1222360313300018433Grasslands SoilVEVYAESLYEAVLRGLNRLQDIGWESDTGETVSRVEVEIHQEPTRHIVDVPKLLKWIEEKSTYPAQETRKAKLRKLLGTQ
Ga0179590_100240113300020140Vadose Zone SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKGINQNAMSPRQETLKEKLRKLLGKR
Ga0179592_1024432523300020199Vadose Zone SoilMGKTLQNLLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR
Ga0210403_1001838453300020580SoilMPKCVVRLRHTLDIQHSVVVYAESLFEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWIDQNAMSPRQQTLKEKLRKLLGAR
Ga0210403_1023748213300020580SoilHSVVVYAESLYEAVVRGLNRLGAVGWESDSNETIKQVEVEIHQEPTRHIVDVPKLLKWVEKNSMYPGQETRKEKLRKLLWKR
Ga0210403_1106078313300020580SoilMPKCVVRLRDTLDTQHSVVVYAESLYEAVVRGLNQLSSHADETIKQVEVEIYQEPTRHVVDVPKLLNWINQNAMSPRQQTH
Ga0210399_1002787733300020581SoilVVVYAESLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKQKLRTLLGSR
Ga0210399_1027676923300020581SoilMPKCVVRLRDTLDIQHSVVVYAESLYEAVVRGLNQLSDVGWESHADETIKQVEVEIYQEPTRHIVDVPKLLKWINQNGMSPRQQTQKKKVEEVARGAITEEGKY
Ga0210399_1032426423300020581SoilRGLNQLGNVGWESDTNETIKQVEVEIHQEPTRHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLRKR
Ga0210399_1040618323300020581SoilYEAVVRGLNRLGAVGWESDSNETIKQVEVEIHQEPTRHIVDVPRLFIYPGQETRKEKLRKLLGKR
Ga0210401_1002051163300020583SoilMPKCVVRLRDTLDIQYSVVVYAESLYEAVVCGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWISQNAMSPRQQTLKEKLRKLLGPR
Ga0210404_1058548623300021088SoilWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWVEEKSMYPAQQTRKEKLRKLLIAR
Ga0210405_1000194153300021171SoilMRNRFNEAVLRGLNRLVDVGWESDSDETIRQVEVEVHQEPTRHIVNVPKLLNWVKQDTMRPGEQTKKEKLRKLLGTR
Ga0210396_1067356623300021180SoilYEAVLRGLNQLADVGWESDTDETIRQVEVEIHQEPTRHIVDVPKLLNWVKQDTSRPGQQTQKEKLRKLLGTR
Ga0210393_1025568313300021401SoilEHSATVYAEFLYEAVIRGLKLLDDVGWESDRNETIKHVEVEIHHEPTRHVVDVPRLLKWVRETAVCMFPAQQTKKEKLRKLLGTK
Ga0210387_1082085313300021405SoilQHSVVVYAESLYEAVVRGLNQLSDVGWESHADETIKQVEVEIYQEPTRHIVDVPKLLKWINQNGMSPRQQTQKKKVEEVARGAITEEGKY
Ga0210384_1003222623300021432SoilMPKCVVRLRDTLDIQYSVVVYAESLYEAVVRGLNQLSGVGWESHADETIKQVEVEIYQEPTKHVVDVPKLLAWINQNAMSPRQQTLKEKLRTLLGPR
Ga0210398_1036324113300021477SoilQHSVVVYAESLYEAVVRGLNQLAEVGWEAHADETIKQVEVEVYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR
Ga0210402_1190231913300021478SoilMPKCVVRLRDTLDIQHSVVVYAESLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHIVDVPKLLKWINQNGMSPRQQTQKK
Ga0210409_1110197613300021559SoilSNVDETIKRVEVEIHQEPTRHVVDVPRLLKWVQDRAMSTSPAQDTRKEKLRKLLSTRREK
Ga0242655_1023041413300022532SoilEAVVRGLNQLAEVGWEAHADETIKQVEVEVYQEPTRHVVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR
Ga0247695_104235523300024179SoilVGWESHADETIKQVEVEIYQEPTRHIVDVPKLLKWINQNAMSPRQQTLKEKLRTLLGTR
Ga0179591_103570643300024347Vadose Zone SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEAEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKLRKLLGKR
Ga0209438_102635913300026285Grasslands SoilVIVHLRAESLYEAVLRGLNRLADVGWESGTAETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQETLKEKLRKLLGKR
Ga0257146_100757123300026374SoilVHEAVVQAESLYEAVLRGLNQLMDVGWESISDETISMVEVEIHQELTRHMVNVPKLLEWVKQDGMRPLDQTRKEKLRKLLGTR
Ga0257153_100676423300026490SoilMPKCIVRLRDTLDIQHSVVVYAESLYEAVVRGLNQLSEVGWETHADETIKQVEVEIYQEPTKHVVDVPKLLAWIRQNAMSPRQQTLKEKLRMLLKTR
Ga0209807_111905613300026530SoilYEAVLRGLNRLQDIGWESNTGETVSRVEVEIHQEPTRHIVDVPKLLKWIEEKSTYPAQETRKAKLRKLLGTQ
Ga0179593_122797033300026555Vadose Zone SoilMIPSTVSFVYAESLYEAVVRGLNLLGNVGWESDTNETIKRVEVEIHQEPTRHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR
Ga0179593_125747323300026555Vadose Zone SoilMPKCVVRLRDTLDIEHRVVVYAESLYEAVVRGLNQLSEVGWESHADETIKQVEVEIYQEPTRHVVDVPKLLKWINQNAMSPRQQTVKEQFRTLLRTR
Ga0179587_1100463313300026557Vadose Zone SoilMPKCVVRLRDTLEIQHSVVVYADSLYEAVLRGLNQLSEVGWESHADETIKQVEVEIFQEPTRHVVDVPKLLEWISQNAMSPRQETLKEKLRLL
Ga0179587_1118389013300026557Vadose Zone SoilTRLKDVGWESSTDETIKRVEVEIHHEPTTHIVDVPKLLKWLEGTDISPAQKTRKEKLRKLLGKR
Ga0209730_103529713300027034Forest SoilVRWKNRYPHVYAELLYEAVVRGLNRLQDVGWESNGNETIQRVEVEIHREPTRHIVDVPKLLKWVQEKSKYPVQEARKAKLRKLLSAQ
Ga0209038_1019788113300027737Bog Forest SoilYAETLYEAVIRGLKMLEHVGWESDRDETIKHVEVEIHHEPTRHVVDVPKLLKWVGDSAMSMSPAQDYRKEKLRKLLGIKKLKRTKR
Ga0209074_1037984333300027787Agricultural SoilVEVYAQSLYEAVLRGLSRLQDVGWESNGNETIQRVEVEIHQEHTRHTVDVPKLLKWVQEDSMYPAQQIRKAKLRKLLST
Ga0209701_1058659323300027862Vadose Zone SoilVGWESNANETIQRVEVEIHQEPTRHIVDVPKLLKWVQEDSMYPAQQTRKEKLRKLLSAQ
Ga0209698_1015775413300027911WatershedsVVVYAESLYEAVVRGLNLLGNVGWESDTNETIKRVEVEIHQEPTKHIVDVPKLLKWVEQNSMYPGQETRKEKLRKLLGKR
Ga0265770_100094313300030878SoilETLYEAVIRGLRMLEHVGWESDRDETIKQIEVEIHREPTRHVVDVPRLLKWVGDTALCMSPAQQTKKEKLRKLLGSQ
Ga0170834_11060057123300031057Forest SoilVVYAESLYEAALLGLNRLADVGWESATDETIRQVEVEIHQEPTRHTVNVPNLLKWINQGSGSPAQQTRKEMLRKLLGTR
Ga0307476_1033796613300031715Hardwood Forest SoilESLYEAVLRGLRRLQDVGWESNGNETIQRLEVEIHQEPTRHIVDVPKLLKWIKEGAASPAQQTRKEKLRNLLSTQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.