NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098250

Metagenome Family F098250

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098250
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 64 residues
Representative Sequence MTKVLAVILAAAVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVL
Number of Associated Samples 91
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.308 % of family members)
Environment Ontology (ENVO) Unclassified
(37.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.154 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 26.37%    β-sheet: 15.38%    Coil/Unstructured: 58.24%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF01551Peptidase_M23 7.69
PF00496SBP_bac_5 5.77
PF09350DJC28_CD 3.85
PF11769DUF3313 2.88
PF01642MM_CoA_mutase 1.92
PF04392ABC_sub_bind 1.92
PF02687FtsX 1.92
PF04545Sigma70_r4 1.92
PF00723Glyco_hydro_15 1.92
PF00923TAL_FSA 0.96
PF00994MoCF_biosynth 0.96
PF08264Anticodon_1 0.96
PF06965Na_H_antiport_1 0.96
PF02538Hydantoinase_B 0.96
PF09917DUF2147 0.96
PF00903Glyoxalase 0.96
PF01156IU_nuc_hydro 0.96
PF04461DUF520 0.96
PF02371Transposase_20 0.96
PF03886ABC_trans_aux 0.96
PF04972BON 0.96
PF08327AHSA1 0.96
PF01702TGT 0.96
PF00589Phage_integrase 0.96
PF00005ABC_tran 0.96
PF01653DNA_ligase_aden 0.96
PF00484Pro_CA 0.96
PF13441Gly-zipper_YMGG 0.96
PF02518HATPase_c 0.96
PF08386Abhydrolase_4 0.96
PF12697Abhydrolase_6 0.96
PF06863DUF1254 0.96
PF14707Sulfatase_C 0.96
PF12973Cupin_7 0.96
PF030614HBT 0.96
PF00561Abhydrolase_1 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 1.92
COG1884Methylmalonyl-CoA mutase, N-terminal domain/subunitLipid transport and metabolism [I] 1.92
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 1.92
COG3387Glucoamylase (glucan-1,4-alpha-glucosidase), GH15 familyCarbohydrate transport and metabolism [G] 1.92
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 0.96
COG0272NAD-dependent DNA ligaseReplication, recombination and repair [L] 0.96
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 0.96
COG0343Queuine/archaeosine tRNA-ribosyltransferaseTranslation, ribosomal structure and biogenesis [J] 0.96
COG1549Archaeosine tRNA-ribosyltransferase, contains uracil-DNA-glycosylase and PUA domainsTranslation, ribosomal structure and biogenesis [J] 0.96
COG1666Cyclic di-GMP-binding protein YajQ, UPF0234 familySignal transduction mechanisms [T] 0.96
COG1957Inosine-uridine nucleoside N-ribohydrolaseNucleotide transport and metabolism [F] 0.96
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 0.96
COG3547TransposaseMobilome: prophages, transposons [X] 0.96
COG5361Uncharacterized conserved proteinMobilome: prophages, transposons [X] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.31%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil11.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.69%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.69%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.81%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.85%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.85%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.85%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.92%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.92%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.96%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.96%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.96%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.96%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000709Amended soil microbial communities from Kansas Great Prairies, USA - Total DNA F1.4 TB amended with BrdU and acetate no abondanceEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300003347Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PMHost-AssociatedOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005205Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011421Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT769_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028597Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day14EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031572Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f19EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031880Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f25EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_233182623300000033SoilMKKVFAAMLVAAMLPAFAPSPAPAQSQSLDDIFNRVSPTVVVVRAKGRDVNASGITRFNETGSGVLISSDGRVMTAA
INPhiseqgaiiFebDRAFT_10043144323300000364SoilMKKVLAVILAAALVPTFTPAVQAQSQNQSQSLDDLFTRVSPSVVVVRAKGRDVNAGGVTRFNETGSGVLI
F14TC_10064419913300000559SoilMILAAXXXPAFAPAAHAQNLDDVFRKASPTVVVVRAKGRDVNASGVIRFTETGSGVLISSDGRVMTA
F14TC_10182633923300000559SoilMRKIRVVIFALAALPVFAPAAQAQSPSEAQSQSLDDLFTRVSPTVVVVRAKGRDVAAGGITHFKETGSGVLI
KanNP_Total_F14TBDRAFT_101935613300000709SoilMKKVLAVILAAALVPAFTPAVQAQSQNQSQSLDDLFARVSPSVVVVRAKGRDVNAGGVTRFNE
JGI1027J12803_10706378313300000955SoilVLPAVAPAAQAQNLEEIFAKVSPTVVIVRSKGRDVKAAGITSFTETGSGVLISDSG
F14TB_10130672223300001431SoilMKNVLAVILAGAVLPALAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVL
JGI26128J50194_100918723300003347Arabidopsis Thaliana RhizosphereMQKSLAAVLAAAMVLSGVPAAPQAQRESLDDLFTRVSPTVVVVRAKGRDVTATGVTRFNETGSGVLISSDGRV
Ga0055438_1027471023300003995Natural And Restored WetlandsMKKLLAVMFAAAVLPAFAPAAQAQNLDDLFRKVNPSVVVIKAKGRDVSAGGVTRFNE
Ga0055439_1011619613300004019Natural And Restored WetlandsMKKLLAVMFAAAVLPAFAPAAQAQNLDDLFRKVNPSVVVIKAKGRDVSAGG
Ga0066683_1041020123300005172SoilMKKVLAVILAAGVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVLISS
Ga0068999_1007033223300005205Natural And Restored WetlandsMMAPMKRLIAAILAVGALPALVPAAQAQSLDDLFRKVNPSVVVIKAKGRDVSAGGVTRFTETGSGVLVS
Ga0066388_10814202423300005332Tropical Forest SoilVTLTAAVLTAVVPAARAQNLDEIFRKVNPSVIVVRSNGRDVGAGGITRFRETGSGVLISDRGRVM
Ga0070671_10044008413300005355Switchgrass RhizosphereMRQRAPRKTRLTLVLAVAVLPAFAPAAQAQNLDEVFRKASPTVVVVRSKGRDVNASGVVRFTETGSGVLISDDG
Ga0070710_1148180913300005437Corn, Switchgrass And Miscanthus RhizosphereVKNILSVMVVVAIVPAFAPAAQAQSPSLDDLFTRVSPTVVVVRSKGRDVSAGGVTRFN
Ga0070694_10104426513300005444Corn, Switchgrass And Miscanthus RhizosphereMRKVFPKKVLVVALAAAVLPVFAPTAQAQNQSLDDIFTRVSPTVVVVRSKGRDV
Ga0073909_1021082913300005526Surface SoilMRQRAPRKTRLTLVLAVAVLPAFAPAAQAQNLDEVFRKASPTVVVVRSKGRDVNASGVVRFTETGSGVLISDDGRVMT
Ga0070697_10164224323300005536Corn, Switchgrass And Miscanthus RhizosphereMKKVLAVILAAAMLPAFAPAALAQSQSLDDLFTRVSPSVVVVRSKGRDVSAGGVTR
Ga0070696_10150061813300005546Corn, Switchgrass And Miscanthus RhizosphereMKKSLVAVLAAALPALVLPALAPPAHAQNLDEVFRKVSPVVVVVRSKGRDVRVSGITHFKETGSG
Ga0068854_10133346113300005578Corn RhizosphereMKTFLALILAAAVLPAFAPAAVAQNLEEIFRKASPTVVVVRSKGRDVGAGGVTHFKETGSGVLISDSGRVM
Ga0068858_10120430613300005842Switchgrass RhizosphereMKTFLALILAAAVLPAFAPAAVAQNLEEIFRKASPTVVVVRSKGRDVGAGGVTH
Ga0075417_1008215823300006049Populus RhizosphereMKKALAVILAAAMSPVVASGAQAQNLDETFRKVSPFVVVVRSKGRDVGPSGIVRFNETGSGVLISRDGRVMT
Ga0075417_1028558023300006049Populus RhizosphereMKKVFVVILAAAALPAFAPAAHAQNLDDVFRKASPFVVVVRSKGRDVGASGIVRFNETGSGVL
Ga0075428_10069406223300006844Populus RhizosphereMKKVFVVILAAAALPAFAPAAHAQNLDDIFRKASPFVVVVRSKGRDVGASGIVRFNETGSGVLIS
Ga0075428_10081804123300006844Populus RhizosphereMRKLFAMTLAAAVLPAFAPAAQAQNLDEVFRKASPTVVVVRAKGRDVTASGITRFTETGSGVLISGDGRVM
Ga0075425_10214975523300006854Populus RhizosphereMMKILAVTLAVAVLPAFAPTARAQNLDDIFGKVNPSVLVVRSKGRDVGAGGVTRFKETG
Ga0075435_10117892733300007076Populus RhizosphereMNKVAAMMLAAVMLPAFAPTPAESQSLDDIFNRVSPTVVVVRSTGRDVGATGIIRFNETGSGVL
Ga0099794_1039467623300007265Vadose Zone SoilMQKLLAIILAVASLSVLAPAAQAQNLDDIFREVNPSVVVVRSKGRDVSAAGITRFNETGSGVLISDSG
Ga0111538_1273922413300009156Populus RhizosphereVSVPGVAPPVHLEVHLKKLFAIVFAVAVLPSFVPPACAQHLDDTFRKVSPFVVVVRSKGRDVGASGIVRFNETGSGVLISSDG
Ga0075423_1021636043300009162Populus RhizosphereVKKIVSVLVAVAIVPAFAPAAQAQSPSLDDLFTRVSPTVVVVRSKGRDVSAGGVTRFTETGS
Ga0105104_1083226523300009168Freshwater SedimentMKKTLTVMLAAAMLPALAPASAPAQSPSLDELFNRVSPTVVVVRAKGRDVKAAGVTRFNETGSGVLIS
Ga0105248_1327981313300009177Switchgrass RhizosphereMKKLVAVSVGAAVLAAFVPVAQAQSLEEVFRQVNPSVVVVRAKGRDVGAGGIFRFTETGSGVLISDVTDGSPADRA
Ga0105249_1107083223300009553Switchgrass RhizosphereMNKLLALILAAAVLPAFAPAARAQNLEEIFRKASPTVVVVRSKGRDVGAGGVTHFK
Ga0126374_1029323313300009792Tropical Forest SoilMRNVLAALILVAVWPGVTPPARAQNLDDIFRKVNASVIVVRSRGRDVGAGGVTRFRETGSGVL
Ga0126373_1094452613300010048Tropical Forest SoilMKILLAAILAVAALPVVSPTADAQNLEEIFYKVSPTVVVVRAKGRDVGVDGIKHFTETGSGVLISED
Ga0134088_1038911713300010304Grasslands SoilMKKILAVILVAAVLPALAPAAWAQNLEDIFRKVNPSVVVVRSKGRDVSAGGVTRFN
Ga0134071_1063723423300010336Grasslands SoilMKKVLAVILAAAVLPAFAPAARAQNLDDTFRTVSPFVVVVRSKGRDVGASGVTRFN
Ga0134062_1062948023300010337Grasslands SoilMKVLAVILAAAALPAFAPAAQAQNLDDIFRKVNPSVIVVRAKGRDVGAGGVTRFNETGSGVVISAS
Ga0126370_1137894523300010358Tropical Forest SoilMKKVFAVTLAAAVLTAVAPPARAQNLDEVFRKVNPSVIVVRSKGRDVGAGGITRFR
Ga0126377_1051866723300010362Tropical Forest SoilMKKVLTVILAAIVLPALVPAAGAQNLDEIFHKVNPTVIVVRSRGRDVGAGGVTRFKEIGSGFLIS
Ga0126381_10025874233300010376Tropical Forest SoilMKKAVVVVLAAALSAFAVPALAPPARAQNLDEVFRKVSPVVVVVRSKGRDVRASGITHFNETGSGVLISPDG
Ga0134122_1041764833300010400Terrestrial SoilMNKILAVVLAAAALSAFAPAARAQNLEEIFRKVNASVVVIRSKGRDVGAGGV
Ga0137462_118589513300011421SoilMKKLLGVALAAALLPAFSLEARAQNLEETFRRVSPTVVIVRAKGRDVGAGGVTSFKETGSGVLVSSDG
Ga0137389_1044267613300012096Vadose Zone SoilMNKILAVILAAAVLPVFAPAAQAQNRDDIFRKVNPSVVVVRAKGRDVSAGGVTRFNETGSGVLISGS
Ga0137363_1147229713300012202Vadose Zone SoilMKKVLAVILVAAVLPAFAPAARAQNLDEIFRKVNPSVIVVRSKGRDVGAGGVTRFNETGSGF
Ga0137399_1030959523300012203Vadose Zone SoilMRKMLAVMLAAAVLPALAPAAGAQNLDETFRKVSPLVVVVRSKGRDVGASGITRFNETGSGVL
Ga0137380_1046157113300012206Vadose Zone SoilMKKLLAVIVAVAVLPAFAPAAQAQKQLDDIFREVSPTVVVVRAKGRDVGAGGVTRFN
Ga0137378_1017645613300012210Vadose Zone SoilMKVLAVILAAAVLPAFAPAARAQNLDDIFRKVNPSVIVVRSKGRDVSAAGITRFNETGSGVLISSSGR
Ga0137371_1142073513300012356Vadose Zone SoilMKKLLAVIVAVAVLPAFAPAAQAQKQLDDIFREISPTVVVVRAKGRDVGTGGITRFNE
Ga0137361_1005061243300012362Vadose Zone SoilMNKILCMMVVVATLPAFAPAAQAQSPSLDDLFTRVSPTVVVVRSKGRDVSPGGVTRFNETGSGVL
Ga0137361_1081807013300012362Vadose Zone SoilVDELKIQPEGAMKKVLAVILAAAVLPAFAPAARAQSPSLDDLFTRVSRTVVVVRSKGRDVSAGGITRFNETGS
Ga0137390_1074683633300012363Vadose Zone SoilMKKPLAVILAVAVLPAFAPAAKAQNLDDIFREVNPSVVVVRAKGRDVGAAGITRFNETGSGVLIS
Ga0137395_1021878813300012917Vadose Zone SoilMLKKVLPVMAVVAAISPAFAPAPATAQSQSLDDLFNRVSPTVVVVRAKGRDVNAGGITRFNETGSGGLI
Ga0137396_1093325223300012918Vadose Zone SoilMKKVLAVILAAAVLLAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVLISSDGRVM
Ga0137359_1002564933300012923Vadose Zone SoilMTNLFVLILLSAALLPAFVSAAQAQNLDEVFRKASPTVVVVRAKGRDVKAEGVTRFTETGSGVLILQ*
Ga0137419_1162568523300012925Vadose Zone SoilMTKVLAVILAAAVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVL
Ga0137419_1175856913300012925Vadose Zone SoilMTNLFVLILLSAALLPAFVSAAQAQTAGLDDLFTRVSPTVVVVRSKGRDVGASGVTRFNETGSGVL
Ga0126375_1075219913300012948Tropical Forest SoilMKKVLTVILAAIVLPALVPAAGAQNLDEIFRRVNPSVIVVRSKGRDVGAGGVTRFNEIGS
Ga0163162_1055910043300013306Switchgrass RhizosphereMNKILAVVLAAAALSAFAPAARAQNLEEIFRKVNASVVVIRSKGRDVGAGGVTRFKETGSGV
Ga0163162_1198836613300013306Switchgrass RhizosphereVESPVKTFLALIVAAGLLPAIVPGAAAQNLDEVFRKTSPTVVVVRAKGRDVNASGVTRFAETAPGS*
Ga0157375_1114981213300013308Miscanthus RhizosphereVLAVAVLPAFAPAAQAQNLDEVFRKASPTVVVVRSKGRDVNASGVVRFTETGSGVLISDDGRV
Ga0137405_115121013300015053Vadose Zone SoilMKKVLAVILAAAVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVLISSDGRVSRVMT
Ga0137405_115316733300015053Vadose Zone SoilLKVHPEDAMKKVLAVILAAAMLPAFAPAALAQSQSLDDLFTRVSPSVVVVRSKGRDVSAGGVTRLNETGSG
Ga0134085_1009702613300015359Grasslands SoilMKVLAVILAAAALPAFAPAAQAQNLDDIFRKVNPSVIVVRAKGRDVGAGGVTRFNETGS
Ga0132258_1165496743300015371Arabidopsis RhizosphereMLLVAVTLPALTPTLALAQSQSLDDLFNPVSPTVVVVRAKGRDVGAAGITRFN
Ga0132257_10157382823300015373Arabidopsis RhizosphereMLLVAVTLPALTPTLALAQSQSLDDLFNRVSPTVVVVRAKGRDVGAAGITRFNETGSGVLISTDGRVMTA
Ga0132255_10362307023300015374Arabidopsis RhizosphereMKKRFALALAVAVLTCFVPAARAQNLEEVFRKASPTVVVVRAKGRDVTAGGITRFTE
Ga0163161_1207742323300017792Switchgrass RhizosphereMKTFLALTLAAAVLPAFAPAAGAQSLEEIFRKVSPTVVVVRSKGRDVGAGGVTHFK
Ga0187779_1049321313300017959Tropical PeatlandMKKLIAAMLAATMWPAFASPAHAQNLDEIFGKVNASVIVVRSRGRDVTAGGVTRFKEIGSGF
Ga0187766_1013353533300018058Tropical PeatlandMKTNTILAAILAAAVVPVWAGAVSGQNLDDVFRMVNASVVVVRAHGRDVGAGGVIRFTETGSGVLVSA
Ga0184637_1006059613300018063Groundwater SedimentMKTVLAVILAAAVLPAFAPAAWAQNLDDIFRKVNPSVVVVRAKGRDVSAGGVTRFNETGSGV
Ga0137408_129067833300019789Vadose Zone SoilMKKVLAVILAAAVLPAFASAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVLISSDGR
Ga0210399_1068536633300020581SoilVKKVLAVILAAAELPAFASAAEAQSLDEIFTRVSPSVVVVRSKGRDVSAEGVTHFNETG
Ga0210384_1049109013300021432SoilMKNLLVGILAAAMLPAFAPAAQAQNLDDVFRKVNASVLVIRAKGRDVTAEGIIRFNETGSGVLISDRG
Ga0210138_104512923300025580Natural And Restored WetlandsMKKLLAVMFAAAVLPAFAPAAQAQNLDDLFRKVNPSVVVIKAKGRDVSAGGVTRFNETGSGVLVS
Ga0207646_1014802853300025922Corn, Switchgrass And Miscanthus RhizosphereMKKIPSVMVIVATLPAFAPAAQAQSPSLDDLFTRVSPTVVVVRSKGRDVSTGGVTP
Ga0207668_1164565623300025972Switchgrass RhizosphereGARMKTLLALVLAVAVLPAFAPAAQAQNLDDVFRKASPAVVVVRAKGRDVNASGVYRFTETGSGVLISGDGRVMTTRGRVGLGAPPPT
Ga0207676_1164018213300026095Switchgrass RhizosphereMSVITCCERAVHGKVPMNKVLALILAAVALPAVAPAAWAQNLEEIFRKASPTVVVVRSKGRDISAGGVTHFNETGSGVL
Ga0209438_104532823300026285Grasslands SoilMKKVLAAILAAAVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGS
Ga0209158_131071713300026333SoilMKKLLAVILAVAVLPAFAPAARAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTR
Ga0268265_1034133113300028380Switchgrass RhizosphereMQKSLAAVLAAAMVLSGVPAAPQAQRESLDDLFTRVSPTVVVVRAKGRDVMATGVTRFNETGSGVL
Ga0268264_1244466423300028381Switchgrass RhizosphereMKRLLTVALAAAVLPAFALEARAQNLEETFRRVSPSVVVVRAKGRDVGAGGVT
Ga0247820_1007013013300028597SoilMKTLLALILAVAVLPAFAPAAQAQNLDDVIRKASPAVVVVRAKGRDVNASGVYRFTETGSGVLISG
Ga0310888_1039732913300031538SoilMKQHAPTNTRLTLVLAVALLPVFAPAAGAQNLDEVFRKASPTVVVVRSKGRDVKASGVVHFKETGSGVL
Ga0318515_1016290213300031572SoilMKKSLVVILAAALAAFALPAHAPPAGAQNLDEVFGKVSPVVVVVRSKGRDVRASGITHFNETGSGVLISPDGRH
Ga0307469_1136671133300031720Hardwood Forest SoilMKKILSVMVVVATLPAFASAAQAQSPNLDDLFTRVSPTVVVVRSKGRDVSAGGITRFNE
Ga0307468_10116134823300031740Hardwood Forest SoilMPRVPISRLLAVILAAALLPVFAPAAHAQNLEEIFRKVSPSVVVVRAKGRDVGAGGVTRF
Ga0307468_10129974823300031740Hardwood Forest SoilMKRLTLVLAVAVLPAFAPAAGAQNLDEVFRKASPTVVVVRSKGRDVKASGVVHFKETGSGVLISGDGRVM
Ga0307468_10137441213300031740Hardwood Forest SoilMRKVLAVILATTTLAVVVPAAQAQNLDDTFRKVSPFVVVVRSKGRDVGASGVTRFNETGSGVLISSDGRVMT
Ga0307468_10211124213300031740Hardwood Forest SoilMNNDLALILAVAVLPAFAPAARAQSLEEIFRKVSPTVVVVRSKGRDVGAGGFTRFNETGS
Ga0307468_10239219513300031740Hardwood Forest SoilMKKALTVMLAAAMLPALAPASAPAQSQSLDELFNRVVVVRAKGREVKAAGITCFNRDP
Ga0318554_1010994613300031765SoilMKKSLVVILAAALAAFALPAHAPPAGAQNLDEVFGKVSPVVVVVRSKGRDVRAS
Ga0318509_1046743323300031768SoilMKKLVAVIVGVAVLPTFAPVAQAQGLEEVFRQVNPSVVVVRAKGRDVGAGGII
Ga0307473_1034986113300031820Hardwood Forest SoilMKTRLALILAALVLPAFAPAAQAQNLDEVFRKASPTVVVVRAKGRDVNASGIIR
Ga0318512_1026250423300031846SoilMKKSLVVILAAALAAFALPAHAPPAGAQNLDEVFGKVSPVVVVVRSKGRDVRASG
Ga0318544_1013700713300031880SoilMKKALALLLAAVVPSFFSPITHAQSLDDIFRQVNPSVIVVRSKGRDVGAGGLTRFNEI
Ga0318536_1016406323300031893SoilMKKSLVVILAAALAAFALPAHAPPAGAQNLDEVFGKVSPVVVVVRSKGRDVRASGITHFNETGSGVLISPDGRVMR
Ga0307470_1044948013300032174Hardwood Forest SoilMLGKVLTVMVVAAILPAFVPAPARSQSQSLDDLFNRVSPTVVVVRAKGRDVGAA
Ga0307471_10023557713300032180Hardwood Forest SoilMKKILSVMVVVATLPAFASAAQAQSPNLDDLFTRVSPTVVVVRSKGRDVSAGGVTRFNETGSGVLVSS
Ga0307471_10055182543300032180Hardwood Forest SoilMKKLFAMIFAAAVLPAFAPAAHAQNLDEVFRKASPTVVVVRAKGRDVNASGFVRFTETGS
Ga0307472_10128216523300032205Hardwood Forest SoilMKQHAPTKTRLTLVLAVALLPAFAPTVQAQNLDEVFRKASPTVVVVRSKGRDVKASGIVRFTETGSGVLVSDDGRV
Ga0307472_10202729223300032205Hardwood Forest SoilMPMKVLAVILVAAVLPAFAPAARAQNLDEIFRKVNPSVIVVRAKGRDVSTSGVTRFNETGSGFLISGSGRVMTAAH
Ga0335082_1165554513300032782SoilVKKVCAAILAAAVLPVFAPAVGAQSLEDIFRQVSPCVIVVRSKGRDVKSGGVTRFNETGS
Ga0326726_1131668533300033433Peat SoilMKKVLAVILAVAMLPAFAPAARAQNLDDIFRKVKASVVVVRSRGRDVKAGGVTRCVEHGPGGGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.