NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104860

Metagenome Family F104860

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104860
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 73 residues
Representative Sequence VGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGW
Number of Associated Samples 88
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 16.67 %
% of genes near scaffold ends (potentially truncated) 12.00 %
% of genes from short scaffolds (< 2000 bps) 9.00 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.21

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (88.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(19.000 % of family members)
Environment Ontology (ENVO) Unclassified
(36.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 25.00%    β-sheet: 2.00%    Coil/Unstructured: 73.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.21
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF13561adh_short_C2 37.00
PF00106adh_short 22.00
PF12242Eno-Rase_NADH_b 5.00
PF04455Saccharop_dh_N 3.00
PF00027cNMP_binding 2.00
PF00296Bac_luciferase 2.00
PF07676PD40 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1915Uncharacterized conserved protein AF1278, contains saccharopine dehydrogenase N-terminal (SDHN) domainFunction unknown [S] 3.00
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A88.00 %
All OrganismsrootAll Organisms12.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10485905All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium523Open in IMG/M
3300005540|Ga0066697_10028190All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3097Open in IMG/M
3300005568|Ga0066703_10015462All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3817Open in IMG/M
3300006796|Ga0066665_10092471All Organisms → cellular organisms → Bacteria2200Open in IMG/M
3300010326|Ga0134065_10174473All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium764Open in IMG/M
3300012096|Ga0137389_10463315All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1086Open in IMG/M
3300012198|Ga0137364_11264794All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium551Open in IMG/M
3300012203|Ga0137399_11048733All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium686Open in IMG/M
3300015356|Ga0134073_10131117All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium774Open in IMG/M
3300025981|Ga0207640_11809987All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium552Open in IMG/M
3300026333|Ga0209158_1227381All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium642Open in IMG/M
3300031846|Ga0318512_10471318All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium635Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil19.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment2.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.00%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.00%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.00%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment1.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.00%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.00%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.00%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.00%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2065487018Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022226Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_13EnvironmentalOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025971Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300031834Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_0EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033419Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day5_noCTEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPINP_016980402065487018SoilLSGSDPSARLARVGGKGDRPTVENRWWVDQGKDAKGNPVPRPPAQFGHRFFRTGDGFEQDRARIRLTDSNLKWLEEANPAPGAWA
JGI25382J43887_1048590523300002908Grasslands SoilVGGKGDRPTVENRWWVDQGKDAKGRPLERPPAQFGGRFFRTGEGFEQDRAQRRLTDSNLKWLQEANPPADAWAVVRMLPGDHFPAARFEKI
Ga0063356_10497503313300004463Arabidopsis Thaliana RhizosphereVDQGKAPRDGAPVERPPPQFGKVFFRSGDGFEQDRARLRLTDSNLKWLAESNPPAD
Ga0066673_1015390723300005175SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEE
Ga0066688_1048056323300005178SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPPQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEA
Ga0070689_10149096213300005340Switchgrass RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFDQDRAQRRLTDLNLKWFTEA
Ga0070703_1022226213300005406Corn, Switchgrass And Miscanthus RhizosphereMGGKGDRPTVENRWWVDQGKDARGGPVERPPPSFGKVFFRSGDGFEQDRARLRLTDSNLKWLE
Ga0066689_1077878513300005447SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWL
Ga0070697_10018726623300005536Corn, Switchgrass And Miscanthus RhizosphereMGGKGDRPNVENRWWVDQGKDAHGQPVERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEE
Ga0066697_1002819013300005540SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPPQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAVDAWAVVRM
Ga0066701_1037118713300005552SoilVGGKGDRPTVENRWWVDQGKDATGRPIERPPAQFGGRFFRTGEGFEQERAQRRLTDSNLKWLEEANPP
Ga0066695_1009169923300005553SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAPSPGGWGLVHALADER
Ga0066704_1050691323300005557SoilVGGRGDRPTVENRWWVDQGKDAKGNPIPRPPAQFGHRFFRSGDGFEQDRARLRLTDSNMKWLEEAAPPAGGWAVVHVL
Ga0066703_1001546263300005568SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPPQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAVDGWAVVRMLPEEHFPASR
Ga0066903_10895769323300005764Tropical Forest SoilMGGKGDRPSLENRWWVDQGKGGRPIERPPARYGNTFFRTGTGFQQDRAQRRLTDSNLKWFDEAPGEPGAWT
Ga0075293_105787713300005875Rice Paddy SoilVGGKGDRPTVENRWWVDQGKDAKGQPIERPPAQFGGRFFATGAGFEQNRARIRLTDSNLKWLEEAAPAPDAWAVVRMLPGERVP
Ga0066665_1009247113300006796SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPPPDGWV
Ga0066659_1098542823300006797SoilVGGKGDRPTVENRWWVDQGKDAKGRPLERPPAQFGGRFFRTGEGFEQDRAQRRLTDSNLKWLQEANPPADAWAVVRMLPGD
Ga0075421_10248781523300006845Populus RhizosphereVGGKGDRPTVENRWWVDHGKDAKGNPVPRPPAQFGHRFFRTGDGFEQDRARIRLTDSNLKWLEEATPAPG
Ga0075431_10047369823300006847Populus RhizosphereVENRWWVDQGKEPRGGAVERPPPQFGKTFFRSGDGFEQDRARLRLTDSNLKWLEEASPPADGWAYVQVLAGERTPESRI
Ga0075420_10061092013300006853Populus RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFEQDRAQRRLTDLNLKWFTEASPEPGGW
Ga0075425_10295260513300006854Populus RhizosphereVGGKGDRPTVENRWWVDQGKEGQGGVRPHATFGRRFFKTGDGFEQDRARLRLTDSNLKWLQEANPPPNGWIVVRPVSGERMP
Ga0075425_10317527013300006854Populus RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFEQDRAQRRLTDLNLKWFTEAS
Ga0075434_10183032413300006871Populus RhizosphereVGGKGDRPTVENRWWVDQGKEARGGSPAERPPPAFGKIFFRSGDGFEQDRARLRLTDSNLKWLEEA
Ga0075424_10133604613300006904Populus RhizosphereMGGKGDRPSLENRWWVDQGKDGQPMDRPPAQFGNGFFRSAAGFHQERAQRRLTDLNLKWFDEAQAEPGT
Ga0066710_10012316863300009012Grasslands SoilVGGKGDRPTVENRWWVDQGKDAAGRPIERPPAQFGGRFFRTGEGFEQERAQRRLTDSNLKWLEEAN
Ga0066710_10037596213300009012Grasslands SoilVGGKGDRPTVENRWWVDQGKDATGRPLERPPAQFGGRFFRTGEGFEQDRAQRRLTDSNLKWLEEANPPADGWAV
Ga0066710_10294065513300009012Grasslands SoilMGGKGDRPSVENRWWVDHGKDARGNPLPRPPAAFGSRFFRTGEGFEQERARVRLTDSNLKWLEEA
Ga0066709_10360912323300009137Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPATFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPPPDGWVVV
Ga0099792_1048435923300009143Vadose Zone SoilMGGKGDRASIENRWWVDQGKDGQPLERPPAQFGNMFFRTGLGFEQDRAQRRLTDLNLKWFDEAR
Ga0114129_1030477913300009147Populus RhizosphereVGGKGDRPTVENRWWVDQGKDAKGNPVPRPPAQFGHRFFRTGDGFDQDRARIRLTDSNVKWLEEATPAPGSWAVVR
Ga0126384_1041254113300010046Tropical Forest SoilMGGKGDRPTVENRWWVDQGKDARGAPVPRPPAQFGHRFFRTGDGFEQDRARMRLTDSNLKWLEEASPTPGEWAMVHMLE
Ga0134070_1048449613300010301Grasslands SoilVVGKGDRPTVENRWWVDQGKDATGRPIERPPAQFGGRFFRTGEGFEQERAQRRLTDSNLKWIEEANPPADGW
Ga0134109_1028136613300010320Grasslands SoilVGGKGDRPTVENRWWVDQGKDATGRPIERPPAQFGGRFFRTGEGFEQERAQRRLTDSNLKWLEEANPPADG
Ga0134086_1016684723300010323Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNL
Ga0134065_1017447323300010326Grasslands SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAPSPGGWGLIHALADERFPEKR
Ga0134071_1032500723300010336Grasslands SoilVGGKGDRPAVENRWWVDHGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPLDG
Ga0126377_1150686923300010362Tropical Forest SoilMGGKGDRPNVENRWWVDHGKDAHGQPVERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAPPPGSWGVIRALADE
Ga0136847_1250091423300010391Freshwater SedimentVGGKGDRPTIENRWWVDQGKDAKGLPLERPPAPFGGRFFRTGRGFEQEQARARLTDSNLKWLEE
Ga0136847_1314535343300010391Freshwater SedimentMGGKGDRPSLENRWWVDQTKDKDGRPIDRPPAQFGNGFFRTAAGFDQDRAQRRLTDLNLKWFDEVRPEPGRWAVLHLLE
Ga0134127_1085115013300010399Terrestrial SoilVGGKGDRPTVENRWWVDQGKAPRDGAPIERPPPQFGKVFFRSGDGFEQDRARLRLTDSNLKWLE
Ga0134122_1226975113300010400Terrestrial SoilMGGKGDRASLENRWWVDQGKDGQPLERPPALFGHRFFRTGAGFQQDRAQRRLTDLNLKW
Ga0137389_1046331523300012096Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGWVVVHVLAGERFPEGRIE
Ga0137364_1126479413300012198Vadose Zone SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPPQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAVDGWAVVRMLPEEHFPAGRFER
Ga0137365_1040149023300012201Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEA
Ga0137399_1104873313300012203Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPVPRPPAQFGHRFFRTGDGFEQDRARLRLTDSNLKWLEESAPPGGGWGVVRVLTDERVPDARVEK
Ga0137371_1051406913300012356Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEE
Ga0137419_1154223813300012925Vadose Zone SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAASPGGWGLIHALTDERFP
Ga0134077_1032070513300012972Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPPPDGWVV
Ga0134076_1008271913300012976Grasslands SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAPSPGGWGLIHALADERFPEK
Ga0134076_1016326323300012976Grasslands SoilVGGKGDRPTVENRWWVDQGKDAKGRPLERPPAQFGGRFFRTGEGFEQDRAQRRLTDSNLKWL
Ga0157375_1055867323300013308Miscanthus RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFQQDRAQRRLTDLNLKWFTEASPEPGGK
Ga0134079_1040710023300014166Grasslands SoilVGGKGDRPNVENRWWVDHGKDAHGQPIERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLEEGAPSPGGWGLVHALAD
Ga0075354_112614523300014308Natural And Restored WetlandsMGGKGDRPSLENRWWVDQSKDKDGRPIDRPPALYGNGFFRTAAGFDQDRAQRRLTDL
Ga0137405_108863743300015053Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPLDGWVVVHVLAGERFP
Ga0137405_123648813300015053Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPD
Ga0137405_126465533300015053Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGWVVVHVLAGERFPEG
Ga0137418_1098229713300015241Vadose Zone SoilMGGKGDRPTIENRWWVDQGKDAAGRPIERPPAQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAVDGWAVVRMLP
Ga0134073_1013111713300015356Grasslands SoilVGGKGDRPAVENRWWVDHGKDAKGNPLPRPPAAFGGRFFRTGEGFDQERARLRLTDSNLKWLEESGPPPDGWVVVRTLAGEHFPEGR
Ga0132256_10092191823300015372Arabidopsis RhizosphereVGGKGDRPTVENRWWVDHGKDAKGNPVPRPPAQFGHRFFRTGDGFEQDRPRIRLTASNLKWLEEATPAPGSWTIVHVL
Ga0134074_123087813300017657Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGWVVVH
Ga0134083_1009103213300017659Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPATFGARFFRSGDGFEQERARVRLTDSNLKWLEE
Ga0187894_1043495723300019360Microbial Mat On RocksVENRWWVDQQRDAQGVPLARPPAQYGGRFFRTGAGFEQDRAQRRLTDLNLKWFDE
Ga0179592_1031986313300020199Vadose Zone SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGARFFRTGEGFEQERARVRLTDSNLKWL
Ga0224512_1018872513300022226SedimentMGGQGDRPSLENRWWVDQQKIESDGLERPEAKYGKRFFRTGDGYDQDTAQRRLTDLNLKWFDEVETSPGKWAVIYTLPKQRV
Ga0207701_1160145123300025930Corn, Switchgrass And Miscanthus RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFDQDRAQRRLTDLNLKWFTEASPEPGG
Ga0207706_1115327623300025933Corn RhizosphereVGGQGDRPNLEGRWWIDQQKIDPKTGVPAERPPAKFGQRFFHTGDGYDQDRAQRRLTDLNLKWFE
Ga0207670_1055499513300025936Switchgrass RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFDQDRAQRRLTDLNLKWFTEASPEP
Ga0207711_1152955613300025941Switchgrass RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFQQDRAQRRLTDLNLKWFTEAS
Ga0210102_103062313300025971Natural And Restored WetlandsVGGKGDRPSLENRWWIDERRDKDGKPIDRPPPQFGQRFFHTGRGFEQDRAQRRLTDLNLKWFDEARAEPGTWA
Ga0207640_1180998713300025981Corn RhizosphereVGGKGDRPTVENRWWVDQGKAPRDGAPVERPPPQFGKVFFRSGDGFEQDRARLRLTDSNLKWLAENNPPADAWTYVQVLPGERTAES
Ga0207708_1138183823300026075Corn, Switchgrass And Miscanthus RhizosphereMGGKGDRASLENRWWVDQGKDGQPLERPPAQFGHRFFRTGAGFQQDRAQRRLTDLNLKWFDEARPEPGGWAVLH
Ga0207648_1210993913300026089Miscanthus RhizosphereVGGKGDRPTVENRWWVDHGTDAKGNPVPRPPAQFGHRFFRTGDGFEQDRARIRLTDSNLKWLEEATPAPGSWAIVYVLP
Ga0207675_10034036613300026118Switchgrass RhizosphereMGGKGDRPSLENRWWVDQGKDGQPIERPPATFGNRFFRTAAGFEQDRAQRRLTDLNLKWFTEASPEPGGWAVLHVLE
Ga0209237_108807913300026297Grasslands SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGWVVVHVLA
Ga0209268_103565733300026314SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGWGVV
Ga0209155_119912913300026316SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANP
Ga0209472_103687243300026323SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPADGWVVVHVLAGERFPE
Ga0209472_104283943300026323SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPPPDGWVVVRVLA
Ga0209266_114752923300026327SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPPQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAV
Ga0209802_104646243300026328SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPATFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPPPDGWVVVRVLAAERFPEG
Ga0209473_104446343300026330SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPATFGGRFFRTGEGFEQERARVRLTDSNLKWLEEANPP
Ga0209158_122738113300026333SoilVGGKGDRPTVENRWWVDQGKDATGRPLERPPAQFGGRFFRTGEGFEQDRAQRRLTDSNLKWLEEANPPADGWAVVRTLPGDHFPAPRF
Ga0209804_116145733300026335SoilMGGKGDRPTVENRWWVDQGKDAAGRPIERPPAQFGGRFFRTGDGFQQERAQRRLTDSNLKWLEEADPAVDGWAVVRM
Ga0209474_1039013323300026550SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDAN
Ga0209581_107537113300027706Surface SoilMLGCVGGKGDRPTVENRWWVDQGKDAAGRPIVRPQARFGHRFFRTGEGFEQTRAQRRLTDLNLKWLDEAPGP
Ga0209488_1059142713300027903Vadose Zone SoilMGGKGDRASIENRWWVDQGKDGQPLERPPAQFGNMFFRTGLGFEQDRAQRRLTDLNLKWFDEARPEPGRWAVMHL
Ga0247828_1086847613300028587SoilVENRWWVDQGKAPRDGTPIERPPPQFGKIFFRSGDGFEQDRARLRLTDSNLKWLEEANPPADGWTYVQVLAGERSAESR
Ga0315290_1172739923300031834SedimentMGGKGDRASLENRWWVDQGKDGQPLERPPAQFGQMFFRTGVGFQQDRAQRRLTDLNLKWFDEAQGEPGSWA
Ga0318512_1047131823300031846SoilVGGKGDRPTVENRWWVDQQKDDSGRVIERPPAQFGGRFFHTGAGFEQERAQRRLTDLNLKWFQEAAGPPGAWTDIALLPKERVAPRRVERAKAALPIP
Ga0307471_10020062313300032180Hardwood Forest SoilMGGKGDRPNVENRWWVDHGKDAHGQPVERPPAQFGGRFFRTGADFEQERARLRLTDSNLKWLDEGAPPP
Ga0307471_10264190113300032180Hardwood Forest SoilVENRWWVDQGKDAKGNPVPRPPAQFGHRFFRTGEGFEQDRARLRLTDSNLKWLEEASPPAGGWAFV
Ga0307471_10409841523300032180Hardwood Forest SoilVGGKGDRPTVENRWWVDQGKDAKGDPIPRPPATFGGRFFRTGAGFEQDRASVRLTDLNLKWFEEANPPTDGWAN
Ga0307472_10224388413300032205Hardwood Forest SoilVGGKGDRPTVENRWWVDQGKDAAGRPIERPPAQFGGRFFRTGEGFQQERAQRRLTDSNLKWLEEADPAEDAWAVVRMLPGEHFPTG
Ga0307472_10276056713300032205Hardwood Forest SoilVGGKGDRPAVENRWWVDQGKDAKGNPLPRPPAAFGGRFFRTGEGFEQERARVRLTDSNLKWLEEASPPPDGW
Ga0306920_10399201723300032261SoilVGGKGDRPTVENRWWVDQQKDDSGRVIERPPAQFGGRFFHTGAGFEQARAQRRLTDLNLKWFQE
Ga0316601_10260290023300033419SoilVGGQGDRPNLEGRWWIDQQKTGVPYERPPAKFGQRFFQTGDGYDQDRAQRRLTDLNLKWFDEVDAAEGTWGVL
Ga0326726_1074283113300033433Peat SoilMGGKGDRPVIDRWWVDQGKDAKGNPIPRPPAQFGGRFFRTGEGFEQERARTRLTDSNLKWLEEGSPPADGWTLVHM
Ga0326726_1234976613300033433Peat SoilMVVAPATLGYARRVGGKGDRPNVENRWWAELGKDARGVPIERPPAQFGHRFFRTGDGFEQDRARLRLTDSNTKWLEEATPGPDGWGLVQVFP
Ga0316628_10255279613300033513SoilVGGQGDRPNLEGRWWIDQQKTGVPYERPPAKFGQRFFQTGDGYDQDRAQRRLTDLNLKWFDEVDAAEGTW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.