NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F094988

Metagenome Family F094988

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094988
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 62 residues
Representative Sequence MSRKPQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Number of Associated Samples 69
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 8.65 %
% of genes near scaffold ends (potentially truncated) 26.67 %
% of genes from short scaffolds (< 2000 bps) 78.10 %
Associated GOLD sequencing projects 63
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.810 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(45.714 % of family members)
Environment Ontology (ENVO) Unclassified
(38.095 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.381 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 17.44%    β-sheet: 25.58%    Coil/Unstructured: 56.98%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF07813LTXXQ 3.81
PF02265S1-P1_nuclease 1.90
PF13474SnoaL_3 1.90
PF01323DSBA 1.90
PF00072Response_reg 0.95
PF12852Cupin_6 0.95
PF13430DUF4112 0.95
PF01381HTH_3 0.95
PF08448PAS_4 0.95
PF13546DDE_5 0.95
PF01436NHL 0.95
PF01068DNA_ligase_A_M 0.95
PF12680SnoaL_2 0.95
PF03466LysR_substrate 0.95
PF01710HTH_Tnp_IS630 0.95
PF07536HWE_HK 0.95
PF07969Amidohydro_3 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG3678Periplasmic chaperone Spy, Spy/CpxP familyPosttranslational modification, protein turnover, chaperones [O] 15.24
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 0.95
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.95
COG3415CRISPR-associated protein Csa3, CARF domainDefense mechanisms [V] 0.95
COG3920Two-component sensor histidine kinase, HisKA and HATPase domainsSignal transduction mechanisms [T] 0.95
COG4251Bacteriophytochrome (light-regulated signal transduction histidine kinase)Signal transduction mechanisms [T] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.81 %
All OrganismsrootAll Organisms16.19 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002907|JGI25613J43889_10182775Not Available557Open in IMG/M
3300002914|JGI25617J43924_10106083Not Available1001Open in IMG/M
3300002917|JGI25616J43925_10059581All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1633Open in IMG/M
3300002917|JGI25616J43925_10194708Not Available787Open in IMG/M
3300005174|Ga0066680_10439393Not Available825Open in IMG/M
3300005181|Ga0066678_11008544Not Available539Open in IMG/M
3300005332|Ga0066388_103890327Not Available762Open in IMG/M
3300005332|Ga0066388_104229765Not Available732Open in IMG/M
3300005332|Ga0066388_104816793All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300005471|Ga0070698_100316847Not Available1490Open in IMG/M
3300005552|Ga0066701_10971092Not Available502Open in IMG/M
3300005555|Ga0066692_10510161Not Available764Open in IMG/M
3300005764|Ga0066903_101068605Not Available1484Open in IMG/M
3300005764|Ga0066903_103240688Not Available880Open in IMG/M
3300005764|Ga0066903_107844567All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium tropiciagri549Open in IMG/M
3300007255|Ga0099791_10002281Not Available7732Open in IMG/M
3300007255|Ga0099791_10011752All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3674Open in IMG/M
3300007258|Ga0099793_10100988Not Available1335Open in IMG/M
3300007258|Ga0099793_10163825Not Available1057Open in IMG/M
3300007265|Ga0099794_10135004Not Available1247Open in IMG/M
3300007788|Ga0099795_10638878Not Available509Open in IMG/M
3300009143|Ga0099792_10039922All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium2232Open in IMG/M
3300009143|Ga0099792_10128524Not Available1370Open in IMG/M
3300009792|Ga0126374_10791278Not Available723Open in IMG/M
3300009792|Ga0126374_11403978Not Available569Open in IMG/M
3300010046|Ga0126384_12132352Not Available538Open in IMG/M
3300010047|Ga0126382_12513662Not Available502Open in IMG/M
3300010358|Ga0126370_10967201All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium774Open in IMG/M
3300010358|Ga0126370_12601277Not Available506Open in IMG/M
3300010359|Ga0126376_11179689Not Available779Open in IMG/M
3300010359|Ga0126376_13120713Not Available513Open in IMG/M
3300010360|Ga0126372_10332847Not Available1352Open in IMG/M
3300010360|Ga0126372_10384164Not Available1274Open in IMG/M
3300010360|Ga0126372_10606197Not Available1052Open in IMG/M
3300010360|Ga0126372_10676361Not Available1003Open in IMG/M
3300010360|Ga0126372_10681236Not Available1000Open in IMG/M
3300010362|Ga0126377_10746094Not Available1034Open in IMG/M
3300010362|Ga0126377_11690624Not Available708Open in IMG/M
3300011269|Ga0137392_10036135Not Available3627Open in IMG/M
3300011269|Ga0137392_10953096Not Available706Open in IMG/M
3300011271|Ga0137393_10954271Not Available731Open in IMG/M
3300012096|Ga0137389_10752223Not Available837Open in IMG/M
3300012199|Ga0137383_10127094Not Available1861Open in IMG/M
3300012202|Ga0137363_10145774Not Available1855Open in IMG/M
3300012202|Ga0137363_11081116All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei682Open in IMG/M
3300012202|Ga0137363_11124969Not Available668Open in IMG/M
3300012203|Ga0137399_10117968Not Available2080Open in IMG/M
3300012205|Ga0137362_10041663All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → unclassified Hyphomicrobiaceae → Hyphomicrobiaceae bacterium3704Open in IMG/M
3300012205|Ga0137362_10232464Not Available1591Open in IMG/M
3300012205|Ga0137362_10978222Not Available721Open in IMG/M
3300012208|Ga0137376_10812684Not Available804Open in IMG/M
3300012208|Ga0137376_10999185Not Available717Open in IMG/M
3300012361|Ga0137360_10027948All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → unclassified Hyphomicrobiaceae → Hyphomicrobiaceae bacterium3867Open in IMG/M
3300012361|Ga0137360_10099556Not Available2221Open in IMG/M
3300012362|Ga0137361_10535215Not Available1075Open in IMG/M
3300012685|Ga0137397_10096279Not Available2165Open in IMG/M
3300012917|Ga0137395_10283971Not Available1169Open in IMG/M
3300012918|Ga0137396_10029702Not Available3603Open in IMG/M
3300012918|Ga0137396_10386045Not Available1038Open in IMG/M
3300012918|Ga0137396_10576599Not Available833Open in IMG/M
3300012923|Ga0137359_10398067Not Available1221Open in IMG/M
3300012927|Ga0137416_11860838Not Available550Open in IMG/M
3300012930|Ga0137407_11877451Not Available571Open in IMG/M
3300012944|Ga0137410_10014911All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5293Open in IMG/M
3300012971|Ga0126369_11312400Not Available814Open in IMG/M
3300015054|Ga0137420_1243298All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300015241|Ga0137418_10071974Not Available3151Open in IMG/M
3300015241|Ga0137418_10139470All Organisms → cellular organisms → Bacteria2145Open in IMG/M
3300015242|Ga0137412_10249122Not Available1406Open in IMG/M
3300015245|Ga0137409_10011056All Organisms → cellular organisms → Bacteria9109Open in IMG/M
3300015245|Ga0137409_10021901All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6284Open in IMG/M
3300016404|Ga0182037_11928361Not Available529Open in IMG/M
3300017970|Ga0187783_10191005Not Available1506Open in IMG/M
3300017970|Ga0187783_10284857Not Available1206Open in IMG/M
3300018468|Ga0066662_10791960Not Available918Open in IMG/M
3300018468|Ga0066662_11637283All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300020199|Ga0179592_10037106Not Available2206Open in IMG/M
3300021086|Ga0179596_10004978Not Available3941Open in IMG/M
3300021086|Ga0179596_10212311Not Available945Open in IMG/M
3300021086|Ga0179596_10647875Not Available535Open in IMG/M
3300026304|Ga0209240_1020576Not Available2525Open in IMG/M
3300026340|Ga0257162_1022393Not Available763Open in IMG/M
3300026355|Ga0257149_1058916Not Available558Open in IMG/M
3300026356|Ga0257150_1004024Not Available1948Open in IMG/M
3300026358|Ga0257166_1010965Not Available1118Open in IMG/M
3300026361|Ga0257176_1032749Not Available787Open in IMG/M
3300026446|Ga0257178_1047998Not Available555Open in IMG/M
3300026481|Ga0257155_1022260Not Available914Open in IMG/M
3300026481|Ga0257155_1061541Not Available596Open in IMG/M
3300026482|Ga0257172_1001062Not Available3145Open in IMG/M
3300026482|Ga0257172_1083462Not Available587Open in IMG/M
3300026489|Ga0257160_1019661Not Available1057Open in IMG/M
3300026494|Ga0257159_1063157Not Available634Open in IMG/M
3300026496|Ga0257157_1066606Not Available615Open in IMG/M
3300026498|Ga0257156_1091873Not Available631Open in IMG/M
3300026507|Ga0257165_1087290Not Available576Open in IMG/M
3300026514|Ga0257168_1053784Not Available884Open in IMG/M
3300026551|Ga0209648_10007234All Organisms → cellular organisms → Bacteria → Proteobacteria9543Open in IMG/M
3300026551|Ga0209648_10110171All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Beijerinckiaceae → unclassified Beijerinckiaceae → Beijerinckiaceae bacterium2241Open in IMG/M
3300027512|Ga0209179_1046688Not Available921Open in IMG/M
3300027512|Ga0209179_1118209Not Available592Open in IMG/M
3300027671|Ga0209588_1172503Not Available680Open in IMG/M
3300027903|Ga0209488_10080710Not Available2420Open in IMG/M
3300032180|Ga0307471_100930354Not Available1037Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil45.71%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil15.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.24%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.57%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.81%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.95%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.95%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026489Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-AEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25613J43889_1018277513300002907Grasslands SoilMPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
JGI25617J43924_1010608333300002914Grasslands SoilATYLAANTGGMPRKPQQPSQRTKWHVYLQARAARREWVGEIEAVDEREAIVIAAKEFKQPAKKLIATQRR*
JGI25616J43925_1005958113300002917Grasslands SoilMPRKPQQXSQRTKWHVYLQARAARREWVGEIEAVDEREAIVIAAKEFKQPAKKLIATQRR
JGI25616J43925_1019470823300002917Grasslands SoilMSRKPQQRPLRTTWHVYLQTRTARPEWVGEIKAVDERKAIEIAAKEFKQPASKLIAVPRR
Ga0066680_1043939323300005174SoilMPRKPQQPPQRTKWYVYYRLAARREWVGEIEAVDEREAIEIAAKEFKQPARKLMAERRR*
Ga0066678_1100854413300005181SoilMSRRQQQPPQPAKAKWHVYKLAAKRQWVGEIEAVDEREAIEVAAKEFKQPETKLMAVRRR
Ga0066388_10389032713300005332Tropical Forest SoilLRPILPGTSRKPQQPPQPKIKWHVYRLAAKRQWVGEIEAVDEREAIEKAAKEFKQPAKKLMAVQRP*
Ga0066388_10422976523300005332Tropical Forest SoilMARKPQQPPQQTKWHVYRKIAARREWVGEIKAVDEHEAIKIAAKEFKQPAT
Ga0066388_10455905223300005332Tropical Forest SoilRTKWHVYLQARAAKLEWVGEIEAADEREAIEIAAKEFKQPAKKLIAVQLR*
Ga0066388_10481679313300005332Tropical Forest SoilMSSKPQPSPQRTKWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFKQPARKLMAVQRK
Ga0070698_10031684743300005471Corn, Switchgrass And Miscanthus RhizosphereMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERPAIEIAAKEFKQPAKKLIATQRR
Ga0066701_1097109223300005552SoilLLVLTLRPILPAMSRKPQPPQPTKTKWHVYKLAAKREWVGEIEAVDEREVIEIAAKEFKQPAKKLMAMQR
Ga0066692_1051016113300005555SoilRKPQQPPQRTKWHIYYGLAARREWVGEIEAVDERGAIEIAAKEFKQPARKLMAERQR*
Ga0066903_10106860523300005764Tropical Forest SoilMFRKPQQRTRWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFKRPAKKLIAVQLR*
Ga0066903_10324068823300005764Tropical Forest SoilMPRKPQQPQQRTKWHVYLQARAAKLEWVGEIEAADEREAIEIAAKEFKQPAKK
Ga0066903_10784456723300005764Tropical Forest SoilANTCSMPRKPQQPSQQTKWHVYRKIAARREWVGEIKAGDEREAIKIAAKEFKQPENKLVLLSHKATSGGF*
Ga0099791_1000228183300007255Vadose Zone SoilMPRKPRQPSQRTKWHVYYKLAAKLEWVGEIEAVDEHKAIEIAAKEFKEPAKKLISVQRR*
Ga0099791_1001175233300007255Vadose Zone SoilMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRQ*
Ga0099793_1010098823300007258Vadose Zone SoilMSRKPQQPPQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKELKQPAKKLMAVPRR
Ga0099793_1016382523300007258Vadose Zone SoilLPARGLVAATYLAANTGGIPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR*
Ga0099794_1013500433300007265Vadose Zone SoilMPRKPRQPPQRTKWHVYYKLAAKLEWVGEIEAVDEHKAIEIAAKEFKEPAKKLISVQRR*
Ga0099795_1063887813300007788Vadose Zone SoilMSRKPQQPPQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKK
Ga0099792_1003992233300009143Vadose Zone SoilMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEAVDELEAIEIAAKEFKQPAKKLMAVQRR*
Ga0099792_1012852413300009143Vadose Zone SoilMSRKPQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFRQPAKKLMAVPRR*
Ga0126374_1079127813300009792Tropical Forest SoilMSRKPQPPPQRTKWHVYLQGRTARREWVGEIEAFDQREAIEIAAKEFKQPAKKLIAVQL
Ga0126374_1140397813300009792Tropical Forest SoilMSRKPQQPPQRTKWHVYKLAAKRQWVGEIEAVDEREASEIAAKEFKQPAKKL
Ga0126384_1213235223300010046Tropical Forest SoilMSRKPQQPLQRTKWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFKQPVRKLMAVQRK
Ga0126382_1251366213300010047Tropical Forest SoilMPRKRQPPPRPTKTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Ga0126370_1096720123300010358Tropical Forest SoilMSRKPQQPQQRTRWHVYLQARAAKREWVGEIEAFDEREAIEIAAKEFKQPAKKLIAVQL*
Ga0126370_1260127713300010358Tropical Forest SoilMSRKPQQPLQRTKWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFKQPARKLMAVQRK
Ga0126376_1117968913300010359Tropical Forest SoilLPCSQYCRMSRKPQQPQQRTRWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFRQPAKKLIAVQRR*
Ga0126376_1312071313300010359Tropical Forest SoilMSRKPQPPPQRTKWHVYLQGRTARREWVGEIEAFDQREAIEIAAKEFKQPAKKLIAVQL*
Ga0126372_1033284713300010360Tropical Forest SoilMPRKPQQPSQQTKWHVYRKIAARREWVGEIKAGDEREAIKIAAKEFKQPANKLVAVPRQGR*
Ga0126372_1038416423300010360Tropical Forest SoilMSRKPQQPLQRTKWHVYLQARAAKREWVGEIEAVDEREAIEIAAREFKQPARKLIAVQLR
Ga0126372_1060619713300010360Tropical Forest SoilMARKPQQPPQRTKWHVYYKLAARREWVGEIEAVDEREAIEMAAKEFRQPAKKLMAVQRR*
Ga0126372_1067636123300010360Tropical Forest SoilMPRKPQQPQQRTKWHVYLQARAAKLEWVGEIEAVDEREAIEIAAKEFKQPAKKLIAVQLRRR*
Ga0126372_1068123623300010360Tropical Forest SoilMSRKPQQSPQPKIKWHVYKLAAKRQWVGEIEAVDEREAIEKAAKEFKQPAKKLMAVQRP*
Ga0126377_1074609423300010362Tropical Forest SoilMSREPQQPPKRTKWHVYLQARAAKLEWVGEIEAVDERDAIEIAAKEFKVDAWRLIAVQRR
Ga0126377_1169062423300010362Tropical Forest SoilMSRKPQQPPQRTKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVRQR*
Ga0137392_1003613523300011269Vadose Zone SoilLPARGLVAATYLAANTGGIPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKEFKQPAKKLIATQRR*
Ga0137392_1095309633300011269Vadose Zone SoilMSRNPQPPQPTKTRWHVYLHARAAGREWVGEIEAINEREAIQIAAKEFKQPANKLIAVARR*
Ga0137393_1095427133300011271Vadose Zone SoilMSRKPQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR*
Ga0137389_1075222313300012096Vadose Zone SoilMPRKPQQPPQPTKTKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQHASKLVAVRK*
Ga0137383_1012709433300012199Vadose Zone SoilMTRKPQQPPQPTKTKWHVYKLAVKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Ga0137363_1014577443300012202Vadose Zone SoilMSRKPQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPVKKLMVVPR*
Ga0137363_1108111613300012202Vadose Zone SoilRYLPCGQYCRMPRKQKQPPRRTKWHVYYKLPPRREWVGEIKAVDEREAIEIAAKEFKQPAKKLMAVPRQ*
Ga0137363_1112496933300012202Vadose Zone SoilSRKPQQPPPPTKTKWHVFKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRP*
Ga0137399_1011796833300012203Vadose Zone SoilMSRKPQQPPQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRQ
Ga0137362_1004166383300012205Vadose Zone SoilMSRKPQQPPPPTKTKWHVFKLAAKRQWVGEIEAVDEREAIEVAAKEFKQPETKLMAVRRR
Ga0137362_1023246423300012205Vadose Zone SoilMSRKLQPPQPTKTRCHVYKLAAKRQWGGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRP*
Ga0137362_1097822213300012205Vadose Zone SoilMPRKQKQPPRRTKWHVYYKLPPRREWVGEIKAVDEREAIEIAAKEFKQPAKKLMAVPRQ*
Ga0137376_1081268423300012208Vadose Zone SoilMPRKPQQPPQRTKWHVYYRLAARREWVGEIEAVDEREEMEIAAKEFKQPAKKLMAVQRR*
Ga0137376_1099918533300012208Vadose Zone SoilGPILTGMSRRQQQPPQPAKAKWHVYKLAAKRQWVGEIEAVDEREAIEVAAKEFKQPETKLMAVRRR*
Ga0137360_1002794863300012361Vadose Zone SoilMSRKPQQPPPPTKTKWHVFKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRRDQRV*
Ga0137360_1009955623300012361Vadose Zone SoilMSRKLQPPQPTKTKWHVYKLAAKRQWGGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRP*
Ga0137361_1053521523300012362Vadose Zone SoilMPRKPQQPSQRTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLIATQRR
Ga0137397_1009627953300012685Vadose Zone SoilMPRKPQQPPQPTETKWHVYKLAAKRQWVGEIEAVEERDAIKIAAKEFKQPAKKLMAVQRQ
Ga0137395_1028397113300012917Vadose Zone SoilQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPVKKLMVVPR*
Ga0137396_1002970233300012918Vadose Zone SoilMPRKPRQPPQRTKWHVYYKLAAKLEWVGEIEAVDEHKAIEIEAKEFKEPAKKLISVQRR*
Ga0137396_1038604513300012918Vadose Zone SoilLPGMSRKPQQPSQPTKTRWHVYKLAKRRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR*
Ga0137396_1057659933300012918Vadose Zone SoilMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAQEFKQPAKKLIATQRR
Ga0137359_1039806743300012923Vadose Zone SoilVAATYLAANTGGMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERPAIEIAAKEFKQPAKKLIATQRR*
Ga0137416_1186083813300012927Vadose Zone SoilRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLIATQRR*
Ga0137407_1187745113300012930Vadose Zone SoilMPRKPQQPPQRTKWHVYYRLAARREWVGEIEAVDEREAIEIAAKEFKQP
Ga0137410_1001491153300012944Vadose Zone SoilMSRKPQPPQPTKTTKTRWHVYKLAAKRQWVGEIEAVEEREAIKIAAKEFKQSAKKLMAVPRR*
Ga0126369_1131240013300012971Tropical Forest SoilCGQYCRMSRKPQQPQPRTKWHVYLQARAAKREWVGEIEAVDEREAIEIAAKEFKQPAKKLIAVQLR*
Ga0137420_124329823300015054Vadose Zone SoilVLQLLTFQRILPGMSRKPQPPQPTKTRWHVYKLAAKRQWVGEIEAVEERDAIKIAAKEFKQPAKKLMAVPRR*
Ga0137418_1007197473300015241Vadose Zone SoilLPARGLVAATYLAANTGGPRKPQQPSQRTKWHVYLQARAARREWVGEIEAADEREAIEKAAKELKQPAKKLMAVQRR*
Ga0137418_1013947023300015241Vadose Zone SoilMSRKPQPPQPTKTRWHVYKLAAKRQWVGEIEAVEERDAIKIAAKEFKQPAKKLMAVPRR*
Ga0137412_1024912223300015242Vadose Zone SoilMSRKPQQPPQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVPRR
Ga0137409_10011056193300015245Vadose Zone SoilMSRKPQPPQPTKTRWHVYKLAAKRQWVGEIEAVEEREAIKIAAKEFKQSAKKLMAVPRR*
Ga0137409_1002190193300015245Vadose Zone SoilMPRKPQQPPQPTKTKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLIAVQRR
Ga0182037_1192836113300016404SoilMRRKPQQPQQRTKWHVYLQARTAKLEWVGEIEAVDEREAIEKAAKEFRLDPAKLMAEQR
Ga0187783_1019100533300017970Tropical PeatlandSCLPSRGTCGDRTGIEPTYLAANTGGMPRKPQQPPQPTKTRWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKTLMAVQRR
Ga0187783_1028485733300017970Tropical PeatlandMSRKPQQPPKRTKWHVYYKFAARREWVGEIEAADEREAIQIAAKEFKQPANKLVAVQRR
Ga0066662_1079196013300018468Grasslands SoilMSRKPQPPQPTKTKWHVYKLAAKREWVGEIEAVDEREVIEIAAKEFKQPAKKLMAMQRR
Ga0066662_1163728323300018468Grasslands SoilMSRKPRPPPQPTRTKWHVYKLAAKREWVGEIEAVDERDAIEIAAKEFRLPARKLMAVPRR
Ga0179592_1003710623300020199Vadose Zone SoilMPRKPRQPPQRTKWHVYYKLAAKLEWVGEIEAVDEHKAIEIAAKEFKEPAKKLISVQRR
Ga0179596_1000497853300021086Vadose Zone SoilLPARGLVAATYLAANTGGIPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
Ga0179596_1021231113300021086Vadose Zone SoilLPARGLVAATYLAANTGGMPRKPQQPSQRAKWHVYLQARAARREWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVPRR
Ga0179596_1064787513300021086Vadose Zone SoilMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEVVDERQAIEIAAKEFKQP
Ga0209240_102057663300026304Grasslands SoilMPRKPQQLSQRTKWHVYLQARAARREWVGEIEAVDEREAIVIAAKEFKQPAKKLIAMQRR
Ga0257162_102239333300026340SoilRAGRNQPDGLPARGLVAATYLAANTGGMPRKPQQPSQRTKWHVYLQARAARRKWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
Ga0257149_105891623300026355SoilMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLIATQRR
Ga0257150_100402473300026356SoilMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEAVDELEAIEIAAKEFKQPAKKLMAVQRR
Ga0257166_101096523300026358SoilLPARGLVAATYLAANTGGIPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Ga0257176_103274913300026361SoilMPRKPQRPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLIATQRR
Ga0257178_104799813300026446SoilARGLVAATYLAANTGGIPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
Ga0257155_102226023300026481SoilLPARGLVAATYLAANTGGMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLIATQRR
Ga0257155_106154113300026481SoilMSRKPQPPQPTKTRWHVYLHARAAGREWVGEIEAINEREAIQIAAKEFKQPANKLIAVQR
Ga0257172_100106283300026482SoilVLNVLTLRPILSAMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEAVDELEAIEIAAKEFKQPAKKLMAVQRR
Ga0257172_108346223300026482SoilAANTGGMPRKPQQPSQRTKWHVYLQARAARRKWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
Ga0257160_101966113300026489SoilMPRKPQQPSQQTKWHVYLQARAARREWVGEIEAVDERQAIEIAAKEFKQPAKKLITTQRR
Ga0257159_106315723300026494SoilQPDGLPARGLVAATYLAANTGGMPRKPQQPSQRTKWHVYLQARAARREWVAEIEAVDEREAIEIAAKEFKQPAKKLMAVPRR
Ga0257157_106660633300026496SoilRNPQPPQPTKTRWHVYLHARAAGREWVGEIEAINEREAIQIAAKEFKQPANKLIAVARR
Ga0257156_109187323300026498SoilMSRKPQQPSQPTKTRWHVYKLAARRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Ga0257165_108729013300026507SoilMSRKPQQPSQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVPRR
Ga0257168_105378423300026514SoilLPARGLVAATYLAANTGGMPRKPQQPSQRTKWHVYLQARAARRKWVAEIEAVDEREAIEIAAKELKQPAKKLMAVQRR
Ga0209648_10007234123300026551Grasslands SoilMPRKPQQPSQRTKWHVYLQARAARREWVGEIEAVDEREAIVIAAKEFKQPAKKLIATQRR
Ga0209648_1011017143300026551Grasslands SoilMSRKPQPPQPTKTKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRQ
Ga0209179_104668823300027512Vadose Zone SoilMSRKPQQPSQPTKTRWHVYKLAARRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVPRR
Ga0209179_111820913300027512Vadose Zone SoilTLRPILPGMSRKPQQPPQPTKTRWHVYRLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLMAVQRR
Ga0209588_117250323300027671Vadose Zone SoilMPRKPQQPPQPTETKWHVYKLAAKRQWVGEIEAVDEREAIEIAAKEFKQPAKKLIAVQRR
Ga0209488_1008071033300027903Vadose Zone SoilMSRKPQQPPQRTKWHVYKLAAKREWVGEIEAVDEREAIEIAAKEFKQPVKKLMVVPR
Ga0307471_10093035433300032180Hardwood Forest SoilMSRKPQPPQPTKTKWHVFLQDRATKQWVGEIKAVDEREAIEIAAKEFKQPAKKLMAVPRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.