NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072003

Metagenome Family F072003

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072003
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 59 residues
Representative Sequence MPRKPEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIEKVAKERNIPAARLIATQR
Number of Associated Samples 62
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 34.17 %
% of genes near scaffold ends (potentially truncated) 25.62 %
% of genes from short scaffolds (< 2000 bps) 90.91 %
Associated GOLD sequencing projects 57
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (84.298 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(49.587 % of family members)
Environment Ontology (ENVO) Unclassified
(42.975 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.587 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.81%    β-sheet: 0.00%    Coil/Unstructured: 51.19%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF03625DUF302 1.65
PF02265S1-P1_nuclease 0.83
PF01925TauE 0.83
PF03950tRNA-synt_1c_C 0.83
PF01594AI-2E_transport 0.83
PF03459TOBE 0.83
PF02861Clp_N 0.83
PF13546DDE_5 0.83
PF14136DUF4303 0.83
PF01904DUF72 0.83
PF04973NMN_transporter 0.83
PF00708Acylphosphatase 0.83
PF12680SnoaL_2 0.83
PF14022DUF4238 0.83
PF07969Amidohydro_3 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG3439Uncharacterized conserved protein, DUF302 familyFunction unknown [S] 1.65
COG0008Glutamyl- or glutaminyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.83
COG0542ATP-dependent Clp protease, ATP-binding subunit ClpAPosttranslational modification, protein turnover, chaperones [O] 0.83
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.83
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 0.83
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 0.83
COG3201Nicotinamide riboside transporter PnuCCoenzyme transport and metabolism [H] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A84.30 %
All OrganismsrootAll Organisms15.70 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002917|JGI25616J43925_10060241All Organisms → cellular organisms → Bacteria → Proteobacteria1622Open in IMG/M
3300002917|JGI25616J43925_10343600All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005332|Ga0066388_101090333All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1350Open in IMG/M
3300005332|Ga0066388_101941102Not Available1053Open in IMG/M
3300005332|Ga0066388_102130657Not Available1010Open in IMG/M
3300005764|Ga0066903_106101154Not Available631Open in IMG/M
3300005764|Ga0066903_106185596Not Available626Open in IMG/M
3300005764|Ga0066903_106444189Not Available612Open in IMG/M
3300007258|Ga0099793_10157722Not Available1077Open in IMG/M
3300007258|Ga0099793_10367433Not Available705Open in IMG/M
3300007258|Ga0099793_10481932Not Available615Open in IMG/M
3300007258|Ga0099793_10567284Not Available567Open in IMG/M
3300007788|Ga0099795_10030146Not Available1860Open in IMG/M
3300007788|Ga0099795_10070470Not Available1318Open in IMG/M
3300009038|Ga0099829_10330922All Organisms → cellular organisms → Bacteria1252Open in IMG/M
3300009088|Ga0099830_10737911Not Available811Open in IMG/M
3300009088|Ga0099830_11551102Not Available552Open in IMG/M
3300009090|Ga0099827_10493528Not Available1052Open in IMG/M
3300009143|Ga0099792_10398179Not Available842Open in IMG/M
3300009143|Ga0099792_10557283Not Available725Open in IMG/M
3300009143|Ga0099792_10839895Not Available604Open in IMG/M
3300010043|Ga0126380_10503520Not Available930Open in IMG/M
3300010043|Ga0126380_11024741Not Available697Open in IMG/M
3300010043|Ga0126380_11620572Not Available578Open in IMG/M
3300010046|Ga0126384_10984579All Organisms → cellular organisms → Bacteria767Open in IMG/M
3300010046|Ga0126384_11698608Not Available597Open in IMG/M
3300010046|Ga0126384_12166057Not Available534Open in IMG/M
3300010159|Ga0099796_10112230Not Available1039Open in IMG/M
3300010159|Ga0099796_10375332Not Available618Open in IMG/M
3300010358|Ga0126370_10590189Not Available957Open in IMG/M
3300010360|Ga0126372_10019272All Organisms → cellular organisms → Bacteria3980Open in IMG/M
3300010360|Ga0126372_10470305Not Available1171Open in IMG/M
3300010360|Ga0126372_10916782Not Available880Open in IMG/M
3300010360|Ga0126372_11085658Not Available818Open in IMG/M
3300010360|Ga0126372_11964813Not Available631Open in IMG/M
3300010360|Ga0126372_12332320Not Available585Open in IMG/M
3300010362|Ga0126377_12147179Not Available635Open in IMG/M
3300010362|Ga0126377_12635274Not Available578Open in IMG/M
3300010366|Ga0126379_12608112Not Available603Open in IMG/M
3300011269|Ga0137392_10010358Not Available6157Open in IMG/M
3300012199|Ga0137383_11068316Not Available586Open in IMG/M
3300012202|Ga0137363_10015546All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium → Rhizobium leucaenae4986Open in IMG/M
3300012202|Ga0137363_10984605Not Available716Open in IMG/M
3300012205|Ga0137362_11588650Not Available541Open in IMG/M
3300012211|Ga0137377_10038301All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4384Open in IMG/M
3300012361|Ga0137360_10397377Not Available1161Open in IMG/M
3300012361|Ga0137360_10982095Not Available728Open in IMG/M
3300012361|Ga0137360_11305289All Organisms → cellular organisms → Bacteria → Proteobacteria627Open in IMG/M
3300012362|Ga0137361_11604182Not Available571Open in IMG/M
3300012683|Ga0137398_10062318All Organisms → cellular organisms → Bacteria → Proteobacteria2249Open in IMG/M
3300012683|Ga0137398_10127220Not Available1634Open in IMG/M
3300012683|Ga0137398_10910390Not Available613Open in IMG/M
3300012917|Ga0137395_10139141All Organisms → cellular organisms → Bacteria → Proteobacteria1653Open in IMG/M
3300012917|Ga0137395_10526679Not Available852Open in IMG/M
3300012917|Ga0137395_10528204Not Available851Open in IMG/M
3300012918|Ga0137396_10925103Not Available637Open in IMG/M
3300012918|Ga0137396_11044535Not Available588Open in IMG/M
3300012918|Ga0137396_11064282Not Available581Open in IMG/M
3300012923|Ga0137359_11593643Not Available541Open in IMG/M
3300012923|Ga0137359_11779933Not Available503Open in IMG/M
3300012925|Ga0137419_10189776Not Available1514Open in IMG/M
3300012927|Ga0137416_10282401All Organisms → cellular organisms → Bacteria1369Open in IMG/M
3300012927|Ga0137416_10993336Not Available749Open in IMG/M
3300012927|Ga0137416_11581663Not Available596Open in IMG/M
3300012927|Ga0137416_12050192Not Available525Open in IMG/M
3300012944|Ga0137410_10037777All Organisms → cellular organisms → Bacteria3401Open in IMG/M
3300012944|Ga0137410_10132118All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1884Open in IMG/M
3300012944|Ga0137410_10287569Not Available1299Open in IMG/M
3300012944|Ga0137410_10372554Not Available1146Open in IMG/M
3300012944|Ga0137410_10789670Not Available796Open in IMG/M
3300012971|Ga0126369_10932933Not Available954Open in IMG/M
3300012971|Ga0126369_12978979Not Available554Open in IMG/M
3300012971|Ga0126369_13432494Not Available519Open in IMG/M
3300015241|Ga0137418_10153584Not Available2026Open in IMG/M
3300015241|Ga0137418_10172455All Organisms → cellular organisms → Bacteria1889Open in IMG/M
3300017970|Ga0187783_10199903Not Available1468Open in IMG/M
3300017970|Ga0187783_10311656All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1147Open in IMG/M
3300017975|Ga0187782_10185583Not Available1554Open in IMG/M
3300020582|Ga0210395_10577408Not Available844Open in IMG/M
3300021086|Ga0179596_10208043Not Available954Open in IMG/M
3300021086|Ga0179596_10335102Not Available758Open in IMG/M
3300021086|Ga0179596_10531485Not Available597Open in IMG/M
3300021086|Ga0179596_10711563Not Available508Open in IMG/M
3300021178|Ga0210408_11153920Not Available594Open in IMG/M
3300021476|Ga0187846_10151585Not Available982Open in IMG/M
3300021478|Ga0210402_10390126Not Available1293Open in IMG/M
3300026304|Ga0209240_1020576Not Available2525Open in IMG/M
3300026320|Ga0209131_1091964Not Available1615Open in IMG/M
3300026320|Ga0209131_1386389Not Available522Open in IMG/M
3300026340|Ga0257162_1010078Not Available1097Open in IMG/M
3300026355|Ga0257149_1037721Not Available686Open in IMG/M
3300026356|Ga0257150_1009175Not Available1334Open in IMG/M
3300026356|Ga0257150_1040991Not Available677Open in IMG/M
3300026356|Ga0257150_1066551Not Available543Open in IMG/M
3300026358|Ga0257166_1028856Not Available755Open in IMG/M
3300026359|Ga0257163_1004190All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium symbiodeficiens2011Open in IMG/M
3300026359|Ga0257163_1009823Not Available1433Open in IMG/M
3300026361|Ga0257176_1020420Not Available953Open in IMG/M
3300026374|Ga0257146_1020195Not Available1085Open in IMG/M
3300026376|Ga0257167_1023831Not Available889Open in IMG/M
3300026480|Ga0257177_1086800Not Available511Open in IMG/M
3300026481|Ga0257155_1023961Not Available887Open in IMG/M
3300026481|Ga0257155_1029299Not Available816Open in IMG/M
3300026482|Ga0257172_1016284All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300026494|Ga0257159_1024451Not Available995Open in IMG/M
3300026507|Ga0257165_1107257Not Available520Open in IMG/M
3300026508|Ga0257161_1013327Not Available1515Open in IMG/M
3300026508|Ga0257161_1092739Not Available627Open in IMG/M
3300026551|Ga0209648_10040868All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4011Open in IMG/M
3300026551|Ga0209648_10341378Not Available1038Open in IMG/M
3300026551|Ga0209648_10394844Not Available913Open in IMG/M
3300027862|Ga0209701_10490307Not Available669Open in IMG/M
3300027882|Ga0209590_10320566Not Available997Open in IMG/M
3300027903|Ga0209488_10465250Not Available930Open in IMG/M
3300027903|Ga0209488_10619564Not Available783Open in IMG/M
3300028536|Ga0137415_10270562Not Available1505Open in IMG/M
3300028536|Ga0137415_10530300Not Available986Open in IMG/M
3300028536|Ga0137415_11363345Not Available529Open in IMG/M
3300032076|Ga0306924_11440733Not Available733Open in IMG/M
3300032261|Ga0306920_100525067Not Available1756Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil49.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.18%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil15.70%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.61%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.96%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.65%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017975Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25616J43925_1006024123300002917Grasslands SoilMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR*
JGI25616J43925_1034360013300002917Grasslands SoilPPPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQRR*
Ga0066388_10109033323300005332Tropical Forest SoilMAMQSPGPQTSPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPANRLIATQRR
Ga0066388_10194110213300005332Tropical Forest SoilMVKKPKPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDIPAALLIARRR*
Ga0066388_10213065713300005332Tropical Forest SoilMQSPNSQASPLSSWDVFRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIARRR*
Ga0066903_10610115413300005764Tropical Forest SoilMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKQRNLPANRLMARRHR*
Ga0066903_10618559623300005764Tropical Forest SoilMQSPNPQASPLSSWDVYRQAHKAIWLGTVEAVDERDAIEKVVKERNIPANRLRATRRR*
Ga0066903_10644418923300005764Tropical Forest SoilPNPQASPLSPWDVYRQAHKAIWLGTVEAIDEREAIEKVAKERNIPANRLIAARHRQGTGGGYTD*
Ga0099793_1015772213300007258Vadose Zone SoilMAMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0099793_1036743323300007258Vadose Zone SoilMARKPEPLSLSSCDVFKIASKAIWLATIEATDERDAIERVAQERNIPANRLIATQRR*
Ga0099793_1048193213300007258Vadose Zone SoilMLHSRLANTCRMPRKPEPPPPSTWDVFKIASKAVWLATVEATNERDAIERVAKERNVSADRLIATRR*
Ga0099793_1056728423300007258Vadose Zone SoilMPRKPEPPPLSSWDVYKIASKAVWLATIEVTDERDAIERVAKERNVPAARLIATQRR*
Ga0099795_1003014633300007788Vadose Zone SoilMLRKPEPPPPSTWDVFKIASKAVWLATVEATNERDAIERVAKERNVSADRLIATRR*
Ga0099795_1007047023300007788Vadose Zone SoilMPRKPEPSPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQRR*
Ga0099829_1033092213300009038Vadose Zone SoilPTYLAANTRRMARKPEPPLSTWDVFKITKKAVWLATIEATDERDTIEWVAKERDIPAARLIATRRR*
Ga0099830_1073791123300009088Vadose Zone SoilMARKPEPPLSTWDVFKITKKAVWLATIEATDERDTIEWVAKERDIPAARLIATRRR*
Ga0099830_1155110223300009088Vadose Zone SoilMTRKPEAPPLSTFDVLKIASKAVWLGTVEAIDERGAIEKVAKERNVPAARLIATQRR*
Ga0099827_1049352813300009090Vadose Zone SoilMAMQSPNPQASPLLSWEVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0099792_1039817923300009143Vadose Zone SoilMPRKPEPPALSTFDVFKLARKAMWLATIEATDEHDAIKKVAEERNIPAYRLIATRR*
Ga0099792_1055728323300009143Vadose Zone SoilMARKPEPPLSTWDVFKITKKAVWLATIEATDERDTIEWVAKERDIPAARLI
Ga0099792_1083989513300009143Vadose Zone SoilMVKKPDPPPLSTFDVFKLAKKAVWLATVEATDERDAIERVAKERNVQPPG*
Ga0099792_1085057633300009143Vadose Zone SoilYRFATDVEPTYLAAHTPGMVKKTDPPALSTWDVFKLASKAVWLATVEATDERDAIERVAKERNVPASKLMATRRR*
Ga0126380_1050352023300010043Tropical Forest SoilMAMQSPGPEAPPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0126380_1102474113300010043Tropical Forest SoilMAKQSPDPQALPPPSWDVYRQAHKAIWLGIVVATDERDAIEKVAKERNIPANRLIATRRR
Ga0126380_1162057223300010043Tropical Forest SoilMAKQSPDPQASLPSSWDVYRQAHKAIWLGIVVATDERDAIEKVAKERNIPASRLIATRRRS*
Ga0126384_1098457913300010046Tropical Forest SoilGCASNMATNLQASPLSSWNVFRQAHKAIWLGTVEATDERDAIEKVAKERNVPANGLIARRR*
Ga0126384_1169860813300010046Tropical Forest SoilMGKQSPDPEAFQLSSWDVYRQAHKAIWLGTVEATSERDAIEKVAKERNIPVNRLIATQRR
Ga0126384_1216605713300010046Tropical Forest SoilMAKKPEPPPLSSWNVYRAAHRAIWLATVEAANEHDAIERVASERNVPANRLTA
Ga0099796_1011223023300010159Vadose Zone SoilMAMQSPNPQASPLLSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0099796_1037533213300010159Vadose Zone SoilMPRKPEPPPLSTFDVFKLARKAVWLATIEATDERDAIKKVAKERNIPAYRLIATRR*
Ga0126370_1059018913300010358Tropical Forest SoilLKQQSPDPLASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPANRLIATRHR
Ga0126372_1001927223300010360Tropical Forest SoilMVKKPEPLLQSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDIPAAWLIARRR*
Ga0126372_1047030533300010360Tropical Forest SoilMVKKPEPPLLSTFDGFGSRTAIWLGTVEASDERDAIQSGAREIQRVARERGIPVAWLIARRR*
Ga0126372_1091678223300010360Tropical Forest SoilMQSPNPQASPLSSWDVYRQAHKAIWLGTVEASDERDAIEKVAKERNIPANILIATQRR*
Ga0126372_1108565823300010360Tropical Forest SoilMAMQSPNPQASPLSPWDVYRQAHKAIWLGTVEAIDEREAIEKVAKERNIPANRLIAARHRQGTGGGYSD*
Ga0126372_1196481313300010360Tropical Forest SoilLKQQSPDPLASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERDIPANRLIATRRH
Ga0126372_1233232013300010360Tropical Forest SoilPNPQMSPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERSIPANRLIATRHRQRASQA*
Ga0126377_1214717923300010362Tropical Forest SoilMVKKPEPLLLSTFDVFRLAHKAIWLGTVQASDERDAIQRVDRERDIPAAWLITRRR*
Ga0126377_1263527423300010362Tropical Forest SoilMAMQSPGPQAPPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERDIPANRLIATRRR
Ga0126379_1260811223300010366Tropical Forest SoilMAMQSPNPQAPSLSSWDVYRQAHKAIWLGTVEASDERNAIEKVAKERNIPAKLLIARQR*
Ga0137392_1001035813300011269Vadose Zone SoilMVKKLKPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDIPAAWLIARRR*
Ga0137383_1106831613300012199Vadose Zone SoilMAKQPEPPAPLSSFDVYKAASKALWLAAIDATDEGDAIERVTKERNVPAAKLIATPRR*
Ga0137363_1001554643300012202Vadose Zone SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATNERDAIERVAKERNVSADRLIATRR*
Ga0137363_1098460513300012202Vadose Zone SoilMAMQSPNPQASPLSSWDVYRQAHKAIGLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0137362_1158865013300012205Vadose Zone SoilMAKQSPNPEASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPANRLRATRHR
Ga0137377_1003830153300012211Vadose Zone SoilMPRKPEPPPLSTWDVFRVAHRAIWLGTVEATDERDAIEKVAKERDIPAAMLIATRRR*
Ga0137360_1039737713300012361Vadose Zone SoilPHKPEPPPLSSWDVFKLAKKAVWLATIEATDEHDAIKKVAEERNIPAYRLIATRR*
Ga0137360_1098209513300012361Vadose Zone SoilMPRKPEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIEKVAKERNIPAARLIATQR*
Ga0137360_1130528913300012361Vadose Zone SoilMARKPEPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDVPAAWLMATRRR*
Ga0137361_1160418213300012362Vadose Zone SoilQIQSPRPILGGMVKKPDPPPLSTFDVFKLAKKAVWLATVEATDERDAIERVAKERNVQPPG*
Ga0137398_1006231813300012683Vadose Zone SoilEWRVPRRATQGFGGMAMQSPNPQASPLLSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR*
Ga0137398_1012722023300012683Vadose Zone SoilMTRKLEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIERVAKERDIPAARLIATQRR*
Ga0137398_1091039013300012683Vadose Zone SoilMARKPEPLSLSSCDVFKIASKAIWLATIEATDERDAIERVAQERDIPANRLIATQRR*
Ga0137395_1013914113300012917Vadose Zone SoilMAMQSPNPQASPLLSWDVYRQADKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0137395_1052667923300012917Vadose Zone SoilNPQAPPLSSWDVYRHAHKAIWLGTVEATDERDAIEKVAKERNIPANWLIATRRR*
Ga0137395_1052820423300012917Vadose Zone SoilMPRKPEPPPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQRR*
Ga0137396_1092510323300012918Vadose Zone SoilMANQREDCIDGMAKLAEPPPPSSFDVFKIAKKAVWLATVEATNEHDAIERAAKERNVPAARLIAT
Ga0137396_1104453513300012918Vadose Zone SoilMAMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVTKERNIPANRLIATRHR
Ga0137396_1106428213300012918Vadose Zone SoilHWPILAGMPRKPEPPPLSSFDVFKLASKAVWLATIEAADEHDAIERVAKERNVPAARLIATQRR*
Ga0137359_1159364313300012923Vadose Zone SoilMPRKPEPPPLTSWEVFKIASKAVWLATVEAADEHDAIKMVAKERNISANRLIATRRQ*
Ga0137359_1177993323300012923Vadose Zone SoilMAKKPDPPPLSSFDVFKIASKAVWLATIEATNEHDAIERVAKERNVPAARLIATQRR*
Ga0137419_1018977643300012925Vadose Zone SoilMLRKPEPPPPSTWDVFKIASKAVWLATVEATNERDLIERVAKERNV*
Ga0137416_1028240123300012927Vadose Zone SoilRATQGFGGMAMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR*
Ga0137416_1099333633300012927Vadose Zone SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATDERDAIERVAKERDIPAARLIATQRR*
Ga0137416_1158166323300012927Vadose Zone SoilMAKKPEPPALSSWDVYKLASKAVWLATIEATDEHDAIEKVAKERNVPAARLLATRRR*
Ga0137416_1205019213300012927Vadose Zone SoilLPSMARKPEPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDVPAAWLMATRRR
Ga0137410_1003777713300012944Vadose Zone SoilVKKPKPPPLSTFDVFKLASKAVRLATVEATNELDAIKKVSKKRRIRAYRLIATRR*
Ga0137410_1013211813300012944Vadose Zone SoilMVKKTDPPPLSTWDVFKLASKAVWLATVEATDERDAIERVAKERNVPASKLIAVLRR*
Ga0137410_1028756913300012944Vadose Zone SoilMARKPEPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDIPAAWLIARRR*
Ga0137410_1037255433300012944Vadose Zone SoilMAKKPETPALSSWDVFKLASKAVWLATIEATDERDAIERVAKERDIPAARLIATQRR*
Ga0137410_1078967033300012944Vadose Zone SoilMERKPDPPPLSTFDVFKIAAKAVWLATVEATDEHDAIERVASERNLPANKL
Ga0126369_1093293313300012971Tropical Forest SoilMVKKPEPPLLSTFDVFRVAHKAIWLGTVEASDERDAIQRVARERDIPAACLIARRR*
Ga0126369_1297897913300012971Tropical Forest SoilMTMQSPNLQASPQSSWDVYRQAHKAIWLGTVEATDEREAIEKVAKERSIPANRLIATRHR
Ga0126369_1343249413300012971Tropical Forest SoilMVMQSFNPQASPLSSWDVYRQAHKEIWLGTVEASDERNAIEKVAKERNIPAKLLIARQR*
Ga0137418_1015358433300015241Vadose Zone SoilMERKPDPPSLSSFDVFKLAKKAVWVATVEATDEHDAINKVAQESNIPAYRLIATRR*
Ga0137418_1017245543300015241Vadose Zone SoilMVKKPAPPLLSTFDVFRVAHKAIWLGAVEASDERDAIQRVARERDVPAAWLMATRRR*
Ga0187783_1019990323300017970Tropical PeatlandMANESPNPQASPLSPWDVYRQAHKAIWLGTVEATNERDAIEKVANERNIPANRLIATRHR
Ga0187783_1031165623300017970Tropical PeatlandMGKKPETPPLLSWDVYRLAHRAIWLTIVEATDERDAIEKDERNIPANRLIATPRR
Ga0187782_1018558343300017975Tropical PeatlandMGKKPESLPLLSWDVYRLAHRAIWLGIVEAMDERDAIEKVANERNIPAKMLIAMRHR
Ga0210395_1057740813300020582SoilMAKQSPDPQASPPSFWDVYRQAHKAIWLGIVVATDERDAIEKVAKERNIPANRLIATRRR
Ga0179596_1020804323300021086Vadose Zone SoilMPRKPEPPPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQRR
Ga0179596_1033510223300021086Vadose Zone SoilMTRKLEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIEKVAKERNIPAARLIAT
Ga0179596_1053148523300021086Vadose Zone SoilMANQREDCIDGMAKLAEPPPPSSFDVFKIAKKAVWLATVEATNEHDAIERAAKERNVPAARLIATRRR
Ga0179596_1071156323300021086Vadose Zone SoilMPHKPEPPPLSSWDVFKLAKKAVWLATIEATDEHDAIKKVAEERNIPAYRLIATRR
Ga0210408_1115392013300021178SoilAGCRRTTQGSGGMAKQSPDPQASPPSFWDVYRQAHKAIWLGIVVATDERDAIEKVAKERNIPANRLIATRRRS
Ga0187846_1015158523300021476BiofilmMATNSQASPLSSWDVYRQAHKAVWLGTVEATDERDAIEKVAKERNIPANRLIATQRR
Ga0210402_1039012643300021478SoilMPFKSKPPPLLTFDVFKLASKTLWLAAVEATNELDAIKMVSKKRRIRAYRLIATRRQ
Ga0209240_102057633300026304Grasslands SoilMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0209131_109196433300026320Grasslands SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATNERDAIERVAKERNVSADRLIATRR
Ga0209131_138638913300026320Grasslands SoilMAKQSPNPEASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPANRLIAMRHR
Ga0257162_101007843300026340SoilSHRRANLEPMYLAADTAGMPRKPELPPLCSWDVYKIASKAVWLATIEATDERDAIERVAKERNVPAARLIATQRR
Ga0257149_103772123300026355SoilMYLAADTAGMPRKPELPPLCSWDVYKIASKAVWLATIEATDERDAIERVAKERNVPAARLIATQRR
Ga0257150_100917523300026356SoilMPRKPEPPPLSSFDVFKLASKAVWLATVEATDELDAIKKVSKKRRIRAYRLIATRRQ
Ga0257150_104099113300026356SoilPRKPEPPPLSTWDVFRVAHKAIWPGTVEATDEHDAIERVAKERDIPAARLMATQRR
Ga0257150_106655123300026356SoilMTRKPEPPPLSTSDVFKIASKAVWLATVEATNERDSIERVAKERNVSADRLIATRR
Ga0257166_102885613300026358SoilMERRSPNPQASPPSSWDGYRQAHKAIWLGTVEATDERDAIEKVANERNIPANRLIATRHR
Ga0257163_100419053300026359SoilQGYLAADTAGMPRKPELPPLCSWDVYKIASKAVWLATIEATDERDAIERVAKERNVPAARLIATQRR
Ga0257163_100982323300026359SoilMAKQPPDPPPLSTFDVFRVAHKAIWLGTVEATDERDAIEKVANERNIPANRLIATQRR
Ga0257176_102042013300026361SoilMQSPNPQASPLLSWDVYRQADKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0257146_102019543300026374SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATNERDAIERVAKERNVPAARLIATQRR
Ga0257167_102383133300026376SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATNERDLIERVAKERNV
Ga0257177_108680013300026480SoilMYLAADTAGMPRKPELPPLCSWDVYKIASKAVWLATIEATDERDAIERVAKERNVPAARLIATQ
Ga0257155_102396133300026481SoilMTRKLEPPPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQRR
Ga0257155_102929913300026481SoilRRATQGFGGMAMQSPNPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVANERNIPANRLIATRHR
Ga0257172_101628423300026482SoilEGCDEWRVPRRATQGFGGMAMQSPNPQASPLLSWDVYRQADKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0257159_102445113300026494SoilMTRKPEAPPLSTFDVLKIASKAVWLGTVEAIDERGAIEKVAKERNVPAARLIATQRR
Ga0257165_110725713300026507SoilAGMPRKPEPPPPSTWDVFKIASKAVWLATVEVTNERDAIERVAKERNVSADRLIATRR
Ga0257161_101332743300026508SoilDTCRMPRKPEPPPLSTWDVFRVAHKAIWLGTVEAADERDAIERVAKERNVPVAWLIATQR
Ga0257161_109273913300026508SoilPQASPLSSWDVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPANRLIATRHR
Ga0209648_1004086843300026551Grasslands SoilMPRKPEPPPLSSCDVFKIASKAIWLATIEATDERDAIERVAQERNIPANRLIATQRR
Ga0209648_1034137823300026551Grasslands SoilMARKPEPPPLSTWDVYKIASKAVWLATIEATDEHDAIERVAKARNVPAARLIATQRR
Ga0209648_1039484433300026551Grasslands SoilMPHKPEPPPLSTWDVYKIASKAVWLATIEAIDERDAIERVAKERNVPAARLIATQRR
Ga0209701_1049030723300027862Vadose Zone SoilMARKPEPPLSTWDVFKITKKAVWLATIEATDERDTIEWVAKERDIPAARLIATRRR
Ga0209590_1032056633300027882Vadose Zone SoilMQSPNPQASPLLSWEVYRQAHKAIWLGTVEATDERDAIEKVAKERNIPAKRLIATRHR
Ga0209488_1046525013300027903Vadose Zone SoilMVKKPDPPPLSTFDVFKLAKKAVWLATVEATDERDAIERVAKERNVQPPG
Ga0209488_1061956413300027903Vadose Zone SoilMARKPQPPPPSTCDVFKIASKALWLATIEATDERDAIERVAKERNVPAARLIATQRR
Ga0137415_1027056233300028536Vadose Zone SoilMPRKPEPPPPSTWDVFKIASKAVWLATVEATDERDAIERVAKERDIPAARLIATQRR
Ga0137415_1053030023300028536Vadose Zone SoilLNAQSRWPILPGMERKPDPPPLSTFDVFKIAAKAVWLATVEATDEHDAIEKVAKERNVPAARLLATRRR
Ga0137415_1136334513300028536Vadose Zone SoilMPRKPEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIEKVAKERNIPAARLIATQR
Ga0306924_1144073333300032076SoilMAKKPTTPPLYSFDVYKAASKAVWLAAIEATDERDAIERVANERNLPANKLIASRRR
Ga0306920_10052506723300032261SoilMTRKPEPPPLSTWDVFRVAHKAIWLGTVEATDERDAIERVAKQRDIPAARLIATQRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.