NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F077904

Metagenome Family F077904

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F077904
Family Type Metagenome
Number of Sequences 117
Average Sequence Length 53 residues
Representative Sequence MSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV
Number of Associated Samples 38
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 78.95 %
% of genes near scaffold ends (potentially truncated) 26.50 %
% of genes from short scaffolds (< 2000 bps) 85.47 %
Associated GOLD sequencing projects 29
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (35.043 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(88.034 % of family members)
Environment Ontology (ENVO) Unclassified
(82.906 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(99.145 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 54.22%    β-sheet: 0.00%    Coil/Unstructured: 45.78%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF07508Recombinase 19.83
PF00239Resolvase 13.79
PF00589Phage_integrase 5.17
PF14311DUF4379 4.31
PF01435Peptidase_M48 4.31
PF03703bPH_2 3.45
PF07661MORN_2 1.72
PF01380SIS 1.72
PF10544T5orf172 0.86
PF02493MORN 0.86
PF01757Acyl_transf_3 0.86
PF02796HTH_7 0.86
PF02086MethyltransfD12 0.86
PF10754DUF2569 0.86
PF01396zf-C4_Topoisom 0.86
PF08378NERD 0.86
PF13156Mrr_cat_2 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 33.62
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 13.79
COG3402Uncharacterized membrane protein YdbS, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 3.45
COG3428Uncharacterized membrane protein YdbT, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 3.45
COG2849Antitoxin component YwqK of the YwqJK toxin-antitoxin moduleDefense mechanisms [V] 2.59
COG3392Adenine-specific DNA methylaseReplication, recombination and repair [L] 0.86
COG4642Uncharacterized conserved proteinFunction unknown [S] 0.86
COG0338DNA-adenine methylaseReplication, recombination and repair [L] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms64.96 %
UnclassifiedrootN/A35.04 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001589|JGI24005J15628_10015669All Organisms → Viruses → Predicted Viral3374Open in IMG/M
3300001589|JGI24005J15628_10017413All Organisms → Viruses → Predicted Viral3171Open in IMG/M
3300001938|GOS2221_1019320All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → alpha proteobacterium SCGC AAA160-J141691Open in IMG/M
3300005239|Ga0073579_1015115All Organisms → Viruses → Predicted Viral1406Open in IMG/M
3300008964|Ga0102889_1138661Not Available714Open in IMG/M
3300009172|Ga0114995_10074614All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1913Open in IMG/M
3300009172|Ga0114995_10080398All Organisms → Viruses → Predicted Viral1835Open in IMG/M
3300009172|Ga0114995_10119730Not Available1475Open in IMG/M
3300009172|Ga0114995_10130066All Organisms → Viruses → Predicted Viral1408Open in IMG/M
3300009172|Ga0114995_10137187All Organisms → Viruses → Predicted Viral1367Open in IMG/M
3300009172|Ga0114995_10160777All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon1253Open in IMG/M
3300009172|Ga0114995_10268897All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium940Open in IMG/M
3300009172|Ga0114995_10319763Not Available853Open in IMG/M
3300009172|Ga0114995_10355954All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria804Open in IMG/M
3300009172|Ga0114995_10357625Not Available802Open in IMG/M
3300009172|Ga0114995_10357625Not Available802Open in IMG/M
3300009172|Ga0114995_10548242All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300009173|Ga0114996_10590366All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria826Open in IMG/M
3300009173|Ga0114996_11086466Not Available564Open in IMG/M
3300009193|Ga0115551_1322300Not Available672Open in IMG/M
3300009409|Ga0114993_10396061All Organisms → cellular organisms → Bacteria → Proteobacteria1038Open in IMG/M
3300009409|Ga0114993_10599287All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria810Open in IMG/M
3300009409|Ga0114993_10643116All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → unclassified Pelagibacterales → Pelagibacterales bacterium776Open in IMG/M
3300009420|Ga0114994_10141660All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1630Open in IMG/M
3300009420|Ga0114994_10164065All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1503Open in IMG/M
3300009420|Ga0114994_10399423All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon910Open in IMG/M
3300009420|Ga0114994_10629523Not Available702Open in IMG/M
3300009422|Ga0114998_10066459All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1840Open in IMG/M
3300009422|Ga0114998_10250998All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium TMED199834Open in IMG/M
3300009422|Ga0114998_10260399Not Available816Open in IMG/M
3300009422|Ga0114998_10321780Not Available724Open in IMG/M
3300009422|Ga0114998_10494771All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria573Open in IMG/M
3300009422|Ga0114998_10563333Not Available536Open in IMG/M
3300009422|Ga0114998_10593107Not Available522Open in IMG/M
3300009425|Ga0114997_10115529All Organisms → Viruses → Predicted Viral1622Open in IMG/M
3300009425|Ga0114997_10546463Not Available614Open in IMG/M
3300009425|Ga0114997_10607281Not Available576Open in IMG/M
3300009432|Ga0115005_10717314Not Available803Open in IMG/M
3300009432|Ga0115005_11107285All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria643Open in IMG/M
3300009495|Ga0115571_1042220All Organisms → Viruses → Predicted Viral2155Open in IMG/M
3300009497|Ga0115569_10156974All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1084Open in IMG/M
3300009498|Ga0115568_10445381All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria557Open in IMG/M
3300009507|Ga0115572_10691707Not Available556Open in IMG/M
3300009512|Ga0115003_10025547All Organisms → Viruses → Predicted Viral3888Open in IMG/M
3300009512|Ga0115003_10527120All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria691Open in IMG/M
3300009526|Ga0115004_10114682All Organisms → Viruses → Predicted Viral1646Open in IMG/M
3300009526|Ga0115004_10273099Not Available1004Open in IMG/M
3300009526|Ga0115004_10304397Not Available945Open in IMG/M
3300009526|Ga0115004_10359856All Organisms → cellular organisms → Bacteria → Proteobacteria862Open in IMG/M
3300009526|Ga0115004_10497021Not Available722Open in IMG/M
3300009526|Ga0115004_10587479Not Available660Open in IMG/M
3300009526|Ga0115004_10670119Not Available615Open in IMG/M
3300009526|Ga0115004_10753859Not Available579Open in IMG/M
3300009526|Ga0115004_10906688All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria526Open in IMG/M
3300009705|Ga0115000_10357055All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria937Open in IMG/M
3300009705|Ga0115000_10701671Not Available625Open in IMG/M
3300009705|Ga0115000_10729447Not Available611Open in IMG/M
3300009705|Ga0115000_10971775All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → unclassified Pelagibacterales → Pelagibacterales bacterium518Open in IMG/M
3300009785|Ga0115001_10029239All Organisms → Viruses → Predicted Viral3643Open in IMG/M
3300009785|Ga0115001_10091685All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1991Open in IMG/M
3300009785|Ga0115001_10140181All Organisms → Viruses → Predicted Viral1578Open in IMG/M
3300009785|Ga0115001_10320453All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon980Open in IMG/M
3300009785|Ga0115001_10335303All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon954Open in IMG/M
3300009785|Ga0115001_10365208Not Available907Open in IMG/M
3300009785|Ga0115001_10497924Not Available752Open in IMG/M
3300009785|Ga0115001_10644240Not Available645Open in IMG/M
3300009785|Ga0115001_10869310Not Available542Open in IMG/M
3300009786|Ga0114999_10630809Not Available810Open in IMG/M
3300010883|Ga0133547_10455140All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae2599Open in IMG/M
3300010883|Ga0133547_10679725All Organisms → Viruses → Predicted Viral2043Open in IMG/M
3300010883|Ga0133547_10682761All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon2037Open in IMG/M
3300010883|Ga0133547_10831290All Organisms → cellular organisms → Bacteria1809Open in IMG/M
3300010883|Ga0133547_10832139All Organisms → Viruses → Predicted Viral1808Open in IMG/M
3300010883|Ga0133547_11330339All Organisms → Viruses → Predicted Viral1361Open in IMG/M
3300010883|Ga0133547_11426574All Organisms → Viruses → Predicted Viral1305Open in IMG/M
3300010883|Ga0133547_11518441All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1256Open in IMG/M
3300010883|Ga0133547_11549916All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → Candidatus Pelagibacter ubique1241Open in IMG/M
3300010883|Ga0133547_11628135All Organisms → Viruses → Predicted Viral1204Open in IMG/M
3300010883|Ga0133547_11656763All Organisms → Viruses → Predicted Viral1191Open in IMG/M
3300010883|Ga0133547_11924776Not Available1087Open in IMG/M
3300010883|Ga0133547_12069011All Organisms → Viruses → Predicted Viral1039Open in IMG/M
3300010883|Ga0133547_12192651Not Available1003Open in IMG/M
3300010883|Ga0133547_12196614All Organisms → Viruses → Predicted Viral1002Open in IMG/M
3300011253|Ga0151671_1167350All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium TMED199667Open in IMG/M
3300020396|Ga0211687_10313612All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria616Open in IMG/M
(restricted) 3300022920|Ga0233426_10307887All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria612Open in IMG/M
3300025137|Ga0209336_10131076All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales679Open in IMG/M
3300025138|Ga0209634_1008774All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6222Open in IMG/M
3300025138|Ga0209634_1015327All Organisms → Viruses → Predicted Viral4436Open in IMG/M
3300025138|Ga0209634_1148016Not Available962Open in IMG/M
3300025138|Ga0209634_1203651Not Available754Open in IMG/M
3300025168|Ga0209337_1070271All Organisms → Viruses → Predicted Viral1734Open in IMG/M
3300025849|Ga0209603_1102420All Organisms → Viruses → Predicted Viral1274Open in IMG/M
3300025890|Ga0209631_10059113All Organisms → cellular organisms → Bacteria2426Open in IMG/M
3300025892|Ga0209630_10167452All Organisms → Viruses → Predicted Viral1099Open in IMG/M
3300027687|Ga0209710_1032585All Organisms → Viruses → Predicted Viral2546Open in IMG/M
3300027687|Ga0209710_1109194All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1075Open in IMG/M
3300027687|Ga0209710_1200133Not Available681Open in IMG/M
3300027752|Ga0209192_10126233All Organisms → Viruses → Predicted Viral1031Open in IMG/M
3300027752|Ga0209192_10332424Not Available539Open in IMG/M
3300027779|Ga0209709_10226777All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → unclassified Pelagibacterales → Pelagibacterales bacterium847Open in IMG/M
3300027779|Ga0209709_10235624All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium TMED199823Open in IMG/M
3300027779|Ga0209709_10236432All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria821Open in IMG/M
3300027780|Ga0209502_10050238All Organisms → Viruses → Predicted Viral2298Open in IMG/M
3300027780|Ga0209502_10067399All Organisms → Viruses → Predicted Viral1906Open in IMG/M
3300027780|Ga0209502_10104423All Organisms → Viruses → Predicted Viral1432Open in IMG/M
3300027780|Ga0209502_10178984Not Available994Open in IMG/M
3300027788|Ga0209711_10086633All Organisms → Viruses → Predicted Viral1615Open in IMG/M
3300027791|Ga0209830_10054132All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2116Open in IMG/M
3300027791|Ga0209830_10174294All Organisms → Viruses → Predicted Viral1017Open in IMG/M
3300027791|Ga0209830_10217550Not Available882Open in IMG/M
3300027791|Ga0209830_10292635All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium TMED199725Open in IMG/M
3300027801|Ga0209091_10389584All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria634Open in IMG/M
3300027801|Ga0209091_10444950Not Available576Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine88.03%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine5.13%
Pelagic MarineEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Pelagic Marine1.71%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine0.85%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater0.85%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine0.85%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.85%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine0.85%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine0.85%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001589Marine viral communities from the Pacific Ocean - LP-40EnvironmentalOpen in IMG/M
3300001938Marine microbial communities from Bedford Basin, Nova Scotia, Canada - GS005EnvironmentalOpen in IMG/M
3300005239Environmental Genome Shotgun Sequencing: Ocean Microbial Populations from the Gulf of MaineEnvironmentalOpen in IMG/M
3300008964Estuarine microbial communities from the Columbia River estuary - metaG 1551A-02EnvironmentalOpen in IMG/M
3300009172Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_154EnvironmentalOpen in IMG/M
3300009173Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_134EnvironmentalOpen in IMG/M
3300009193Pelagic marine microbial communities from North Sea - COGITO_mtgs_110321EnvironmentalOpen in IMG/M
3300009409Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150EnvironmentalOpen in IMG/M
3300009420Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152EnvironmentalOpen in IMG/M
3300009422Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_138EnvironmentalOpen in IMG/M
3300009425Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_136EnvironmentalOpen in IMG/M
3300009432Marine eukaryotic phytoplankton communities from Arctic Ocean - Arctic Ocean - Greenland ARC118M MetagenomeEnvironmentalOpen in IMG/M
3300009495Pelagic marine microbial communities from North Sea - COGITO_mtgs_120531EnvironmentalOpen in IMG/M
3300009497Pelagic marine microbial communities from North Sea - COGITO_mtgs_120503EnvironmentalOpen in IMG/M
3300009498Pelagic marine microbial communities from North Sea - COGITO_mtgs_120426EnvironmentalOpen in IMG/M
3300009507Pelagic marine microbial communities from North Sea - COGITO_mtgs_120607EnvironmentalOpen in IMG/M
3300009512Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_88EnvironmentalOpen in IMG/M
3300009526Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_90EnvironmentalOpen in IMG/M
3300009705Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128EnvironmentalOpen in IMG/M
3300009785Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_130EnvironmentalOpen in IMG/M
3300009786Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_126EnvironmentalOpen in IMG/M
3300010883western Arctic Ocean co-assemblyEnvironmentalOpen in IMG/M
3300011253Seawater microbial communities from Japan Sea near Toyama Prefecture, Japan - 2014_2, permeateEnvironmentalOpen in IMG/M
3300020396Marine microbial communities from Tara Oceans - TARA_B100000767 (ERX555915-ERR599122)EnvironmentalOpen in IMG/M
3300022920 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_10_MGEnvironmentalOpen in IMG/M
3300025137Marine viral communities from the Pacific Ocean - LP-32 (SPAdes)EnvironmentalOpen in IMG/M
3300025138Marine viral communities from the Pacific Ocean - LP-40 (SPAdes)EnvironmentalOpen in IMG/M
3300025168Marine viral communities from the Pacific Ocean - LP-53 (SPAdes)EnvironmentalOpen in IMG/M
3300025849Pelagic marine microbial communities from North Sea - COGITO_mtgs_120607 (SPAdes)EnvironmentalOpen in IMG/M
3300025890Pelagic Microbial community sample from North Sea - COGITO 998_met_08 (SPAdes)EnvironmentalOpen in IMG/M
3300025892Pelagic Microbial community sample from North Sea - COGITO 998_met_01 (SPAdes)EnvironmentalOpen in IMG/M
3300027687Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_138 (SPAdes)EnvironmentalOpen in IMG/M
3300027752Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_154 (SPAdes)EnvironmentalOpen in IMG/M
3300027779Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_136 (SPAdes)EnvironmentalOpen in IMG/M
3300027780Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_90 (SPAdes)EnvironmentalOpen in IMG/M
3300027788Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_88 (SPAdes)EnvironmentalOpen in IMG/M
3300027791Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_130 (SPAdes)EnvironmentalOpen in IMG/M
3300027801Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128 (SPAdes)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI24005J15628_1001566923300001589MarineMSGRVRMVKWTREIMKNHKPRTPKQILKEKIRIDKIKKELYESDFHEITWDKKCV*
JGI24005J15628_1001741313300001589MarineVSGRIRMVKWTRELMKYYKPPTPKQLXRIKIREQKIKKEIMEDDFHEITWDKKCV*
GOS2221_101932063300001938MarineMRMVKWVKLYQNPNYKTPTPKQLERLNIREQKIMKELYESDS
Ga0073579_101511523300005239MarineMRLVKWVKLYQNPNYKTPTPKQLERLKIRKQKLMKELYDSDFHKITWDKKCV*
Ga0102889_113866123300008964EstuarineLSGRMRLTKWDRLYQKTNYKITTPKQLERLKIREQKIMKELYESDFHEIIWDKKCV*
Ga0114995_1007461423300009172MarineMISGRMKLTKWVRLYQNRNYKPSTPKQLEKLKIREQKLMKELYESDFHEITWDKKCV*
Ga0114995_1008039813300009172MarineVSGRVRMVKWTRELLKNDKPPTPKQLEKLKIREQKIMKELYESDFHEITWHKKCV*
Ga0114995_1011973013300009172MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114995_1013006613300009172MarineMVKWVKLYQNPNYKTPTPKQLERLNIREQKIMKELYESDFHEITWDKKCV*
Ga0114995_1013718733300009172MarineMSGRVRMVKWTKELMKNYKPSTPKQLIRFKIREQRIMKEMMEDDFHEITWDKKCV*
Ga0114995_1016077723300009172MarineMSGRIRMVKWTREIMKNYKSRTTKQILKEKLRVQKIMKKIYESDFHEITWDRKYS*
Ga0114995_1026889713300009172MarineSGRIRMVKWTRELMKNYKPPTPKQLERLKTREQKIMKELYKSDFHEITWDKKCV*
Ga0114995_1031976333300009172MarineMSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFYEITWDKKCV*
Ga0114995_1035595413300009172MarineVSGRLRMVKWTRELMKNYKPLTAKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114995_1035762523300009172MarineMSGKIRMVKWTRELMKNYKPPTPKKLERLKIREQKLMKELYDSDFHEITWDKKCV*
Ga0114995_1035762533300009172MarineMVKWTRELMKNYKPPTPKQLKRIKIREQKIKKEIMEDDFHEITWDKKCV*
Ga0114995_1054824223300009172MarineMVKWVRLYQNLNYKPPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV*
Ga0114995_1067740113300009172MarineKEIMKNYKSRTPEQILKEKIRTDKIKKKIYESDFHEITWDKKYI*
Ga0114996_1059036613300009173MarineVSGRIRMVKWTRELMKNYNPPTPKQLERLKIRDQKIMKEIYESDFHEITWDKKCV*
Ga0114996_1108646633300009173MarineSGRIRMVKWTRELMKNYNPPTPKQLEKHKIREQKIMKELYESDFHEITWDS*
Ga0115551_132230013300009193Pelagic MarineSRYSKNCIKRGCIVSGRVRMVKWTRELMKNYKPPTPKQLKRIKIREQKIMKELYESDFHEITWDKKCV*
Ga0114993_1039606123300009409MarineMSGRIRMVKWTRELMKNYKPTTPKQLERLKIREQKLMKELYQSDFHEITWDKKCV*
Ga0114993_1059928733300009409MarineMSGRMRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114993_1064311613300009409MarineMSGRIRMVKWTRELMKNYKPPTPKQLKRIKIREQKIIKELYESDFH
Ga0114994_1014166033300009420MarineMVKWTRELMKNYKPPTPKQLEKLKIREQKLMKELYESDFHEIIWDNKCV*
Ga0114994_1016406523300009420MarineLSGRMRMVKWVKLYQNPNYKTPTPKQLERLNIREQKIMKELYESDFHEITWDKKCV*
Ga0114994_1039942323300009420MarineMSGRIRMVKWTRELMKNYNPPTPKQLEKHKIREQKIMKELYESDFHEITWDS*
Ga0114994_1062952313300009420MarineMVKWTREIMKNHKPRTLKKILKEKIRKDKIKKEIMELDFHEITWDKIY*
Ga0114998_1006645933300009422MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFHKITWDKKYV*
Ga0114998_1025099833300009422MarineMSGRVRMVKWTRKLMKNYKPPNPKQLERLKIREQKLMKQLYKSDFHEITWDKKCV*
Ga0114998_1026039923300009422MarineVSGRIRIVKWTRELLKNYKPLTPKQLERLKIREQKIMKELYETYFHEITWDKKCV*
Ga0114998_1032178013300009422MarineMVKWTGELMKNYKPSTPKQLDRLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114998_1049477133300009422MarineMSGRVRMVKWTKELMKNYKPSTPKQLIRLKIREQRIMKEMMEYDFHEITWDKKCV*
Ga0114998_1056333323300009422MarineMSGRIRMVKWTRELLKNFKPPTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114998_1059310713300009422MarineLSGRMRMVKWVKLYQNPNYKTPTPKQLERLKIRKKKLMKELYQSDFHEITWDKKCV*
Ga0114997_1011552923300009425MarineMSGRIRMVKWSRELMKNHKSRTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV*
Ga0114997_1043040033300009425MarineMRMVKWTREIMKNHKPRTSKQILKEKIRTDKIKKEIMESDFHEITWDKIY*
Ga0114997_1054646323300009425MarineMSGRIRMVKWTRELMKNYKPPTPKQLEKLMIREQKLMKELYESDFHEITWDKKCV*
Ga0114997_1060728123300009425MarineLVSGRIRMVKWTRELMKNYKPTTPKQLERLKIREQKIMKELYDSDFHEITWDKKCV*
Ga0115005_1071731423300009432MarineMSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115005_1110728513300009432MarineMRMVKWVKLYQNPNYKTPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115571_104222033300009495Pelagic MarineMSGRVRMVKWTRELMKNYNPPTPKQLEKHKIRELKLMKELYESDFHEITSDKKCV*
Ga0115569_1015697413300009497Pelagic MarineLSGRMRLTKLVRLYQNPNYNPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115568_1044538123300009498Pelagic MarineMSGRVRMVKWTRELMKNYKPPTPKKLKRLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115572_1069170713300009507Pelagic MarineVSGRVRMVKWTRELMKNYTPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115003_1002554743300009512MarineMVKWTRELMKNYKTPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV*
Ga0115003_1052712023300009512MarineVSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESNFHEITWDKKCV*
Ga0115003_1073740823300009512MarineMSGRVRMVKWTREIMKNHKPRTPKQILKEKIRMDKIKKEIIESDFHVIT*
Ga0115004_1011468213300009526MarineLSGRMRMVKWVKLYQNPNYNPPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV*
Ga0115004_1027309913300009526MarineRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFHKITWDKKYV*
Ga0115004_1030439733300009526MarineRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFHEITWDKKYV*
Ga0115004_1035985623300009526MarineVSGRIRMVKWTRELMKYYKPPTPKQLKRIKIREQKIKKEIMEDDFHEITWDKKCV*
Ga0115004_1049702123300009526MarineTKWVRLYQNPNYKTPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115004_1058747923300009526MarineMSGRVRMVKWTRELLKNYNSPTPKQLERLKIREQKLMKQLYKSDFHEITWDKKCV*
Ga0115004_1067011923300009526MarineMSGRMRLVKWIRLYQNPNHNPPTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115004_1075385913300009526MarineMVKWTRELMKNYKPPTPKQLESLKIIEQKIMKELYESDFHEITWD
Ga0115004_1090668823300009526MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFH
Ga0115000_1035705513300009705MarineVSGRIRMVKWTRELMKNYKPPTPKQLERLKIIEQKIKKELYESDFHEITWDKKCV*
Ga0115000_1070167113300009705MarineMSGRVRMVKWTRELMKNYKPPTPKQLERHKIREQKIMKEIYESDFHEITWDKKCV*
Ga0115000_1072944713300009705MarineMSGRVRMVKWTREIMKNHKPRTPKQILKEKIRKDKIKKEIMELDFHEITWDKIY*
Ga0115000_1097177523300009705MarineVSGRIRMVKWTRELMKNYKPPTPKQLEKLKIREQKLMKELYESDFHEIIWDNKCV*
Ga0115001_1002923953300009785MarineMRMVKWVRLYQNPNYKTPTPKQLERLKIREQKIMKELYELDFHEITWDKKCV*
Ga0115001_1009168533300009785MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFHEITWDKKYV*
Ga0115001_1014018113300009785MarineRMVKWTRELMKNYNPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115001_1032045323300009785MarineMVKWTRELIKNYKPPTPKQLERLKIREQKIMKELYGSDFHEINWDKICV*
Ga0115001_1033530323300009785MarineVSGRIRMVKWTRELMKYYKPPTPKQLKRIKIREQKIKKELMEDDFHEI
Ga0115001_1036520833300009785MarineCIMSGRIRMVKWTRELMKNYNPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0115001_1049792413300009785MarineMVKWVKLYQNPNYKTPTPKQLERLNIREQKIMKELYDSDFHEITWDKKCV*
Ga0115001_1064424023300009785MarineMVKWTRELMKNYKPPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV*
Ga0115001_1086931013300009785MarineMVKWVKLYQNPNYKTPTPKQLERLKIRKKKLMKELYQSDFHEITWDKKCV*
Ga0114999_1063080933300009786MarineMEKIISGRVRMVKWTRELMKNYKPPTPKQLERLKLREQKTMKELYESDFHEITWDKKCV*
Ga0133547_1045514023300010883MarineVSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYDSDFYEITWDKKCV*
Ga0133547_1067972523300010883MarineMSGRIRMVKWTRELMKNYKTPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0133547_1068276133300010883MarineMSGRVRMVKWTREIMKNYKSITPEQILKEKIRTDKIKKKIYESDFHEITWDKKYI*
Ga0133547_1083129033300010883MarineMSGRIRMVKWSRELMKNHKSRTPKQLERHKIREQKIMKELYESDFHEITWDKKCV*
Ga0133547_1083213923300010883MarineMSGRIRMVKWTRELLKNFKPPTPKQLEKLKIREQKIMKELYESDFYEITWDKKCV*
Ga0133547_1133033933300010883MarineMISGIMRMVKWTREIMKNYKPPTPKQLEKRKIREQKIIKELYVSDFHEITWDKKCV*
Ga0133547_1142657423300010883MarineMSGRIRMVKWTRELMKNYKPPTPKQLEKLKIREQKLMKELYESDFHEIIWDNKCV*
Ga0133547_1151844113300010883MarineMVKWIKLYQNPNYKTPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV*
Ga0133547_1154991623300010883MarineMSGRVRMVKWTRELMKNYNPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0133547_1162813513300010883MarineNSKDSIKRGCIVSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKLMKKLYESDFHEITWDKKCV*
Ga0133547_1165676323300010883MarineMRMVKWVRLYQNPNYKTPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV*
Ga0133547_1192477633300010883MarineWTRELIKNYKPPTPKQLERLKIREQKIMKELYGSDFHEINWDKICV*
Ga0133547_1206901113300010883MarineSGRMRMAKWVKLYQNPNYKTPTPKQLERLKIRKKKLMKELYESDFHEITWDKKCV*
Ga0133547_1219265133300010883MarineVSGRMRMVKWVRLYQNPNYKTPTPKQLEKLKIREQKIMKELYESDFHQITWDKKCV*
Ga0133547_1219661423300010883MarineMSGRIRMVKWTRELMKNYKPTTPKQLERLKIREQKIMKELYDSDFHEITWDKKCV*
Ga0151671_116735023300011253MarineMSGRIRMIKWTRELMKNYKLPTPKQLERLKIRKKKLMKELYDS
Ga0211687_1031361213300020396MarineMSGRIRMIKWTGELMKNYKPPTPKQLEKLKIREQKIMKEIYVSDFHEIT
(restricted) Ga0233426_1030788713300022920SeawaterMSGRVRMVKWTREIMKNHKPRTPKQILKEKIRIDKIKKELYESDFHEITWNKKCV
Ga0209336_1013107613300025137MarineMSGRMRMVKWIRLYQNPKYKPPTPKQLDRLKIREQKLMKELYESDFHEITWDKKCV
Ga0209634_1008774123300025138MarineVSGRIRMVKWTRELMKYYKPPTPKQLKRIKIREQKIKKEIMEDDFHEITWDKKCV
Ga0209634_101532743300025138MarineMSGRVRMVKWTREIMKNHKPRTPKQILKEKIRIDKIKKELYESDFHEITWDKKCV
Ga0209634_114801613300025138MarineGCIVSGRIRMVKWTRELMKYYKPPTPKQLERLKTREQKIMKELYESDFHEITWDKKCV
Ga0209634_120365123300025138MarineYNQESNGVCMSGRIRLAKWTREIMKNHQQRTPKQILKENIRTDKIKKELYESDFHEITWDKKCV
Ga0209337_107027123300025168MarineVSGRIRMVKWTRELMKYYKPPTPKQLERLKTREQKIMKELYESDFHEITWDKKCV
Ga0209603_110242023300025849Pelagic MarineMSGRVRMVKWTRELMKNYNPPTPKQLEKHKIRELKLMKELYESDFHEITWDKKCV
Ga0209631_1005911343300025890Pelagic MarineMSRRVRMIKWRRELMKNYKPPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV
Ga0209630_1016745233300025892Pelagic MarineGRVRMVKWTRELMKNYKPPTPKQLERLKIREQKLMKELYESDFHEITWDKKCV
Ga0209710_103258543300027687MarineMRMVKWVKLYQNPNYKTPTPKQLERLKIRKKKLMKELYQSDFHEITWDKKCV
Ga0209710_110919433300027687MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKLMKELYESDFHKITWDKKYV
Ga0209710_120013313300027687MarineMSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYGSDFHEINWDKICV
Ga0209192_1012623333300027752MarineVSGRLRMVKWTRELMKNYKPLTAKQLERLKIREQKIMKELYESDFHEITWDKKCV
Ga0209192_1033242413300027752MarineMSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFYEIT
Ga0209709_1022677723300027779MarineMSGRIRMVKWTRELMKNYNPPTPKQLEKHKIREQKIMKELYESDFHEITWDS
Ga0209709_1023562423300027779MarineMSGRIRMVKWSRELMKNHKSRTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV
Ga0209709_1023643233300027779MarineVSGRIRMVKWTRELMKNYNPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV
Ga0209502_1005023813300027780MarineLVSGRIRMVKWTRELMKNHKPPTPKQLERLKIREQKLMKELYESDFHEINWDKKCV
Ga0209502_1006739923300027780MarineMRMVKWVKLYQNPNYKTPTPKQLERLNIREQKIMKELYESDFHEITWDKKCV
Ga0209502_1010442323300027780MarineMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFHEITWDNKCV
Ga0209502_1017898433300027780MarineVSGRIRMVKWTRELMKNYKPQTPKQLERLKIREQKTMKELYESDFHEITWDKKCV
Ga0209711_1008663323300027788MarineVSGRIRMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESNFHEITWDKKCV
Ga0209830_1005413223300027791MarineVSGRIRMVKWTRELLKNYKPPTPKQLEKLKIREQKIMKELYESDFHEITWDKKCV
Ga0209830_1017429423300027791MarineVSGRIRMVKWTRELMKYYKPPTPKQLEKLKIREQKLMKKLY
Ga0209830_1021755023300027791MarineMISGRMKLTKWVRLYQNRNYKPSTPKQLEKLKIREQKLMKELYESDFHEITWDKKCV
Ga0209830_1029263533300027791MarineMSGRIRMVKWTRELMKNYKTPTPKQLERLKIREQKIMKELYE
Ga0209091_1038958423300027801MarineVSGRIRMVKWTRELMKNYKPPTPKQLEKLKIREQKLMKELYESDF
Ga0209091_1044495013300027801MarineMVKWTRELMKNYKPPTPKQLERLKIREQKIMKELYESDFHEITWDKKCV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.