NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072853

Metagenome Family F072853

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072853
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 52 residues
Representative Sequence MAASVSNVYDVLDTVKNEVDVLCAQENVSPLLVWLLLRQQADYQLLLLNNPID
Number of Associated Samples 60
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 7.44 %
% of genes near scaffold ends (potentially truncated) 23.14 %
% of genes from short scaffolds (< 2000 bps) 80.17 %
Associated GOLD sequencing projects 53
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.380 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(40.496 % of family members)
Environment Ontology (ENVO) Unclassified
(85.124 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(81.818 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 51.85%    β-sheet: 0.00%    Coil/Unstructured: 48.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF00504Chloroa_b-bind 10.74
PF00124Photo_RC 5.79
PF13392HNH_3 1.65
PF05315ICEA 1.65
PF02867Ribonuc_red_lgC 0.83
PF01176eIF-1a 0.83
PF01503PRA-PH 0.83
PF07681DoxX 0.83
PF02511Thy1 0.83
PF03330DPBB_1 0.83
PF07460NUMOD3 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG0209Ribonucleotide reductase alpha subunitNucleotide transport and metabolism [F] 0.83
COG0361Translation initiation factor IF-1Translation, ribosomal structure and biogenesis [J] 0.83
COG1351Thymidylate synthase ThyX, FAD-dependent familyNucleotide transport and metabolism [F] 0.83
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 0.83
COG4270Uncharacterized membrane proteinFunction unknown [S] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.38 %
All OrganismsrootAll Organisms25.62 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2189573028|GS313G0146KB_1112079340936Not Available796Open in IMG/M
3300001450|JGI24006J15134_10019890Not Available3105Open in IMG/M
3300001450|JGI24006J15134_10027028Not Available2563Open in IMG/M
3300001450|JGI24006J15134_10040466All Organisms → Viruses → Predicted Viral1975Open in IMG/M
3300001450|JGI24006J15134_10061029Not Available1490Open in IMG/M
3300001450|JGI24006J15134_10149401Not Available768Open in IMG/M
3300001450|JGI24006J15134_10165708Not Available708Open in IMG/M
3300001450|JGI24006J15134_10219995Not Available565Open in IMG/M
3300001460|JGI24003J15210_10039649Not Available1646Open in IMG/M
3300001947|GOS2218_1044121All Organisms → cellular organisms → Bacteria3436Open in IMG/M
3300004448|Ga0065861_1028828All Organisms → Viruses → Predicted Viral2393Open in IMG/M
3300004448|Ga0065861_1102431Not Available545Open in IMG/M
3300004457|Ga0066224_1085813All Organisms → Viruses → Predicted Viral1768Open in IMG/M
3300004460|Ga0066222_1159609Not Available650Open in IMG/M
3300004460|Ga0066222_1180803Not Available1134Open in IMG/M
3300004460|Ga0066222_1181965Not Available577Open in IMG/M
3300004461|Ga0066223_1190782Not Available548Open in IMG/M
3300005239|Ga0073579_1165475Not Available968Open in IMG/M
3300005239|Ga0073579_1174464All Organisms → cellular organisms → Bacteria14016Open in IMG/M
3300005239|Ga0073579_1190873Not Available86332Open in IMG/M
3300005239|Ga0073579_1505496Not Available847Open in IMG/M
3300005239|Ga0073579_1523851Not Available874Open in IMG/M
3300005837|Ga0078893_10404687All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria832Open in IMG/M
3300006752|Ga0098048_1023983All Organisms → cellular organisms → Bacteria2024Open in IMG/M
3300006752|Ga0098048_1199860Not Available590Open in IMG/M
3300006789|Ga0098054_1082853Not Available1209Open in IMG/M
3300006793|Ga0098055_1027696Not Available2368Open in IMG/M
3300006793|Ga0098055_1044033Not Available1817Open in IMG/M
3300006793|Ga0098055_1051310Not Available1664Open in IMG/M
3300006793|Ga0098055_1148211Not Available904Open in IMG/M
3300006793|Ga0098055_1278370Not Available627Open in IMG/M
3300006793|Ga0098055_1300957Not Available599Open in IMG/M
3300006919|Ga0070746_10300762Not Available737Open in IMG/M
3300006921|Ga0098060_1003690Not Available5556Open in IMG/M
3300006921|Ga0098060_1053045Not Available1196Open in IMG/M
3300006921|Ga0098060_1100197Not Available821Open in IMG/M
3300006924|Ga0098051_1023090All Organisms → Viruses → Predicted Viral1790Open in IMG/M
3300006925|Ga0098050_1030720Not Available1459Open in IMG/M
3300006925|Ga0098050_1046881Not Available1145Open in IMG/M
3300006925|Ga0098050_1114471Not Available686Open in IMG/M
3300006990|Ga0098046_1120394Not Available574Open in IMG/M
3300007992|Ga0105748_10445233Not Available562Open in IMG/M
3300009593|Ga0115011_11528235Not Available591Open in IMG/M
3300010150|Ga0098056_1268446Not Available564Open in IMG/M
3300010430|Ga0118733_107874646Not Available552Open in IMG/M
3300017708|Ga0181369_1059752Not Available839Open in IMG/M
3300017724|Ga0181388_1098472Not Available696Open in IMG/M
3300017725|Ga0181398_1000040Not Available34051Open in IMG/M
3300017726|Ga0181381_1003999Not Available3742Open in IMG/M
3300017727|Ga0181401_1042571Not Available1266Open in IMG/M
3300017727|Ga0181401_1042572All Organisms → Viruses → Predicted Viral1266Open in IMG/M
3300017727|Ga0181401_1047499All Organisms → Viruses → Predicted Viral1185Open in IMG/M
3300017727|Ga0181401_1063345Not Available988Open in IMG/M
3300017727|Ga0181401_1098551Not Available745Open in IMG/M
3300017727|Ga0181401_1164121Not Available536Open in IMG/M
3300017728|Ga0181419_1005943Not Available3761Open in IMG/M
3300017740|Ga0181418_1018413Not Available1836Open in IMG/M
3300017740|Ga0181418_1056952Not Available967Open in IMG/M
3300017744|Ga0181397_1013839All Organisms → Viruses → Predicted Viral2440Open in IMG/M
3300017750|Ga0181405_1079719Not Available837Open in IMG/M
3300017753|Ga0181407_1118631Not Available660Open in IMG/M
3300017755|Ga0181411_1028944All Organisms → Viruses → Predicted Viral1768Open in IMG/M
3300017756|Ga0181382_1007524All Organisms → Viruses → Predicted Viral3884Open in IMG/M
3300017757|Ga0181420_1009700Not Available3298Open in IMG/M
3300017757|Ga0181420_1061951Not Available1187Open in IMG/M
3300017757|Ga0181420_1156087Not Available678Open in IMG/M
3300017757|Ga0181420_1192371Not Available595Open in IMG/M
3300017762|Ga0181422_1084303Not Available1001Open in IMG/M
3300017762|Ga0181422_1160131Not Available688Open in IMG/M
3300017764|Ga0181385_1162092Not Available677Open in IMG/M
3300017769|Ga0187221_1015174All Organisms → Viruses → Predicted Viral2791Open in IMG/M
3300017770|Ga0187217_1253480Not Available574Open in IMG/M
3300017772|Ga0181430_1176493Not Available615Open in IMG/M
3300017776|Ga0181394_1082071Not Available1044Open in IMG/M
3300017783|Ga0181379_1064594Not Available1377Open in IMG/M
3300017786|Ga0181424_10067422Not Available1546Open in IMG/M
(restricted) 3300024255|Ga0233438_10016773All Organisms → Viruses → Predicted Viral4683Open in IMG/M
(restricted) 3300024255|Ga0233438_10038525All Organisms → cellular organisms → Bacteria2566Open in IMG/M
(restricted) 3300024255|Ga0233438_10055242Not Available1993Open in IMG/M
(restricted) 3300024255|Ga0233438_10066518All Organisms → Viruses → Predicted Viral1756Open in IMG/M
(restricted) 3300024255|Ga0233438_10091119All Organisms → Viruses → Predicted Viral1414Open in IMG/M
(restricted) 3300024255|Ga0233438_10097808Not Available1347Open in IMG/M
(restricted) 3300024255|Ga0233438_10112544All Organisms → Viruses → Predicted Viral1221Open in IMG/M
(restricted) 3300024255|Ga0233438_10123189All Organisms → Viruses → Predicted Viral1147Open in IMG/M
(restricted) 3300024255|Ga0233438_10129977All Organisms → Viruses → Predicted Viral1106Open in IMG/M
(restricted) 3300024255|Ga0233438_10171237Not Available914Open in IMG/M
(restricted) 3300024255|Ga0233438_10265250Not Available672Open in IMG/M
(restricted) 3300024255|Ga0233438_10337241Not Available567Open in IMG/M
(restricted) 3300024518|Ga0255048_10461612Not Available615Open in IMG/M
(restricted) 3300024518|Ga0255048_10632826Not Available517Open in IMG/M
(restricted) 3300024520|Ga0255047_10504115Not Available609Open in IMG/M
3300025071|Ga0207896_1037915Not Available809Open in IMG/M
3300025071|Ga0207896_1069254Not Available552Open in IMG/M
3300025084|Ga0208298_1061547Not Available718Open in IMG/M
3300025085|Ga0208792_1018613Not Available1467Open in IMG/M
3300025085|Ga0208792_1076779Not Available599Open in IMG/M
3300025099|Ga0208669_1129535Not Available506Open in IMG/M
3300025108|Ga0208793_1015308All Organisms → Viruses → Predicted Viral2843Open in IMG/M
3300025108|Ga0208793_1025979All Organisms → Viruses → Predicted Viral2001Open in IMG/M
3300025108|Ga0208793_1041153Not Available1472Open in IMG/M
3300025108|Ga0208793_1066975All Organisms → Viruses → Predicted Viral1065Open in IMG/M
3300025108|Ga0208793_1123586Not Available705Open in IMG/M
3300025108|Ga0208793_1142430Not Available639Open in IMG/M
3300025141|Ga0209756_1078436All Organisms → Viruses → Predicted Viral1491Open in IMG/M
3300025168|Ga0209337_1025161Not Available3387Open in IMG/M
3300025168|Ga0209337_1049447All Organisms → Viruses → Predicted Viral2190Open in IMG/M
3300025168|Ga0209337_1059105All Organisms → Viruses → Predicted Viral1946Open in IMG/M
3300025168|Ga0209337_1060279Not Available1919Open in IMG/M
3300025168|Ga0209337_1095823All Organisms → Viruses → Predicted Viral1396Open in IMG/M
3300025168|Ga0209337_1186269Not Available859Open in IMG/M
3300025168|Ga0209337_1315076Not Available557Open in IMG/M
3300025668|Ga0209251_1148749Not Available613Open in IMG/M
3300025695|Ga0209653_1102404Not Available925Open in IMG/M
3300028194|Ga0257106_1180953Not Available731Open in IMG/M
3300028197|Ga0257110_1002589Not Available8415Open in IMG/M
3300031519|Ga0307488_10093250All Organisms → Viruses → Predicted Viral2213Open in IMG/M
3300031519|Ga0307488_10626789Not Available620Open in IMG/M
3300031621|Ga0302114_10087776All Organisms → Viruses → Predicted Viral1451Open in IMG/M
3300032073|Ga0315315_10442844All Organisms → Viruses → Predicted Viral1206Open in IMG/M
3300032073|Ga0315315_10467933Not Available1169Open in IMG/M
3300032073|Ga0315315_10876828Not Available812Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine40.50%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater24.79%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater12.40%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine5.79%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine5.79%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine2.48%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater2.48%
Sackhole BrineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Sackhole Brine1.65%
Marine SedimentEnvironmental → Aquatic → Marine → Coastal → Sediment → Marine Sediment0.83%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous0.83%
Marine Surface WaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine Surface Water0.83%
Estuary WaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Estuary Water0.83%
Marine EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Marine Estuarine0.83%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573028Estuarine microbial communities from Columbia River, sample from South Channel ETM site, CMGS313-FOS-0p8-ETM-15mEnvironmentalOpen in IMG/M
3300001450Marine viral communities from the Pacific Ocean - LP-53EnvironmentalOpen in IMG/M
3300001460Marine viral communities from the Pacific Ocean - LP-28EnvironmentalOpen in IMG/M
3300001947Marine microbial communities from the Gulf of Maine, Canada - GS002EnvironmentalOpen in IMG/M
3300004448Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300004457Marine viral communities from Newfoundland, Canada MC-1EnvironmentalOpen in IMG/M
3300004460Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300004461Marine viral communities from Newfoundland, Canada BC-2EnvironmentalOpen in IMG/M
3300005239Environmental Genome Shotgun Sequencing: Ocean Microbial Populations from the Gulf of MaineEnvironmentalOpen in IMG/M
3300005837Exploring phylogenetic diversity in Port Hacking ocean in Sydney, Australia - Port Hacking PH4 TJ4-TJ18EnvironmentalOpen in IMG/M
3300006752Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006793Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaGEnvironmentalOpen in IMG/M
3300006919Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_21EnvironmentalOpen in IMG/M
3300006921Marine viral communities from the Subarctic Pacific Ocean - 21_ETSP_OMZ_AT15319 metaGEnvironmentalOpen in IMG/M
3300006924Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaGEnvironmentalOpen in IMG/M
3300006925Marine viral communities from the Subarctic Pacific Ocean - 14_ETSP_OMZ_AT15311 metaGEnvironmentalOpen in IMG/M
3300006990Marine viral communities from the Subarctic Pacific Ocean - 11B_ETSP_OMZ_AT15265_CsCl metaGEnvironmentalOpen in IMG/M
3300007992Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1461AB_0.2umEnvironmentalOpen in IMG/M
3300009593Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT8 MetagenomeEnvironmentalOpen in IMG/M
3300010150Marine viral communities from the Subarctic Pacific Ocean - 17B_ETSP_OMZ_AT15314_CsCl metaGEnvironmentalOpen in IMG/M
3300010430Marine sediment microbial communities from Gulf of Thailand under amendment with organic carbon and nitrate - JGI co-assembly of 8 samplesEnvironmentalOpen in IMG/M
3300017708Marine viral communities from the Subarctic Pacific Ocean - Lowphox_04 viral metaGEnvironmentalOpen in IMG/M
3300017724Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 11 SPOT_SRF_2010-05-17EnvironmentalOpen in IMG/M
3300017725Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 21 SPOT_SRF_2011-04-29EnvironmentalOpen in IMG/M
3300017726Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 4 SPOT_SRF_2009-09-24EnvironmentalOpen in IMG/M
3300017727Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 24 SPOT_SRF_2011-07-20EnvironmentalOpen in IMG/M
3300017728Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 42 SPOT_SRF_2013-04-24EnvironmentalOpen in IMG/M
3300017740Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 41 SPOT_SRF_2013-03-13EnvironmentalOpen in IMG/M
3300017744Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 20 SPOT_SRF_2011-02-23EnvironmentalOpen in IMG/M
3300017750Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 28 SPOT_SRF_2011-11-29EnvironmentalOpen in IMG/M
3300017753Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 30 SPOT_SRF_2012-01-26EnvironmentalOpen in IMG/M
3300017755Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 34 SPOT_SRF_2012-07-09EnvironmentalOpen in IMG/M
3300017756Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 5 SPOT_SRF_2009-10-22EnvironmentalOpen in IMG/M
3300017757Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 43 SPOT_SRF_2013-05-22EnvironmentalOpen in IMG/M
3300017762Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 45 SPOT_SRF_2013-07-18EnvironmentalOpen in IMG/M
3300017764Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 8 SPOT_SRF_2010-02-11EnvironmentalOpen in IMG/M
3300017769Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 5 SPOT_SRF_2009-10-22 (version 2)EnvironmentalOpen in IMG/M
3300017770Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 15 SPOT_SRF_2010-09-15 (version 2)EnvironmentalOpen in IMG/M
3300017772Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 53 SPOT_SRF_2014-04-10EnvironmentalOpen in IMG/M
3300017776Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 17 SPOT_SRF_2010-11-23EnvironmentalOpen in IMG/M
3300017783Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 2 SPOT_SRF_2009-07-10EnvironmentalOpen in IMG/M
3300017786Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 47 SPOT_SRF_2013-09-18EnvironmentalOpen in IMG/M
3300024255 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_123_September2016_10_MGEnvironmentalOpen in IMG/M
3300024518 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_2EnvironmentalOpen in IMG/M
3300024520 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_1EnvironmentalOpen in IMG/M
3300025071Marine viral communities from the Pacific Ocean - LP-36 (SPAdes)EnvironmentalOpen in IMG/M
3300025084Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025085Marine viral communities from the Subarctic Pacific Ocean - 14_ETSP_OMZ_AT15311 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025099Marine viral communities from the Subarctic Pacific Ocean - 21_ETSP_OMZ_AT15319 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025141Marine viral communities from the Pacific Ocean - ETNP_6_85 (SPAdes)EnvironmentalOpen in IMG/M
3300025168Marine viral communities from the Pacific Ocean - LP-53 (SPAdes)EnvironmentalOpen in IMG/M
3300025668Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI073_LV_100m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025695Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - ESP_116LU_22_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300028194Marine microbial communities from Northeast Subartic Pacific Ocean, Canada - LP_J_2011_P26_10mEnvironmentalOpen in IMG/M
3300028197Marine microbial communities from Northeast Subartic Pacific Ocean, Canada - LP_J_2015_P26_10mEnvironmentalOpen in IMG/M
3300031519Sea-ice brine microbial communities from Beaufort Sea near Barrow, Alaska, United States - SB 0.2EnvironmentalOpen in IMG/M
3300031621Marine microbial communities from Western Arctic Ocean, Canada - AG5_SurfaceEnvironmentalOpen in IMG/M
3300032073Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 40m 3416EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
GS313G0146KB_006274302189573028Marine EstuarineMAASVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQAD
JGI24006J15134_1001989053300001450MarineMAASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQADYQLLLLNNPTV*
JGI24006J15134_1002702823300001450MarineMAAHVSNVYDVLECVKQEVEELCSQEAVSPLLVWLLLRQQADYQLLLLKDSNN*
JGI24006J15134_1004046673300001450MarineMAASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQAEYELLLLQHPTD*
JGI24006J15134_1006102923300001450MarineMAAPVSNIYAVLEGITDQVEELAKQDKVSPLLIWLLLRQQAEYQLLLLHNPID*
JGI24006J15134_1014940133300001450MarineMAAPVSNIYAVLEGITDQVEELAEQDKVSPLLIWLLLRQQAEYQLLLLQNPID*
JGI24006J15134_1016570813300001450MarineLMAASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQAEYELLLLQHPQT*
JGI24006J15134_1021999523300001450MarineMTATVTSVYDVLDTIKNEVDILCAKENVSPLLVWLVLRQQADYQLLLLSNPID*
JGI24003J15210_1003964913300001460MarineMAASVSNVYDVLDSVKNEVDVLCAQEDVSPLLIWLLLRQQADYQLLLL
GOS2218_104412123300001947MarineMAASVSNVYDVLECVKQEVEELCKQQEVSPLLVWLLLRQQADYQLLLLVNINDK*
Ga0065861_102882833300004448MarineMAASVSNVYDVLDNVRNEVHTICIEENVSPLLVWLLLRQQADYQLLLLKNPVN*
Ga0065861_110243123300004448MarineMAASVTNVYDVLDGVKKEVEDISEKENVSPLLVWLLLRQQADYQLLLLKNPID*
Ga0066224_108581353300004457MarineMAAPVSNIYAVLEGITEQVEELSKQDKVSPLLVWLLLRQQAEYELLLLQHSTD*
Ga0066222_115960923300004460MarineMAASVSNVYDVLDCVKQEVEQLCEQQQVSPLLVWLLLRQQADYQLLLLKNPID*
Ga0066222_118080333300004460MarineCCLMAASVTNVYDVLDGVKNEVDCISAKENVSPMLIWLLLRQQADYELLLITNPTE*
Ga0066222_118196533300004460MarineMAASVTNVYDVLDGVKNEVDCISAKENVSPMLIWLLLRQQ
Ga0066223_119078223300004461MarineMAASVSNVYDVLDCVKQEVEELCRQQEVSPLLVWLLLRQQADYQLLLLVNINDK*
Ga0073579_116547543300005239MarineMAASVSNVYGVLDCVKQEVEELCKQQEVSPLLVWLLLRQQADYQLLLLTNSTNLTN*
Ga0073579_1174464123300005239MarineMAASVSNVYDVLECVKQEVEELCKQQEVSPLLVWLLLRQQADYQLLLLVNINDKLIYAP*
Ga0073579_1190873903300005239MarineMAASVGNVYDVLECVKQEVKELCKQEAVSQQLVWLLLRQQADYQLLLLKDSNN*
Ga0073579_150549623300005239MarineMAASVSNVYGVLDCVKQEVEELCKREAVSPLLVWLLLRQQADYQLLLLNNPID*
Ga0073579_152385113300005239MarineMAVSVSSVYDVLDTIKNEVDCLCAKENISPMLVWLLLRQQADYQLLLQQESTD*
Ga0078893_1040468733300005837Marine Surface WaterMAASVGNVYDVLDCVKNKVEALCEQEDVSPQLVWLLLRQQADYQLLLLNNPSE*
Ga0098048_102398313300006752MarineIAGLVTSDAAMAASVSNVYDVLDTIKNEVDCLCAQENISPLLVWLVLRQQADYQLLLLSKPTD*
Ga0098048_119986033300006752MarineMAASVSNVYDVLDCVKNEVDCICAKENVSPLLIWLLLRQQADYQLLLLQ
Ga0098054_108285313300006789MarineMAASVTNVYDVLDTIKNEVDCLCAKEDVSPLLVWLVLRQQADYQLLLLSKPTD*
Ga0098055_102769653300006793MarineMAVPVSNVYDVLDCVKKEVEELCKREAVSPLLVWLLLRQQADYQLLLLQNPTD*
Ga0098055_104403343300006793MarineMAVPVSNVYDVLDCVKKEVEELCKRETVSPLLVWLLLRQQADYQLLLLQNPID*
Ga0098055_105131023300006793MarineMAASVTNVYDVLDTIKNEVDCLCAKEDVSPLLVWLVLRQQADYQLLLLSNPVD*
Ga0098055_114821123300006793MarineMAASVSNVYDVLESVKQEVEELCKQEAVSPLLVWLLLRQQADYQLLLLTNPID*
Ga0098055_127837023300006793MarineMAASVSNIYDVLDTVKNEVDCVCAQENVSELLVWLLLRQQADYQLLLLQDPIQ*
Ga0098055_130095733300006793MarineMAASVSNVYDVLDCVKQEVEELCKQEAVSPLLVWLLLRQQA
Ga0070746_1030076223300006919AqueousMAASVSNVYDVLECVKQEVEELCKQEGISPLLVWLLLRQQADYQLLLLSNPED*
Ga0098060_100369043300006921MarineMVAKVSCVYDVLECVKDEVEELCRQEEVSPLLVWLLLRQQADYQLLLLNNPPE*
Ga0098060_105304523300006921MarineMAASVSNVYDVLECVKKEVDVLCAHENVSPLLVWLLLRQQADYQLLLLQDPIK*
Ga0098060_110019723300006921MarineMAAPVSSVYDVLDCVRNKVEELCVEEELSPLLVWLVLRQQADYQLLLLQDTNE*
Ga0098051_102309063300006924MarineMASTVGNVYDVLDTIKNEVDCLCAKEDVSPLLVWLVLRQQADYQLLLLSNPVD*
Ga0098050_103072033300006925MarineMAASVSNIYDVLDVVKNEVDCVCDKENVSPLLIWLLLRQQADYQLLLLQHPID*
Ga0098050_104688123300006925MarineMAASVTNVYDVLDTIKNEVDCLCAQENVSPLLVWLVLRQQADYQLLLLSNPTD*
Ga0098050_111447133300006925MarineMAASVSNVYDVLDCVKQEVDELCKQEEVSPLLVWLLLRQQADYQLLLLSNPSD*
Ga0098046_112039433300006990MarineMAVPVSNVYDVLDCVKKEVEELCKREAVSPLLVWLLLRQQADY
Ga0105748_1044523323300007992Estuary WaterMAASVSNVYDVLDTIKNEVDCLCAKENVSPLLVWMVLRQQADYQLLLLSNP
Ga0115011_1152823523300009593MarineMAASVTNVYDVLDTVKNEVDCLCAKENVSPMLVWLLLRQQADYELLLQSNPVD*
Ga0098056_126844613300010150MarineMAASVSNVYDVLESVKQEVEELCKQEAVSPLLVWLLLRQQAD
Ga0118733_10787464623300010430Marine SedimentMAASVSNVYDVLDRVKDEVDELCKQENVSPLLVWLLLRQQADYQLLLLTNSTN*
Ga0181369_105975213300017708MarineMAASVSNVYDVLECVKQEVEELCRQESVSPLLVWLLLRQQ
Ga0181388_109847223300017724SeawaterMAASVMSVYDVLDTIKNEVDILCAKEDVSPLLVWLVLRQQADYQLLLLSNPTD
Ga0181398_1000040323300017725SeawaterMAASVSNVYDVLECVKNKVEELCVAEEVSPLLVWLLLRQQADYQLLLLQDSNK
Ga0181381_100399923300017726SeawaterMAVSVSNVYDILECVKAEVKELCKQQQVSPALVWLLLRQQADYQLLLLKEKNQ
Ga0181401_104257123300017727SeawaterMAASVTNVYDVLDTIKNEVDCLCAKENVSPLLVWMVLRQQADYQLLLLSNPTD
Ga0181401_104257213300017727SeawaterTNVYDVLDTIKNEVDVLCAKENVSPLLVWLLLRQQADYQLLLLDNPID
Ga0181401_104749923300017727SeawaterMAASVSNVYDVLDCVKQEVDTLCAQENVSPLLVWLLLRQQADYQLLLLQDPIQ
Ga0181401_106334523300017727SeawaterMTASVTNVYDVLDTIKNEVDILCAKENVSPLLVWLVLRQQADYQLLLLNNPVD
Ga0181401_109855123300017727SeawaterMAASVVNVYDVLDTIKNEVDILCAKEDVSPLLVWLVLRQQADYQLLLLSNPTD
Ga0181401_116412113300017727SeawaterMAASVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQADYQ
Ga0181419_100594313300017728SeawaterMAVSVSNVYDVLECVKQEVEELCKQESVSPLLVWLLLRQQADYQLLLLQESNQ
Ga0181418_101841333300017740SeawaterMTATVTNVYDVLDTIKNEVDCLCAQENVSPLLVWMVLRQQADYQLLLLSNPTD
Ga0181418_105695223300017740SeawaterMAASVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQADYQLLLLQDPIK
Ga0181397_101383953300017744SeawaterMAASVSNVYDVLDTVKNEVDVLCAKENVSPLLIWLLLRQQADYQLLLLSDPID
Ga0181405_107971933300017750SeawaterMAASASNVYDVLDCVKQEVDTLCAQENVSPLLVWLLLRQQ
Ga0181407_111863123300017753SeawaterSVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQADYQLLLLNNPID
Ga0181411_102894423300017755SeawaterMAASVSNVYDVLNCVRNKVEELCVEEDVSELLIWLLLRQQADYQLLLLSDPVD
Ga0181382_1007524113300017756SeawaterMAASVSNVYDVLECVKQEVEELCKQESVSPLLVWLLLRQQADYQLLLLQESNQ
Ga0181420_100970073300017757SeawaterMAASVSNVYDVLDCVKQEVDTLCAQENVSPLLIWLLLRQQADYQLLLLQDPIQ
Ga0181420_106195133300017757SeawaterMAASVSNVYDVLECVKQEVEELCLQESVSPQLVWLLLRQQADYQLLLLQDSNN
Ga0181420_115608713300017757SeawaterSNVYDVLECVKQEVEELCLQESVSPQLVWLLLRQQADYQLLLLQDSNE
Ga0181420_119237113300017757SeawaterMAASVSNVYDVLDTVKNEVDVLCAQENVSPLLVWLLLRQQADYQLLLLNNPID
Ga0181422_108430323300017762SeawaterMAASVTNVYDVLDTVKNEVDILCAQENVSPMLVWLLLRQQADYQLLLIQDPID
Ga0181422_116013123300017762SeawaterMAASVSNIYDVLDTVKNEVDCLCAKENVSPLLIWLLLRQQADYQLLLQNDSID
Ga0181385_116209223300017764SeawaterMVASVTNVYHVLDTIKNEVDVLCAKENVSPLLVWLLLRQQADYQLLLLQNPID
Ga0187221_101517493300017769SeawaterMAASVSNVYDVLECVKQEVEELCKQESVSPLLVWLLLRQQADYQLSLLQESNQ
Ga0187217_125348013300017770SeawaterMAASVSNVYDVLECVKQEVEELCRKENVSPLLVWLLLRQQADYQLLLL
Ga0181430_117649333300017772SeawaterMAASVSNVYDVLDTVKNEVDVLCAQENVSPLLIWLLLRQQADYQLLLL
Ga0181394_108207123300017776SeawaterMAGSVTNVYDVLDTIKNEVDCLCAKENVSPLLVWMVLRQQADYQLLLLSNPTD
Ga0181379_106459423300017783SeawaterMTASVTNVYDVLDTIKNEVDILCAKENVSPLLIWLLLRQQADYQLLLLDNPID
Ga0181424_1006742233300017786SeawaterMAASVSNVYDVLDTVKNEVDVLCAKENVSPLLVWLLLRQQADYQLLLLQDPTV
(restricted) Ga0233438_1001677333300024255SeawaterMAASVTSVYDVLDTIKNEVDILCDKENVSPLLVWLVLRQQADYQLLLLGNPQD
(restricted) Ga0233438_1003852583300024255SeawaterMVASVTNVYDVLDTVKNEVDVLCAKEDVSPLLVWLLLRQQADYQLLLISNPTD
(restricted) Ga0233438_1005524253300024255SeawaterMVASVSNVYDVLDTIKNEVDILCAKENVSPLLVWLVLRQQADYQLLLLSNPLD
(restricted) Ga0233438_1006651843300024255SeawaterMAASVSNVYDVLDTVKNEVDILCAQENVSPLLVWLLLRQQADYQLLLLQDPIK
(restricted) Ga0233438_1009111933300024255SeawaterMVATVTNVYDVLDIIKNEVDVLCANENVSPLLVWLVLRQQADYQLLLLSNPED
(restricted) Ga0233438_1009780833300024255SeawaterMTATVTNVYDVLDTIKNEVDCLCAKENVSPLLVWMVLRQQADYQLLLLSNPTD
(restricted) Ga0233438_1011254433300024255SeawaterMAASVSNVYDVLECVKQEVEELCRQQKVSPLLVWLLLRQQADYQLLLLANINDK
(restricted) Ga0233438_1012318923300024255SeawaterMAASVSNVYDVLDTVKNEVDILCAQENVSPLLVWLLLRQQADYQLLLLQDPII
(restricted) Ga0233438_1012997723300024255SeawaterMAASVTNVYDVLDTVKNEVDILSAQENVSPMLVWLLLRQQADYQLLLLQDPID
(restricted) Ga0233438_1017123723300024255SeawaterMAASVTSVYDVLDTIKNEVDILCDKENISPLLVWLVLRQQADYQLLLLGNPQD
(restricted) Ga0233438_1026525033300024255SeawaterMAASVSNVYDVLECVKQEVEELCKQESVSPLLVWLLLRQQADYQLLLLQDPLK
(restricted) Ga0233438_1033724123300024255SeawaterMAASVTKVYDVLDTVKNEVDILCAQENVSPLLVWLLLRQQADYQLLLLSDPID
(restricted) Ga0255048_1046161233300024518SeawaterMTASVMSVYDVLDTIKNEVDVLCAKENVSPLLVWLVLRQQADYQLLLLSNPTD
(restricted) Ga0255048_1063282613300024518SeawaterMAASVTSVYDVLDTIKNEVDILCDKENISPLLVWLVLRQQA
(restricted) Ga0255047_1050411513300024520SeawaterMAASVSNVYDVLDCVKNEVDVLCAKEDVTPLLVWLLLRQQA
Ga0207896_103791523300025071MarineASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQAEYELLLLQHPQT
Ga0207896_106925413300025071MarineMAASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQAEYELLLLQHPTD
Ga0208298_106154723300025084MarineMAVPVSNVYDVLDCVKKEVEELCKRETVSPLLVWLLLRQQADYQLLLLQNPID
Ga0208792_101861323300025085MarineMAASVTNVYDVLDTIKNEVDCLCAKEDVSPLLVWLVLRQQADYQLLLLSKPTD
Ga0208792_107677913300025085MarineMAASVSNVYDVLDCVKQEVDELCKQEEVSPLLVWL
Ga0208669_112953523300025099MarineMAAPVSSVYDVLDCVRNKVEELCVEEELSPLLVWLVLRQQADYQLLLLQDTNE
Ga0208793_101530813300025108MarineMAASVSTVYDVLDRVKNEVHGICIDENVSPLLVWLLLRQQA
Ga0208793_102597933300025108MarineMAASVSNVYDVLESVKQEVEELCKQEAVSPLLVWLLLRQQADYQLLLLTNPID
Ga0208793_104115323300025108MarineMAASVSNIYDVLDTVKNEVDCVCAQENVSELLVWLLLRQQADYQLLLLQDPIQ
Ga0208793_106697523300025108MarineMAASVSNVYDVLDCVKQEVDELCKQEEVSPLLVWLLLRQQADYQLLLLSNPSD
Ga0208793_112358623300025108MarineMAASVSNVYDVLDCVKQEVEELCKQEAVSPLLVWLLLRQQADYQLLLLQDPIK
Ga0208793_114243013300025108MarineMAASVTNVYDVLDTIKNEVDCLCAKEDVSPLLVWLVLRQQADYQLLLL
Ga0209756_107843613300025141MarineMAASVSNVYDVLECVKQEVEELCRHEEVSPLLVWLLLRQQADYQLLLLQD
Ga0209337_102516153300025168MarineMAASVSNIYAVLEGITQQVEELSTQDEVSPLLVWLLLRQQADYQLLLLNNPTV
Ga0209337_104944723300025168MarineMAAPVSNIYAVLEGITDQVEELAKQDKVSPLLIWLLLRQQAEYQLLLLHNPID
Ga0209337_105910553300025168MarineMAAPVSNIYAVLEGITDQVEELAEQDKVSPLLIWLLLRQQAEYQLLLLRNN
Ga0209337_106027943300025168MarineMAAHVSNVYDVLECVKQEVEELCSQEAVSPLLVWLLLRQQADYQLLLLKDSNN
Ga0209337_109582343300025168MarineMTATVTSVYDVLDTIKNEVDILCAKENVSPLLVWLVLRQQADYQLLLLSNPID
Ga0209337_118626933300025168MarineMAASVSNIYAVLEGITDQVEELAEQDKVSPLLIWLLLRQQADYQLLLLQNSQT
Ga0209337_131507633300025168MarineMAASVSNVYDVLDTVKNEVDVLCAQENVSPLLIWLLLRQQADYQLLLLQDPID
Ga0209251_114874913300025668MarineMAASVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQADYQLLLLNNPTV
Ga0209653_110240433300025695MarineMAASVSNVYDVLDCVKNEVDVLCAKEDVSPLLVWLLLRQQADYQLLLLNN
Ga0257106_118095313300028194MarineMAAPVSNIYAVLEGITDQVEELAEQDKVSPLLIWLLLRQQAEYQLLLLRND
Ga0257110_100258953300028197MarineMAVSVGNVYDVLECVKQEVEELCKQEAVSQQLVWLLLRQQADYQLLLLKDSNN
Ga0307488_1009325023300031519Sackhole BrineMAASVSNVYDVLERVKNEVHSLCIDENVSPLLVWLVLRQQADYQLLLLTNSTNLTN
Ga0307488_1062678913300031519Sackhole BrineMAAPVSNIYAVLEGITDQVEELAEQDKVSPLLIWLLLRQQAEYQLLLLQNPID
Ga0302114_1008777653300031621MarineMAASVTKVYDVLDTVKNEVDILCAQENVSPLLVWLLLRQQADYQLLLLQDPIT
Ga0315315_1044284443300032073SeawaterMAASVSNVYDVLECVKQEVEELCLQESVSPQLVWLLLRQQADYQLLLLQDSNE
Ga0315315_1046793323300032073SeawaterMAASVSNVYDVLDTVKNEVDVLCAQENVSPLLVWLLLRQQADYQLLLLQDPTV
Ga0315315_1087682823300032073SeawaterMAASVSNVYDVLECVKQEVEELCRKENVSPLLVWLLLRQQADYQLLLLNTSTDSCQDGNYLIGKYC


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.