NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F092097

Metagenome / Metatranscriptome Family F092097

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092097
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 44 residues
Representative Sequence MYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQ
Number of Associated Samples 59
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Viruses
% of genes with valid RBS motifs 78.50 %
% of genes near scaffold ends (potentially truncated) 39.25 %
% of genes from short scaffolds (< 2000 bps) 87.85 %
Associated GOLD sequencing projects 39
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Predicted Viral (35.514 % of family members)
NCBI Taxonomy ID 10239 (predicted)
Taxonomy All Organisms → Viruses → Predicted Viral

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(79.439 % of family members)
Environment Ontology (ENVO) Unclassified
(81.308 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(85.981 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 88.64%    β-sheet: 0.00%    Coil/Unstructured: 11.36%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
82.2%17.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Sediment
Freshwater
Freshwater
Freshwater
Marine Sediment
Worm Burrow
Aqueous
Seawater
Freshwater To Marine Saline Gradient
Salt Marsh
Estuarine Water
Marine
Pond Water
Marine Methane Seep Sediment
3.7%79.4%3.7%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
DelMOSpr2010_1006987613300000116MarineMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA*
B570J40625_10015108873300002835FreshwaterMKYNHQQLKDLEDQFLKDFHRGLLTRTQLLNIIHRLDRISHHL*
Ga0075474_1005409213300006025AqueousMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTTK*
Ga0075474_1014074113300006025AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTK*
Ga0075478_1001976563300006026AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTR*
Ga0075478_1020262523300006026AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEIASTK*
Ga0075478_1026013643300006026AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHE
Ga0075514_176472613300006403AqueousMYTTQELKDLESTAIRDYQRGLLTKTQLLNIIHRLD
Ga0075461_1019938413300006637AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIAST
Ga0070749_1018990733300006802AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEL*
Ga0070749_1052695523300006802AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDR
Ga0070754_1019304653300006810AqueousMYTTQELKELEATAIRDHQRGLLTKTQLLNIIHRLDRISHHEIASTK*
Ga0070754_1024384813300006810AqueousLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA*
Ga0070754_1031996033300006810AqueousMTYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRI
Ga0070754_1032606023300006810AqueousCLMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGT*
Ga0070754_1042837633300006810AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRL
Ga0070754_1043147913300006810AqueousYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEL*
Ga0075476_1022626733300006867AqueousMYTTQELKELEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGT*
Ga0075477_1037815723300006869AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGT*
Ga0075479_1031823413300006870AqueousMYTTQELKELEATAIRDHQRGLLTRTQLLNIIHRLDRISHHEIASTK*
Ga0075475_1025820623300006874AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTPK*
Ga0070750_1022306713300006916AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLD
Ga0070750_1031292213300006916AqueousMYTTQELKDLEATAIRDHQRGLLTKTQLLNIIHRLDRISHHELQDS*
Ga0070750_1038465623300006916AqueousMYTTQKLKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA*
Ga0070750_1043411233300006916AqueousQELKDLEATAIRDHQRGLLTKTQLLNIIHRLDRISHHEL*
Ga0070746_1020638253300006919AqueousMMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTPR*
Ga0070746_1029327313300006919AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVST
Ga0075463_1007710143300007236AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRIS
Ga0070745_111446533300007344AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHERTSNH*
Ga0070745_115362353300007344AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHE
Ga0070745_124650733300007344AqueousMYTTQKLKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHEL
Ga0070753_127224813300007346AqueousMYTAQEVKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQG
Ga0099849_111125443300007539AqueousMYTTQELKNLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA*
Ga0099849_117468233300007539AqueousMMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDR
Ga0099849_124220023300007539AqueousMMYTAQQLKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA*
Ga0070751_112728513300007640AqueousMYTAQEVKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0075480_1036953313300008012AqueousKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTK*
Ga0102963_131155623300009001Pond WaterMNEQTLKDLEDQALRDHKRGLLTRTQLLNIIHRLD
Ga0129345_129981633300010297Freshwater To Marine Saline GradientMYTTQELKDLEVTAIRDYQRGLLTKIQLLNIIHRLDRISHHELQGA*
Ga0136549_1009094413300010389Marine Methane Seep SedimentMSYTHKQLRELEAQAIEDHRRGLLTRAQLLRIIHRLDRISWITV*
Ga0181590_1031015833300017967Salt MarshMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHERTSNH
Ga0181585_1093000123300017969Salt MarshMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTK
Ga0181591_1021658763300018424Salt MarshMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIA
Ga0193974_102046323300019726SedimentMMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGS
Ga0194023_105716123300019756FreshwaterMTYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEISTPK
Ga0194024_104644823300019765FreshwaterMYTTQELKELEATAIRDHQRGLLTKTQLLNIIHRLDRISHHEIASTK
Ga0194024_105118613300019765FreshwaterMYTAQELKNLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0194022_100628073300019937FreshwaterMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTTK
Ga0208050_100109183300020498FreshwaterMKYNHQQLKDLEDQFLKDFHRGLLTRTQLLNIIHRLDRISHHL
Ga0213859_1044840023300021364SeawaterMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQDS
Ga0222718_1009627843300021958Estuarine WaterMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTPK
Ga0222718_1019109523300021958Estuarine WaterMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEISTPK
Ga0222718_1038277213300021958Estuarine WaterMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEITTTK
Ga0196883_100046753300022050AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTR
Ga0196883_100153293300022050AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHH
Ga0196883_100464513300022050AqueousMYTTQELKELEATAIRDYQRGLLTKTQLLNIIHRLDRISHHEIATTK
Ga0196883_100595453300022050AqueousMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTPK
Ga0196883_101175113300022050AqueousLMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0196883_101632623300022050AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGS
Ga0196883_101838913300022050AqueousMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTTK
Ga0212025_102020343300022057AqueousMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTK
Ga0212024_107692513300022065AqueousMYTTQELKDLEVTAIRDYQRGLLTKTQLLNIIHRLDR
Ga0196895_100772313300022067AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTKW
Ga0212021_113638733300022068AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTP
Ga0196907_10866123300022149AqueousMMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0196897_103189423300022158AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTTK
Ga0212027_102201113300022168AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRI
Ga0196891_109066333300022183AqueousYTTQELKDLEATAIRDYQRGLLTRTQLLNIIHRLDRISHHELQGA
Ga0196899_105476143300022187AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHE
Ga0196899_107955023300022187AqueousMYTSQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0196899_108185743300022187AqueousGCLMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGT
Ga0196899_113668423300022187AqueousMMYTAQEVKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0196899_115595333300022187AqueousMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRI
Ga0196899_117527613300022187AqueousMYTTQELKELEATAIRDHQRGLLTRTQLLNIIHRLDRISHHEIASTK
Ga0196899_119514933300022187AqueousGGCMMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTPK
Ga0196905_115794523300022198AqueousMNEQTLKDLEDQALRDHKRGLLTRTQLLNIIHRLDRLSHH
Ga0255768_1048711913300023180Salt MarshMNEQTLKDLEDQALRDHKRGLLTRTQLLNIIHRLDRLS
Ga0208149_101616593300025610AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0208428_101388913300025653AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQG
Ga0208428_115041713300025653AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTPK
Ga0208898_101397763300025671AqueousMYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTRXATRFARP
Ga0208898_101469073300025671AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEIASTK
Ga0208898_106490953300025671AqueousMYTTQELKDLEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEIASTR
Ga0208898_107408743300025671AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRI
Ga0208898_107751713300025671AqueousMYTTQELKDLESTAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0208162_1013512103300025674AqueousMNEQTLKELEDQALRDHKRGLLTRTQLLNIIHRLDRISHHE
Ga0208162_106399123300025674AqueousMYTTQELKNLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0208899_103560673300025759AqueousMYTTQELKDLEATAIRDHQRGLLTKTQLLNIIHRLDRISHHELQDS
Ga0208899_109056023300025759AqueousMYTAQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0208899_114201453300025759AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISH
Ga0208767_111526033300025769AqueousMYTTQELKDLEATAIRDYRRGLLTKTQLLNIIHRLDRISHHEL
Ga0208767_114466113300025769AqueousMMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTPR
Ga0208767_116264913300025769AqueousMYTTQELKDLEVTAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGS
Ga0208767_117557943300025769AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQ
Ga0208917_101710073300025840AqueousMMYTTQELKDLEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTTK
Ga0208645_105528163300025853AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGT
Ga0208645_113221713300025853AqueousMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISH
Ga0208645_116216733300025853AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEL
Ga0208645_120924933300025853AqueousMTYTTQELKELEATAIRDRQRGLLTRTQLLNIIHRLDRISHHEVSTPK
Ga0208644_109094553300025889AqueousMYTTQELKDLEVTAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA
Ga0208644_110386323300025889AqueousMTYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEIASTR
Ga0209536_10185675213300027917Marine SedimentMYTTQELKDLEVTAIRDYQRGLLTKTQLLNIIHRLDRISHHELQ
Ga0316201_1128856113300032136Worm BurrowMMYTTQELKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHH
Ga0335036_0073845_1_1173300034106FreshwaterQQLKDLEDQFLKDFHRGLLTRTQLLNIIHRLDRISHHL
Ga0348335_109046_68_2083300034374AqueousMYTTQELKDLEVTAIRDYQRGLLTKTQLLNIIHRLDRISHHELQDS
Ga0348335_145272_49_1923300034374AqueousMYTTQELKELEATAIRDRQRGLLTQTQLLNIIHRLDRISHHEVSTPR
Ga0348337_122277_630_7703300034418AqueousMYTTQKLKDLEATAIRDYQRGLLTKTQLLNIIHRLDRISHHELQGA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.