NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F094518

Metagenome Family F094518

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094518
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 40 residues
Representative Sequence MQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFY
Number of Associated Samples 8
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.94 %
Associated GOLD sequencing projects 6
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Arthropoda → Digestive System → Gut → Unclassified → Termite Gut
(100.000 % of family members)
Environment Ontology (ENVO) Unclassified
(100.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Animal → Animal proximal gut
(100.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 36.36%    β-sheet: 0.00%    Coil/Unstructured: 63.64%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF00245Alk_phosphatase 0.94
PF00255GSHPx 0.94
PF10545MADF_DNA_bdg 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG0386Thioredoxin/glutathione peroxidase BtuE, reduces lipid peroxidesDefense mechanisms [V] 0.94
COG1785Alkaline phosphataseInorganic ion transport and metabolism [P] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001542|JGI20167J15610_10041664Not Available612Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Termite GutHost-Associated → Arthropoda → Digestive System → Gut → Unclassified → Termite Gut100.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001542Nasutitermes corniger crop gut microbial community from laboratory colony in Florida, USA - Nc150CHost-AssociatedOpen in IMG/M
3300002238Nasutitermes corniger P1 segment gut microbial community from laboratory colony in Florida, USA - Nc150 P1Host-AssociatedOpen in IMG/M
3300002308Nasutitermes corniger P4 segment gut microbial community from laboratory colony in Florida, USA - Nc150 P4Host-AssociatedOpen in IMG/M
3300027539Nasutitermes corniger midgut segment microbial community from laboratory colony in Florida, USA - Nc150M (SPAdes)Host-AssociatedOpen in IMG/M
3300027670Nasutitermes corniger crop gut microbial community from laboratory colony in Florida, USA - Nc150C (SPAdes)Host-AssociatedOpen in IMG/M
3300027966Nasutitermes corniger P5 segment gut microbial community from laboratory colony in Florida, USA - Nc150 P5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028325Nasutitermes corniger P1 segment gut microbial community from laboratory colony in Florida, USA - Nc150 P1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028327Nasutitermes corniger P3 segment gut microbial community from laboratory colony in Florida, USA - Nc150 P3 (SPAdes)Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI20167J15610_1004166423300001542Termite GutMQQFYNFITRRSVSLNMFRAHPRPSSGAYNWINNLWFYLFSQEIL*
JGI20167J15610_1004638723300001542Termite GutMQQFHKFIT*RFVSLDMFRAPPRPSSGAYNCLNSLW
JGI20167J15610_1008435323300001542Termite GutMQQFYKFITGRFVSLNMFREPPRPSSGAYNCINSLWFYLGAW
JGI20167J15610_1009368913300001542Termite GutMHLFYKFITRRFVSLNMFRAPPRPSSGACNCINSLWFYRWSVV
JGI20169J29049_1061154413300002238Termite GutMQHIHKFIT*CFVSLNMFRAPPRPSSGAYNCINSLW
JGI20169J29049_1068648413300002238Termite GutMQQFHKFIA*RFVSLNMFRDPPRPTSGAYNCINSLWFYLGA
JGI20169J29049_1070984613300002238Termite GutMQQFHKFLLDVYVSFNMFRVPPHPSSGAYNCINSLWFFLP
JGI20169J29049_1073127213300002238Termite GutMQQFYKFIT*RFVSLNMFRAPTRPSSGAYNCINSLWF
JGI20169J29049_1083406113300002238Termite GutMQQFYRFIT*RFVSLNMFREPPRPSSGAYNCINSLWFY
JGI20169J29049_1084933413300002238Termite GutMQQFYKFIT*RFVSLNMFRAPPRPSSGVYNCINSLWFYRWS
JGI20169J29049_1086207913300002238Termite GutMQHFYKFIT*RFYVWLNMFRTLPRPSSGAYNCISNLWFYRWS
JGI20169J29049_1090895423300002238Termite GutMQQFYKFIT*RFVSLNMFRAPPHPSSGAYNCINSL
JGI20169J29049_1092779213300002238Termite GutMQHFYKFIT*RFVSLNMFRAPSRPSSGAYNCINSL
JGI20169J29049_1092855823300002238Termite GutMQQFYNVIT*RFVSLNMFRAPPRPSSGAYNCINSLWFY
JGI20169J29049_1093327043300002238Termite GutMQQFYKFIT*RFVSLNMFRAPPRPSSGAYNCINSLWFYLG
JGI20169J29049_1093778813300002238Termite GutMQQFHKFLLDVYVSLNMFRAPPCPSSGAYNCINSLWFYL
JGI20169J29049_1097452913300002238Termite GutMQQFYKFITRRFVSLNMFRAPPRPSSEAYNCINSLWFYLLGAWW
JGI20169J29049_1105051513300002238Termite GutMQQFYKFIT*RFVSLNMSRAPPRSSSGAYNCINSL*FYRWSVVVAALLTALL
JGI20169J29049_1109567123300002238Termite GutMQQFYKFIT*YFVSLNMFRASSRPSSAAYNFINNLWFYLG
JGI20169J29049_1115176213300002238Termite GutMQQFYKFIIGRFVSLNMFRAPPRPSSGAYNCINSLWFYLGA
JGI20169J29049_1118435313300002238Termite GutYKLTSHMQQFYKFITRRFVSLNMFRAPRRPSSGAYNCINSLWFYLAALLVVVWSDHD*
JGI20169J29049_1132438233300002238Termite GutMQQFYKFITRRFVSLNMFRAPPRPSSGAYNCINSLWFYLG
JGI20171J29575_1155995723300002308Termite GutMQQFYKFIT*LFVSFNMFRAPPRPSSGAYNCINSLWFTVG
JGI20171J29575_1160525923300002308Termite GutMQQFYKFIT*DFVSLNMFRAPPCLSSGAYNYINSLWFYRWSV
JGI20171J29575_1177436713300002308Termite GutMQQFHKFIT*RFVSLNMFGAPPRPSSGAYNCINSLWFYRWSVVVVALLVVV*PD
JGI20171J29575_1184596623300002308Termite GutMQQFYKFIT*RFVSLNMFRAPPRPSSGAYNCIDSLWVYLG
JGI20171J29575_1198456913300002308Termite GutMQQFYKFIT*RFVLLNMFRAPPRPSSGAYNCINSLWFYLGAWW
JGI20171J29575_1201411913300002308Termite GutMQEFYKFIT*HFVSPNMFRAPPRPSSGAYNCINSLWFYRWS
JGI20171J29575_1206942313300002308Termite GutMKQFHKFPFDVYVSLNMFRAPPRPSSGAYNCINILWFYRWSVEAAVL
JGI20171J29575_1219179513300002308Termite GutMQQFYKFIT*CFVSLNMFRAPPRPSSGAYNCINSLWFYLGAW
JGI20171J29575_1222415643300002308Termite GutMHLFYKFITRHFVPLNMFRAPPRPSSGACNCINSLWFYRWSVVVVALLV
JGI20171J29575_1245584313300002308Termite GutMQQFYDLLLDVYVSLNMFQASPRPSSGAYNCINSLWFYLGTWW
JGI20171J29575_1252516413300002308Termite GutMQQFYKLIT*RFVSLNIFRAPPRPSAGVYNCINRVWFYRWSVVVAVL
JGI20171J29575_1257835943300002308Termite GutMQQFYNYIT*RFVSLNMFRAPPRPSSGAYNYIDSLWFYLGAWW*
Ga0209424_100544913300027539Termite GutMQQFYKYITXRFVSLDMFRAAPRPSSGAYNCINSL
Ga0209424_100940123300027539Termite GutMKQFYKFITXRFVSVDMFRAPPRPSSGAYNCINSLWFYLG
Ga0209424_102824123300027539Termite GutMQQFYKFIIXRFVSLNMFRAPPRPSSGAYNCIKSLWFY
Ga0209424_110229613300027539Termite GutMQKFYKFISXRFVSLNMFRAPPRPSSGAYNYINSL
Ga0209424_112683723300027539Termite GutMQLFYKFITXRFVSLNMFWAPPRPSSGAYNCINGLWFYRWSMVV
Ga0209424_113312513300027539Termite GutMEQFYKFINXRFVSLNMFRAALRPSSGAYNCTNSLWFYLGAW
Ga0209424_115474013300027539Termite GutMQHFYKFITXRFVPLNMFRALPRPLSGAYNCINNLWFYLGAWW
Ga0209424_117178513300027539Termite GutMQQFHKFIAXRFMSLNMFRAPPRPSSGAYNCINSLWFYRWNVV
Ga0209424_118335313300027539Termite GutMQQFHKFITXRFVSLNMFRAPPRPSTGVYICINSRW
Ga0209424_118381813300027539Termite GutMQQFYMFITXHFVSLNMFRVPPRLSSGAYNCISSL
Ga0209424_122142713300027539Termite GutMQQFYKFITGRFVSLNMFREPPRPSSGAYNCINSLWF
Ga0209424_124173113300027539Termite GutMQQFYKFITXRFVSLNMFRVPPRPSSGAYNCINSLWF
Ga0209424_125029413300027539Termite GutMQQFYKFITXCFVLLNMFQLPPRPSSGAYNCINNLWFYIGVWW
Ga0209423_1003456013300027670Termite GutMQQFYKFITXRFVSLNMFRAAPRPSSGAYNCINSIXFYL
Ga0209423_1009144813300027670Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLR
Ga0209423_1009880313300027670Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSL
Ga0209423_1014387613300027670Termite GutMQQFYKFITXRFVSLNMFRAPPRQSSGAYNCINSLWFYLGAWW
Ga0209423_1019736213300027670Termite GutMQQFYKFITXYFVSLNMFRAPPRPSSGAYNCINSLWFYRW
Ga0209423_1020585713300027670Termite GutMQQFHKFITXSFVSLNMFRAPPRPSSGAYNCINSLWF
Ga0209423_1022627113300027670Termite GutMHQFYKFITXRFVSLNMSRAPPRPSSGAYNCISSLWFYL
Ga0209423_1022840413300027670Termite GutMQQFYEFITXRFVSLNMFRAPPRPSSGAYNCINSLWFY
Ga0209423_1023286313300027670Termite GutMQQFYKFITXGFVSLNMFRAPPRPSSGAYNCINSLWFYLEA
Ga0209423_1032203413300027670Termite GutMQKFYKFISXRFVSLNMFRAPPRPSSGAYNYINSLWF
Ga0209423_1034651013300027670Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWF
Ga0209423_1038863413300027670Termite GutMQQFYKFITXRFVSLYTFRAPTRPSSGDYNCINSLWF
Ga0209423_1039881813300027670Termite GutMQQIYKFITXSFVSLNMFRAPPRPSSGAYNCINSLWF
Ga0209423_1045060613300027670Termite GutMQLFYNFITXRFVSLNMFRAPPRPSSGAYNCINSL
Ga0209423_1047121113300027670Termite GutMQQFYKFIITSRFVSLNMFRAPSRPSSGAYNGINSLWFYLGA
Ga0209423_1050686913300027670Termite GutMQQFYKFITXRFVSLNMFQAPPRPSSGAYSCINSLWFYLG
Ga0209423_1060200613300027670Termite GutMHLFYKFITRRFVSLNMFRAPPRPSSGACNCINSLWFYRWSVVV
Ga0209738_1012645213300027966Termite GutMQQFYKFITXRFVSLNMFRAPPRSSSGAYNCINSLW
Ga0209738_1013325113300027966Termite GutMQQFYKFITXRFVSLNMFRAPPRQSSGAYNCINSLWFYL
Ga0209738_1020109613300027966Termite GutMQQFHKFITXRFVSLNMFRAPLRPSSGAYNCINGLWFYL
Ga0209738_1023383013300027966Termite GutMQQFYKFITXRFVSLNMFLAPPRPSSGAYSCINSL
Ga0209738_1030600613300027966Termite GutMQQFYKFITXRFVSLNMFRAPSRPSSGAYNCINSLWFY
Ga0209738_1033338513300027966Termite GutMQQFYKFIITSRFVSLNMFRAPSRPSSGAYNGINSLWFYLGAW
Ga0209738_1040569013300027966Termite GutMQQIYKFITXSFVSLNMFRAPPRPSSGAYNCINSLWFYRW
Ga0209738_1048250013300027966Termite GutMQQIRKFIAXRFVSLNMFREPPRSSSGVYNCINSLWFYIG
Ga0209738_1051290013300027966Termite GutMQLFYKFITXRFVALNMFRAPPRLSSGAYNCINSLVVLP
Ga0209738_1052233413300027966Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYLG
Ga0209738_1052800413300027966Termite GutMQQSYKFITXRFVSLNMFRAPLLPSSGAYNCINSLSFY
Ga0209738_1058442613300027966Termite GutMQLFYKFITXRFVSLDMFRAPPRPSSGAYNCINSL
Ga0209738_1061628413300027966Termite GutMQKFYKFISXRFVSLNMFRAPPRPSSGAYNYINSLWFYLGA
Ga0268261_1004925343300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINNLWFYLEAWR
Ga0268261_1007651353300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYLGA
Ga0268261_1008626013300028325Termite GutMQQSYKFITXRFVSLNMFRAPPRPSSGAYNCINSL
Ga0268261_1014111113300028325Termite GutMQQFHKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFY
Ga0268261_1018864813300028325Termite GutMQQFYKFITXRFVSLNMFRAPPCPSSGAYNCINNLWFYRWSVVV
Ga0268261_1019546813300028325Termite GutMQQFYKFITXHFVSLNMFRAPPRPSSGAYNCINSLWFYL
Ga0268261_1020247413300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRQSSGAYNCINSLWF
Ga0268261_1022841013300028325Termite GutMQQFYKFITXRFVSLNMFRAPPHPSSGDYNCINSLWFYRWSV
Ga0268261_1026730123300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYRWSVVVA
Ga0268261_1032811413300028325Termite GutMQHFYKFITXRFVSLNMFRAPTRPSSGAYNCINSLWFYRWCVVVAALLVVVW
Ga0268261_1034526613300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYCWSV
Ga0268261_1034682413300028325Termite GutMQQIHKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYLGAW
Ga0268261_1037074313300028325Termite GutMHQFYKFITXRFVSLNMFRAPPRPSSGAYSCINSLWFYRWSVVVAALL
Ga0268261_1037281123300028325Termite GutMQQFYKFITXRFVSLNMFQAPPHPLSGAYNYINSLWFYRWSVVVS
Ga0268261_1040801823300028325Termite GutMHLFYKFITRHFVPLNMFRAPPRPSSGACNCINSLWFYRWSVVVVALLVVVWP
Ga0268261_1041800413300028325Termite GutMQQFYEFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYRW
Ga0268261_1043191613300028325Termite GutMQQFYKFTTXRFISFNMFRAPPRPSSGAYSCINSLWFYRWSVVVAALL
Ga0268261_1048275313300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCIDSLLFYLGAWW
Ga0268261_1048880913300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYSCINSLW
Ga0268261_1049800813300028325Termite GutMQLFYKFITXRFVSLNMFRAPPRPSSGDYNCINSLWFYLGA
Ga0268261_1050094613300028325Termite GutMQQFYKFITXYFVSLNMFRASSRPSSAAYNFINNLWFYLGV
Ga0268261_1057097813300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFY
Ga0268261_1057106413300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGTYNCINSLW
Ga0268261_1061568613300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLW
Ga0268261_1061719613300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCFNSLWLY
Ga0268261_1075591813300028325Termite GutMQQFYKFITXRFVSLNMFRAPPRPSSGAYNCINSLWFYRWSVVVAG
Ga0268261_1076255213300028325Termite GutMQQFYKFITXDFVSLNMFRAPPCLSSGAYNYINSLWFYRWSVVVAALLVVV
Ga0268261_1078038113300028325Termite GutMQQFYKFITXRFVSLNMFRVPPRPSSXAYNCINTL
Ga0268262_1043313013300028327Termite GutMQQFHKFITRRFVSLNMFRPPPRPSSGAYNCINSLWFYRWNVV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.