NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099967

Metagenome Family F099967

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099967
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 52 residues
Representative Sequence MLNFLDIPIPEIQVWETLQEDPRRLAIAVLARLIVQATLKNPGWEEEDHDR
Number of Associated Samples 77
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 40.78 %
% of genes near scaffold ends (potentially truncated) 24.27 %
% of genes from short scaffolds (< 2000 bps) 77.67 %
Associated GOLD sequencing projects 68
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.107 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(30.097 % of family members)
Environment Ontology (ENVO) Unclassified
(63.107 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.728 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.51%    β-sheet: 0.00%    Coil/Unstructured: 59.49%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035404550MLNFLDIPIPEIQVWETLQEDPRRLAIAVLARLIVQATLKNPGWEEEDHDRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
63.1%36.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Vadose Zone Soil
Tropical Forest Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Termite Gut
30.1%17.5%6.8%4.9%4.9%7.8%12.6%6.8%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A1DRAFT_1010609513300000597Forest SoilLDIPIPEIQVWETLQEEARSWTIAVLARLIVQTTFKNPELEEEES*
AP72_2010_repI_A001DRAFT_103995113300000893Forest SoilVQRMLNFLDIPIPEIQVWETLQDEARGLAIAVLARLIMQATLENSGWEEGDHDR*
JGI12636J13339_102580113300001154Forest SoilVQRMLNFLDIPILEIQAWETLEEKQRMLAIEVLARLIVQATLKNRGLEEADHVR*
JGI12630J15595_1009504623300001545Forest SoilVQRMLNFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLKNPGWEVEDHDR*
JGI12635J15846_1004626523300001593Forest SoilMLNFLDIPILEIQAWETLEEKQRMLAIEVLARLIVQATLKNRGLEEADHVR*
JGIcombinedJ26739_10014560933300002245Forest SoilVQRMLNFLDIPIPEIQVWETLQEDPRRLAIAVLARLIVQATLKNPGWEEEDHDR*
JGIcombinedJ26739_10018760323300002245Forest SoilLQRMLNFLNIPISEIQVWETLEEEQRILAIAVLARLIAQASVEQQRSEEDHDR*
Ga0066396_1001652223300004267Tropical Forest SoilMLNFLDIPIPEIQVWETMEEEQRILALEVLARLIVQATCRTRRPEEDHDR*
Ga0066398_1002417523300004268Tropical Forest SoilMLSFLDIPIPEIQVWETLEEEQKILAIAVLARLIAQATVEPPRSEENHDR*
Ga0066395_1000903943300004633Tropical Forest SoilTPILQHMLNFLDIPIPEIQVWETMEEEQRILALEVLARLIVQATCRTRRPEEDHDR*
Ga0066388_10364813323300005332Tropical Forest SoilMLNFLDIPIPEIQVWETLQEQTRSWAIAVLARLIVQATFKTPELEEEKS*
Ga0066388_10485355613300005332Tropical Forest SoilIQVWETLQEEPRRLAVAVLARLIVQATLQNPGWEVEDHDR*
Ga0070761_1109970613300005591SoilRMLNFLDIPIPEIQVWETLEEEQRMLAIAVLARLIAQATVDQQRTEEDHDR*
Ga0070762_1019915523300005602SoilMLNFLDITIPEIHVWEALPEEATKLAIAVLARLIIQATLNHPGSEEENRDR*
Ga0070763_1002987523300005610SoilMLSFLDIPIPEIQVWETLQENLKRLAIAVLARLIVQATLKNPGWEEEDHDR*
Ga0066905_10004835123300005713Tropical Forest SoilMLNFLDIPIPEIQVWETLQDEARGLAIAVLARLIMQATLENSGWEAGDHDR*
Ga0070766_1006936333300005921SoilVQRMLSFLDIPIPEIQVWETLQENLKRLAIAVLARLIVQATLKNPGWEEEDHDR*
Ga0123357_1092699223300009784Termite GutMLNFLDIPIPETQIWETLEEEQRLLAIAVLARLIVQAMVDQPRAEVDNDR*
Ga0126374_1006288913300009792Tropical Forest SoilRRTPIVQRMLSFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLQNPGWEVEDHDR*
Ga0123355_1024409643300009826Termite GutLQRILNFLDIPIPEIQVWETLQEEARSWAITVLARLIVQTTFKNPELEGEES*
Ga0123355_1068819523300009826Termite GutMLNFLDIPIPEIQVWETLQEEARSWAIAVLARLIAQATSKNPELEEEES*
Ga0126384_1003402523300010046Tropical Forest SoilMLNFLDIPIPEIQVWETLQDEARGLAIAVLARLIMQATLENSGWEEGDHDR*
Ga0126384_1011950123300010046Tropical Forest SoilMLNFLEIPIPEIQVWETLQPEARSWAIAVLARLIVQTTFKNPELEEEES*
Ga0126384_1045804333300010046Tropical Forest SoilPIPEIQVWETLEEEQKILAIAVLARLIAQATVEPPRSEENHDR*
Ga0126373_1052097323300010048Tropical Forest SoilMLNFLDIPIPEIQVWETLQKDPRRLAIAVLARLIIQATLNNPGWEEGDHDR*
Ga0126373_1055904223300010048Tropical Forest SoilMLNFLDIPIPEIQIWETLQEEPRKLVIAVLARLIIQATLNHPGSGEENRDR*
Ga0126373_1061381723300010048Tropical Forest SoilMLSFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLKNPGWEVEDHDR*
Ga0126373_1113859513300010048Tropical Forest SoilRTRRTLILQRMLNFLDIPIPEIQVWGTLQEEARSWATAVLARLIAQTTFKNPELEEEES*
Ga0126373_1133145023300010048Tropical Forest SoilMLNFLDIPIPEVQVWETLEEEARSWAIAVLARLIVQTTFKNPELEEEES*
Ga0126373_1163401623300010048Tropical Forest SoilMLNFLDISIPEIQIWETLEEEQRMLAIAVLARRIAQATVNQQRAEEDHDR*
Ga0131853_1098509023300010162Termite GutMLNFLDIPIPEIQVWETLQEQARSWAITVLARLIVQTTFKNPELEGEES*
Ga0126370_1184089423300010358Tropical Forest SoilMLNFLDISIPEIQIWETLEEEQRMLAIAVLARLIAQATVNQQRAEEDHDR*
Ga0126376_1082479123300010359Tropical Forest SoilALPSPAATINLSTRRTPIVQRMLSFLDIPIPEIQVWETLQDEARGLAIAVLARLIMQATLENSGWEAGDHDR*
Ga0126376_1150820633300010359Tropical Forest SoilMLSFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLQNPGWEVEDHDR*
Ga0126376_1245746513300010359Tropical Forest SoilMLNFLDIPIPEVQVWETLEEEARSWAIAVLARLIVQTTFKNPELEEEES
Ga0126372_1020086823300010360Tropical Forest SoilMLNFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLQNPGWEVEDHDR*
Ga0126378_1042016623300010361Tropical Forest SoilMLNFLEIPIPEIQVWETLQQEARSWAIAVLARLIVQTTFKNPELEEEES*
Ga0126378_1157005723300010361Tropical Forest SoilMLNFLDIPIPEIQVWETLQEEARSWAIAVLARLIVQTTFKNPELEEEES*
Ga0126377_1051726013300010362Tropical Forest SoilVQHMLNFLDIPFPEIHVWETLQDEARGLAIAVLARLIMQATLENSGWEAGDHDR*
Ga0126379_1025044023300010366Tropical Forest SoilMLNFLEIPIPEIQVWETLQQEARGWAIAVLARLIVQTTFKNPELEEEES*
Ga0126379_1066377923300010366Tropical Forest SoilMLNFLDISIPEIQVWETLAEEQRMLAIAVLARVIAQAPVDQQRSEEDNDR*
Ga0136643_1070087723300010369Termite GutMLNFLDIPIPEIQVWETLQEEARSWAITVLARLIVQTTFKNPELEGEES*
Ga0126381_10040059833300010376Tropical Forest SoilMLNFLDIAIPEIQVWETLQEEARSWAIAGLARLIVQTTFKNPKLEEEES*
Ga0126381_10079509513300010376Tropical Forest SoilPPSPGAMRNLSTRRTPSLQRMLNFLDIPIPEIQVWETLPEEPRKLVIAVLARLIIQATLNHPGSGEENRDR*
Ga0126381_10312277413300010376Tropical Forest SoilMLSFLDIPIPEIQVWETLQEEPRRLAVAVLARLIAQVTLKNPGWEVEDHDR*
Ga0126381_10335057313300010376Tropical Forest SoilMLSFLDIPIPEIQIWETLQEEPRKLVIAVLARLIIQATLNHPESGEENRDR*
Ga0126383_1100704323300010398Tropical Forest SoilMLNFLDIPILEIQIWETLREEPRKLVIAVLARLIIQATRNHPGSGEENHDR*
Ga0137399_1125211823300012203Vadose Zone SoilRTPIVQRMLNFLDIPIPKIQVWETLQEDPRRLAIVVLARLIVQATLKNPGWEEEDHDR*
Ga0137362_1151334213300012205Vadose Zone SoilMLNFLDIPIPKIQVWETLQEDPRRLAIVVLARLIVQATLKNPGWEEEDHDR*
Ga0126375_1002672113300012948Tropical Forest SoilFLDIPIPEIQVWETLQYEARGLAIAVLARLIMQATLENSGWEAGDHDR*
Ga0182039_1009657213300016422SoilMRNLSTRRTPSLQRMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLILQATLNHPGSGEENRDR
Ga0187781_1008243723300017972Tropical PeatlandVQRMLNFLDISIPEIQIWETLEEEQKMLAIAVLARLIAQATVNQQRAEEDHDR
Ga0187782_1006213623300017975Tropical PeatlandMLNFLDISIPEIQIWETLEEEQKMLAIAVLARLIAQATVNQQRAEEDHDR
Ga0210407_1004508623300020579SoilMLNFLDITIPEIHVWEALPEEATKLAIAVLARLIIQATLNHPGSEEENRDR
Ga0210395_1020727423300020582SoilMLSFLDIPIPEIQVWETLQENLKRSAIAVLARLIVQATLKNPGWEEEDHDR
Ga0210406_1081192723300021168SoilLQRMLNFLDIPIPEIQVWETLQEEARCWAIAVLARLIVQTTFKNPELEEEES
Ga0210397_1050573323300021403SoilVQRMLNFLDIPIPKTPIWETLEEEQRALAIAALARLIGQATVDQQRLGEDHDR
Ga0210383_1030631823300021407SoilVQRMLNFLDTPIPEMQAWETLEEKQRMLAIAVLARLMAQATFKDRGLEEKDHDR
Ga0210394_1060342113300021420SoilRRTPIVQRILNFLDMPIPEIQVWETLQEEPRRLAIVVLARLIVQATLKNPGWEGEDHDR
Ga0210384_1033541023300021432SoilAPPSPAATRNLSKRRTPIVQRMLSFLDIPIPEIQVWETLQENLKRSAIAVLARLIVQATLKNPGWEEEDHDR
Ga0210384_1055849223300021432SoilSTRRTPIVQRMLSFLDTPIPEMQAWETLEEKQRILAIAVLARLMVQATFKDRGLEEKDHD
Ga0210384_1087022713300021432SoilVQRILSFLDTPIPEMQAWETLEEKQRILAIAVLARLMVQATFKDRG
Ga0126371_1010733223300021560Tropical Forest SoilMLNFLDIPIPEIQVWETMEEEQRILALEVLARLIVQATCRTRRPEEDHDR
Ga0126371_1026484323300021560Tropical Forest SoilLQRMLNFLDIPIPEIQVWETLQEQTRSWAIAVLARLIVQATFKTPELEEEKS
Ga0126371_1032805923300021560Tropical Forest SoilMLSFLDIPIPEIQVWETLQEEPRRLAVAVLARLIVQATLKNPGWEVEDHDR
Ga0126371_1051606923300021560Tropical Forest SoilLQRMLNFLDIPIPEIQVWGTLQEEARSWATAVLARLIAQTTFKNPELEEEES
Ga0126371_1334435823300021560Tropical Forest SoilMRRTLILQRMLNFLEIPIPEIQVWETLQQEARSWAIAVLARLIVQTTFKNPELEEEES
Ga0257168_115583213300026514SoilMLNFLDMPIPEIQVWETLQSEPRRLAIVVLARLIVQATLKNPGSEGEDRDR
Ga0208732_101087223300026984Forest SoilVQRMLSFLDIPIPEIQVWETLQENLKRSAIAVLARLIVQATLKNPGWEEEDHDR
Ga0208604_101083923300027090Forest SoilQRMLSFLDIPIPEIQVWETLQENLKRSAIAVLARLIVQATLKNPGWEEEDHDR
Ga0209527_104240723300027583Forest SoilMLNFLDIPIPEIQVWETLQEDPRRLAIAVLARLIVQATLKNPGWEEEDHDR
Ga0209528_104298223300027610Forest SoilMLNFLNIPISEIQVWETLEEEQRILAIAVLARLIAQASVEQQRSEEDHDR
Ga0209528_106166323300027610Forest SoilVQRMLNFLDIPIPEIQVWETLQEDPRRLAIAVLARLIVQATLKNPGWEEEDHDR
Ga0209330_116040223300027619Forest SoilLQRMLNFLNIPISEIQVWETLEEEQRILAVAVLARLIAQASVEQQRSEEDHDR
Ga0209799_101069823300027654Tropical Forest SoilVQRMLNFLDIPIPEIQVWETLQDEARGLAIAVLARLIMQATLENSGWEAGDHDR
Ga0209799_109169623300027654Tropical Forest SoilLQRMLNFLEIPIPEIQVWETLQQEARGWAIAVLALLIVQTTFKNPELEEEES
Ga0209118_103009823300027674Forest SoilVQRMLNFLDIPILEIQAWETLEEKQRMLAIEVLARLIVQATLKNRGLEEADHVR
Ga0209011_101143113300027678Forest SoilMLNFLDIPILEIQAWETLEEKQRMLAIEVLARLIVQATLKNRGLEEADHVR
Ga0209274_1049270423300027853SoilMLNFLDIPIPEIQVWETLEEEQRMLAIAVLARLIAQATVDQQRTEEDHDR
Ga0209275_1063352123300027884SoilVQRILNFLDMPIPEIQVWETLQEEPRRLAIVVLARLIVQATLKNPGWEGEDHDR
Ga0209380_1002897923300027889SoilVQRMLSFLDIPIPEIQVWETLQENLKRLAIAVLARLIVQATLKNPGWEEEDHDR
Ga0170834_10816090523300031057Forest SoilVQRMLSFLDIPIPEIQVWETLQENLRRLSIAVLARLIVQATLKNPRWKEEDHDR
Ga0170824_10427556623300031231Forest SoilMLSFLDIPIPEIQVWETLQENLRRLAITVLARLIVQATLKHPGWEEQDHDR
Ga0170824_11332049233300031231Forest SoilVQRMLNFLDIPIPEMQLWETLAEEPRILAIQVLARLIFQVTVHPQRPEEDHDR
Ga0170820_1633728323300031446Forest SoilVQRMLSFLDIPIPEIQVWETLQENLRRLAITVLARLIVQATLKNPGWEEQDHDR
Ga0170818_10509985733300031474Forest SoilMLSFLDIPIPEIQVWETLQENLRRLAITVLARLIVQATLKNPGWEEQDHDR
Ga0318516_1005986123300031543SoilMLNFLDIPIPEIQVWETLHEEPRSRAIAVLARLIVQASFENPELEEEDHDR
Ga0318538_1009457123300031546SoilMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLIIQATLNHPGSGEENRDR
Ga0318528_1075255813300031561SoilLQRMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLIIQATLNHPGSGEENR
Ga0318574_1037401933300031680SoilMLNFLDIPIPEIQVWETLHEEPRSRAIAVLARLIVQASFENPELEEEDHD
Ga0318496_1026204423300031713SoilLQRMLNFLDIPIPEIQVWETLHEEPRSRAIAVLARLIVQASFENPELEEEDHDR
Ga0307476_1055325523300031715Hardwood Forest SoilLQRMLNFLEMPIPEMQVWETLEEEQTISAITVLARLIAQATIERAEEEHDR
Ga0307474_1015108713300031718Hardwood Forest SoilMLNFLNIPISEIQVWETLEEEQRILAVAVLARLIAQASEQQRSEEDHDR
Ga0318501_1037434623300031736SoilRRTPSLQRMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLIIQATLNHPGSGEENRDR
Ga0307477_1009531223300031753Hardwood Forest SoilVLQRMLNFLDIPIPEIQVWETLQEEQRILTITVLARLIAQATVDQPPRLEEDHDR
Ga0307475_1004202323300031754Hardwood Forest SoilMLNFLNIAIPEIRVWETLQEEATKLAIAVLARLVIQATLNHPGSEEENRDR
Ga0318503_1010730413300031794SoilDIPIPEIQVWETLHEEPRSRAIAVLARLIVQASFENPELEEEDHDR
Ga0306919_1014296623300031879SoilMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLILQATLNHPGSGEENRDR
Ga0306926_1133327123300031954SoilLQRMLNFLDIPIPEVQVWETLEEEARSWAIAVLARLIVQTTFKNPELEEEES
Ga0307479_1007274623300031962Hardwood Forest SoilMLNFLEMPIPEMQVWETLEEEQTISAITVLARLIAQATIERAEEEHDR
Ga0318575_1001872813300032055SoilPEIQVWETLHEEPRSRAIAVLARLIVQASFENPELEEEDHDR
Ga0306924_1198938113300032076SoilLQRMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLIIQATLNH
Ga0306920_10008099323300032261SoilMRNLSTRRTPSLQRMLNFLDIPIPEIQVWETLQEEPRKLVIAVLARLIIQATLNHPGSGEENRDR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.