NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F071704

Metagenome Family F071704

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071704
Family Type Metagenome
Number of Sequences 122
Average Sequence Length 43 residues
Representative Sequence MDRKLRNRTLLFLALAAIVAFVLVRLSGRQPVAKISATTPVRQ
Number of Associated Samples 111
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 77.97 %
% of genes near scaffold ends (potentially truncated) 95.90 %
% of genes from short scaffolds (< 2000 bps) 90.98 %
Associated GOLD sequencing projects 107
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (62.295 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.934 % of family members)
Environment Ontology (ENVO) Unclassified
(29.508 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.639 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 40.85%    β-sheet: 0.00%    Coil/Unstructured: 59.15%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MDRKLRNRTLLFLALAAIVAFVLVRLSGRQPVAKISATTPVRQSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
62.3%37.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Peatland
Freshwater Wetlands
Freshwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Glacier Forefield Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Peat Soil
Switchgrass Rhizosphere
Populus Rhizosphere
3.3%4.1%13.1%8.2%13.9%6.6%4.9%5.7%4.9%3.3%3.3%3.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
deeps_000634902199352024SoilMDQRKVRNRILIFLVLAGVVAYVLVRLSGREPVAKIAAVSPVRDNLVSSIS
JGIcombinedJ26739_10026727513300002245Forest SoilMDQRKLRNRILLFLVVAAIVAYVLVRLSGREPVVKIAAVTPVRD
JGIcombinedJ26739_10044073413300002245Forest SoilMDKRFRNRTILLLIGAAVLAFVLVRLSGRQPVAKISAMIPGRQNIISSVTSNG
JGI26337J50220_103687813300003370Bog Forest SoilMDRRLRNRIVIFLVLAGVAAFVLVKLSGRQPVAKISAVAPIREDLASSI
Ga0062385_1037455313300004080Bog Forest SoilMEKKLRNRVLLFLALAAIAAFVLVKISGRQPVAKIAATRPLRENIISSVSS
Ga0062386_10057970313300004152Bog Forest SoilMDKKLRNRTLLFLLLVGIVVFVLIHWSGRQPVAKISAVRPVRQNVVASITSN
Ga0066388_10606297323300005332Tropical Forest SoilMDRRLRNRILLFLLAAGILAYVLVLLSGRQPVTKLSATFPTREN
Ga0066388_10777876223300005332Tropical Forest SoilMERKLRNRILIFLALAAVVAFVLVKLSGRQPVAKIMAVKPVRENLVASVSSN
Ga0070708_10097445213300005445Corn, Switchgrass And Miscanthus RhizosphereMDRRLRNRILLFLLAAGILAYVLVLLSGRQPVAKLSATYP
Ga0066695_1008786443300005553SoilMDRRLRNRILLFLVLAAAAAYGLYRLSGRQPAAKIAA
Ga0070717_1082791623300006028Corn, Switchgrass And Miscanthus RhizosphereMDRRLRNRILLILLAAGILAYVLLLLSGRQPVAKLSA
Ga0075028_10043268123300006050WatershedsMDRKLRNRILIFLAIAGVVAYGLVRLSGRQPVAKISVIQPIRETLVS
Ga0075029_10028706113300006052WatershedsMDKKFRNRTLLLLLLAGILAFFLIRASGKQPVAKISAMQPVREDIVSSIS
Ga0070715_1023989223300006163Corn, Switchgrass And Miscanthus RhizosphereMERKLRNRILIFLAAAGIVAYLLVRLSGRQPVARIMAVKPVRENLVASVSTNG
Ga0070765_10110312213300006176SoilMDRRLRNRIVILLLLAGAAAPFLVWLSGRKPAAKI
Ga0070765_10132693223300006176SoilMDRKLRNRILIFLVLAGIVAFALIRFSGRQPVAKIFAV
Ga0070765_10224597313300006176SoilMDRRLRNRILIFLALAGASALVLIKVSGRQPLAKISAVTPV
Ga0075434_10041610013300006871Populus RhizosphereVDRRLRNRILIFLLLAGIAAIVLISLSGRQPVAKISAVMPV
Ga0075426_1146142313300006903Populus RhizosphereMERKLRNRILIFLVAAGIVAFLLVRLSGRQPIAKIMAVKPV
Ga0075435_10082388613300007076Populus RhizosphereMERKLRNRVLIFLLVAGIVAFVLVRISGRQPVAKISAMTPKRQN
Ga0099792_1036306413300009143Vadose Zone SoilMDRRLRNRIVIFLVLAAIVAFVLVRLSGRQPVAKISAVTP
Ga0116218_138921023300009522Peatlands SoilMDRKLRNRTLLLLGLAAVVAFVLIRVSGHQPVAKISATSPVRQNITLLIS
Ga0116221_127573033300009523Peatlands SoilMDKKLRNRVLVFLLLAGIVAFLLVRVSGRQPVAKIS
Ga0126380_1039379023300010043Tropical Forest SoilMDRRLRNRILVFLLAAGILAYVLVLMSGRQPVAKLSATYATRE
Ga0126380_1079067113300010043Tropical Forest SoilMERKLRNRILIFLALAAVVAFVLVRLSGRQPVARIMVVKPVRENLVASV
Ga0126373_1286215113300010048Tropical Forest SoilMDRRLRNRILLFLLAAGISAYVLVLLSGRQPVAKLSATVPTRENIVSSVS
Ga0134080_1022279313300010333Grasslands SoilMERKLRNRILIFLAAAGIVAYLLVRLSGRQPVAKVMAVRP
Ga0126376_1089011213300010359Tropical Forest SoilMDRKLRNRILLFLVLAAIAATALISLSGRQPVAKISAVLPVRENI
Ga0126372_1015933913300010360Tropical Forest SoilMDRKLRNRTLLFLALAAIVAFVLVRLSGRQPVAKISATTPVRQNIVSSIISN
Ga0126372_1207735613300010360Tropical Forest SoilMERKLRNRILIFLALAAVVAFVLVKLSGRQPVAKIMAVKPVREN
Ga0126378_1298174623300010361Tropical Forest SoilMDRKLRNRTLLLLVLAGIIAFVLVRISGRQPVAKIAATRPVRQ
Ga0126378_1332355513300010361Tropical Forest SoilMDRRLRNRILIFLFAAGVLAYVLVLLSGRQPVAKLSATYPTREN
Ga0126377_1243968713300010362Tropical Forest SoilMDRRLRNRILFYLLGAGVLAYVLFLLSGRQPVAKLSAV
Ga0126379_1064146433300010366Tropical Forest SoilMDKKLRNRTLLFLLAAGIVAFVLIKISERQPVAKIAAVRPFRQ
Ga0134121_1292930513300010401Terrestrial SoilMDRRLRNKIVLFLLAAALVAYGLVWLSGRQPVAKIAAVTPM
Ga0137393_1161834123300011271Vadose Zone SoilMDRRLRNRIVIFLILAAVVAIVLVRLSGRQPVAKISAVTP
Ga0137363_1127410713300012202Vadose Zone SoilMDRRFRNRILIFLLLATVAAFALVELSGRKPVAKISAVTPTREN
Ga0137362_1045488713300012205Vadose Zone SoilMDRRLRNRIILFLALAAVAAVILVRLSGRQPVAKISAVTPV
Ga0137380_1077920033300012206Vadose Zone SoilMDRRLRNRILLFLVLAAAAAYGLYRLSGRQPAAKIA
Ga0137390_1010868613300012363Vadose Zone SoilMDRRLRNRIVILLVLAGIVAYALVRLTGRQPVAKISAVTPVREN
Ga0137395_1073043523300012917Vadose Zone SoilMDRRLRNRILIFLALAAIVAFALVRLSGRQPVAKISAVTPIRENVISSISS
Ga0137394_1092077213300012922Vadose Zone SoilMDRRLRNRILLILLAAGILAYMLLLLSGRQPVAKLSAVTPTRENIVS
Ga0137419_1052135113300012925Vadose Zone SoilMDRRLRNRILLILLAAGILAYLLLLLSGRQPVAKLSAVKPT
Ga0137407_1106904023300012930Vadose Zone SoilMIGFQEFMDKKLRNRTLLFLLLAGIVAFVLIKVSGRQPVAKIS
Ga0153915_1029558313300012931Freshwater WetlandsMERKLRNRILLFLLLAGVIAYLLVRISGRQPVAKI
Ga0153915_1223821423300012931Freshwater WetlandsMERKLRNRILLFLLLAGVIAYLLVRISGRQPVAKIAAVLPARENLVASISSN
Ga0137410_1030713613300012944Vadose Zone SoilMDRRLRNRVLIFLALAAIVAFALVRLSGRQPVAKISAVTPIRENVISSI
Ga0137405_120485113300015053Vadose Zone SoilMERKLRNRILIFLAAAGIVIFLAAAGIVAYLLVRLSGRQP
Ga0167661_104665513300015167Glacier Forefield SoilMDQRKVRNRILIFLALAAIVAYALVRLSGREPVVKIAAVMPVL
Ga0167638_103154223300015197Glacier Forefield SoilMDQRKLRNRILLFLLVAGIVAYALVRLSGREPVVKIAAVTPV
Ga0137403_1015546743300015264Vadose Zone SoilLIFLLLAGIAAIVLISLSGRQPVAKISAVMPVRENIVASL
Ga0182036_1070849023300016270SoilMDRRLRNRILIFLFAAGVLAYLLVLLSGRQPVAKLSA
Ga0182033_1117711613300016319SoilMDRKLRNRTLLFLALAAVVAFVLVRLSGRQPVAKISATTPVRQNIVSSVS
Ga0182040_1079147813300016387SoilMERKLRNRVLIFLLVAGIVAFVLVRMSGRQPVAKISAMTPKRQNIVSSISSN
Ga0182037_1166429823300016404SoilMDRKLRNRTLLFLALAAVVAFALVRLSGRQPVAKISATTPVRQNIVSSV
Ga0182039_1004023743300016422SoilMDRKLRNRALLFLALAGIVAFALVRLSGRQPVAKISAT
Ga0182039_1185391413300016422SoilMDRRLRNRILVFLLAAGILAYVLILLSGRQPVAKLSATY
Ga0187802_1045503123300017822Freshwater SedimentMDRKLRNRTLLLLGLAAVVAFVLVRVSGRQPIAKISAARPVRQNIVSSTSSN
Ga0187825_1042019223300017930Freshwater SedimentMDRRLTRRTLLLLALAAVVAFVLIRVSGRQPVAKISATSPV
Ga0187801_1017196433300017933Freshwater SedimentMDERLQLIMDRKLRNRTLLLLGLAAVVAFVLIRVSGRQPVAKISATSPVRQNIMSSIG
Ga0187803_1015646913300017934Freshwater SedimentMDRRLTRRTLLLLALAAVVAFVLIRVSGRQPVAKISATSPVRQ
Ga0187785_1073299223300017947Tropical PeatlandMDKTLRNRTLLFLLLAGIVAFVLVRVSGRQPVAKISAVRPVRQNIVSSI
Ga0187817_1019695813300017955Freshwater SedimentMDKTLRNRLLVFLLVAGIVAYYLVWVSGRQPVAKISATTPV
Ga0187779_1097129923300017959Tropical PeatlandMAAQLQDFMERKLRNRVLIFLLVAGIVAFVLVRISGRQPVAKISAMTPMRQNIV
Ga0187780_1073406913300017973Tropical PeatlandMDKKLRNRLVVFLLLAGIVAFVLVRISGRQPVAKVSATTP
Ga0187777_1052754523300017974Tropical PeatlandMDKKLRNRTLFLLVLAGIVAFVLVRISGRQPVAKIAATRPVRQNIAS
Ga0187784_1073565613300018062Tropical PeatlandMDKKLRNRTISLLVLAALLAFVLIHVSGRQPVAKISASRP
Ga0187772_1014135913300018085Tropical PeatlandMDRKLRNWTLLLLGLAAVVAFVLVRVSGRQPVAKI
Ga0187771_1005295443300018088Tropical PeatlandMDKRWRNRTLLLLGLAALAVIVLMRVSGRQPVAKISVTAPVRQDIVSSIIS
Ga0179594_1013339023300020170Vadose Zone SoilMDQRKVRNRILICLVLAGVVAYVLVRLSGREPVAKIAAVSPVRDNV
Ga0210403_1105952713300020580SoilMDRRLRNRILIFLVLAAVVAFALVRLSGRQPVAKISAVSPIRENVISSISS
Ga0210399_1107935613300020581SoilMDRKLRNRILIFLVLAGIVAFTLIRFSGRQPVAKILAVTPIRANVVSSISSN
Ga0210395_1113618623300020582SoilMLRNRVLILLGLAAIIAFVLVKVSGRQPVAKISATTPV
Ga0210406_1113404813300021168SoilMDRRLRNRIVIFLVLAAIVAFVLVRLTGRQPVAKIAAVTP
Ga0210408_1025792033300021178SoilMDKKLRNRTLLFLLLAGIVAFALIKYSGHQPVAKISA
Ga0210408_1104436913300021178SoilMDRRLRNRIVIFLVLAAVVAFALVRLSGRQPVAKISAVTPVRENLISS
Ga0210383_1051604323300021407SoilMDKKLRNRTLLFLLLVVIVVIVFIKVSGRQPAAKISAVKPFRENI
Ga0210398_1002407113300021477SoilMDQRKLRNRILLFLVVAGIVAYVLVRLSGREPVAKIAAVTPVRDNLVSS
Ga0210402_1051978323300021478SoilMLRNRVLILLGLAAIIAFVLVRVSGRQPVAKISATT
Ga0224564_111612913300024271SoilMDRRLRNRIVILLLLAGAAAPLLVWLSGRKPAAKISAMTP
Ga0208321_100908813300025409PeatlandMDKKLRNRVFAFLLLAGIVAFLLVRISGRPPVAKISATTPVRRNIVASIS
Ga0208036_101540413300025419PeatlandMDKKLRNRVFAFLLLAGIVAFLLVRISGRPPVAKIS
Ga0207693_1099157123300025915Corn, Switchgrass And Miscanthus RhizosphereMDRRLRNRILLILLAAGILAYVLLLLSGRQPVAKL
Ga0207641_1059406613300026088Switchgrass RhizosphereMDRRLRNRILLFLLAAGILAYVLVLLSGRKPVAKLSATYPTRENIV
Ga0209470_138060513300026324SoilMDRRLRNRLLFFLLAAAILAYLLFLLSGRQPVTKLSAVVPSHE
Ga0179593_114593913300026555Vadose Zone SoilMDKKLRNRTLLLLLLAGIVAFVLIKVSGRQPVAKISAVKPFRHNI
Ga0179587_1050380213300026557Vadose Zone SoilMDRRLRNRILIFLALAAIAAFILVRLSGRQPVAKISAVLPIREN
Ga0207781_101594313300026890Tropical Forest SoilMDRKLRNRTLLFLALAGIVAFVLVRLSGRQPVAKISATTPVRQNIVSS
Ga0207817_100612923300026979Tropical Forest SoilMDRKLRNRTLLFLALAGIVAFVLVRLSGRQPVAKISATTPVRQNI
Ga0207815_100598613300027014Tropical Forest SoilMDRKLRNRTLLFLALAAIVAFVLVRLSGRQPVAKISAT
Ga0207819_104107513300027024Tropical Forest SoilMDRKLRNRTILFLLLAGIVAFILVRASGRQPVAKISAMRPARENIVSSV
Ga0209529_102636813300027334Forest SoilMDRRLRNRILIFLLAATVLALILIKVSGRQPVAKISAVTPIRQNIIASIS
Ga0209004_101334913300027376Forest SoilMDKKLRNRTLLFLLLAGIVAFVLIKVSGRQPAAKISA
Ga0209656_1016763613300027812Bog Forest SoilMEKKLRNRVLIFLLVAGIVAYVLVRVSGRQPVAKIT
Ga0209166_1033520133300027857Surface SoilMERKLRNRILIFLVAAGIVAYLLVRLSGRQPVARIMAVKPVRENLVASVST
Ga0209583_1015879613300027910WatershedsMDKKLRNRTLLFLLLAGFVAFVLIKVSGRQPVAKISAVKPFRQN
Ga0265352_101240023300028021SoilMERRLRNRIVISLLLAVVAAIFLVWLSGRKPAAKISAV
Ga0268264_1042859313300028381Switchgrass RhizosphereMDRRLRNRILVFLLAAGILAYVLVLLSGRQPVAKL
Ga0308309_1040991223300028906SoilMDRRLRNRIVIFLLLAGVLALVLIKVSGRQPVAKISAVTPVRENI
Ga0311368_1032856123300029882PalsaMERVLRNRILIFLVLAAVGAVVLIRLSGRQPVAKITAVRPVRQNIVAFI
Ga0302178_1048158313300030013PalsaMERVLRNRILIFLVLAAVGAVVLIRLSGRQPVAKITAVRPVR
Ga0302309_1028907423300030687PalsaMDHRLRNRILIFLALAAVAAYALVRLSGRQPVAKISAVTPIREN
Ga0170818_10924181533300031474Forest SoilMDRRLRNRILLILLAAGILAYMLLLLSGRQPVAKLSAVKP
Ga0318573_1038171413300031564SoilMDRKLRNRTLVLLVVAGVIAFILVKLSGKQPVAKIAATVPVR
Ga0318496_1050520423300031713SoilMDRKLRNRTLLFLALAAVVAFVLVRLSGRQPVAKISATTPVRQNIVSSVSS
Ga0307476_1066316813300031715Hardwood Forest SoilMDRKLRNRTLLFLALAAIVAFVLVRLSGRQPVAKISATTPVRQ
Ga0307474_1163147913300031718Hardwood Forest SoilMDRRLRNRILIFLVLAGATALVLIRVSGRQPVAKISAVTPVRENIIASI
Ga0307469_1013907813300031720Hardwood Forest SoilMDRRLRNRIVIFLVLAGIVAFALVRLSGRQPVAKISAVSPMRE
Ga0307478_1055466813300031823Hardwood Forest SoilMDRRLRNRIVIFLALAAVAVFVLVRLTGRQPVAKIAAV
Ga0310917_1087666013300031833SoilMDRKLRNRTLLFLALAAVVAFALVRLSGRQPVAKI
Ga0306925_1147756013300031890SoilMDRKLRNRTLLFLALAAVVAFALVRLSGRQPVAKIS
Ga0310910_1011780533300031946SoilMDRKLRNRTLVLLALAAVTAFILIRVSGRQPVAKIAATR
Ga0310910_1071596013300031946SoilMDKKLRNRTLLFLLLAGIAVFAFIRLSNRQPVAKIAAVLPVRQNIV
Ga0318530_1023211013300031959SoilMDRKLRNRTLLFLALAAVVAFALVRLSGRQPVAKISATTPVRQNIVSSVSSN
Ga0307479_1035103113300031962Hardwood Forest SoilMDRRLRNRIVIFLVLAAIVAFVLVRLTGRQPVAKIAA
Ga0318549_1038225413300032041SoilMDRRLRNRILIFLFASGVLAYVLVLLSGRQPVAKL
Ga0307471_10206796423300032180Hardwood Forest SoilMDRRLRNRIVIFLVLAAVAVIVLVRLTGRQPVAKI
Ga0306920_10123677623300032261SoilMDRRLRNRILIFLFAAGVLAYLLVLLSGRQPVAKLSATYPPAKISSPP
Ga0335079_1006498853300032783SoilMDRKLRNRIVFFLLGAGVVAFVLVKLSGRQPVAKISAVTPMREN
Ga0335081_1142006623300032892SoilMDRKLRNRALILLGFAAIIAFALVRLSGRQPVAKIAATR
Ga0310914_1189548713300033289SoilMDRKLRNRTLVLLALAAVTAFILIRVSGRQPVAKIA
Ga0326728_1027000333300033402Peat SoilMDKKLRNRVLVFLLLAGIVAYLLVRISGRQPVAKISA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.