NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104637

Metagenome / Metatranscriptome Family F104637

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104637
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 40 residues
Representative Sequence MILSNRFWRSVTWLVRLDMAIGIAIAVGLLLWLMWH
Number of Associated Samples 83
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.00 %
% of genes near scaffold ends (potentially truncated) 21.00 %
% of genes from short scaffolds (< 2000 bps) 75.00 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (73.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(14.000 % of family members)
Environment Ontology (ENVO) Unclassified
(37.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 53.12%    β-sheet: 0.00%    Coil/Unstructured: 46.88%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MILSNRFWRSVTWLVRLDMAIGIAIAVGLLLWLMWHCytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
76.0%24.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Boreal Forest Soil
Tropical Rainforest Soil
3.0%5.0%14.0%6.0%14.0%13.0%8.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_035168202088090014SoilMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWX
GPIPI_010226602088090014SoilMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMW
KansclcFeb2_012668302124908045SoilMTLSDRFWRSVTLLVRLDMAAGIAITVGLMFWLMWH
INPgaii200_099358612228664022SoilMSRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVLVWH
AF_2010_repII_A100DRAFT_104295123300000655Forest SoilMSXSDRFWRSVTLLVRLDMAACIAIAAGLLLVLVWH*
AP72_2010_repI_A100DRAFT_100103873300000837Forest SoilMTLSDRFWRSVTLLVRLDMAAGIAITAGLVFWLMWH*
JGI12627J18819_1022835613300001867Forest SoilKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH*
Ga0062595_10080784513300004479SoilVESSALRMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH*
Ga0066673_1053307123300005175SoilMSEGPRTMILSNRFWRSVTWWVRLDMAICIAIAVGLLLWLMWH*
Ga0066388_10173948413300005332Tropical Forest SoilMSEGQRTMILSNRFWRSVTWLVRLDMAIGIAIAVGLLLWLMWH*
Ga0066388_10291488723300005332Tropical Forest SoilMSHGQMTMILSNRFWRSITWLVRLDIAASIVIAVGLLLWLMWH*
Ga0066388_10309198723300005332Tropical Forest SoilLEGEERGMTRQEQERATMLSHRFWRTLTCLVRLDMAAGIILAVGLLLWLKWH*
Ga0066388_10565837723300005332Tropical Forest SoilMSHGLMTMVLSNRFWRSMTWLVRLDIAASIVIAVGLLLWLMWH*
Ga0066388_10711945023300005332Tropical Forest SoilMSAGLRTMILSNRLWRSVTWLVRLDMAMCIAVAVGLLLWLMWH*
Ga0070668_10133679723300005347Switchgrass RhizosphereMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWP*
Ga0070674_10152383523300005356Miscanthus RhizosphereMSDGPKTMILSNRFWRSVTWLVRLDMAVGIVIAAALLLWLMWH*
Ga0008090_1008972553300005363Tropical Rainforest SoilMSEGLMTMIPSNRFWRSVTWLVRLDMAICIAIAVGLPLWLMWH*
Ga0070709_1030471023300005434Corn, Switchgrass And Miscanthus RhizosphereMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH*
Ga0070710_1005796223300005437Corn, Switchgrass And Miscanthus RhizosphereMSDGPKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH*
Ga0070711_10162977113300005439Corn, Switchgrass And Miscanthus RhizosphereVESSALRMSDGPKTMILSNRFWRSVTWLVRLDMAVGIVIAAALLLWLMWH*
Ga0070733_1003973823300005541Surface SoilVATIISRRFWRTITLSVRLDIAAGIATTAGLLLWLTWH*
Ga0070695_10047897023300005545Corn, Switchgrass And Miscanthus RhizosphereMRVGEAITMTRMAGFWRALGLLVRLDMAAGIAITAALLLWLTWH*
Ga0066699_1120643723300005561SoilMILSNRFWRSVTWWVRLDMAICIAIAVGLLLWLMWH*
Ga0066654_1071900423300005587SoilMILSNRFWRSVTWLVRLDIAVCIAIAVGLPLWLVWH*
Ga0066905_10066548823300005713Tropical Forest SoilMILSDRFWRSVTVMVRLDMAAGIAVAAGLLLWLMWH*
Ga0066905_10070734523300005713Tropical Forest SoilMTPSDRFWRSVTLLVRLDMAAGIAITAGLVFWLMWH*
Ga0066905_10094147723300005713Tropical Forest SoilMMTVSDRFWRSVTLLVRLDMAAGIAITAGLMFWLMWH*
Ga0066903_10012893823300005764Tropical Forest SoilMTPSNRFWRGITWLVRLDIAVCIAIAAGLLLWLKWH*
Ga0066903_10621270723300005764Tropical Forest SoilMESGALRMSAGPRTMILSNRFWRSVTWLVRLDMAIGIAIAVGLLLWLMWH*
Ga0066903_10683826223300005764Tropical Forest SoilMSEGLRTMILSNPFWRSVTWLVRLDMANGHVHRLAVGLLLWLMWH*
Ga0066903_10770255223300005764Tropical Forest SoilMTLSDRFWRSVILLVRLDMAAGIAITAGPVFWLMWH*
Ga0070766_1077644623300005921SoilVATTISRRFWRTITLSVRLDIAAGIATTAGLLLWLTWH*
Ga0081455_1006396723300005937Tabebuia Heterophylla RhizosphereMTRQNVNELMMLSHRFWHRLTWLVRLDMAAGIALAVGLLLWLKWH*
Ga0081538_1002662823300005981Tabebuia Heterophylla RhizosphereMTRQNVNELMMLSHRFWHRLAWLVRLDMAVGIALAVGLLLWLKWH*
Ga0081540_100766513300005983Tabebuia Heterophylla RhizosphereMESGALRVSDGLRTMILSNPFWRSATWLVRLDIAASIVIAAGL
Ga0070715_1000174633300006163Corn, Switchgrass And Miscanthus RhizosphereMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH*
Ga0075433_1008045843300006852Populus RhizosphereMSRSDRFWRGVTLLVLLDMAAGIAIAAGLLLVLVWH*
Ga0075425_10106266323300006854Populus RhizosphereMSEGLRAMILSNRFWRNVTWLVRLDMAICIAIAAGLLLWLTWH*
Ga0075434_10009561023300006871Populus RhizosphereMSEGLRAMILSNRFWRNVTWLVRLDMAICIVIPAGLLLWLMWH*
Ga0074063_1013870623300006953SoilMIRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH*
Ga0079219_1011094023300006954Agricultural SoilMNQGLRAMILSNRFWRNVTWLVRLDMAICIVIPAGLLLWLMWH*
Ga0114129_1025971023300009147Populus RhizosphereMILSNRFWRSVTWLVRLDMAVGIVIAAALLLWLMWH*
Ga0116224_1032273423300009683Peatlands SoilMIVSGRFWRGLTLLARADMAASIAIAAGLLLWLTWHS*
Ga0126374_1001427733300009792Tropical Forest SoilMSRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH*
Ga0126374_1075861323300009792Tropical Forest SoilMEKGALRMSHGLMTMVLSNRFWRSMTWLVRLDIAASIVIAVGLLLWLMWH*
Ga0126384_1136624423300010046Tropical Forest SoilMVVSKRFWRSITLVVRLDIAACIAIAAGLLLWLTWD*
Ga0126384_1140359323300010046Tropical Forest SoilMTISPRFWRVATLLVRLDIATGILLATGLWLWLTWH*
Ga0134082_1054739913300010303Grasslands SoilMESGALRMNDGPRTTILSNRFWRSVTWWVRLDMAICIAIAVGLLLWLMWH*
Ga0126376_1322112223300010359Tropical Forest SoilMTVSSRFWRAVTLLVRLDMAAGITIAAALFLFLTWR*
Ga0126377_1248523123300010362Tropical Forest SoilMTISPRFWRAVTLLVRLDVAAGIAIAAALFLFVTWH*
Ga0126379_1147401423300010366Tropical Forest SoilMTLSDRFWRSVTVLVRLDMAAGIAIAAGLLLWLMWH*
Ga0126381_10016241843300010376Tropical Forest SoilMESGALRMSEGLMTMIPSNRFWRSVTWLVRLDMAICIAIAVGLPLWLMWH*
Ga0126381_10024225113300010376Tropical Forest SoilMAISPRFWRAATLLVRLDMAAGITLAAALLLFLTWH*
Ga0126383_1306178913300010398Tropical Forest SoilMSGSDRFWRSVTLLVRLDMAACIAIAAGLLLVLVWH*
Ga0126350_1090436323300010880Boreal Forest SoilMRVTPRFWRSVTLLVRLDMAACITLAAALFLFSTWH*
Ga0137365_1005532423300012201Vadose Zone SoilMTLSDRLWRSVTVLVRLDMAAGIAIAAGLVLWLMWH*
Ga0137363_1121114323300012202Vadose Zone SoilMTLSDRFWRSVTVVVRLDMAAGIAIAAGLLLWLMWH*
Ga0137374_1004704923300012204Vadose Zone SoilMTLSDWLWRSVTVLVRLDMAAGIAIAAGLVLWLMWH*
Ga0150985_11817507913300012212Avena Fatua RhizosphereASWTPRLWRGLTLLVRLDIAASVTMTAALLLWLTWH*
Ga0137372_1016099223300012350Vadose Zone SoilVRTMTLSDRLWRSVTVLVRLDMAAGIAIAAGLVLWLMWH*
Ga0137360_1015074533300012361Vadose Zone SoilMTLSDRLWRSVTVFVRLDMAAGIAIAAGLVLWLMWH*
Ga0164303_1009838133300012957SoilMSDGPKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMW
Ga0126369_1080205623300012971Tropical Forest SoilMESGALRMSAGLRTMILSNRLWRSVTWLVRLDMAMCIAVAVGLLLWLMWH*
Ga0126369_1104413523300012971Tropical Forest SoilMVVSKRFWRSITLLVRLDIAACIAIAAGLLLWLTWD*
Ga0163162_1326015523300013306Switchgrass RhizosphereMESGALRVSDGLRTLILSNRFWRNVTWLVQLDMAVCIAITVGLLLWLMWH*
Ga0132258_1065616033300015371Arabidopsis RhizosphereMSEGLRAMILSNRFWRNVTWLVPLDMAICIVIAAGLLLWLMWH*
Ga0066667_1067419033300018433Grasslands SoilMILSNRFWRSVTWWVRLDMAICIAIAVGLLLWLMWH
Ga0210406_1104626723300021168SoilMESGALRVSDGPMTMILSNRFWRSVTWLVRLDIAASIVIAVGLLLWLMWH
Ga0210384_1043750813300021432SoilMESGALRVSDGPMTMILSNRFWRSVTWLVRLDIAASIVIAVGLLL
Ga0210402_1000864053300021478SoilMTMILSNRFWRSVTWLVRLDIAASIVIAVGLLLWLMWH
Ga0126371_1013784843300021560Tropical Forest SoilMILSNRFWRSVTWLVRLDMAIGIAIAVGLLLWLMWH
Ga0126371_1117613233300021560Tropical Forest SoilMTLSDRFWRSVTVLVRLDMAAGIAIAAGLLLWLMWH
Ga0207692_1008066813300025898Corn, Switchgrass And Miscanthus RhizosphereLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH
Ga0207699_1004574043300025906Corn, Switchgrass And Miscanthus RhizosphereMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH
Ga0207664_1016534823300025929Agricultural SoilRMSDGPKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH
Ga0207665_1011602733300025939Corn, Switchgrass And Miscanthus RhizosphereMSDALKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH
Ga0207668_1116482823300025972Switchgrass RhizosphereMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWP
Ga0207675_10219242723300026118Switchgrass RhizosphereMSDGLKTMILSNRFWRSVTWLVRLDMAVGIVIAAALLLWLMWH
Ga0209329_115463823300027605Forest SoilMIRSDWFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0209466_104747723300027646Tropical Forest SoilMTLSDRFWRSVTLLVRLDMAAGIAITAGLVFWLMWH
Ga0208696_104737123300027696Peatlands SoilMIVSGRFWRGLTLLARADMAASIAIAAGLLLWLTWHS
Ga0209167_1080758023300027867Surface SoilVATIISRGFWRTITLSVRLDIAAGIATTAGLLLWLTWH
Ga0307299_1024995613300028793SoilCKVVESSALRMSDGLKTMILSNRFWRSVTWSVRLDMAVGIAIAVALLLWLMWH
Ga0307277_10000073223300028881SoilMSDGAKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLMWH
Ga0318516_1014085323300031543SoilMTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0318516_1017153823300031543SoilMSRSDRFWRSVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0318534_1010072423300031544SoilMSRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0318528_1045238913300031561SoilRNGARRMSRSDRFWRSVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0318493_1029239633300031723SoilMSRSDRFWRSVTLLVRLDMAACIAIAAGLLLVLVW
Ga0318537_1017013023300031763SoilRDGARTMTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0318521_1082791413300031770SoilTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0318557_1031318123300031795SoilARTMTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0307473_1049581823300031820Hardwood Forest SoilMSDGLKTMILSNRFWRSVTWLVRLDMAVGIAIAVALLLWLM
Ga0318567_1018037413300031821SoilRMSRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0307478_1036249423300031823Hardwood Forest SoilMTVSRHFWRGLTFLARADMAASVAVAAAPLLWLTWHS
Ga0318520_1027756213300031897SoilSRSDRFWRGVTLLVRLDMAAGIAIAAGLLLVLVWH
Ga0306921_1015206613300031912SoilTMTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0306921_1208527523300031912SoilMTLSDRFWRSVILLVRLDMAAGIAITAGPVFWLMWH
Ga0318545_1033716913300032042SoilRTMTLSHRFWRSITVLVRLDMAAGIAIAAGLVFWLMWH
Ga0335080_1023245823300032828SoilMRVTSRFWRSVTLLVRLDIAASIALAAALFLFSTWH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.