NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100807

Metagenome / Metatranscriptome Family F100807

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100807
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 41 residues
Representative Sequence RDKLRYEVVVNFVDKDRIRGYLSAPKDKVLSAERPQFRPE
Number of Associated Samples 86
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 3.96 %
% of genes near scaffold ends (potentially truncated) 93.14 %
% of genes from short scaffolds (< 2000 bps) 85.29 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.19

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.078 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(19.608 % of family members)
Environment Ontology (ENVO) Unclassified
(42.157 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.725 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 22.06%    Coil/Unstructured: 77.94%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540RDKLRYEVVVNFVDKDRIRGYLSAPKDKVLSAERPQFRPESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.19
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
96.1%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Soil
Grasslands Soil
Grass Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Bio-Ooze
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
2.9%3.9%3.9%2.9%6.9%16.7%19.6%5.9%4.9%10.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
N57_051068002170459013Grass SoilFLVGRDRLRYEFVVNYVDKDRIRGYLSTPKDKVLAAEAGRRF
F14TC_10182055913300000559SoilEFVVYSVDKDRVRGYISTPKDKLVAAENPLQRRLQ*
JGI1027J12803_10933800413300000955SoilPVTFLVGKDRLRYEFVVNSVDKDRIRGYVSTPKDKVLAVEGPSTVQHPD*
Ga0066674_1013461023300005166SoilMFLVGPDRLRYELVVNYVDKDRIRAYLSTPKAPKDKTLAAEGPSFRRLQ*
Ga0066683_1010710633300005172SoilPVQFMVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAERTTQVKR*
Ga0066676_1054279613300005186SoilEQLRYEVVVNSVEKDRIRGYVSAPKDKVLSAEVPRFRLQ*
Ga0065715_1087483513300005293Miscanthus RhizosphereLVGREKLRYEIVVNVVDKDRIRGYLSAPKDKVLSAERPAFK*
Ga0066388_10449978623300005332Tropical Forest SoilLVGKDRLRYELVVYSVDKDRIRGYLSTPKDKTLSAEGPSFRH*
Ga0066388_10777700023300005332Tropical Forest SoilANEPVTFLVGHDRVRYEFVVFSVDKDRIRGYISTPKDKLVAAENPLQRRLQ*
Ga0066388_10778499613300005332Tropical Forest SoilHDRVRYEFVVFTVDKDRIHGYISTPKDKLVAAESPLQRRLQ*
Ga0070675_10211446113300005354Miscanthus RhizosphereLVGQSHVRYELVVNAVEKDRIRGYVSTPKDVVLSAEGPNARQ*
Ga0070709_1078520133300005434Corn, Switchgrass And Miscanthus RhizosphereFLVGREQLRYEVVVNSVEKDRIRGYMSAPKDKSLSAEIPQFRQQ*
Ga0066686_1100701013300005446SoilLVGRDKLRYEIVVNYVDKDRIRGYLSAPKDKVLAAERPAFRPE*
Ga0066689_1012970313300005447SoilGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTQLKR*
Ga0070678_10076189313300005456Miscanthus RhizosphereLLIGQNHVRYELVVNSVEKDRIRGYISAPKDVVLSAEGPTSR*
Ga0070741_1121226523300005529Surface SoilVGRDQLRYEVVVNSVEKDRIRGYVSAPKDKTLAAEIPRFRE*
Ga0066692_1013882133300005555SoilVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAERTTQVKR*
Ga0066692_1090757413300005555SoilYEFVVNYVDKDRIRGYLSTPKDKTLAAEGPSRQRF*
Ga0066703_1019681933300005568SoilFVVNYVDKDRIRGYLSTPKDKTLSAEGPNFRRLQ*
Ga0066706_1047656623300005598SoilMFLVGPDRLRYELVVNYVDKDRIRGYLSTPKAPKDKTLAAEGPSFRRLQ*
Ga0066903_10150451823300005764Tropical Forest SoilYEFVVNSVDKDRIRGYLSTPKDRLTVAETPASLRVR*
Ga0068858_10243351513300005842Switchgrass RhizosphereQFLVGRDKLRYEIVVNIVDKDRIRGYLSAPKDKSLAAERPAAR*
Ga0066651_1017433123300006031SoilNYVDKDRIRGYLSTPKAPKDKTLAAEGPSFRRLQ*
Ga0066651_1018564713300006031SoilELVVNFVDKDRVRGYLSTPKDKTFSAEGPSTRRLQ*
Ga0066665_1010260433300006796SoilQLRYEVVVNSVEKDRIRGYVSAPKDKVLSAEVPRFRLQ*
Ga0075428_10051803113300006844Populus RhizosphereVRYEFVVYSVDKDRIRGYISTPKDKLVAAENPVQRRLQ*
Ga0075421_10039559333300006845Populus RhizosphereRYEFVVYSVDKDRIRGYISTPKDKLVAAENPLQRRLQ*
Ga0075421_10273284313300006845Populus RhizosphereEPVTFLVGRDRLRYEIVVNYVDKDRIRGYMSTPKDKMLSAEGPTLRRER*
Ga0075425_10105086913300006854Populus RhizosphereFLVGRDRARYEFVVNFVDKDRIRGYISTPKDKLVASETPTLRVR*
Ga0075429_10053819733300006880Populus RhizosphereLRYEVVVNSVEKDRIRGYVSAPKDKVLSAEGPKFRAQ*
Ga0075429_10149819923300006880Populus RhizosphereEFVVYSVDKDRIRGYISTPKDKLVAAENPLQRRLQ*
Ga0075435_10140010923300007076Populus RhizosphereELVVNYVDKDRIRGYLSTPKDKVLSAEAPALRRAQ*
Ga0066710_10025218333300009012Grasslands SoilYEFVVNYVDKDRIRGYMSTPKDRLVAAETPTLRVQ
Ga0066710_10027465613300009012Grasslands SoilLRYEVVVNAVDKDRIRGYVSTPKDKVLAAEAPQFRLQ
Ga0066710_10482278413300009012Grasslands SoilEPVQFMVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTQLKR
Ga0114129_1223440413300009147Populus RhizosphereVGRDRLRYEIVVNYVDKDRIRGYMSTPKDKLLSAEGPTLRRER*
Ga0114129_1248992223300009147Populus RhizosphereQFLVGRDELRYEVVVNSVEKDRIRGYVSAPKDKVLSAEGPKFRAQ*
Ga0126373_1130250813300010048Tropical Forest SoilGSDHLRYEFVVNSVDRDHIKGYLSTPKDKILSAEAPRAKLQ*
Ga0127445_109171323300010085Grasslands SoilLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPVLRQ*
Ga0127463_106627513300010098Grasslands SoilGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTKQTKR*
Ga0127453_103645913300010102Grasslands SoilRDQLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPILRQ*
Ga0127458_104529423300010112Grasslands SoilVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTKQTKR*
Ga0127482_108462823300010126Grasslands SoilLVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTKQTKR*
Ga0127455_102115813300010132Grasslands SoilDQLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPVLRQ*
Ga0134088_1002405013300010304Grasslands SoilEPVQFMVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTKQTKR*
Ga0134111_1017147513300010329Grasslands SoilYEFVVNYIDKDRIRGYVSTPKDKVLSAEGPSFRRPQ*
Ga0134111_1052289023300010329Grasslands SoilYEMVINAVNKDRIQGYLSTPKDKVLSAERTSQLKR*
Ga0134071_1029545513300010336Grasslands SoilKLRYEIVVNFVDKDRVRGYLSAPKDKALAAERPQFRP*
Ga0126379_1132216813300010366Tropical Forest SoilQNRVRYELVVNAVEKDRIRGYISTPKDAALNSEGPAAR*
Ga0126379_1363158013300010366Tropical Forest SoilDRLRYEIVVNYVDKDRIRGYMSTPKDKVLSAEAPRR*
Ga0126379_1365633413300010366Tropical Forest SoilEFVVNSVDKDRIRGYLSTPKDRLTVAETPASLRVR*
Ga0134124_1168767213300010397Terrestrial SoilKLRYELVVNFVDKDRIRGYLSAPKDKVLAAERPQFRPD*
Ga0126383_1106428933300010398Tropical Forest SoilDKLRYEVVVNYVDKDRVRGYLSAPKDKALAAERPQIRPE*
Ga0134127_1091367513300010399Terrestrial SoilEVVVNFVDKDRIRGYLSTPKDKALSAERPQFRPE*
Ga0134121_1059025013300010401Terrestrial SoilRDKLRYEVVVNFVDKDRIRGYLSAPKDKVLSAERPQFRPE*
Ga0137422_105287243300011416SoilGRDRLRYEFVVNYVDKDRIRGYLSTPKDKVLAAEAPPVRRLQ*
Ga0137431_101472533300012038SoilVGRDELRYEIVVNAVDKDRIRGYVSAPKDKVLSAEAPRLRTQ*
Ga0137327_104682933300012173SoilRDKLRYEVVVNFVDKDRVRGYVSAPKDKVLSAERPQFRPE*
Ga0137380_1011743113300012206Vadose Zone SoilQFLVGRDKLRYEIVVNYVDKDRIRGYLSTPKDKVLAAERPTFRPE*
Ga0137381_1172217923300012207Vadose Zone SoilQFLVGRDQLRYEVVVNSVEKDHIRGYVSAPKDKVLSAEVPRFRLQ*
Ga0137387_1004082053300012349Vadose Zone SoilEPVQFMVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTQLKR*
Ga0134042_118354913300012373Grasslands SoilDQLRYELVVNSVDKDRIRGYVSVPKDKVLASEVPILRQ*
Ga0134044_108890523300012395Grasslands SoilDQLRYELVVNSVDKDRIRGYLSVPKDKILASEVPILRQ*
Ga0134059_111792513300012402Grasslands SoilDQLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPILRQ*
Ga0134060_128603113300012410Grasslands SoilQFMVGHDQLRYEMVINAVNKDRIQGYLSTPKDKVLSAEGPTQLKR*
Ga0157310_1000675133300012916SoilGREKLRYEIVVNVVDKDRIRGYLSSPKDKALSAERPAFK*
Ga0126375_1029080923300012948Tropical Forest SoilFLVGKDELRYEIVVNSVDKDRIRGYVSAPKDKVLSAESPKFRTPSQ*
Ga0126375_1137439023300012948Tropical Forest SoilEVVVNYVDKDRIRGYLSAPKDKVLAAERPQFRPE*
Ga0157374_1008265213300013296Miscanthus RhizosphereVNQPIAFLVGREKLRYEIVVNVVDKDRIRGYLSAPKDKVLSAERPAFK*
Ga0163162_1234821423300013306Switchgrass RhizosphereHLRYELVVNSVEKDRIRGYVSAPKDSSLNSEGPTVN*
Ga0134081_1030157113300014150Grasslands SoilQFLVGRDQLRYEVVVNSVDKDRIRGYLSAPKDKVMSAEIPRFRLQ*
Ga0163163_1061542833300014325Switchgrass RhizosphereTANEPVTFLVGRDRVRYEFVVNYVEKNRIRGYLSAPKDKLTAAETPSLRVR*
Ga0157380_1148227623300014326Switchgrass RhizosphereLVGRDKLRYEVVVNFVDKDRVRGYLSAPKDKVLSAERPQFRPE*
Ga0132258_1104590653300015371Arabidopsis RhizosphereTFLVGRDRVRYELVVNYVDKDRVRGYLSTPKDKLIAGDTPTLRRLQ*
Ga0132258_1374409013300015371Arabidopsis RhizosphereTFLVGRDRLRYEIVVNYVDKDRIRGYMSTPKDKVLSAEAPRR*
Ga0182038_1134932813300016445SoilEPVTFMVGSDKLRYEFVVNYVDRDRIRGYMSTPKDKVLAAETPVRQRLQ
Ga0134069_115889513300017654Grasslands SoilGRDRLRYEFVVNYVDKDRIRGYVSTPKDKILSAEGPSFRRSQ
Ga0134112_1004040523300017656Grasslands SoilRDQLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPILRQ
Ga0066655_1090905833300018431Grasslands SoilFVVNFVDKDRIRGYMSTPKDKVLAAEGPSTVQHPD
Ga0066667_1024638033300018433Grasslands SoilRYEFVVNYVDKDRIRGYLSTPKDKTLSAEGPNFRRLQ
Ga0066669_1133264323300018482Grasslands SoilDKLRYEIVVNFVDKDRVRGYLSAPKDKALAAERPQFRP
Ga0173482_1056781613300019361SoilRDELRYEVVVNSVDKDRIRGYISTPKDKVLSAEAPRLKGQ
Ga0187892_1034740413300019458Bio-OozeDKLRYEVVVNFVDKDRVRGYVSAPKDKVLSAERPAFRPQ
Ga0179594_1026563313300020170Vadose Zone SoilFLVGRDKLRYEIVVNYVDKDRIRGYLSAPKDKVLAAERPAFRPE
Ga0210387_1185768813300021405SoilDRVLYEFVVNSVDKDRIRGYLSTPKDKVLAAETPASAQRLQ
Ga0207700_1100492513300025928Corn, Switchgrass And Miscanthus RhizosphereMVGSDRVRYEFVVNSVDKDRIRGYLSTPKDKVLAAEKAPAAQRLQ
Ga0207701_1021532413300025930Corn, Switchgrass And Miscanthus RhizosphereGRDRLRYELVVNYVDKDRIRGYMSTPKDKMLSAEGPTLRRER
Ga0207711_1055055213300025941Switchgrass RhizosphereQPIAFLVGREKLRYEIVVNVVDKDRIRGYLSAPKDKALSAERPAFK
Ga0207648_1193793113300026089Miscanthus RhizospherePLLVGKNHLRYELVVNSVEKDRIRGYVSAPKDSSLNSEGPTVN
Ga0207698_1109110613300026142Corn RhizosphereTVNQPIAFLVGREKLRYEIVVNVVDKDRIRGYLSAPKDKVLSAERPAFK
Ga0209375_104445633300026329SoilMFLVGPDRLRYELVVNYVDKDRIRAYLSTPKAPKDKTLAAEGPSFRRLQ
Ga0209159_121212023300026343SoilDQLRYELVVNSVDKDRIRGYLSVPKDKILASEVPILRQ
Ga0209378_117683413300026528SoilRYELVVNSVDKDRIRGYVSVPKDKVLASEVPILRQ
Ga0209378_128855713300026528SoilFLVGRDRLRYEFVVNYIDKDRIRGYVSTPKDKVLSAEGPSFRRPQ
Ga0209058_106025213300026536SoilGRDQLRYELVVNSVDKDRIRGYLSVPKDKVLASEVPILRQ
Ga0209376_137979923300026540SoilRYEMVINAVNKDRIQGYLSTPKDKVLSAERTTQVKR
Ga0209161_1055726023300026548SoilMFLVGPDRLRYELVVNYVDKDRIRGYLSTPKAPKDKTLAAEGPSFRRLQ
Ga0209799_105063923300027654Tropical Forest SoilLRYEVVVNFVDKDRIRGYLSAPKDKALAAERPQFRPE
Ga0209814_1003952713300027873Populus RhizosphereVVNFVDKDHVQGYLSTPKDKALAAERPQFRPESKP
Ga0209382_1175380613300027909Populus RhizosphereGRNELRYEVVVNSVEKDRIRGYVSAPKDKVLSAEGPKFRAQ
Ga0315912_1002204913300032157SoilVPLLVGQNHVRYELVVNAVEKDRIRGYVSTPKDVVLSAEGPAPKQ
Ga0315912_1009084553300032157SoilYELVVNYVDKNRIRGYVSTPGGKAIAAEERALRPQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.