NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097874

Metagenome / Metatranscriptome Family F097874

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097874
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 38 residues
Representative Sequence VRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKN
Number of Associated Samples 90
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 4.81 %
% of genes near scaffold ends (potentially truncated) 95.19 %
% of genes from short scaffolds (< 2000 bps) 86.54 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.154 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.462 % of family members)
Environment Ontology (ENVO) Unclassified
(25.962 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.962 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 46.15%    β-sheet: 0.00%    Coil/Unstructured: 53.85%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035VRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKNCytopl.Extracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
71.2%28.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog
Peatland
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Agricultural Soil
Palsa
Bog
Peat Soil
Corn Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
4.8%2.9%6.7%4.8%7.7%4.8%5.8%13.5%4.8%6.7%3.8%5.8%3.8%2.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12270J11330_1029029323300000567Peatlands SoilGPFVKRLGPGAMGAINRIFGFLILAIAVQLVWNGAADFRV*
JGIcombinedJ26739_10149126923300002245Forest SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFGN*
Ga0070661_10022571713300005344Corn RhizospherePLVQWLGPNAAGSINRIFGFLILAVAVQLIWDGISEFRT*
Ga0070703_1040900013300005406Corn, Switchgrass And Miscanthus RhizosphereLGGPLVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAAKFGK*
Ga0070734_1006797733300005533Surface SoilLGPSAVGAINKIFGFLILAIAVQLVWDGVADFNS*
Ga0070734_1017147513300005533Surface SoilGGPLVERLGPNATGSINRIFGFLILAVAVQLIWDGISDFRT*
Ga0070762_1033492923300005602SoilLGPSAVGAINKIFGFLILAIAVQLVWNGMADFRN*
Ga0070764_1068133213300005712SoilVKRLGPGTVGAINRIFGFLILAIAVQLLWDGFAQFGR*
Ga0070766_1057659813300005921SoilGPFVKRLGPGALGAINRIFGFLILAIAVQLVWNGAADFRG*
Ga0075017_10084708323300006059WatershedsLVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAADFGH*
Ga0075017_10108987813300006059WatershedsVKRLGPGAVGAINRIFGFLILAIAVQLVWNGAADFRG*
Ga0075019_1008088013300006086WatershedsLGPSAVGAINKIFGFLILAIAVQLVWNGIADFKN*
Ga0075015_10037201823300006102WatershedsLGGPLVERLGPGAVAGITRIFGFLIFAVAVQLVWDGVAEFGR*
Ga0070716_10150032913300006173Corn, Switchgrass And Miscanthus RhizospherePLVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAAKFGK*
Ga0070712_10104284323300006175Corn, Switchgrass And Miscanthus RhizosphereGPFVNLLGPSAMGAINRIFGFLILSIAVQLVWNGAADFRH*
Ga0070765_10006233413300006176SoilPLVRRLGPGAVGAINKIFGFLILAIAVQLMWDGAADFGH*
Ga0105245_1098563423300009098Miscanthus RhizosphereRLGGPLVAKLGPGMVAGITRIFGFLIFAVAVQLVWDGIAEFGK*
Ga0116108_102063833300009519PeatlandGGPFVKRLGPGVMGAINRIFGFLILAIAVQLVWNGAADFRG*
Ga0116214_136996523300009520Peatlands SoilGGPPVRRLGPSAVGAINKIFGFLILAIAVQLVWNGVADFRN*
Ga0116219_1008906263300009824Peatlands SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFNS*
Ga0126373_1045462613300010048Tropical Forest SoilVRKLGPTALGAINKICGFLILAIAVQLVCDGIADFKS*
Ga0126379_1147624523300010366Tropical Forest SoilLGPGATGAINRIFGFLILAIAVQLIWDGMVDFKA*
Ga0105239_1217278823300010375Corn RhizosphereVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFR*
Ga0126381_10236325113300010376Tropical Forest SoilRLGPSAVGAISRIFGFLILAIAVQLVWDGVAEFKMGQ*
Ga0136449_10015628713300010379Peatlands SoilLGPGAVGAINKIFGFLILAIAVQLVWNGVADFKN*
Ga0136449_10172831623300010379Peatlands SoilGQGAVGSINRIFGFLILAIAVQLVTTGLTDLHVIS*
Ga0136449_10209095523300010379Peatlands SoilLGPNAVGAINKIFGFLILAIAVQLVWDGVADFNS*
Ga0150983_1305393323300011120Forest SoilRLGPSAVGAISRIFGFLILAIAVQLVWDGVAEFK*
Ga0137391_1154130423300011270Vadose Zone SoilYLFLRLGGPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFRQ*
Ga0137362_1033635123300012205Vadose Zone SoilVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAAEFGK*
Ga0137362_1100981923300012205Vadose Zone SoilYLFLRLGGPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFRS*
Ga0137395_1099797023300012917Vadose Zone SoilLGPGAVAGITRIFGFLIFAVAVQLVWDGVAEFGK*
Ga0137416_1116056723300012927Vadose Zone SoilLVGRLGPSAAGAISRIFGFLILAIAVQLIWDGVADFRG*
Ga0126375_1086782013300012948Tropical Forest SoilKLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKS*
Ga0126369_1176417123300012971Tropical Forest SoilLGPSAVGAINKIFGFLILAIAVQLVSDGVADFLT*
Ga0164305_1047706913300012989SoilPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFGK*
Ga0157378_1009918733300013297Miscanthus RhizosphereGPLVAKLGPGMVAGITRIFGFLIFAVAVQLVWDGVAEFGK*
Ga0181530_1006430613300014159BogGPFVKRLGPGVMGAINRIFGFLILAIAVQLVWNGAADFRG*
Ga0181523_1002399913300014165BogLVRRLGPSAVGAINKIFGFLILAIAVQLVWNGAADFRN*
Ga0181523_1036135423300014165BogVRRLGPSAVGAINKIFGFLILAIAVQLVWNGAADFRN*
Ga0181531_1005152933300014169BogVRRLGPTAVGAINKVFGFLILAIAVQLVWDGVADFNS*
Ga0181531_1058956323300014169BogKRLGPGAVGAISRIFGFLILSIAVQLVWDGVADFKG*
Ga0137405_120681113300015053Vadose Zone SoilAQPQRSVYLFLRLGGPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFRS*
Ga0182033_1134616913300016319SoilVRKLGPSAVGAINKIFGFLILAIAVQLVWNGIAEFKN
Ga0182037_1217355313300016404SoilKLGPSAVGAINKIFGFLILAIAVQLVSDGIADFRN
Ga0187824_1006734823300017927Freshwater SedimentLRLGGPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGAADFGR
Ga0187801_1007796213300017933Freshwater SedimentLVRRLGPGAVGAINKIFGFLILAIAVQLVWNGIADFNS
Ga0187778_1018120013300017961Tropical PeatlandRLGPASVGAINKIFGFLILAIAVQLVWNGVADFNS
Ga0187776_1099338423300017966Tropical PeatlandVKRLGPGASGAINRIFGFLLLAIAVQLVWDGVADFKA
Ga0187781_1090660923300017972Tropical PeatlandPFVKRLGPGAVGAINRIFGFLILAIAVQLLWNGAADFRR
Ga0187781_1128040113300017972Tropical PeatlandRLGPSAVGAINKIFGFLILAIAVQLVWNGVADFRN
Ga0187782_1130139923300017975Tropical PeatlandVRRLGPSAVGAINKIFGFLILAIAVQLVWDGIADFNG
Ga0187805_1001290663300018007Freshwater SedimentRLGPSAVGAINKIFGFLILAIAVQLVWNGMADFRN
Ga0187858_1026375613300018057PeatlandRLGPGAVGAINKIFGFLILAIAVQLVWDGVADFNR
Ga0187772_1048980013300018085Tropical PeatlandKRLGPGAVGAINRIFGFLILAIAVQLVWDGVADFRG
Ga0187770_1154221923300018090Tropical PeatlandVNRLGPGTVGAINRIFGFLILAIAVQLVWNGVADFRS
Ga0210407_1018843343300020579SoilVERLGPCATGAINRIFGFLILAIAVQLLWDGVADFK
Ga0210403_1010360213300020580SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWNGVADFRN
Ga0210401_1022971923300020583SoilVGRLGPSAVGAISRIFGFLILAIAVQLIWDGVADFKG
Ga0210404_1055221813300021088SoilYLFLRLGGPLVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAAKFGK
Ga0210406_10000871303300021168SoilLVERLGPCATGAINRIFGFLILAIAVQLLWDGVADFK
Ga0210405_1066768113300021171SoilFLRLGGPLVARMGPGAVAGITRIFGFLIFAVAVQLVWDGVADFRQ
Ga0210393_1063302023300021401SoilVLVKRLGPGAAVAIDRIFGFLLLAIAVQLVWNGGP
Ga0210393_1100931323300021401SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWNGMADFRN
Ga0210385_1044010513300021402SoilVRRLGPSAVGAINKIFGFLILAIAVQLVWNGMADFHN
Ga0210386_1077388123300021406SoilGPWVRRLGPNAVGAINKIFGFLILAIAVQLVWDGVADFHS
Ga0210402_1043090623300021478SoilRLGPGAAGGINRIFGFLLLAIAVQLVWDGVADFPH
Ga0210409_1112246813300021559SoilVRRLGPSAVGAINKIFGFLILAIAVQLVWNGVADFKS
Ga0242662_1027702213300022533SoilRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFK
Ga0242661_100693553300022717SoilLVERLGPCTTGAITRIFGFLILAIAVQLVWDGVAYFK
Ga0224564_105281933300024271SoilVERLGTCTTGAINRIFGFLILAIAVQLVWDGVAYFK
Ga0207653_1028714123300025885Corn, Switchgrass And Miscanthus RhizosphereLRLGGPLVARLGPGAVAGITRIFGFLIFAVAVQLVWDGAAKFGK
Ga0207654_1131719733300025911Corn RhizosphereGGPLVARLGPSAAGAINRIFGFLILSVAVQLLWDGLAEFQFGR
Ga0207694_1029776423300025924Corn RhizosphereLFLRLGGPLVAKLGPGMVAGITRIFGFLIFAVAVQLVWDGVAEFGK
Ga0207664_1052386523300025929Agricultural SoilVDRLGPGTVGAINRIFGFLILAIAVQLVWDGVAEFK
Ga0207670_1139986213300025936Switchgrass RhizosphereRLGGPLVAKLGPGAVAGITRIFGFLIFAVAVQLVWDGVADFGK
Ga0207640_1023436723300025981Corn RhizosphereARLGPNTVGAINRIFGFLILAIAVQLVWDGIADFRG
Ga0179587_1016403313300026557Vadose Zone SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKG
Ga0179587_1078886913300026557Vadose Zone SoilPLVGRLGPSAAGAISRIFGFLILAIAVQLIWDGVADFRG
Ga0207777_104601723300027330Tropical Forest SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWNGVADFKN
Ga0209008_115215823300027545Forest SoilRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFNS
Ga0209624_1000616643300027895Forest SoilLWLGEPQARRLGPHAVGVINKIFGFLILAIAVQLVWNGIADFKN
Ga0209067_1009582513300027898WatershedsRLGPGAVGAINRIFGFLILAIAVQLVWNGVADFKG
Ga0209067_1027903313300027898WatershedsPGAVGAINRIFGFLILAIAVQLIWNGAMDFRGGMPS
Ga0209698_1116963823300027911WatershedsVKRLGPGAVGAINRIFGFLILAIAVQLVWNGAAAFHT
Ga0302147_1031168013300028566BogRLGPGAVGAINRIFGFLILAIAVQLVWNGMADFHR
Ga0308309_1065591413300028906SoilRRLGPGAVGAINKIFGFLILAIAVQLMWDGAADFGH
Ga0308309_1145379713300028906SoilRRLGPSAVGAINKIFGFLILAIAVQLVWNGMADFHN
Ga0302188_1040759013300029986BogLVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKS
Ga0302271_1037701723300029998BogRLGPGAIGAINRIFGFLILAIAVQLVWDGVVDFRA
Ga0311353_1068273723300030399PalsaVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKN
Ga0265765_107092813300030879SoilRWLGPNAVGAINKIFGFLILAIAVQLVWDGVADFGS
Ga0302180_1014641733300031028PalsaRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFKG
Ga0170834_11025616323300031057Forest SoilFWMGGPLVGRLGPSAVGAINKIFCFLILAIAVQLVWDGVADFKT
Ga0170824_12705634923300031231Forest SoilFWMGGPLVGRLGPSAVGAINKIFCFLILAIAVQLVWDGVADFRS
Ga0310686_10217258513300031708SoilLVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFRT
Ga0310686_11111935243300031708SoilVRRLGPNAVGAINKIFGFLILAIAVQLVWDGVADFHS
Ga0307475_1007731033300031754Hardwood Forest SoilVKRLGPGATGAINRIFGFLILALAVQLVWDGAAQFSR
Ga0307478_1086664223300031823Hardwood Forest SoilMVRRLGPSAVGAINKIFGFLILAIAVQLVWDGVADFGN
Ga0307478_1180753713300031823Hardwood Forest SoilRLGPNAVGAINKIFGFLILAIAVQLVWDGVADFKS
Ga0307471_10246769223300032180Hardwood Forest SoilQLGPSAVGAISRIFGFLILAFAVQLIWDGVADFRV
Ga0307472_10071563313300032205Hardwood Forest SoilFVNLLGPSAMGAINRIFGFLILSIAVQLVWNGAADFQH
Ga0335071_1133596833300032897SoilGPTATGAINRIFGFLILSVAVQLIWDGLAEFQFVK
Ga0326724_0006167_14533_146403300034091Peat SoilRLGPGAVGAIARIFGFLILAIAVQLVWDGVADFRG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.