NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000461

7000000461: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 604812005



Overview

Basic Information
IMG/M Taxon OID7000000461 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053012 | Ga0031346
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 604812005
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size48470086
Sequencing Scaffolds16
Novel Protein Genes22
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales2
All Organisms → Viruses → Predicted Viral1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4163
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → Porphyromonas bobii1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F043235Metagenome156N
F066860Metagenome126N
F067846Metagenome125Y
F068942Metagenome124N
F071328Metagenome122N
F080166Metagenome115N
F081455Metagenome114N
F081456Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F090517Metagenome108N
F092229Metagenome107N
F092232Metagenome107N
F095633Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103432Metagenome101N
F103435Metagenome101N
F105379Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2865620All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes555Open in IMG/M
C2887862All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria856Open in IMG/M
C2889186All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae895Open in IMG/M
C2896750All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1302Open in IMG/M
SRS045127_LANL_scaffold_10093All Organisms → Viruses → Predicted Viral3612Open in IMG/M
SRS045127_LANL_scaffold_12994All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41688994Open in IMG/M
SRS045127_LANL_scaffold_14630All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae8202Open in IMG/M
SRS045127_LANL_scaffold_19110All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41640238Open in IMG/M
SRS045127_LANL_scaffold_23478All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1635Open in IMG/M
SRS045127_LANL_scaffold_29811All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes750Open in IMG/M
SRS045127_LANL_scaffold_31187All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales17849Open in IMG/M
SRS045127_LANL_scaffold_32184All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria694Open in IMG/M
SRS045127_LANL_scaffold_3695All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41675325Open in IMG/M
SRS045127_LANL_scaffold_37032All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → Porphyromonas bobii664Open in IMG/M
SRS045127_LANL_scaffold_38229All Organisms → cellular organisms → Bacteria15892Open in IMG/M
SRS045127_LANL_scaffold_4892All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip221246Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2865620C2865620__gene_84915F081456FEEPPIYYILISLIFLIVFGAVAFATWLVWLTPISFMAKLIMTAIGFLLCAMTVILYTISAE
C2887862C2887862__gene_93053F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDKE
C2889186C2889186__gene_93633F067846MSIIADWERQEFNKWDKQCSKEDDYNRAIEMEIEAIKEDIANGDDDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILND
C2896750C2896750__gene_96533F043235MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDSLIQVSGAAILVKEVKDGMYMPYHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSAH
SRS045127_LANL_scaffold_10093SRS045127_LANL_scaffold_10093__gene_10298F067846MESLQAQWERKTFNDWDKQCSKEDDYNRAIEMEIESIKEDIANNDSDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILKSLAEERIEQRRKDYEKGFILND
SRS045127_LANL_scaffold_12994SRS045127_LANL_scaffold_12994__gene_13531F103435MKFVFCTEPIYQYYRACVYDADKDKLDKQLLVEYGDYKDIWDLKQQQDALPENIFVAELTSRDYPRNPWNYVSQLINKLTYQYLIDSPDFENIFSEILFNQSEREFYEFYKAIDRFYNGSEIFITVGNDDYSDMVTQMVCSVLRRTYGIRPQIIYDIDDVHSIRDDIDFSPEGAQIAYLQRHTYLALEGKSTVEPLRVWYPFDMNSYTNALE
SRS045127_LANL_scaffold_12994SRS045127_LANL_scaffold_12994__gene_13540F089057MTNIIPIIAKKYNRKGDTSGSLKSLISDLNCIDNVDDSLLFLSSIPRETKYTLDEVFDLITSDDIYIKIFGNVLTFLNMDLDYHRLLLNAIKSESYKIISIINESIPTPDLFLAKNNYECLSVALDKPFVIFDKILGMVVSQLLHTASSKEERIFGIFMTICIINREINKLASLCTGYLAITRDEVLVKDLMNESAMVAFQYMSTEDINNVVSDINSRTVLSRYLSNM
SRS045127_LANL_scaffold_12994SRS045127_LANL_scaffold_12994__gene_13577F099452MGKTYEELLQETLSKIYELKDLDNRDRGKALTIFIGERLNRELLLSSRHIFTLYKDIINLDDVSLLTDLRKTDWYEKWFTDDSNNANLIDLSRFNFKTLARFEKEEYLRNVEGYDFEAVTQVDGYSLFDTLIEDKDVELFKLAAENILISHGFFDNTDYNFYDIPDEYMGDKEVCAYMCLLNIENMDFVDKKTLDTTVLYNIVKDRICGSIYFTLFDSLNKDTRTNAR
SRS045127_LANL_scaffold_14630SRS045127_LANL_scaffold_14630__gene_15510F071328MKWEFILEGEYIVEFVKLCKHLVLERTIDPNKHQAAAIYLRYSSQLLLKKRRAIRRLGVEKEYVSAILRQYGIHYREYGDNEHRVFFLDTGINLYFSKHDQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW
SRS045127_LANL_scaffold_19110SRS045127_LANL_scaffold_19110__gene_20860F105379MVIHFPLNQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLSKITQLEPSGDNPEIAIHKPVVSVFNWDTEYVKACMNSLREYQIDDNIITRTDEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIVVNQSMLFLPY
SRS045127_LANL_scaffold_19110SRS045127_LANL_scaffold_19110__gene_20880F092232MNSQSKFIAEYNDRNRPKFNDRFFCKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYAEVQKLLIGEETPSISIKDSDLKLLKVTYYVGCTKDEETFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTATSAKTQSITLKTNSNAVKMLRNFVDLNTTKEESIRLAMFSVYLFDHKVTLFEYYLARFGWYDTLEKFNFQDIIRITDYDIDDPEYYTFAIANSHMKSPFYISAVKSFVDNDRILQSFIASFQRAIMLFATKKTTLDQIYTTQFWIQKLGFNFVSSETSTFTKGNAIIESLENSYDIPTKKRLRLPDEIKADIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINIKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNHGVYEWNSFTNEEEPNVWDENFSKMLNVYREEKGYTSAIMLAEDAGLELTDTRDPDAVAFDAQLLGQTIAMVARTREFETQLRPALINMEDSCSIYFEEA
SRS045127_LANL_scaffold_23478SRS045127_LANL_scaffold_23478__gene_26455F081510MKLPKINTKAIKAGAKTTYNTAKILGKKYAPIALVTTGLVGYGIAVYQGIKSGKKLEATKAKYEAKDAAGEEYTRMDVVKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTAVTEEHARYRLQCKEVLDEETFKKVDTPMNQVTVEEDGKEAQSFVPKEGLMYGNWFKYSANYASDSPEYNEQWIRESIRVLEEKIARKGLLNFSDMLDQLGFDVPKAALPFGWTDTDGFYIEYDIMEVWNAEEQMHEPQIYVRWKCPRNLYATTNFRDLIPGRKELA
SRS045127_LANL_scaffold_29811SRS045127_LANL_scaffold_29811__gene_34772F018385MMAEYENQWGPYKEHSIENDRDPVLDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGYCRIACPRDSKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLYYEE
SRS045127_LANL_scaffold_31187SRS045127_LANL_scaffold_31187__gene_36588F103432MKLIHSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTATGLRVSEPRRIVPMLPRQLHVEMEGKTIFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAASVDTFSVKDDWLLNTKAEPSLYVPSFRLLVWDQPAEGCTKLRFTLTLVDGRSLVTEVPLY
SRS045127_LANL_scaffold_32184SRS045127_LANL_scaffold_32184__gene_37924F095633MGNYENSTEVGRGEGLTEGELRTMGTLAMEATEELKKTTISKEAVLLGSVPFGSWDEFAKAVQEMAAHKMTAYSYEPIPVKINTKRLIAIAFLDDRGEMSVEEHSVPEEVFIDLSRTRCVVDADRSHKSYKFTCPVLKRYPDGELYPIREAYVISAIDVNGSQEVDF
SRS045127_LANL_scaffold_3695SRS045127_LANL_scaffold_3695__gene_3404F092229MNKEYRFDHIPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFRSTIEELDLATVLQKVTTRPYIAILNNIFFRYFNSKLIDDMFKLGESTKVLDLAIEYECEYYTINSAKTNIRRYMQQAYFDKYAADSNIISSHRVLNDPQVNAVKSAEFTYDLFTAARSEKFNPEMVRDIFLKYGLKTNSSRNLYTRMNNNLSLYYYMEDYLTEYMLKGSFTYGSQVYSTIKEFKCLPLMNVLTQLTRHNPSGYVLDSNLELVKG
SRS045127_LANL_scaffold_3695SRS045127_LANL_scaffold_3695__gene_3426F099453MLRRKDMNRFDIIELAQQTITFVHSAFNGKVNALDPYTRLNFVSGYLDKKTNIARTTPYGCIYVSLEAFADTVEAYRFIDTDQIRNLALEIIIHELTHVDQLIDYRYIKFNNGYREEIERQCVKQSCQWILDNIQFIRSLGLVVIPEVYEERLVGLSDVTYAFKNPAVIAMSKLEHMIGKKFKEFNSNDIEIHYVDRLKNYYKIPVCVNRMYQNSQNLNDLGERLLNDKQYMIEYMEYGNSKLVIKITQGA
SRS045127_LANL_scaffold_3695SRS045127_LANL_scaffold_3695__gene_3428F080166MEVTFNSILKRLGNDVKENFHTEYIVNSGSLTTNVKYRYRMKLSPKGEQTCVYIDWDNYDDLFNVLEESIKICDPENPRTPFKRTYSDKGDLLDIRCNSLQVKYQHLNDRFGNTIDLIPFVLVDEQSGLLTEAIRFRFNNELVYDVPISRLKGFRRFLMTYNPLLHAGAMARYMAMTPLLGSNRQNMMRS
SRS045127_LANL_scaffold_3695SRS045127_LANL_scaffold_3695__gene_3429F081455MDLLKHLNETKLNLKFQQEREKRNAIEDSKALIRHLDPDYAEYELESLNPVEEIDAQNVEIIDAVDAIQQPLENKDAGIAVNFSQMINQPAIQEEVAKVVTSVPEQGEPKIKVVFPQNEWILGNYVDYDSFNKIKESNADIIIRSVRVLNFKMSDPNAVAAFNNFIMKFNPECDPNKRLRYELIRHQGREKDIVVRLSTVVNNVKYYADIYADLNKIDLDHHLISSAKKK
SRS045127_LANL_scaffold_37032SRS045127_LANL_scaffold_37032__gene_45094F090517TLEYKSAKYGPYFVYLSFVSDSIVEVSTFATTDLTVAYDSKDLCSYTMDDRILTLRARDEPYLNVSIDRNSWYLIRKEPSLLVFQANAGNPAAEVTLTLRKKL
SRS045127_LANL_scaffold_38229SRS045127_LANL_scaffold_38229__gene_47220F068942MIRKILSLPTLALCFTLGSTFFAGCNEDYIKNTDIKVRWSNVKNPKYGDAINITLKAEGETFTTVGDYSWISFRSDVSTLDTFTRHDFSESDKDTAYYKDIVIYLTRNKREETTTLKLVAPPNRTQQPKQFEFSVSTIPPSLYIFKVRQPALPAKAQ
SRS045127_LANL_scaffold_4892SRS045127_LANL_scaffold_4892__gene_4675F105380MPILELDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRNKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.