NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000063

7000000063: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 763577454



Overview

Basic Information
IMG/M Taxon OID7000000063 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052535 | Ga0031245
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 763577454
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size108359052
Sequencing Scaffolds13
Novel Protein Genes13
Associated Families12

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria6
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides thetaiotaomicron1
Not Available3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → Viruses → Predicted Viral2

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F046432Metagenome151Y
F051211Metagenome144N
F066860Metagenome126N
F077404Metagenome117N
F081455Metagenome114N
F089057Metagenome109N
F092232Metagenome107N
F095633Metagenome105N
F099452Metagenome103N
F103436Metagenome101Y
F105378Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3389430All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria586Open in IMG/M
C3394786All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides thetaiotaomicron611Open in IMG/M
C3407449All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria680Open in IMG/M
C3409461All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria693Open in IMG/M
C3411058All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria703Open in IMG/M
C3418291Not Available756Open in IMG/M
C3419891Not Available770Open in IMG/M
SRS015057_WUGC_scaffold_18883Not Available4931Open in IMG/M
SRS015057_WUGC_scaffold_24384All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria5149Open in IMG/M
SRS015057_WUGC_scaffold_29464All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus2060Open in IMG/M
SRS015057_WUGC_scaffold_69869All Organisms → Viruses → Predicted Viral2261Open in IMG/M
SRS015057_WUGC_scaffold_70514All Organisms → Viruses → Predicted Viral1731Open in IMG/M
SRS015057_WUGC_scaffold_75129All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria840Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3389430C3389430__gene_161105F033081VRKRTLTSRPTFVWSRLVTEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNIILFLALLEEQAFRSGSEKWNWRERVRASVCFGLLHIANIWYSFAAGIALSATGFGFLLVYLWYYRKYRIQIIATAAA
C3394786C3394786__gene_162937F077404GKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFIQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDS
C3407449C3407449__gene_167434F095633MKNYENSTEVGRREGLTEGELRTMGALAVEATEEFRKTIVRKEAVLLGSVPFGSWDEFAKAVQEMTAYSYEPIPVKINTKRLIATAFLDDGGEMSLEEHSVPEEVFIDLSRTRCVVDADRSHKSYEFTCPVLKKFPDGELYPIREAYVISAIDVNGSQEVDFKII
C3409461C3409461__gene_168124F095633MGNYENSTEAWRREGLTEDELRTMGTLAMEATEKLKKTIIRKETVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVEINTKRLIAKAFLDDRGEMSVEENFVPEDVFIDLSRTRCDAEEDRNRKSYEFTCPALERYPDGELCPTRKAYVISAIDVNGSQEVDFNIIYGGLN
C3411058C3411058__gene_168678F046432MTESKLSNIISKYQLPMDDYSVEVDGAFGRGEFFWVIKNQSTNQKYLLVNTYSHHGVESEIECYREGGFDNLEAIPRRIETLENASDADNEIFKYLFGLYSIFEMKS
C3418291C3418291__gene_171221F081455METTNTENKNLFQQLARLGFNVNESLLELEKEYAPVEEIGMQNVDIIDAAEEAIQQPLANTDSSIAVNFSQMINKPEVEEVKTEVASVPDNGETKVNVFFPKNEHILSNYVDYDSFNKIKESNTETIVRAVRLLNYKMSDQNAAMKFGQFVSEFNSECDPNKRLRYELIRHQGREKDLVVRLSTVVNGTTKYYADIYPDLNKIDIDHHLISSARK
C3419891C3419891__gene_171774F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDANDSLLFITNIPRETKYSIEEVFNIITSNDKYSEVLSNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTAALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIVNKDIDKLASLCTGYLAITKDEVLVKDLMNESATMAFQY
SRS015057_WUGC_scaffold_18883SRS015057_WUGC_scaffold_18883__gene_21865F105378MNVKVYVDKIKKWVQISSDEVLDINKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIITDSTHQFVTDEEKSKWNNKLNAPVPIQDHLENNQIGYDSANSKFYIGLNNQNVLLGGSSCFDNIVVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE
SRS015057_WUGC_scaffold_24384SRS015057_WUGC_scaffold_24384__gene_28126F051211MRVKKAIKVFEKIRDLPYGTSGSDEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLQIPEHILKIRRQQYERHVILRVFIDEFAYDVDPSIDIGLTPTLPMACWDGKSSTTTMVPLRCLRVYRPHSLHERILSQLRRKIFRSNPESFYTAIDIWLATIRVS
SRS015057_WUGC_scaffold_29464SRS015057_WUGC_scaffold_29464__gene_34753F103436MITTSKDGWCGKSDAEVLNSLRAWVLKCDLKYSKREALKKIDSAFALWGGRQYVAAVDLLDENEVYFSKEDWPYYALGIEILKARKYTYFY
SRS015057_WUGC_scaffold_69869SRS015057_WUGC_scaffold_69869__gene_91075F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSVNVFTLYKDIINLDDVSLLADLRHTEWYKDWFTSDKRNSDLIDLSRFNFRVLERFEKEEYLKDAEHYDFEGVSEVDSYDLFDTLREDKDIELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR
SRS015057_WUGC_scaffold_70514SRS015057_WUGC_scaffold_70514__gene_92308F092232MNTQAKFIADYNDKNRPKFNDIFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEETFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEETLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDVIKLSDHDIDDPEYYTFAIANSHMKSPFYISAVKSFVDNDRILQSFIASFTKSIASYATKKTTMDQLYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDNRDPEAVAFDAQLLGQTIAKVA
SRS015057_WUGC_scaffold_75129SRS015057_WUGC_scaffold_75129__gene_101946F066860MTTKKQKLQKQQAIDTWIVIALWVSTIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKNHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDRE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.