NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000394

7000000394: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765620695



Overview

Basic Information
IMG/M Taxon OID7000000394 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052939 | Ga0031287
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765620695
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size109490424
Sequencing Scaffolds19
Novel Protein Genes20
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
Not Available4
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → unclassified Neisseria → Neisseria sp. oral taxon 0141
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium4
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F040149Metagenome162N
F043990Metagenome155N
F046432Metagenome151Y
F051212Metagenome144N
F054111Metagenome140N
F057446Metagenome136N
F061925Metagenome131N
F061926Metagenome131N
F061927Metagenome131N
F063777Metagenome129N
F064819Metagenome128N
F067846Metagenome125Y
F073671Metagenome120N
F085820Metagenome111N
F097490Metagenome104N
F105376Metagenome100N
F105378Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2669739All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria543Open in IMG/M
C2670557Not Available547Open in IMG/M
C2670699All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → unclassified Neisseria → Neisseria sp. oral taxon 014547Open in IMG/M
C2696269Not Available674Open in IMG/M
C2698229All Organisms → cellular organisms → Bacteria687Open in IMG/M
C2717490All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium862Open in IMG/M
C2718680All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae877Open in IMG/M
C2723400All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium942Open in IMG/M
C2724346All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales956Open in IMG/M
C2730724All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1069Open in IMG/M
C2733376All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1126Open in IMG/M
C2736858All Organisms → cellular organisms → Bacteria1212Open in IMG/M
C2740016All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1306Open in IMG/M
SRS019607_WUGC_scaffold_13832All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1465Open in IMG/M
SRS019607_WUGC_scaffold_16586Not Available4304Open in IMG/M
SRS019607_WUGC_scaffold_17433Not Available1161Open in IMG/M
SRS019607_WUGC_scaffold_27724All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip230522Open in IMG/M
SRS019607_WUGC_scaffold_30703All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1540Open in IMG/M
SRS019607_WUGC_scaffold_57356All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae11791Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2669739C2669739__gene_134528F046432MWEMTENELSGIISKYQMPEGRYSVEQEGSFGESEFFWVIKNQLTNQKYLLMNTYSHHGVEDEVEYYREEGFDNLEAIPRRIETLENASDADDEISKYLFGMYSIF
C2670557C2670557__gene_134856F105376MGKIMNEKPEVSAKEFGALQAKVEYIKDGVDKHTVMLERIENIARANVTQAQLAQHEKESEEKYVKRTEIEGVMNFWSLVTSNVAKLFAIALVGLAIYATNNLIQQNKAVTELQEEVQQSQVRRK
C2670699C2670699__gene_134916F061927MKNPIFILSVILILGACSSESAKKAYNDSFRKTFIEEGVKSCIENSGLKESEAREYCECAMNKINENLSNDEIIDISMDNPPKDLDERIDKAISSCVENK
C2696269C2696269__gene_145223F085820FFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVVAAREEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSKKGKK
C2698229C2698229__gene_146047F061925MSWEYSINLDSEESVSSVVADLKICKLFSSSTTDYIDWKNSEPIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKHDYHITDCDTDEEVTLEHIFRSVI
C2717490C2717490__gene_154404F061926MIKTAKHIKTFLASVLLLIFVMNVSGLFVRLHHQETHQKTEKIAECSDKVCYHKAHLQTKSDCDCGFLCTLNYFYILPEKPQTEIHVNEYFSYFSLYKIFVSERIILLWQSRAPPVLS
C2718680C2718680__gene_154975F040149VADEGAEELRWEVLIEEQGIPVLFVEVVAWYDGRVSSSEILRSVGIALEREPRLAPVWSHDSEDAIHDFIYDISVPKGHTLTAVRERETVVAQLLNIHRYVYYP
C2723400C2723400__gene_157128F064819MILKDNQGKDRIKFSDIVRLNKEEPVFVKMKMRISTETMVSEENLDIERSDLEDLTRNLKDLSECKIRKFFFQNIDETIEIVFSINDIGTIAVEGKMYDESYMNSINFSFQTDLNGIAIFSKEISQELEKYK
C2724346C2724346__gene_157569F057446NSISFGKTTITSYPEYFEITDNKKTSKLLYLSASLVFIAIYLFDLYQNDFDFGKVAHFKTISAVLWLVIFALQFWLINTESKIEKSKIKEVVVRKNRWTSIVIHYGDKKRKIDGFSQDEAEQIIKFLMNNR
C2730724C2730724__gene_160570F043990MIKKLGIIFTFGVIILGIVVYANHKIERSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDSSFMKLNKLPIKEDLPPNGIPKQFLNIANGYYKYVVDENDDRSFDILIVDTTRKEICIYYQIL
C2733376C2733376__gene_161831F097490LLCIISCSERKEIEVYNMKIDENKKEVLVEIRNNTENNYYLLSPIVSIMTKHLQYIDGEMIEGQIHHKKLDSIVCSVCIWDDICKEEYYAMRDIVLLPKKSVKKIKYKYDNEEYIEIETVHIGFPYNGYYNEIGKKMQFMLKKKLDSSNIIKGYEFYNKDIETMTIKM
C2736858C2736858__gene_163632F054111MNKSLESITHEEFLKLMEHLKNLQEFTFLEYIIAPEADIFYFNFMEKTVKIKWDLDYGLFLETEELSTADRDLFLNILDKEILFLI
C2740016C2740016__gene_165219F063777LKVLNIKISALPPENNSFWMKLLFYLCIPFLLIFGILLLIGWGIYSGISSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLKKEDYLPDENSQEYKIFDDFCAKSNEYLDDGYIFYKLTDEKSATDLNGAIISEFQEDIGNYILLQNLILENNQLKNQLISFNKNTGKITVLADIKDFFWLDFDSETKTINGYNNKEQIEIAISE
SRS019607_WUGC_scaffold_13832SRS019607_WUGC_scaffold_13832__gene_16777F051212MKKFFFIFVLYWLYSCNGTEKAMATSPDTQKTSISEKQNAEKIERIIYSETGGDTDGKNVHLVITKDSIIYRLTEGVTDERTIADLSLNNNNKDWEAFIDKIDLEDFEKGKPSEELIMDLPTIKIIIKTDKKEYSKTDIQANKTWDYITKQIIDIKYSQLYNHLNLEK
SRS019607_WUGC_scaffold_16586SRS019607_WUGC_scaffold_16586__gene_20107F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIITDSTHQFVTDEEKNKWNNKLNAPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE
SRS019607_WUGC_scaffold_17433SRS019607_WUGC_scaffold_17433__gene_21085F032313MACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMLTSKISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWITTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWFVLRLNKNAVEFLQRGRTMWGPYDWYYGRNSGRSEVTLEAK
SRS019607_WUGC_scaffold_27724SRS019607_WUGC_scaffold_27724__gene_33560F105380MSILQLDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSYAEDYFPNGDRLTLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK
SRS019607_WUGC_scaffold_30703SRS019607_WUGC_scaffold_30703__gene_37450F033081MAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKSSILAVGLIIMTIAMIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATATAWIVGATGVFLMGNFTSYTKKQKSLQSLKATGCGMYNTLLLFALLEEQAFRSGSERWNWRERVRASVCFGLLHVTNIWYSFAAGIALSVTGFGFLMVYLWYYRKYRSQIIATAAAATVHALYNVIALSLIAVVLAIDIAKLL
SRS019607_WUGC_scaffold_57356SRS019607_WUGC_scaffold_57356__gene_80981F073671MNKEQAEHELAELHEKERSLEKALELVREKIRELVNYTDKNKVYK
SRS019607_WUGC_scaffold_57356SRS019607_WUGC_scaffold_57356__gene_80987F067846MSIIAEWERKTFNDYDKQCSKEDDYNRAVEMEIEAIKEDIANGDDDAICAFSEKMLDYDFLKAVILGTDYEEMRIKILTAMAEDRLEQLEKDYRNGYILND

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.