NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006498

3300006498: Human supragingival plaque microbial communities from NIH, USA - visit 1, subject 159591683



Overview

Basic Information
IMG/M Taxon OID3300006498 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052511 | Ga0100374
Sample NameHuman supragingival plaque microbial communities from NIH, USA - visit 1, subject 159591683
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size151863366
Sequencing Scaffolds16
Novel Protein Genes27
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter1
All Organisms → cellular organisms → Bacteria3
Not Available2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F04321
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Supragingival Plaque → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F022002Metagenome216Y
F027205Metagenome195N
F030786Metagenome184N
F036281Metagenome170N
F040685Metagenome161N
F043991Metagenome155N
F047127Metagenome150N
F049707Metagenome146N
F051211Metagenome144N
F051212Metagenome144N
F051214Metagenome144N
F061926Metagenome131N
F061927Metagenome131N
F063778Metagenome129Y
F071327Metagenome122N
F074985Metagenome119N
F078842Metagenome116N
F081454Metagenome114N
F081456Metagenome114N
F084362Metagenome112N
F085821Metagenome111N
F095630Metagenome105N
F095632Metagenome105N
F097526Metagenome104Y
F101358Metagenome102Y
F103430Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0100374_100013All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria223347Open in IMG/M
Ga0100374_100019All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter208104Open in IMG/M
Ga0100374_100399All Organisms → cellular organisms → Bacteria43772Open in IMG/M
Ga0100374_100408All Organisms → cellular organisms → Bacteria42730Open in IMG/M
Ga0100374_100443All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria40315Open in IMG/M
Ga0100374_100623Not Available31051Open in IMG/M
Ga0100374_101183All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes17801Open in IMG/M
Ga0100374_102127All Organisms → cellular organisms → Bacteria10682Open in IMG/M
Ga0100374_102419All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F04329479Open in IMG/M
Ga0100374_103258All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis7183Open in IMG/M
Ga0100374_103573All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae6590Open in IMG/M
Ga0100374_103866All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis6131Open in IMG/M
Ga0100374_105412All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus4457Open in IMG/M
Ga0100374_106444All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae3754Open in IMG/M
Ga0100374_111251All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae2120Open in IMG/M
Ga0100374_122400Not Available998Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0100374_100013Ga0100374_100013202F095630MRYSPLYKTSKDNGLLAHVYEHLLAQYVLKYLQDRGFFISSDIILTAKTYGDTCYMDVELYNPAAPNAYNEALQAFDKHTIPEKAVRRVVSECGIEMNRAVLELKQDELMSNLSRMQSSDWRQQGEMTYRKSYDKSSVNTLFCVPYLKYGKKSKKLFPEYVLEYSIDEEYINSPIDQALAAIVMQAVALNFLVAVRENYTVYDRGDQWSEASLSVGYRMFLGLAKEDKQITSQLKHEFAAYIQYLLKSPFCSNLQKALLRCSCNLEQVLLGRSTLNNILGGCVIGGRGWLEMADDARIKQMIAAIQLDVYDI*
Ga0100374_100019Ga0100374_10001971F085821MNDVFERLAPLAQTKKQQTAVAKFIEKGFKFTRQPDAQKFTFLVYDFFVAGNMPAVQLCIDYLVAQGYPTAENEKKFLNWVFMEPVYYLKYFISDASTQKKLHHNLLNLWYESTRRQILEQDNGKHDEAYIDEWVKANVNKTLSTIRSGETIASCKEDVAKHSRTLNDEYEQNIGLLDEYCRALLYTAETTQSQQELVAQTQNHIALLKDLYKKVNKIK*
Ga0100374_100399Ga0100374_10039914F074985MTRSVISEEDIVELTDGGWYKTPRIIKGKDFLAHIHDTYASGNAMYVEFKASEGEVRILEYRRLYDVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQQAWKAIHMGSTKRFSVSDFDELYIDQTFRKMTPVAFTHNNQSWTVMGLELASAETGWFIYLKRRDSDFMTRLNFNRDQKFLYNPISGSWSLDDPTQEIKDLEEIKQALRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDSETGEQKYLLDHIKAMHID*
Ga0100374_100399Ga0100374_10039915F036281MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYNAKEYYDYWTAREGRPAPFFYESRQYHVKSFMRVPGSTDLWITAERETGHWYTFRMSDAQKSKFTRNTMTNEKGHQTYDWVLENVEWADDTIRYF*
Ga0100374_100399Ga0100374_10039916F027205VASRLIVSADDILKAVKESEEFERKALSEARKRDRDEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYILDEYDIPRRIKRSAK*
Ga0100374_100399Ga0100374_10039919F043991VSKKNPSVIDYFDLNGDLNEEAYEFEEVKLEDYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLNIVYADINFAGGIRTILFKCRQKKNLTRFISRVLEIAQGDPSNVHPDFRA*
Ga0100374_100399Ga0100374_10039920F051214MPGKIVAHDTHLQIDTEFIELKDCFEAFRRGVEYRDKNDVDDILVICNAPDIIEYQLKNGDSFIVTYDPIHRIIVMRVFLHDEDITIKPIYIYNNREYQIACEFLRQVMHDKIDLKDEWIA*
Ga0100374_100399Ga0100374_10039922F018385MAEYENQWGPYKEHSIEKDRDPALDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGCCRIACPRDGKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLYYEEMK*
Ga0100374_100399Ga0100374_10039933F081456MFEEPPIYYILISLIFLIVFGAIAFATWLVWLTPIAFMAKLVMTAIGFLLCAITVILYTISAD*
Ga0100374_100408Ga0100374_10040825F095632MSFKETTGYKVVSLVASTSASITAGAVVGALCPPAGVVLTAIYGLGSSVLGTYVGDKAGRQYAETLAETIDSLQTPQNN*
Ga0100374_100408Ga0100374_10040844F049707VSEYRSPHNDGHDPYILIWEYGNDIRRAEFTERWAEYDETGWTVWYFRLVDGGVMTFSSREWDQKDDVNHLTTIWMRPSLYDIERKTS*
Ga0100374_100443Ga0100374_1004434F051211MKAKKAIRIFKKIRDLPYGTSGSDEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLPIPERILNLRRQRHERHVILRIFIDGSTYNIDPSIDIGLAPTLPIAHWDGTSSTATMASLKHLRVYRPHSLHERILSRLRSKLFRGNPKEFYIAIDKWLADTRAHHSS*
Ga0100374_100623Ga0100374_10062344F084362MSLMNCTFTVRWSDEKNKPHAKTYATEADAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDKQSETEAEQKGFWWEK*
Ga0100374_100623Ga0100374_1006236F103430MIEQITIKAFIGSDNKTKKLEVDKIISIVNTNHEAFTLDYPVVGYWRGEAEETAVLYLSDDRQRVMNTLSELKEVLDQEAIAYQIENDLQLI*
Ga0100374_101183Ga0100374_10118317F081454MKAFKLVLLLFITSASLVFGQEKRYFFKHEFQPNSKYLIKYKTDMDGGYKFVGSKEVIDKIGMDGVKMTINSDIESAISTQKKQGNNVPFILEYTKYFYKAEINGETVNRKIPLQGVKLIGDIVNGKKMEVKNVEGNIDENTKKILIESIKQFSAIDTDFPKEGLKIGDSFDIVVPYKQSTQMGDIEMIMNIKYKFLKVEKEEAYFDMLIDFVMGDKNVKNMDLSASGDGKGFLLFDMKNNYFTSQNIDMTINLKLKTELLTLENTSKAKSVITQQKIK*
Ga0100374_102127Ga0100374_1021277F030786MKRTRIHKVVFQMLVVMVVTGSLQMLLKNGSATKGGNTMGTKKISLADITDGDDSKEGAIKVKYFDDDGADKIFENNNNRILNTINSQHISFNSQSAEYSKPQLFLLYRSLKVDC*
Ga0100374_102419Ga0100374_1024193F097526MTENSTMATTIYTQDDYCRLCERLNDDTVAALLHAHIDAFAADGNRQLLKRLTEAMRIAAEFEQAGRNGHPDPAELERRERWDSHCKRLQQHARMAGDQITNADNAAVSRLTKQCEKNGSRDTAIPSPHDDGYGFTAGLRDFPLNASQTALLWRMAVLTIAEMTDTTPELTAHYLNGTGGEHLGRALAGKTVYPVTVVSNLAWLLHEQHKSGQLQRDLLSAAKAYENRNTDNRLEDAMTKP*
Ga0100374_102795Ga0100374_1027951F061927MKNPIFILSAMLILGACSSDSTQKATEKIKNAYSEGLRKTFIKEGIKSCIENSGLKESEAREYCECAMNKLNESLSN
Ga0100374_103258Ga0100374_1032584F101358LTQPITTDGMPQERATHPDTYIIPKHENVPQYLWNVLRSSGQLDEGWIDREIINKDDVTLVRMTKLVNRSNIYQLEKCVPLAVLQEQNIDFTNAYLQKAYGVMVKDGRLQPAADTTHATHQNDSTDEQETEVLLAQRGDTYDHLGGDSARQLVGSVAAKASGEVDNSPLVQRALECIR*
Ga0100374_103573Ga0100374_1035733F040685MKKLLFKLFFALTLTSISLHGQEKIQQVEVHIFGGMALYSSHYTINFLYKEFEAKQVMGEPAELPKKILLLNPPDKWRVFTKKINLDRFKKLRDGPSEQAFDGQDEVIIIKTDKKTYRKMNASGNDHDREVWYDLLQIIAKEFGKKGIYE*
Ga0100374_103573Ga0100374_1035734F051212MKKFFFIFVLCWLHSCNGTEKAMATSPDTQKTSISEKQNAEKIERIIYSETGGDTGGKNVHLVITKDSIIYRLTEGVTDKKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTDKKEYSKTNIQNNTTWDYITKQIIDIKYSKLYNHLNLKK*
Ga0100374_103573Ga0100374_1035736F047127MKKTFAFILLSIISLAKAQLTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERRQAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDKPEIAKEVDQMVGDRDIREILEKYNRK*
Ga0100374_103866Ga0100374_1038666F063778MPETISEGAQQKLLQQLQNALGLVADADTSAHDVAAITHSAADGHQLTEVMLQQMTAIDAYLKNCQTSINDAIGNIEAIPLDPPPED*
Ga0100374_105412Ga0100374_1054125F078842MIISSIYKTVDNDGLIAHIYEHLLAQYVLKRLQDNELFVLSDIILSAKTYGDTCFMDAESYSPEAKKTYDEAVREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKARNESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLGCDFLEYIKNLSSSVFCDNLQKALVRCSDSHKQVILNRSTLNAILGGCVIGGKGWLEMADSARIRQMVNSIELDIYEVDS*
Ga0100374_106444Ga0100374_1064443F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLLLQVHLSYKGTISEVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASEFSSYFKLFEYGIIILTCAGMITYVYIFLKNNQTLTLKIVVIALATALLFEYAYPWRLIFG*
Ga0100374_111251Ga0100374_1112513F061926MMIKTAKHIKTFLASVLLLIFVMNVSGLFVRLHHQEIHQKTEKIAECSDKVCYHKVHLQTKSDCDCGFLCTLNYFYILPEKPQTEFHVNEYFSYFSSYKIFVSERIILLWQSRAPPVLS*
Ga0100374_122400Ga0100374_1224001F022002MSSRLALTLGLLLALLLSLPSYAQDGQSKEPIDRTISGFTLGVTTPAEARAIIQRQGGEIEETQAWSDEVVYAITGLKYARRPTLSVRLYFYKGHLRSISFVFGDLKIFEQIESGLENKYGTMAEGKATSKMRVKGIADAFTSLEVVVHSFEDDGHVGFAYAYISYTDLELDRAYSAENENEI*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.