NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006524

3300006524: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 158479027



Overview

Basic Information
IMG/M Taxon OID3300006524 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052545 | Ga0101033
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 158479027
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size154100708
Sequencing Scaffolds25
Novel Protein Genes28
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2792
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae4
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
Not Available1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F030786Metagenome184N
F032313Metagenome180N
F033081Metagenome178Y
F041827Metagenome159Y
F046431Metagenome151Y
F046432Metagenome151Y
F047127Metagenome150N
F051212Metagenome144N
F051213Metagenome144N
F055792Metagenome138N
F058221Metagenome135N
F061925Metagenome131N
F061927Metagenome131N
F063777Metagenome129N
F064818Metagenome128N
F068942Metagenome124N
F071327Metagenome122N
F071328Metagenome122N
F077404Metagenome117N
F078842Metagenome116N
F089055Metagenome109Y
F090516Metagenome108N
F099454Metagenome103N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0101033_100002All Organisms → cellular organisms → Bacteria179210Open in IMG/M
Ga0101033_100003All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279178667Open in IMG/M
Ga0101033_100353All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27929662Open in IMG/M
Ga0101033_100548All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales23686Open in IMG/M
Ga0101033_101324All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes14683Open in IMG/M
Ga0101033_101395All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae14255Open in IMG/M
Ga0101033_101599All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae13154Open in IMG/M
Ga0101033_105139All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes6063Open in IMG/M
Ga0101033_106127All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae5271Open in IMG/M
Ga0101033_107112All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus4653Open in IMG/M
Ga0101033_109021All Organisms → cellular organisms → Bacteria3768Open in IMG/M
Ga0101033_109442All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae3612Open in IMG/M
Ga0101033_114295All Organisms → cellular organisms → Bacteria2337Open in IMG/M
Ga0101033_114327All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2331Open in IMG/M
Ga0101033_115204All Organisms → cellular organisms → Bacteria2190Open in IMG/M
Ga0101033_115589All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip22132Open in IMG/M
Ga0101033_115821All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium2098Open in IMG/M
Ga0101033_118184All Organisms → cellular organisms → Bacteria1797Open in IMG/M
Ga0101033_120926All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae1535Open in IMG/M
Ga0101033_121619All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1475Open in IMG/M
Ga0101033_122972All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1369Open in IMG/M
Ga0101033_125768All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1192Open in IMG/M
Ga0101033_134679Not Available823Open in IMG/M
Ga0101033_136161All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria781Open in IMG/M
Ga0101033_136370All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae774Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0101033_100002Ga0101033_100002181F046432MQMTENELSEVISKFQMPEGRYSIEQEGSFGRGEFFWIIKNQSTNQKYLLMNTYSHHGVESELECYREEGFDNLEAIPRKIETLEIPSDAEDEISKYLFGFYSIFEIKS*
Ga0101033_100003Ga0101033_10000317F099454MAFTFVFTSCDRLTDEPTLEDRGYKYFDSTAQRKSFRVVTASGKPYNHKIDWHIIGIRDSKSDTYLTKKVDTLSNGDLKISYDWISFTIREKKSVIDVEVQKNETGEDRSVKFVAQDNHKGLASPSMKVIQQAK*
Ga0101033_100003Ga0101033_10000318F051213MKASKLLWAVVMALTFVFTSCDPFSQNEPTIEGDRYKYFDSSAQRQSFRVVNGSGKPYNHKVDWHIIGIQEENSDTYLTKKVDTLSNGDFIISYDWVTFTVKENKSVIDVEVQKNETGKDRSVFFATSNSYKQAYLPNMIVTQRAK*
Ga0101033_100003Ga0101033_10000319F051213MKASKLLWAVVVALTFVFTSCDWVGDEPTIEGKLDKFFDSQAQRKSFRILTGSGKPYNHKVDWHIIGITDPYSDTYLTKKVDTLSNGDLKISYDWVSFTVRENKSVIDVEVQKNETGKVRAVYLNTNTSGRHITLPDMRVTQRAK*
Ga0101033_100353Ga0101033_10035314F055792VEIAGEALDSTSAVAHRILLLTTQLGESLLASLRTEDGVIAEAMVTGALERDLAIDCALEEVRPVFVDESDDGTEAGTTWSRHPLETLQKEGYILFEGSMLPCKACRVDPRSSVKSLDLEPRIIGEAIEPVALPDVTRLDESIALQGIGSLRDLLMTPDVSETDYLQTSREEGTDLLQLMSIIARKYQLFHTFVS*
Ga0101033_100548Ga0101033_10054810F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILTCAGMITYVYMFLKSNQILTLKVLIIALAAALLFEYAYPWRIIFG*
Ga0101033_101324Ga0101033_10132419F071328MQGLICHVNEGCKMSRKHWTFINLIRYIEEYERNPLLIERMKWKFIPEGECIVEFVELCKHLVLERTIDSKESLTTAIYLRYSSQLLLKKKRAIRRLGIEKENISAILRQCGRHYKEYGDDEHRVFFLDTDVNIYFCKHYQLPIYILQRIEFSNKEYRPFILKVLPWKKDEW*
Ga0101033_101395Ga0101033_1013951F051212MKKFFFIFVLYWLHSCNGTEKAMPTSPDTQKTSISEKQNTEKIERVIYESGGDKSGKNVHLVITKDSIIYRLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSEELIMDLPTTKIIIKTDKKEYSKTDIQANKTWDYITKQIIDIKYSQLNNHLNLEK*
Ga0101033_101395Ga0101033_1013953F047127MKKTFAFILLSIISLAKAQQTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERRQAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDRPEIAKEVDQMVGDRDIREILEKYNSK*
Ga0101033_101599Ga0101033_1015998F064818MGKRKKYPIPNNINGNTPKLYFINHKFIVVLLPYFTYMDKDEFLKKLIAFIADNSSELHPKIFKNKIRFGINKNSYTEMRFDYSQNYKGFYLQLASYNKEVGDFFEQEMGNSFLKMLEDESKEFRNLFFVQNSFQISHYYYGFPIMTNDNTGHLYPEMGTTIFNDILRNLQANHFKFIQAAEVLSPDLLHYIKKFPSCFFNTALVALLIIEKNLLSLNDERVQGLFEYDNMVTKNECKLFSPFDLIFGKKDYQQTAKQRILQRR*
Ga0101033_105139Ga0101033_1051399F030786MKRTKIHNVVFQMLVVMIVTGSLQLLLKNGSAAKGGNAMGAKKISLADITGGDDSETGVLKVKYFDDEGVDKIFENNNNRILNTINSQHISYNSQSAEYSKPQLFLLYQSLKVDC*
Ga0101033_106127Ga0101033_1061273F061925MSWEYSINLDSEEAVSSVVTDLKICELFSSSTTDYIDWKNPKSIDSTPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI*
Ga0101033_107112Ga0101033_1071123F089055MNSTPECVTKTPEIEAREKLAAIFSDAKRYDGNSGVKPELGKAAIDGKDIKNIIKVNSADNEAVDLCNQALGSYGKSLDRINNSPLETVREIGDLLQSFREDKTKESCR*
Ga0101033_109021Ga0101033_1090213F078842MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFVDAELYSSEAKKTYDEALREFDKLVIPEYDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHMQTPVDQALAAIVIQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLEKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSVLNAILGGCVIGGKGWLEMASSAQIRRMINSIELDVYEVDL*
Ga0101033_109442Ga0101033_1094425F077404MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFW*
Ga0101033_114295Ga0101033_1142955F046431ITIVSILLIIMLTYTQILVFAAESELTLTPKPETNNIHLKWTGPQNSSYRVYQKKPGSSTFETIGLTDFSNTDEEVKVLNVYPVSIAEYNTPDVNVTYLDGSSETIPKSALLKVWMEGRNSYRRV*
Ga0101033_114327Ga0101033_1143272F063777MKLLLYLCVPFLFIFGILLLIGWGIYSGISSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLKKEDYLPDENSQEYKIFDDFCAKSNEYLDDGYIFYRLTDEKSATELNGAIISEFQEDVGNYILLQNLILEDNQLKNQLISFNKNTGKITVLADIKDFFWLDFDSETKTINGYNNKEQIEITISE*
Ga0101033_115204Ga0101033_1152042F033081MCPPDLMVVHRPRKGVMAWLFNRVMPTDSRPVFVWPKLVAAIEDTGNFGKRWLTAIATGLIIVTIATIKALLMIPGLDSSVVGLLTSIFETFLPARWATGAAWIVGTTGVFLMGNFTSYTKKQKSLQSLKATGCGMYNTLLLFALLEEQAFRSGSERWNWRERVRASVCFGLLHVTNIWYSFAAGIALSVTGFGFLMVYLWYYRKYRSQIIATAAAATVHALYNAIAISLIAVVVAVYLAIDIAKLL*
Ga0101033_115589Ga0101033_1155891F105380MSILELDATQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSYAEDYFPNGDRLTLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK*
Ga0101033_115821Ga0101033_1158212F041827MKHFLSALALGCLLLSCNRDLENDETPAPAPQKEKLVLLKQLSEGSTVTFQYKNGNEIESVNIDGVGKSDIDYEYDTYGRIVKERRFHRRYDRGETNITYQYDNQGRLVSSHAISTKFYPGTGLTPRCSVEKKHTYTYQGNKVIVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIEQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNHGIERIEDLRYIDNIKTRDFHDGDYWEYRYRYDNGSTDNDYPNGVGIHARSHNDPTYDEYLYEISANRSYIKEE*
Ga0101033_118184Ga0101033_1181841F090516PLSPLFGKNKMTTFERYFIENDEVISLDKSSEFTDVIVGIGFLPQDMSENTDFKKKTLEKYGFSSSTALADDFRKRVLNIDEPIPENFEKDGIGYVYTVISGYDTFYNRMYMFGIHCFNGDFNVTYFDLENDAETGDYYKEHELYSQAKGYRWLDPESDYYEDVLAWEALNKLATDIYFHLEDKLDVKIDIKPIPEEEKVVPTQEHLAKFLAFCGVEQDVIDENKERLLKALEEYTPDEYEGVSEVMAEMMEYSHKIQRAEPVIEIIREYGVCRFSDWKFYAEELEEYILDLADFSDWKWEYPADTYSADLFPYMRKQLSLYHLWLCHLDEGADAYLFLLFSEKDMPEIMKLARILDLPLKAYFK*
Ga0101033_120926Ga0101033_1209263F058221MGKIKNFQDLKNQKEELRAEIKEIESVLSFENPRKSFGVITNGVTEKYLGGMMDSSLAQNAFFLADKFLFPSLKVGSAKLLSNVLLKRTKPSMKKTLIGLGVVVLAPVVIMRIKKRLDDFQQRETAKSLSKLI*
Ga0101033_121619Ga0101033_1216194F046431MKILKKITIVSILLIIMLTYTQILVFAVESELTLIPNPETNNIHLKWTGPQNSSYKVYQKKPGSSNFETIGLTDFNNVDEEVKVLNIYPTIEGLPMVNVTYLDGQTETIPKSGLLKVWMEGRNSK*
Ga0101033_122972Ga0101033_1229721F061927LNYIIMKNPIFILSAMFILGACSSESAKKAYNDSFRKTFIEEGVKSCIENSGLKESEAREYCECAMNKINENLSNDEIIDISMDNPPKDLDERIDKAISSCVENKP*
Ga0101033_125768Ga0101033_1257683F046431KKITIICILIAIILTYMQTLVMAAESKLTLTPKLETNNIHLKWTGPQNSSYRVYQKKPGATQFETIGLTDFSNNAIDEEVKVLNIYPQESNTDGRLWPTLNSSQVANIAKVLPKVQVTYLDGQTETIQKTALLKVWMEGRNSK*
Ga0101033_134679Ga0101033_1346792F032313MYRLLILLFAITLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTRVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVMLEAK*
Ga0101033_136161Ga0101033_1361611F089055MNSTPKCVTETLEIKAREKLAVIFSDAEQRDNSKVNPELGKTAIDIENTSRMNSADDGAVYLCNQALGSYGKSLDYINNSPLEAVQAIGNSLQLFREDKTKESCR*
Ga0101033_136370Ga0101033_1363702F068942MIRKILSLPTLALCFTLCTALFAGCGENYDGSVTEVHWSNVKNPEYGNAINIMLKAEGETFTTVGNHSWISFSNDASTLDTFTRHRFPEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFKFS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.