Basic Information | |
---|---|
IMG/M Taxon OID | 3300008514 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052933 | Ga0111023 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 160765029 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 109748450 |
Sequencing Scaffolds | 22 |
Novel Protein Genes | 26 |
Associated Families | 25 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 4 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 4 |
Not Available | 5 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → unclassified Prevotella → Prevotella sp. | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Pseudoprevotella → Pseudoprevotella muciniphila | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F032313 | Metagenome | 180 | N |
F033081 | Metagenome | 178 | Y |
F047508 | Metagenome | 149 | N |
F054109 | Metagenome | 140 | N |
F054110 | Metagenome | 140 | N |
F055792 | Metagenome | 138 | N |
F067846 | Metagenome | 125 | Y |
F068942 | Metagenome | 124 | N |
F072446 | Metagenome | 121 | N |
F078842 | Metagenome | 116 | N |
F080164 | Metagenome | 115 | N |
F080166 | Metagenome | 115 | N |
F081455 | Metagenome | 114 | N |
F085820 | Metagenome | 111 | N |
F089057 | Metagenome | 109 | N |
F095633 | Metagenome | 105 | N |
F098763 | Metagenome | 103 | N |
F099452 | Metagenome | 103 | N |
F099453 | Metagenome | 103 | N |
F099454 | Metagenome | 103 | N |
F103432 | Metagenome | 101 | N |
F103433 | Metagenome | 101 | N |
F103435 | Metagenome | 101 | N |
F105379 | Metagenome | 100 | N |
F105380 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0111023_100069 | All Organisms → cellular organisms → Bacteria | 86750 | Open in IMG/M |
Ga0111023_100116 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 67239 | Open in IMG/M |
Ga0111023_100157 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 53807 | Open in IMG/M |
Ga0111023_100195 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 46982 | Open in IMG/M |
Ga0111023_100197 | All Organisms → cellular organisms → Bacteria | 46857 | Open in IMG/M |
Ga0111023_100251 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 41184 | Open in IMG/M |
Ga0111023_100267 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 39314 | Open in IMG/M |
Ga0111023_100312 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 34006 | Open in IMG/M |
Ga0111023_100464 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 25611 | Open in IMG/M |
Ga0111023_100632 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 20388 | Open in IMG/M |
Ga0111023_100831 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 16100 | Open in IMG/M |
Ga0111023_101696 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 9206 | Open in IMG/M |
Ga0111023_101701 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 9188 | Open in IMG/M |
Ga0111023_101716 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 9138 | Open in IMG/M |
Ga0111023_102911 | Not Available | 5736 | Open in IMG/M |
Ga0111023_106787 | Not Available | 2531 | Open in IMG/M |
Ga0111023_107210 | Not Available | 2380 | Open in IMG/M |
Ga0111023_108266 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → unclassified Prevotella → Prevotella sp. | 2092 | Open in IMG/M |
Ga0111023_109332 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Pseudoprevotella → Pseudoprevotella muciniphila | 1856 | Open in IMG/M |
Ga0111023_115264 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1133 | Open in IMG/M |
Ga0111023_118780 | Not Available | 926 | Open in IMG/M |
Ga0111023_119960 | Not Available | 879 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0111023_100069 | Ga0111023_10006957 | F033081 | MYTDITVVHRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRLKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGNFTSYTKKQKSLQSLKATGCGMYNTLLLFALLEEQAFRSGSERWNWHERVRASVCFGLLHITNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL* |
Ga0111023_100116 | Ga0111023_10011678 | F105380 | MSILQLDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVLSHNIKKCAVFLGKITKEQENVNTYEEFMEWTKNTKWFK* |
Ga0111023_100157 | Ga0111023_10015734 | F054110 | VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKEKQLYKKESEYSSTKQHEVLLFLVRTYKGGE* |
Ga0111023_100195 | Ga0111023_10019518 | F067846 | MSIIAEWERQEFNKWDKQCSKEDDYNRAVEMEIEAIKEDIANNDSDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQLEKDYRNGYILND* |
Ga0111023_100197 | Ga0111023_10019717 | F103433 | MKEWSKNKPGVVFFFVVWFVLSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVSVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSAIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE* |
Ga0111023_100251 | Ga0111023_10025125 | F099452 | MDKTYEELLQETLSKIYELKDLDNRDRGKALTIFIGERLNRELLLSSRHIFTLYSDIINLDDVSLLTDLRKTDWYKDWFTDDRNNANLINLSRFNFKTLARFEKEEYLRNVEQYDFEGVVQVDGYSLFDTLIEDKDVELFKLAAENILISHGFFDNTDYNFYDIPDEYMGDKEVCAYMCLLNIENMDFVDKKTLDTTVLYNIVKDRICGSIYFTLFDSLNKDTRTNAR* |
Ga0111023_100267 | Ga0111023_10026721 | F095633 | MRNYENSTEVGCREGLTEGELRTMGTLAMEATEELKKTTISKEAVLLGGVPFGSWDEFAKAVQEMAAHIYEPIPVKINTKRLIATAFLDDGGEMSVEEHSVPEEVFIDLSRTRCVVDEDRSHKSYEFTCPVLKKFPDGELYPIREAYVISAIDVNGSQEVDFKII* |
Ga0111023_100312 | Ga0111023_10031236 | F099453 | MLRRKDMNRFDIIELAQQTITFVHSAFNGKVNALDPYTRLNFVAGYLDKKTNIARTTPYGCIYVSLEAFADTVEAYKFIDTDQIRNLALEIIIHELTHVDQLIDYRYIKFNNGYREEIERQCVKQSCQWILDNIQFIRSLGLVVIPEVYEERLIGLSDVTYSFKNPAVIAMSKLEHMIGKKFKEFNSNDIEIVYVDRLKNYYKIPVCVNRMYQNSQNLNDLGERLLNDKQYTIEYMEYGNSKLVIKITQGV* |
Ga0111023_100312 | Ga0111023_10031239 | F080166 | VYSLANFENYNKVVEVIFELNYKLTLKMEVTFNSILKRLGNDVRENFHTEYIVNSGSLTTNVKYRYRMKLSPKGEQTCVYIDWDNYDDLFNVLEESIKICDPENPRTPFKRTYSDKGDLLDIRCNSLQVKYQHLNDRFGNTIDLIPFVLVDEQSGLLTEAIRFRFNNELVYDVPISRLKGFRRFLMTYNPLLHAGAMARYMAMTPLLGSNRQNMMRS* |
Ga0111023_100312 | Ga0111023_10031240 | F081455 | TKLNLKFQQEREKRNAIEDSKALIRHLDPDYAEYELESLNPVEEIDAQNVEIIDAVDAIQQPLENKDAGIAVNFSQMINQPAIQEEVAKVVTSVPEQGEPKIKVVFPQNEHILGSYVDYDSFNKIKESNADTIIRSVRVLNFKMSDPNAVAAFNNFIMKFNPECDPNKRLKYELIRHQGREKDIVVRLSTVVNDVKYYADIYADLNKIDLDHHLISSAKKR* |
Ga0111023_100464 | Ga0111023_10046415 | F089057 | MTNIIPIIAKKYNRKGDTSGSLKSLVSDLNCIDNVDDSLLFLSSIPRETKYTLDEVFDLITSDDIYIKIFGNVLTFLNMDLDYHRLLLNAIKSESYKIISIINESIPTPDLFLAKNNYECLSVALDKPFVIFDKILGMVVSQLLHTASSKEERIFGIFMTICIINREINKLASLCTGYLAITRDEVLVKDLMNESAMVAFQYMSTEDINNVVSDINSRSVLSRYLSNM* |
Ga0111023_100464 | Ga0111023_10046424 | F103435 | MKFVFCTEPIYQYYRAHLYDADKDKLDKQLLVEYGDYKDIWDLKQQQDALPENIFVAELTSRDYPRNPWNYVSQLINKLTYQYLIDSPEFENIFSEILFNQSEREFYEFYKAIDRFYNGSEIFITVGNDDYSDMVTQMVCSVLRRTYGIRPQIIYDIDDVHSIRDNIDFSPEGAQIAYLQRHTYLALEAKSTVEPLRIWYPFDMNSYTNALE* |
Ga0111023_100632 | Ga0111023_1006329 | F099454 | MKASKLLWAVVMALTFVLTSCDRVTDEPTIEGKMNKFFDSQAQRKSFRVLTASGKPYNHKIDWHIIGILDPKSETYLTKKVDTLSNGDLKISYDWVAFIVRENKSVIDVEVQNNETGQDRSVDFVAQDNHKGLASPSMTVIQRAK* |
Ga0111023_100831 | Ga0111023_10083110 | F098763 | MDGRTSLRVLSPALYLATKLFEAVVWTTISYDPSDEVEGKGGMNAIPTAVDE* |
Ga0111023_101696 | Ga0111023_1016963 | F047508 | MLGHRLVEGRIKYPYLRSIWEYLRHSFDTEDVGWVVKRSKLCALMEHIYYLWGDTYALSKALCTVYEAVTDGVDLIEGLYEVLFFENVEDNLYAACVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGATTVEDQDFHKVLYYMVRCELILSSP* |
Ga0111023_101701 | Ga0111023_1017014 | F055792 | VEIASEALDSTSAVAHRILLLTTQLGESLLASLGTEDGVIAEAMVTGALERDLAIDCALEEVRPVFVDESDDGTEAGTTWSRYPLETLQKEGYILFEGSMLPCEARRVDPRCSVKSLDLEPRIIGEAIESVALPDVTRLDESITLQGIGSLRDLLMTPDVSETDHLQTSREEGTDLLQLMSIIARKYQLFHTFVS* |
Ga0111023_101716 | Ga0111023_1017164 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLGKITQLEPSGDNPEIAIHKPIVSVFNWDAEYVKACMNSLREYQIDDNIITRTEEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIVVNQSMLFLPY* |
Ga0111023_102911 | Ga0111023_1029112 | F103432 | MKLIHSLFSLPLLLVLGGLFCTTACQDDAEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDSTTMFRRHNLPSVSAYSLQVVAVGDTIYRQKESDAQFNADLDALFHESIGIAPRLFGVKELSVVGIDRKGKPRDLGNYSCPLLQGKRRNVNYRTREGIFHEHYEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWQQPAAGCTKLRFTLTLVDGRSLVAEVPLR* |
Ga0111023_102911 | Ga0111023_1029113 | F080164 | MVGLTLCAAPQVTLRERANAFPLITEKDASEIDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGILTVRVLVVGDTVAVHQDLMDDFAKHCRATLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKVEKPLLYKGQAGQLVLCEYYESHRGDLFLDVANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS* |
Ga0111023_106787 | Ga0111023_1067873 | F054109 | MLGRPKSKKGVKVHTAFKIYPDDKARVQAMADKLDMSLSSYINRAVLEKVERDEKSEN* |
Ga0111023_107210 | Ga0111023_1072102 | F072446 | MKKLLFLLSGLCLYCLAACDNDHEPTKPIRPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRQKRELEAISRQLNALTLTSIGTSPVLCGVKSIEAVGVAERGNTYDLSREMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLARIREDELAEHIQPELKFYLPVKRCMDFSSIRFDITLFNGEVLSFQHKLPSKSVLQELPNNSVLENYKQYGFEREATYFTTLWPLPDYKYNEREW* |
Ga0111023_108266 | Ga0111023_1082662 | F085820 | MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVANECHFHRTWLTQGVKAIRVVRTHADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSEKGKK* |
Ga0111023_109332 | Ga0111023_1093324 | F068942 | MIRKILSLPTLALCFTLCTALFAGCGENYDGSVTEVHWSNVKNPEYGNAINITLKAEGETFTTVGNHSWISFSGSVSTLDTFTHHRFSEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFDFSIEVTPPAMYIFKVRQPALPAKAQ* |
Ga0111023_115264 | Ga0111023_1152642 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKCLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEVLREFDKLVIPEDDILRAASECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKNDDKIINQLSCDFLEYIKILSSPVFCDNLRKALVRCSDNHKQVILNRSTLNA |
Ga0111023_118780 | Ga0111023_1187801 | F032313 | MYRFLILIFALTLMACDNNTPQEKPHEQEKYEVPVPKPQFDEVGERIWYGKTPAMRLDSTDYGAGLTSVFGMLTSKISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVILLHRQGINVNHVDTVNYVYDEVGNEIVLEGTGMRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEEK* |
Ga0111023_119960 | Ga0111023_1199602 | F068942 | MIRKILSLPTLALCFTLGSAFFAGCDEGYIKNTETKVRWSNVKNPKYGDYINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFSEVDKDTAYYKDIVIYLTRNESKGTATLKLVAPPNRTQQPKQFDFSIEVTPPGMYIFKVRQPALPAKAR* |
⦗Top⦘ |