Basic Information | |
---|---|
IMG/M Taxon OID | 3300008626 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052986 | Ga0111364 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 764487809 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 103880820 |
Sequencing Scaffolds | 8 |
Novel Protein Genes | 9 |
Associated Families | 9 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 2 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F041827 | Metagenome | 159 | Y |
F046432 | Metagenome | 151 | Y |
F067846 | Metagenome | 125 | Y |
F073671 | Metagenome | 120 | N |
F080166 | Metagenome | 115 | N |
F089057 | Metagenome | 109 | N |
F092230 | Metagenome | 107 | N |
F092232 | Metagenome | 107 | N |
F095629 | Metagenome | 105 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0111364_100202 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 43137 | Open in IMG/M |
Ga0111364_100546 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 22641 | Open in IMG/M |
Ga0111364_101371 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 12206 | Open in IMG/M |
Ga0111364_101680 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 10394 | Open in IMG/M |
Ga0111364_104956 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 4024 | Open in IMG/M |
Ga0111364_105191 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 3822 | Open in IMG/M |
Ga0111364_106497 | All Organisms → Viruses → Predicted Viral | 2961 | Open in IMG/M |
Ga0111364_111000 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1563 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0111364_100202 | Ga0111364_10020250 | F067846 | MESIQAQWERKTFNDYDRQCSKEDDYNRAVEMEIESIKEDISNCDSDTLCAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILND* |
Ga0111364_100202 | Ga0111364_10020255 | F073671 | MNKETEHELAELHEKERGLEKALELVREKIRELVNYTDKNKV* |
Ga0111364_100546 | Ga0111364_1005469 | F080166 | VNILANFENYTKVVEQIFELNYQLTLKMEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCNIIDEAIDICDPDNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGDRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGSMARYMAITPLLGNNRQNMMRS* |
Ga0111364_101371 | Ga0111364_10137115 | F092232 | MNTQAKFIADYNDKNRPKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDANLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV* |
Ga0111364_101680 | Ga0111364_1016805 | F092230 | MRQAHKRMVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEAQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCMLFLLIV* |
Ga0111364_104956 | Ga0111364_1049564 | F041827 | MKHFLSALALGCLLLSCNRDLENNENNETPAPPKEERLVLASLYEFGSNVRFQYKNGNEINRMTIDGPHREASMDFEYDTYGRIVKERRFDHKSDYGETNITYQYDSQGRLVSSHAISTEYEEYPGSGLPPKTNPLCSVERKHTYTYQGNKVTVKIEMGADTCSAIPETGKEKTITLFVENGRVVKSFDENNQIIETIEYHNTKNALRNIKGFPALVAEFYVREYTYELKWYNVIESAYDFRFIDNVKTRNYPSGNYTLYKYKDDKGNSYDGDYPTDIFIFDKSHNDPTLSDHLWYLNASRYYIKEE* |
Ga0111364_105191 | Ga0111364_1051915 | F089057 | MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIISSNDKYSEVLGNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIINKDIDKLASLCTGYLAIIKDEVL |
Ga0111364_106497 | Ga0111364_1064973 | F095629 | MEAKTMRELIICACLLGCFGVAQAAAPVDQPKEVKVVHNDDNVALHKKVYKLEQRIERLEKLLAEKEGK* |
Ga0111364_111000 | Ga0111364_1110002 | F046432 | MWEMTENELSGIISKYQMTEGRYSLEEEGSFGESEFFWVFKNQLTNQKYLLVNTYSHHGVEAELEYYREEGFDNLEAIPRRIETLEIASYADDEISKYLFGMYSLFEIKS* |
⦗Top⦘ |