Basic Information | |
---|---|
IMG/M Taxon OID | 3300007979 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053064 | Ga0114002 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765094712 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 113498527 |
Sequencing Scaffolds | 10 |
Novel Protein Genes | 10 |
Associated Families | 9 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria | 3 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
Not Available | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F046431 | Metagenome | 151 | Y |
F066860 | Metagenome | 126 | N |
F072446 | Metagenome | 121 | N |
F077405 | Metagenome | 117 | N |
F078842 | Metagenome | 116 | N |
F092230 | Metagenome | 107 | N |
F097527 | Metagenome | 104 | N |
F103432 | Metagenome | 101 | N |
F103433 | Metagenome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0114002_100118 | All Organisms → cellular organisms → Bacteria | 72981 | Open in IMG/M |
Ga0114002_101057 | All Organisms → cellular organisms → Bacteria | 15794 | Open in IMG/M |
Ga0114002_101234 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 13869 | Open in IMG/M |
Ga0114002_104812 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 4193 | Open in IMG/M |
Ga0114002_107496 | All Organisms → cellular organisms → Bacteria | 2619 | Open in IMG/M |
Ga0114002_107865 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 2467 | Open in IMG/M |
Ga0114002_108724 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 2197 | Open in IMG/M |
Ga0114002_109769 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1944 | Open in IMG/M |
Ga0114002_110559 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1784 | Open in IMG/M |
Ga0114002_115735 | Not Available | 1122 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0114002_100118 | Ga0114002_10011841 | F046431 | MKRISKIGITIILILSIIISCGSVIISRAVESELTLTPKPETNNIHLKWTGPQNSSYKVYQKKPGSSTFETIGLTDFSNNATDEEVKVLNVYPHSKNIGKLWEGSSAFQTLPMVTVTYLDGHTETIEKSALLKAWMEGRNSK* |
Ga0114002_101057 | Ga0114002_1010573 | F103433 | MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWSGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARAIAGLSPDDASKIPAYIKRASIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLSFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELIHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPESYNFINKIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYILIGLWADYVVKRTVKYE* |
Ga0114002_101234 | Ga0114002_1012348 | F092230 | MVDKLKAHFLKVLLPLFIVCVIFVAFFRQIACGSDGDYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYNNMAYGEGNVEVSDSTSTGEVQNTEEKEARDKVDDAINSMRHLFASAIMVNLRVVELYKILTVCMILIAIAITIGYYSYLKPETVYGFYCKLRRKEKYPSDENLVKRIGFLVIILPPCMLFLLIV* |
Ga0114002_104812 | Ga0114002_1048122 | F066860 | MTTKKQKLQKQQAIDTWIVIALWVSAIWFSFARGFITGIGGWVLALLAPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVAFGIAEDRE* |
Ga0114002_107496 | Ga0114002_1074961 | F078842 | MIISSIYKIADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEVKKTYDEALRKFDKLIIPEDDILRAASECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKTHDESSVNTLFRTSYIKYSKEPDDLFRECVLEYSIDESHIQTPIDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEVSISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSARIRQMINSIELDVYEVNL* |
Ga0114002_107865 | Ga0114002_1078651 | F078842 | MIISSIYKIADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFIDAELYSPEAKKTYDEALREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQLSPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLRYDFLEYIKSLSSSVFCDNLRKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSARIRQMINSIELDIYEVNS* |
Ga0114002_108724 | Ga0114002_1087243 | F072446 | MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDRTEKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLEKHIKFVANKERTQRFYDLEIENFNEETI |
Ga0114002_109769 | Ga0114002_1097694 | F077405 | PWQGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQKQKARLK* |
Ga0114002_110559 | Ga0114002_1105594 | F097527 | MIYFKMEKIGNSTHNKEKKTRSENLVFITIPAAGVEPARPCGHWILSP |
Ga0114002_115735 | Ga0114002_1157351 | F103432 | MKLINSLFSLSLLFALSGLFCTTACQDDAEPTQRAGFISTDSLIHAAEVYDGKAFEHVVSTTAAGLRVSEPRRVVPMLPRQLHVTMEGKTVFRRHSLPSVSAYSLRVVAVGDTIYRQKESDAEFNADLDALFRQSIGIAPRLFGVRELSVVGIDRKGKPRDLGNYSYPLLQGKRRNVNFRTKEGVFHEYFEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDG |
⦗Top⦘ |