Basic Information | |
---|---|
IMG/M Taxon OID | 3300004108 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0097056 | Ga0065181 |
Sample Name | Groundwater microbial communities from aquifer in Utah, USA - Crystal Geyser 4/9/14 0.8 um filter (version 2) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 480564897 |
Sequencing Scaffolds | 25 |
Novel Protein Genes | 31 |
Associated Families | 28 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 3 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Microgenomates group → Candidatus Woesebacteria | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea okcheonensis | 1 |
Not Available | 15 |
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium CG_4_9_14_3_um_filter_65_9 | 1 |
All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → Archaeoglobaceae → Archaeoglobus → Archaeoglobus fulgidus | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000320 | Metagenome / Metatranscriptome | 1306 | Y |
F000449 | Metagenome / Metatranscriptome | 1126 | Y |
F001258 | Metagenome | 735 | Y |
F002298 | Metagenome / Metatranscriptome | 573 | Y |
F002777 | Metagenome | 530 | Y |
F010074 | Metagenome / Metatranscriptome | 308 | Y |
F016244 | Metagenome | 248 | Y |
F017787 | Metagenome | 238 | Y |
F023250 | Metagenome / Metatranscriptome | 211 | N |
F035627 | Metagenome | 171 | Y |
F038538 | Metagenome / Metatranscriptome | 165 | Y |
F042954 | Metagenome / Metatranscriptome | 157 | Y |
F048766 | Metagenome | 147 | Y |
F049412 | Metagenome / Metatranscriptome | 146 | Y |
F050899 | Metagenome | 144 | Y |
F058694 | Metagenome / Metatranscriptome | 134 | Y |
F060597 | Metagenome / Metatranscriptome | 132 | Y |
F069002 | Metagenome / Metatranscriptome | 124 | Y |
F069627 | Metagenome / Metatranscriptome | 123 | Y |
F075677 | Metagenome | 118 | Y |
F076872 | Metagenome / Metatranscriptome | 117 | Y |
F080880 | Metagenome / Metatranscriptome | 114 | Y |
F080881 | Metagenome | 114 | N |
F082132 | Metagenome / Metatranscriptome | 113 | N |
F089865 | Metagenome | 108 | Y |
F094906 | Metagenome | 105 | N |
F100366 | Metagenome | 102 | Y |
F102531 | Metagenome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0065181_1049905 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 1245 | Open in IMG/M |
Ga0065181_1078010 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Microgenomates group → Candidatus Woesebacteria | 957 | Open in IMG/M |
Ga0065181_1085858 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotalea → Candidatus Nitrosotalea okcheonensis | 902 | Open in IMG/M |
Ga0065181_1090346 | Not Available | 875 | Open in IMG/M |
Ga0065181_1110661 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 773 | Open in IMG/M |
Ga0065181_1115913 | Not Available | 752 | Open in IMG/M |
Ga0065181_1118429 | Not Available | 742 | Open in IMG/M |
Ga0065181_1126861 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 711 | Open in IMG/M |
Ga0065181_1131496 | Not Available | 695 | Open in IMG/M |
Ga0065181_1131844 | Not Available | 694 | Open in IMG/M |
Ga0065181_1142548 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 661 | Open in IMG/M |
Ga0065181_1142809 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi | 660 | Open in IMG/M |
Ga0065181_1144644 | Not Available | 655 | Open in IMG/M |
Ga0065181_1145775 | Not Available | 652 | Open in IMG/M |
Ga0065181_1150190 | Not Available | 640 | Open in IMG/M |
Ga0065181_1150563 | Not Available | 639 | Open in IMG/M |
Ga0065181_1156781 | Not Available | 623 | Open in IMG/M |
Ga0065181_1160076 | Not Available | 615 | Open in IMG/M |
Ga0065181_1168687 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium CG_4_9_14_3_um_filter_65_9 | 595 | Open in IMG/M |
Ga0065181_1169198 | Not Available | 594 | Open in IMG/M |
Ga0065181_1174068 | All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → Archaeoglobaceae → Archaeoglobus → Archaeoglobus fulgidus | 583 | Open in IMG/M |
Ga0065181_1200710 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 534 | Open in IMG/M |
Ga0065181_1203119 | Not Available | 530 | Open in IMG/M |
Ga0065181_1212501 | Not Available | 515 | Open in IMG/M |
Ga0065181_1213835 | Not Available | 513 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0065181_1049905 | Ga0065181_10499052 | F076872 | GDQRKVFEKPHSLRRIFWGITVLAGNDVWRQTRISFDDPSFYSYYALDGQHKHFEMKGDGISQGDIWVYNASGISLLYTSIEILV* |
Ga0065181_1078010 | Ga0065181_10780101 | F102531 | FPASIIACQTQGADSPACQMGAAGITSGLFADPIGTLQGLQASAQQVQSVAQQGLTELSFASSILNLPNADNSAGWTLGTYGRQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK* |
Ga0065181_1085858 | Ga0065181_10858581 | F023250 | MRLKVLQIGSLVPLGWGFTETEAKKPAGFCTIKQATEFAMQNLGLKQGEFKVLKGRGVDL |
Ga0065181_1090346 | Ga0065181_10903462 | F089865 | VEIAGFISIGLLVVVQIGYFAYSFGKLDGTVKSIDKRLNDLTHRYDVMEERIGKLEGRK* |
Ga0065181_1110661 | Ga0065181_11106611 | F080880 | TTIKKKTNEIIEELAKTYGTKNRVLEQAVETLLRVEKVGSCEDCVIKAKMNEQTKLREALDLTSLGRKTLDGLLEVAVGDKTIQDFIKEQKAESKNIIEILRGTIEWKTPSNFKEFTIILEEIRNLTQMFDIASHSEIENTVILRPKAFKRLPEVVAFQTAVMLEGVGAPFEIRMMGEDIAVKMIRQEIYPLRKKEFGESLDQQIEKRLATSRPGLFKNSLMLVGPGFMNWAEKHLEEPVTDLGSIIEDVRIALGVD |
Ga0065181_1114413 | Ga0065181_11144132 | F035627 | MKNIKASVGVFPAQEQKPVEGGGVEEKEYTWKFNETLFYLRENPDCQMAVGGEWHGTKAGGVVGSVDTYFRHILTSRRCETVWDAVDGYAGFDDRWADDFEKIATHIEDLPNIEEIIRTERSMMMSEK* |
Ga0065181_1115913 | Ga0065181_11159131 | F050899 | GEHLKNNSLKEKNGGEKMLEKLRGKMKKGLMLFAIVAVILMGTAVLPGCNKEQPVEADELVVPESMSSYEVFNTTANAGPQIKTWPFWQGGWSAEYMDPYGMPQGASEEQFIVAFGAMLAEGFYTKRDGKQAGAGLFKSVLWITILRYENPESAKRSFINISETQELQDSTYGGIALKNGTHTLTWWEEESEDWDESTMPCYLIQSGPFVIYLFGRDDVAKDILDRIIVSFGVKDSTSISTLAANIASEK |
Ga0065181_1118429 | Ga0065181_11184291 | F075677 | MDLNEIITNITMTKACSIKVDKESKVSKIINLKVKFDGASLSSVFDKALAGAVIQWQNGVGRKRFDTYKPNQVVEIQFSAPARRSAIDPET |
Ga0065181_1122609 | Ga0065181_11226091 | F060597 | MVQTVKSQKVYTGWKVILLEKPHSSRRIFFSIKTLADQTTWCRSLISFDDPSFASHYIFDGPVQQLEAKGEGIFQGDIWAHNVSPVDLIFIITEILI* |
Ga0065181_1126861 | Ga0065181_11268612 | F001258 | MKSIEASVGVIPTQEQKEKVVEEGKKYIRKENEVIFCLDEAPEYQISVVARWGENARGEIVEQPDIYFRHISTGRLCQSVYACLVGVEHDQYAEEKWDTLYYRIRTMLLRDLRRIVCYEKKYDVHPI* |
Ga0065181_1131496 | Ga0065181_11314961 | F069002 | MNYAELSMAISDWLNKDSLDKVLPTIIRFGQRDLEDDLRIRPMEYHPVTANISAATASLALPSDFLELFYLVLIKDDVRYVVDGRESSRALYTERPSATETGTPCKVARVADDLVFDVLTDSAYTRDWFYYRRLPVLVATAPNNTNWWSEYAEEALLMSCLNKASGYVTGISAEDKAKWKEGALFTRENLKFNDAREATGGHVMRSSNWK* |
Ga0065181_1131844 | Ga0065181_11318441 | F002298 | EDKIKKGILSMTLKKIDNMMEKSDDIDRMNLEFYAMIGKTYGPQTVFEQHRAAIKIDTFRRDIGKRWAENMRKREEIYEILDMLYELAKIETENSGMENLGQISWAEIRQKIQERMKKILFR* |
Ga0065181_1142548 | Ga0065181_11425482 | F058694 | MVQTVRSEIIHAGCKKIIFEKPLVLHRIFFSINVLAPLDTWFESRVSFDDPMFFSYYTLTGHTKYFEAKGEGIFQGDVWLFNTSTGDVLYTMTEILA* |
Ga0065181_1142809 | Ga0065181_11428091 | F042954 | MEGVRTRRGLYGFEFRDIDQRRTSEDMPRKRFEIKALWQRSHEIINLASRGYKQSDIAEILNISEACVSTTLNSELGQKKLADIRLVRDEDAKKTSEKIRILTAKAIEKYHEIFDNEDGQATLKDQKDVADTVLLELSGLRAPTKIQSSSINMTLTSEEIEAFKSRGLKAAKE |
Ga0065181_1144644 | Ga0065181_11446441 | F016244 | FKNRIKINKMVLPILPGIIVLAGARILVSYGTHLLRFIVANPKILMGTATVVTVADALKEHEKNEEIRNSILQDIYTQNPELAGKIVSAGGFSFHPVENVFQMVIPWAIIGLIFYALFKKI* |
Ga0065181_1145775 | Ga0065181_11457751 | F080881 | MSKNSKSSVALSELAVLPQFVAPVVEIAPEVLSMIVMDIDPALIEAAEIKEARLDAVKAKKDDAKRLLAQAHEIAKTLNPLCDEQDNVEKERKLLKDVLKDALLATRVALANHPEILENKAQIVKAAPNLMAEFNEKFNAAVIKQTQ |
Ga0065181_1150190 | Ga0065181_11501901 | F089865 | MEITGFLSLGMLIIIQIGYFAYNYGQLNGKVANIDKRLNDLSHSVATIEERIGKLEGRK* |
Ga0065181_1150563 | Ga0065181_11505631 | F017787 | EEKNKEYKEMLDRVRTFILFKDMKKEMEKHKKEYNNMEKIIREGVHTETKEKYTLGVRAKFEVNFQYFEISKHPKFDDECVYLSRKFFFLKNNITDKERKEIEKFENDILNRFYYTQ* |
Ga0065181_1150563 | Ga0065181_11505632 | F049412 | MKTMKMMKTISGKVSSVFPELNCFEINSDIFFHDVKASIVKKLRLGKKITVQCRIKKIKSGDWTFTDFIFLKIINKKPINNKQNNNGKEKRKCRKC |
Ga0065181_1156781 | Ga0065181_11567811 | F000320 | MKNTKTPQNKEYKSEALCREIMYKINQIGQKIYKLKKDVNDVCGIILRKPGGTFFFEGRKAELWEEASDDFFKQWKEIGAEKYEVECLVNLMYDLVLQEEVKRKGKIILNAKRRTMSMAK |
Ga0065181_1160076 | Ga0065181_11600762 | F002777 | MKNIKKNTGANYHISMHTEIRSGKIIQKINEIGRKTYALKKKEGAFYAEALKKARRSTEFPGTDKLREMAYDRFEQEWKEIEDGKDEVNWMIHRLYISIEDEGEEK* |
Ga0065181_1168687 | Ga0065181_11686871 | F069627 | TASEIPIEVDLTDEDAIPIYMRISDKVLHLRRLGMPFTSIAEHLGINPWMAKKAARWGNIRKA* |
Ga0065181_1169198 | Ga0065181_11691981 | F000320 | QATMKNIKMPISKDYKSEALGREILRKINGIGEKIYKLKKDVNDICGLVLSRPGATMFVEGHKAELWNAASDDFSQRWKEIAADKYEVECMVNLMYDLALQEEIKRKGKIIFSTKRRAITTAK* |
Ga0065181_1174068 | Ga0065181_11740681 | F042954 | KGMTMEDVQTRNGLYRFDFREVDQRRVAEGEEKKTYNIKSLWQRSHEIINLAARGYKGTDIAEILGITPACVSLTLNSDLGQKKLSEIRLVRDEDAKKTSEKIRVLTAKAIQTYHEIFDNESGEATLKDRKDVADTVLLELSGLRAPTKIHTSSVSTILTAEEIESFKSRGLRAARETGFIDITEKSDEKCSSD |
Ga0065181_1200710 | Ga0065181_12007102 | F048766 | KNKTAKQRNEEIEEDKKIIDIMCKLGGGIKKNNQKIKKDI* |
Ga0065181_1203119 | Ga0065181_12031191 | F082132 | MVQTVCTISVLAGKKKIILEKPDKLRRIFFEVRTIADQAPVCTTWLSFDDPLFHTYFTFSGPMKYFVAEGPDINQGDIWIYNASGGDLNYTATEILH* |
Ga0065181_1212501 | Ga0065181_12125011 | F010074 | CGTKFNEKIYFRLKRMRCPSCMYGMNGSTHTEKFENDRGIKEIEILHDKEINDIEIVRTDIQKRHKIIPFQTSKADGLNAISAVNRIAEKKKEKEVVTIQEKILANFSDTCINFIREYFFLYNHTIFSPIKYMNIGLTQIRKDPRFIANASLKNTILSEVKTACDELLTAC |
Ga0065181_1213835 | Ga0065181_12138351 | F000449 | MKNKKAGGEDTSPPDLDQEYELRTSAPQEEMERYYARKKASLIRKLRQINKKIKKNIYNPYAGDDEKHKNAPD* |
Ga0065181_1213835 | Ga0065181_12138352 | F038538 | MQATMKNTKTPLTKDYKSETLGREILKKINLIGQKTYKLKKDVNGICGLVLSKPGATMFVEGHKAELWKEASDDFSQQWKEIAAEKYEVECMVHLMYD |
Ga0065181_1215875 | Ga0065181_12158751 | F100366 | MRKLDLKNYTISLPDEKGILRFTPYQFQNTLMDMLPHPRLGLNGPELLKAMEVVEEIEKAKTEVLLSEEHYQLILDTCKKFRGFTKYDAQFLKRIFNCPVVPDEDKGVDLSDNGKEKK* |
Ga0065181_1221330 | Ga0065181_12213301 | F094906 | PNTTTYFVIKAYTATENVQSAQYSVAVSNGDSKVYDVYMNIASGQIRLRFKYFVTSPPDNWEARCKKNGGSWNSWIVDPPSGQQYFTDYVGSGWQTGDQLDIDFYNPSNRSETNIQDTILYGNTCF* |
⦗Top⦘ |