Basic Information | |
---|---|
IMG/M Taxon OID | 3300004210 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110935 | Ga0066639 |
Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG10_big_fil_rev_8/21/14_0.10 |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 1491146785 |
Sequencing Scaffolds | 33 |
Novel Protein Genes | 38 |
Associated Families | 30 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis | 1 |
Not Available | 18 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Archaea | 4 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nomurabacteria | 1 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 5 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah: Grand County | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000320 | Metagenome / Metatranscriptome | 1306 | Y |
F000449 | Metagenome / Metatranscriptome | 1126 | Y |
F003421 | Metagenome / Metatranscriptome | 487 | Y |
F004880 | Metagenome / Metatranscriptome | 420 | Y |
F007959 | Metagenome / Metatranscriptome | 341 | Y |
F014811 | Metagenome | 260 | Y |
F016244 | Metagenome | 248 | Y |
F017787 | Metagenome | 238 | Y |
F030345 | Metagenome | 185 | Y |
F037574 | Metagenome / Metatranscriptome | 167 | N |
F040819 | Metagenome | 161 | Y |
F042051 | Metagenome / Metatranscriptome | 159 | Y |
F044906 | Metagenome / Metatranscriptome | 153 | N |
F048766 | Metagenome | 147 | Y |
F050899 | Metagenome | 144 | Y |
F065251 | Metagenome / Metatranscriptome | 128 | Y |
F068436 | Metagenome / Metatranscriptome | 124 | Y |
F071959 | Metagenome / Metatranscriptome | 121 | Y |
F075520 | Metagenome | 118 | Y |
F080249 | Metagenome / Metatranscriptome | 115 | Y |
F083694 | Metagenome | 112 | Y |
F085143 | Metagenome | 111 | Y |
F087386 | Metagenome / Metatranscriptome | 110 | Y |
F087940 | Metagenome / Metatranscriptome | 110 | Y |
F089865 | Metagenome | 108 | Y |
F091390 | Metagenome | 107 | Y |
F094578 | Metagenome / Metatranscriptome | 106 | Y |
F094906 | Metagenome | 105 | N |
F097603 | Metagenome / Metatranscriptome | 104 | Y |
F099485 | Metagenome / Metatranscriptome | 103 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0066639_10001261 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 22534 | Open in IMG/M |
Ga0066639_10015783 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 5893 | Open in IMG/M |
Ga0066639_10030887 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis | 3992 | Open in IMG/M |
Ga0066639_10031560 | Not Available | 3942 | Open in IMG/M |
Ga0066639_10117459 | Not Available | 1792 | Open in IMG/M |
Ga0066639_10121453 | All Organisms → cellular organisms → Bacteria | 1756 | Open in IMG/M |
Ga0066639_10149030 | Not Available | 1545 | Open in IMG/M |
Ga0066639_10205900 | All Organisms → cellular organisms → Archaea | 1258 | Open in IMG/M |
Ga0066639_10241226 | All Organisms → cellular organisms → Archaea | 1134 | Open in IMG/M |
Ga0066639_10264761 | Not Available | 1065 | Open in IMG/M |
Ga0066639_10316084 | Not Available | 944 | Open in IMG/M |
Ga0066639_10326456 | All Organisms → cellular organisms → Archaea | 923 | Open in IMG/M |
Ga0066639_10328346 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Pacearchaeota → Candidatus Pacearchaeota archaeon | 919 | Open in IMG/M |
Ga0066639_10339217 | Not Available | 899 | Open in IMG/M |
Ga0066639_10347205 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nomurabacteria | 884 | Open in IMG/M |
Ga0066639_10351263 | All Organisms → cellular organisms → Archaea | 877 | Open in IMG/M |
Ga0066639_10365011 | Not Available | 854 | Open in IMG/M |
Ga0066639_10398782 | Not Available | 802 | Open in IMG/M |
Ga0066639_10444974 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 741 | Open in IMG/M |
Ga0066639_10488997 | Not Available | 691 | Open in IMG/M |
Ga0066639_10513897 | Not Available | 667 | Open in IMG/M |
Ga0066639_10520457 | Not Available | 660 | Open in IMG/M |
Ga0066639_10548205 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 635 | Open in IMG/M |
Ga0066639_10560809 | Not Available | 624 | Open in IMG/M |
Ga0066639_10569388 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 617 | Open in IMG/M |
Ga0066639_10581512 | Not Available | 607 | Open in IMG/M |
Ga0066639_10591209 | Not Available | 600 | Open in IMG/M |
Ga0066639_10592553 | Not Available | 599 | Open in IMG/M |
Ga0066639_10604056 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 590 | Open in IMG/M |
Ga0066639_10616738 | Not Available | 581 | Open in IMG/M |
Ga0066639_10622128 | Not Available | 577 | Open in IMG/M |
Ga0066639_10696278 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 529 | Open in IMG/M |
Ga0066639_10709822 | Not Available | 520 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0066639_10001261 | Ga0066639_1000126121 | F042051 | MTTTFGEVSWNDDVFSGSEKKNSKDLFLRLDEGSNEMRLITQPFQYLVHKYKKEGDPGFGQKVNCSAVHGSCPLCAAGDKAKPRWLLGVISRKTGTYKILDVSFAVFSQVRKYARNTARWGDPTKYDIDIVVDKNGGATGYYAVQPIPKEPLSAADQQIKDSVDFDDLKRRVTPPTPDMVQKRIDKINGVTGEAAEAAPTPSGKAAKAATKAAPAPVNMSEEEDESFPAYDGDQAK* |
Ga0066639_10015783 | Ga0066639_100157833 | F040819 | MTKAQEIYEKVEALVATGVPKADAFRQVAEEFGQPFNSMRGAYYAHSRTITGGSSRPRRRQTTTADAVESAAQLLRRALESIDDEVLAAKARAEEAKAEYEALRDSVKERKAAIEAKIDALTS* |
Ga0066639_10030887 | Ga0066639_100308871 | F099485 | MNQKIIHNWQHSPSETQVASNPVKAVSKSLQQTKSLVFTNEDLMG |
Ga0066639_10031560 | Ga0066639_100315602 | F091390 | MAIKNTESTNTTNTTTIRIEKSIKEELENLDFVRKNTFNEILSTLIEFYNKNKKGAKNEK |
Ga0066639_10117459 | Ga0066639_101174591 | F083694 | VNRLEKLKNRQLARFLNHLKKTGQLTPGLESDVKRAYSFAFEDVEALILGLDKEKEDDNFKKA* |
Ga0066639_10121453 | Ga0066639_101214534 | F014811 | MWIYDYKTKKEYFASSPRSKFYCYQSIQIEPIPGKPCTILVKKGAEEGRKPAQTFEQLGMFTNQSIPYHASFKK* |
Ga0066639_10149030 | Ga0066639_101490303 | F085143 | MARIDDGFATLIEFAEDSDVQMWEKEVTPPGVSGGGENDTSTMRNTTWRTKSPKGLMSLSEASLVVAYDPAVYNEIIIMLNVNQQITITFADSSTLVFWGWIDEFTPGAAAEGSQPTATVKIIPSNQNGSGVETAPQYSVAP* |
Ga0066639_10205900 | Ga0066639_102059001 | F065251 | MMYLRFFKKGRKKYYYIAKAVREGDRVIQKSILYIGTADTLYKKLIQLKKKSK* |
Ga0066639_10241226 | Ga0066639_102412263 | F094578 | MNNQKERLKWPLQRLKGMYFAMHCWKCKKFELPMEEYKQRCENNIQKIIDSLEIKEMKFKEFNDIIKSTTFCKFDKRGFSNNCPICKKLGISRRGL* |
Ga0066639_10264761 | Ga0066639_102647612 | F087940 | MINKNKRQLEWGISFWIGIGLMSIINVIKITYTQHTISYNWIMFGISIIAIIICV |
Ga0066639_10316084 | Ga0066639_103160841 | F091390 | MNTKSTKSTTIRINEPTKEKLETLDFVRKHTFDDILTELMDFYEKNKGKRTK* |
Ga0066639_10326456 | Ga0066639_103264561 | F065251 | MYLRHFIKGRKKYYYIAKSIRIKNRIIQKSILYVGTTDGLYEKLIKLKKN* |
Ga0066639_10328346 | Ga0066639_103283462 | F094578 | MEQKERLKWPLKRLKGMYFAMHCWKCKIWENSIEGYKKRCEDNIQKIIDNIKVKEMKFKDFNKIIKNSTFCQFEKRDFSDNCPICKKWGISRR* |
Ga0066639_10339217 | Ga0066639_103392171 | F065251 | MYLRSFKKGKKKYYYIAKAVRIGKRVIQKSILYLGTADNIYKKLHTK |
Ga0066639_10347205 | Ga0066639_103472052 | F080249 | MADNKNKVKEDRNLISFKENYEVYYAVNQLKKQFPDETKSNIKEALFDAAKQVSPSEGREKIMRLTRKELNS* |
Ga0066639_10351263 | Ga0066639_103512631 | F065251 | MYLRHFTKGKKKYYYIAKAVRKSTSVIQKSILYIGTADTLYEKLISLKKK* |
Ga0066639_10365011 | Ga0066639_103650112 | F065251 | MYLRSFKKGKKRYYYIAQAVRKGKRVIQKSVLYLGNADNIYKKLHTK* |
Ga0066639_10398782 | Ga0066639_103987821 | F071959 | MASRNYLVMSNDMTLSDKKEYRLNALSAGLERCGLRGIGDIKADIPGLAGIPDANKVARVKLIHNYLITGQWPRSIDQRELTTGTDLVVAPAVDSWLTAPMAAVGNIVSCFQGVAAPQLVQGKLMVCYAVSVESSAVPMPVSRLIFRRGAAGNVQAQFDMEPMGIRWEVDAFFSEVVVIDPQDVFAIQVRCRNATAVAEIVHIHNFLFESAGLVVA* |
Ga0066639_10444974 | Ga0066639_104449741 | F000449 | MKNKKAGEGISPSPDLDQEYELRTSAPKEEMERYYARKKASLLRKIRKINKKIKKNIYNPFINDDE |
Ga0066639_10447732 | Ga0066639_104477322 | F094906 | SYPLVDAGDYYYINLQFKTNKSVTKIEVNWTRAGGYNFSKFKNTSGTIHSLPIKYLRPNTTTYFVIKAYTATENVQSAQYAVAVPNASQKVYEVEITFSTVVRLYWSYFVDSPPNNFSAQYNRNGTDWYPWTVYRTGQQYSTNDGSGWQAGEQLEIKIFNPADKSETNIQDSILYGNTDF |
Ga0066639_10448576 | Ga0066639_104485761 | F097603 | KVPILTSKFGAYMPIPFDFKESPFIAVPYLKLPNEFEGYGLPMLLENPQIMLNMIKNQRLDAVTLNIHKMWIVNPLANINKAELVTRPFGIIYSTDPNGVREVQFSDVKQSAYREEEMTKSDMRYASGVDDFSMGVGGPASSATEVRHLRESTLERVRLFVNHLGEGYAKLMRYWISMYRQFMSEPLKIRITGENGEVQFPMVEKDDLVGEYDFKATVIPSIAGKNDVDKKQNMDLFQLLSQMPF |
Ga0066639_10488997 | Ga0066639_104889972 | F075520 | MKTLHYILNTKTDESFDTIEMKFYPNNWEPEMEEDKDYLQSLIDEDPEKFKNCIIETRTFE* |
Ga0066639_10513897 | Ga0066639_105138971 | F030345 | MELKNEKIYIQTIEKRERVNDICWYAITGNQGKRFSCFEAEVAKKLQINRVNLCKVRYFGKYSNIMSVDGYEDNPEVANSNSEIAKQREVESLRILKCVALKSASLCFEGQNANSDEVITKANSFVKWLFSTEDKGAI* |
Ga0066639_10520457 | Ga0066639_105204572 | F094578 | MEKEKPKWPLQRLKGMYFAMHCWKCKKFELPMEEYKARCEDNIQRIIDNLNIKEMTFRGFNNIIKNSTFCEFEKRECFRKNN* |
Ga0066639_10548205 | Ga0066639_105482051 | F048766 | MKNNKTAKQRNEETEEDKKIIDIVCKLGGGFKEKNNQKNKKTNKI* |
Ga0066639_10560809 | Ga0066639_105608091 | F068436 | MEKEKKSAENAQGKGIVNEIYAEIEKKNREYEKRMNLIEAIF |
Ga0066639_10560809 | Ga0066639_105608092 | F017787 | KEEYANMEKIIREGEYSETKEKYTLGVRARFEVNFQYFEISKHPKFDDECVYLSRKFFFLKNNISDDEKEKIEKFENNILNKFYYGK* |
Ga0066639_10569388 | Ga0066639_105693881 | F037574 | LLSCNKMTGIFADEVSEWTTITISKDTAKKLKEFFGGEETYNYVITFLLDYYDGKTYRF* |
Ga0066639_10569388 | Ga0066639_105693882 | F007959 | MERHTDFKKFKQEYKKDFDIPVLSTKEKDIFMYLFLLLRKKMSRGIFPELYNSEIMFSKSDLKTLVLKNIVIFQNYKRGWIVSINSHYITKNTECSFCGAKFNEMVYFRRNSIRCPGCGIRMHGLVTAKRVNDYSVAIANIEKVEAIPKVTTS |
Ga0066639_10581512 | Ga0066639_105815121 | F091390 | TIRISKKTKERIEKLDFVRKDTFNEILNRLIDLYEKKKK* |
Ga0066639_10591209 | Ga0066639_105912091 | F050899 | SYEVFNTTANAGPQIKTWPFWQGGWSAEYMDPYGMPQGASEEQFIVAFGAMLAEGFYTKRDGKQAGAGLFKSVLWITILRYENPESAKRSFINISETQELQDSTYGGIALKNGTHTLTWWEEESEDWDESTMPCYLIQSGPFVIYLFGRDDVAKDILDRIIVSFGVKDSTSISTLAANIASEKTDSLSLYYGGGNSSNT |
Ga0066639_10592553 | Ga0066639_105925531 | F016244 | KMVLPILPGIIVLAGARILVSYGTHLLRFIIANPKILLSTATVVTVADALKEHEKNEQIRNSILQDIYTQNPELAQKIVSAGGFSFHPVENMFQIAISSAITGLIIYAIIQKI* |
Ga0066639_10604056 | Ga0066639_106040561 | F003421 | MENKKVGEEDIPSPDLDQEYELRTSAPQEEMERYYARKKAFLLRKIRKINKKIKKNIY |
Ga0066639_10616738 | Ga0066639_106167382 | F004880 | SMDYWEEINNGNKYLCDKCGIVALGAFEYDNYIKFQYHYCELCWNYIHLKKGSCSCGNTMTNRNEYPTMKVLCSCGEPVELKIDC* |
Ga0066639_10622128 | Ga0066639_106221281 | F089865 | VEIVGFVSIGLLVVIQIGYFAYTFGKLNGKVASIDKRLNDLAHRYDRMEERIGKREGRK* |
Ga0066639_10696278 | Ga0066639_106962782 | F044906 | MKMNCDNKKEHMTIEIKRNIANRLKKMSSVGVTYDDIITDLIGYYRATK* |
Ga0066639_10709822 | Ga0066639_107098221 | F000320 | KKIKIKTNMMKNTKTPPSKGYKSEVLGREILRKINLIGQKTYKLKKDVNDICGLILSEPGATMFVDGHKAELWTAASDDFSQRWKEIAADKYEVECMVHLMYDLVLQEEIKCKGKIVFSTKRRAMSMVK* |
Ga0066639_10722528 | Ga0066639_107225282 | F087386 | MTKEELFEKYHINESHNVWDNGIDNWMSVEVYRIMHDGNLPPEGDQSTSYVCEFLDKVKEHGAFFSELRKRTPDDFGSLFLTSKRMVYTLADEILKELNNE* |
⦗Top⦘ |