Basic Information | |
---|---|
IMG/M Taxon OID | 3300025007 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110940 | Ga0210038 |
Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG15_big_fil_post_rev_8/21/14_0.20 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 356611405 |
Sequencing Scaffolds | 27 |
Novel Protein Genes | 40 |
Associated Families | 34 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → Viruses → Predicted Viral | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium RIFCSPHIGHO2_12_FULL_63_22 | 1 |
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_15 | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis | 1 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 5 |
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
All Organisms → Viruses → environmental samples → uncultured archaeal virus | 1 |
Not Available | 12 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah: Grand County | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000449 | Metagenome / Metatranscriptome | 1126 | Y |
F001911 | Metagenome / Metatranscriptome | 618 | Y |
F002651 | Metagenome | 539 | Y |
F003014 | Metagenome | 513 | Y |
F006336 | Metagenome / Metatranscriptome | 375 | Y |
F007959 | Metagenome / Metatranscriptome | 341 | Y |
F012423 | Metagenome / Metatranscriptome | 280 | Y |
F016112 | Metagenome / Metatranscriptome | 249 | N |
F016358 | Metagenome | 247 | Y |
F016496 | Metagenome / Metatranscriptome | 246 | N |
F017787 | Metagenome | 238 | Y |
F019951 | Metagenome / Metatranscriptome | 226 | N |
F021923 | Metagenome / Metatranscriptome | 216 | N |
F026802 | Metagenome / Metatranscriptome | 196 | N |
F039475 | Metagenome / Metatranscriptome | 163 | N |
F050143 | Metagenome / Metatranscriptome | 145 | N |
F054576 | Metagenome / Metatranscriptome | 139 | N |
F055778 | Metagenome / Metatranscriptome | 138 | Y |
F060596 | Metagenome / Metatranscriptome | 132 | N |
F062438 | Metagenome / Metatranscriptome | 130 | N |
F068436 | Metagenome / Metatranscriptome | 124 | Y |
F068439 | Metagenome / Metatranscriptome | 124 | N |
F073076 | Metagenome / Metatranscriptome | 120 | N |
F075683 | Metagenome | 118 | N |
F076873 | Metagenome / Metatranscriptome | 117 | Y |
F079604 | Metagenome / Metatranscriptome | 115 | N |
F083827 | Metagenome / Metatranscriptome | 112 | Y |
F093254 | Metagenome | 106 | Y |
F093503 | Metagenome | 106 | Y |
F094905 | Metagenome | 105 | N |
F094906 | Metagenome | 105 | N |
F096615 | Metagenome / Metatranscriptome | 104 | N |
F102531 | Metagenome | 101 | N |
F104464 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0210038_1002233 | All Organisms → cellular organisms → Bacteria | 8544 | Open in IMG/M |
Ga0210038_1010846 | All Organisms → Viruses → Predicted Viral | 3006 | Open in IMG/M |
Ga0210038_1012478 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2754 | Open in IMG/M |
Ga0210038_1026728 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium RIFCSPHIGHO2_12_FULL_63_22 | 1667 | Open in IMG/M |
Ga0210038_1037063 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1338 | Open in IMG/M |
Ga0210038_1038383 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_15 | 1307 | Open in IMG/M |
Ga0210038_1040001 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis | 1271 | Open in IMG/M |
Ga0210038_1045756 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 1161 | Open in IMG/M |
Ga0210038_1046856 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1142 | Open in IMG/M |
Ga0210038_1056621 | All Organisms → Viruses → Predicted Viral | 1004 | Open in IMG/M |
Ga0210038_1057638 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 992 | Open in IMG/M |
Ga0210038_1066553 | All Organisms → Viruses → environmental samples → uncultured archaeal virus | 900 | Open in IMG/M |
Ga0210038_1078705 | Not Available | 804 | Open in IMG/M |
Ga0210038_1080830 | Not Available | 790 | Open in IMG/M |
Ga0210038_1088275 | Not Available | 745 | Open in IMG/M |
Ga0210038_1088984 | Not Available | 741 | Open in IMG/M |
Ga0210038_1089016 | Not Available | 741 | Open in IMG/M |
Ga0210038_1098489 | Not Available | 693 | Open in IMG/M |
Ga0210038_1102214 | Not Available | 677 | Open in IMG/M |
Ga0210038_1104539 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 667 | Open in IMG/M |
Ga0210038_1105343 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 664 | Open in IMG/M |
Ga0210038_1115049 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 628 | Open in IMG/M |
Ga0210038_1127564 | Not Available | 589 | Open in IMG/M |
Ga0210038_1130149 | Not Available | 582 | Open in IMG/M |
Ga0210038_1153880 | Not Available | 525 | Open in IMG/M |
Ga0210038_1158716 | Not Available | 515 | Open in IMG/M |
Ga0210038_1164810 | Not Available | 503 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0210038_1002233 | Ga0210038_10022334 | F093503 | MKKYKVKIKHTDKEEIIKADSELEARVKFCEQNNLNYRHLAGKLEITLNNKPLQNNL |
Ga0210038_1010846 | Ga0210038_10108465 | F075683 | MTHIYTANMQPATGWIDQCGEVEIRGGFGVGEMIRCVCCGKKRPAEDCVVQCYYDGMSVWCAEGKGCKSHAEIEQKRLIAHENRSRGQRARRAKERLLLAAVVPLD |
Ga0210038_1012478 | Ga0210038_10124783 | F096615 | MAGCFGNNPEDRARERELNNYLDSADRLDNNDERIKELARDKFNALPSFYSPKGEKYRYTFMDDAMGSLKQEELIAIARLLRDGEHLQAGKLLEASLMRVLVAEAEEEIEDEEYD |
Ga0210038_1026728 | Ga0210038_10267282 | F054576 | MEFQLGLPELIAFALANIGGGWALLRISFSQFEMRLDDRFKLLDKAVNDVKRIELEIVRADTRNAQTYVTQANHDKVLERIFNVLSSMEQKLDGKANAADCEAKLLRRMERK |
Ga0210038_1033468 | Ga0210038_10334685 | F094905 | MTTIYGYIAAAVLIISLGAGWAHEHDKRIVFKAQVEQAGKDAAKHTAEIDAKHREEMQNAEQNTIIATNSIADWYRAH |
Ga0210038_1037063 | Ga0210038_10370632 | F021923 | IKMNLTDFFNNYEFIKKLQNHLTVNKENLLNDFPNNNIEKSFNQLLLSPSNAYLAKEIIPQTFADMTWKFKLLEGQQTLTSPSPREFTLTYESQTSRSIFYYETPISGSCYVECNFVDVEPGTTNCIFLGNANNITSKTYLAIQTFPLSSFQGNKIGDGWVAIMEFNNSNIHNPNVLSWSYDIEPTGTFRLSKNTSTLKLHHNRTYSLGAKTTFIGEDLHAGFFSYAFASSKIKKTTFKNLSCGNY |
Ga0210038_1038383 | Ga0210038_10383833 | F054576 | MEFQLGLPELIAFVLANIGGGWALLRLSFAQFEQRIKDQFELLDKAVNDVKRIELEIVRADTRNAQTYVAKTSHDKVLERIFNVLSSMEQKLDGKANAADCEAKHLRHMERK |
Ga0210038_1040001 | Ga0210038_10400012 | F055778 | MLSSVDLPEPELPTIKTNSPCLIENETLSRALTLLSPCP |
Ga0210038_1045756 | Ga0210038_10457561 | F007959 | METVINFEKFKQEYKKDFDIPVLSAKEKDIYIYLFLSLRRKMSKGMFPELYNSEIMFSKSDLKALVLKNIVIFQNHKKGWIISMNPKYITKTTECSFCGAKFNEIVYFRRNSITCPGCGFRMHGLVTAKRVNDYSVAIANIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKI |
Ga0210038_1045756 | Ga0210038_10457564 | F060596 | MENKEASVGAIPIPDENVAKKKMAKNRTMVLLDRKVVDRLCCCKRKIGNTYNSVISNLLDKYENGQ |
Ga0210038_1046856 | Ga0210038_10468562 | F083827 | MTGCQGSVGAVENGSVVFQGAVGAFVSVRSRGRFHRRWRTRIAKVTDDIGRFDDESR |
Ga0210038_1056621 | Ga0210038_10566212 | F054576 | MEFQLGLPEIIAFVLANIGGGWALLRISFAQFELRLDDRFELLDNAMADVKRIELEIVRADTRNAQTYVTQTSHDKVLERIFNVLSSMEQKLDGKADAADCDMKIMRIMERK |
Ga0210038_1057638 | Ga0210038_10576381 | F001911 | MKRPIEFEKFKQEYKKDFDIPVLSTKEKDIFMYLFLLLRRKMSKGIFPELYNSEIMFSKSDLKTLVLKNIVIFQNYKKGWIISMNPNCITKNAECSFCGAKFNEMVYFRQNSISCPGCGFRMHGLATAKRVNDYSVAITNIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKVTDIVLAPTAMERTIQIANQEVIPFQTGKADKLNALSTLKSIENKKKEANALVVQDNILANFSPISINFLKEYFFIQHTFFSARKYINLKLTEISKDPRFIANASLKNNTLTEVKNACERLCESEEIRGITLTEE |
Ga0210038_1066553 | Ga0210038_10665532 | F026802 | MYYYVPFSRNLTQGNTEEIAAADPGHLVGVFLRFKSGVEGFHPLRGQISISNIGTTKEYLLKNGIYAIFQRGTDGFVLNDWYQPAFIPLDHDIRANHQVLLETFTWGANFISVRGHFVYS |
Ga0210038_1066553 | Ga0210038_10665533 | F019951 | ILFKAFQINVSANNVKCEANNKLHVQKGQTIILHAIQIAPGIDTADENVFIMLYRDQEQLAGEGIYHAIIDDYTHTIEFDADAVEGQTFVTKAWGPNARTIHGVYMYEITS |
Ga0210038_1078705 | Ga0210038_10787052 | F054576 | MEFQLGLIELIAFAVANIGGWWALLRISFTQFELRIKDQFKLLDKAVNDVKHIELEVVRADTRNAQTYVTQTNQDKALERIFNVLSSMEQKLDGKANAADCEAKLLRHMERK |
Ga0210038_1080830 | Ga0210038_10808301 | F102531 | AAMGITAGYFSDVIGSEQALQASAQQVQSVAQQGLTELSFASSILNLPNADNSAGWTLGTYGGQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK |
Ga0210038_1083882 | Ga0210038_10838821 | F094905 | MTTLYGYIAAIVLIVFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAEIDAKHLEEMKNAEQNTIIATNSIADWYRAHPAVRVRYANADCSAV |
Ga0210038_1085889 | Ga0210038_10858891 | F073076 | MVVLPMQSGVRTIEKIPKISLVSTIDQQEGFFYDVWKNPEKYGYTKLKMTWQECDGYTKEDMRKKKIEIGTRAFASQYECEAQSTTSSFFTHSIIEGSIRNCDYKHGITIGGIDLAKKKDYASISVVEKRDNNFNLIVNFQTQLNYTDLARISKQYEKEYLTTSFLVDTTTGEEFVDFASKEPYLVSLKPFAFTLNSKKQILDYLRIVMEQKRLVIPERFQELISDMRRYQYSDHLPDSISSLALSLWNEKAL |
Ga0210038_1088275 | Ga0210038_10882752 | F068439 | MKSENESVDTKSTRDKKKICEACQKRETEENDSIIKNVEEIEIYDIKKTGKNYSYRIFFCTIENECFFEYYSYNDIKRFNKERTVFLSLLTDKNKKKVYDFERKMHEKYPEYLYKMKNKKVTYPEYKNIKEYRISENYTVLFFTIGQHAIFQYFEDDGLIVYNEVDRSCVLHWLSGEETERLFNFEDEIKEEYAKYIF |
Ga0210038_1088984 | Ga0210038_10889842 | F093254 | MSDTTTSYRKMKAELRERTEGDFTLLSQDEADALSLARWREYSDWAKQEPIVESAVPEILYDAIHGDGTAASCSYETLRRAALAYLAAGTRNDWSCP |
Ga0210038_1089016 | Ga0210038_10890161 | F039475 | RMEKLNVLIYTKDEDYNPLGSVQLELFQLDDAGNETYIGKSISGDDGRLEVSKTALGGEGARIRVRNAKRLSGEDIVQTSDGIYSTFIVSSDETVQEVEIVFQIKKKHNISINVTWK |
Ga0210038_1089016 | Ga0210038_10890162 | F012423 | MVVTKYKGDNCVIAIGISNDADSTAQNYGLSLCLKDEDGDEYTLTMTGLVGTLLAGGFKKITDTDKIVKNKFYFEKGTTYNYNITATFPEMKEGKVYGQLGIYDGTDEDSAWKAGTTYDH |
Ga0210038_1091879 | Ga0210038_10918792 | F079604 | MTKARRGITLIKNRKKFKLDTFLQDIRKSPVAFTEICQYQGRQVQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIALWYFYTCENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVL |
Ga0210038_1095751 | Ga0210038_10957511 | F094906 | LSFNTNKSVTKIEVNWFRADGWNFSKYKNTSGTSHILPIKYLKPNTFTYFVIKAYTATENVQSAQYSVAVSNGDSKVYDVYINVSGSSVRLSWRYFVANPQSNFESWYRKNGGSWYSRSITRIGQKYYWTNYADIGWQAGDQMDIQIFNPADRSETNIEDTILYGNIYF |
Ga0210038_1098489 | Ga0210038_10984891 | F017787 | EKIIREGEYSLTKKKYTLGVRAKFDVKFEYFEVTKYPNFKDECVYLSRKFFFLKNIITDDEKKEIEKFENEILNKFYYTK |
Ga0210038_1098489 | Ga0210038_10984892 | F068436 | MENKKKNAENAWEKGIVNEIYEEIEKKNREYEKRMNLIEAILN |
Ga0210038_1100788 | Ga0210038_11007882 | F079604 | MTARRGITFSKNRRKFKLDTFLQDIRKSPIAFMEICRYQGKQIQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIAIWYFYTHENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIV |
Ga0210038_1102214 | Ga0210038_11022142 | F050143 | MTETDVKDAMVRGISNAGAAHTFGVNYPRQDPIQSAISAIQKGIWEKNFRDSIGKWEKKLALVTIEEWKAATIAAASMYAEKASTIGAEEWGKYYDKAKSVIESAATEYVKSDKKKENMIKFWTDMQKLKNL |
Ga0210038_1104539 | Ga0210038_11045391 | F000449 | MKNKKEGGEDTPSPDLDQEFELRTSAPQEEMERYYARKKASLERKLRRLNKKIKKNIYNP |
Ga0210038_1105343 | Ga0210038_11053432 | F076873 | MEKWKKFTVENKKDFILPRNLSPREKDIFMYIFRLLREKVNRGRYPELYNDEIIYSKSDVKNLILKEIVIFKQYKKGWIITIHPTHITKNVECPWCGARFNEKIYFRQKRMRCPSCMWGMNGSTVAEKFEEINDVSNSNITENSTKTEKVTTDIKLAPTSIERTMQIARQKIIPFQTSKADVLNAISALN |
Ga0210038_1115049 | Ga0210038_11150492 | F002651 | MKNIEASVGAVPTQKKKEKVEGEGKKYIRKENEVIFCLEE |
Ga0210038_1127564 | Ga0210038_11275641 | F006336 | DISVNFIREYFFMYNHTIFSTMKYMNIGLREIRKDPRFIANPSLKNTVLSEVKTACDELLRACRPLQQPNEYVTNEYVTNEYVTALKLL |
Ga0210038_1130149 | Ga0210038_11301491 | F003014 | MKPPHNGGRGTTKNLYQREKEGTMPRRGSKQPYKPKVEYKYTIKDIAELAGMTRNALSVAKVHEKIEPGDFKNVVSFLTRRIIDNRLTGDLFTPTGGKAKRMKGSKSRAHVSGKKPKR |
Ga0210038_1146178 | Ga0210038_11461781 | F062438 | SEIIAHPFDASTVLGEHPDIILADECAFFGDDSFFRMVVLPMQSGVRTIEKIPKISLVSTIDQDEGFFYDVWKNPDKYGYTKLKMTWQECDGYTKEDMKNKKIEIGTRAFASQYECEAQSTTSSFFTHSIIEGCIRGCDYLNGITIGGIDLAKKKDYASISVVEKRDNNFNLIVNFQTQL |
Ga0210038_1146386 | Ga0210038_11463861 | F104464 | MAKIILLIEGEVKDGEELKALNKAFNILTTLFQVEIRNLAPREQKKXTNTEKATSSS |
Ga0210038_1153880 | Ga0210038_11538801 | F016112 | MWGSIEKVRNICGITKEQINDIVIHDMLEQTDRKVRDRVYIFRRLTMQIDKIGTNKIRFDVNKIGDGNMDTSIDYTDISIYRRNNGVYELFPIQKLDILSNEITFQTDIGENYDMIVEYFEDNFHFSIDTLSDASSLLAAAMCMRSLPLTSQTNNDAK |
Ga0210038_1158716 | Ga0210038_11587161 | F096615 | MAGCFGNNPEDRARERELNNYLDAAARLDNDDERIKELARDKFNALPNFYSPKDEKYRYTNMDDAMGSLKQEELIAIARLLRDGEHLQAGKLLEASLMRVLVAEAEEEIEDEE |
Ga0210038_1159734 | Ga0210038_11597341 | F016358 | FINMIQRTDSKNNIHANKSWAKQSPAMKYAALKTIISKSDIFVLPE |
Ga0210038_1164810 | Ga0210038_11648102 | F016496 | MEDKKKEDKYFFKKGNKYQQKGSAGLRYGGTTPVAKDKEKNLERLRCGWKPKTIVCKNITISDDEEAFANEFSAKVSEENDTYVDTVLIELAIAQILQVHRVYVYAKEKKISRDASRMIGTVLSTLREMNATKNARKEDNI |
⦗Top⦘ |