NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025007

3300025007: Groundwater microbial communities from aquifer - Crystal Geyser CG15_big_fil_post_rev_8/21/14_0.20 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300025007 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111384 | Gp0110940 | Ga0210038
Sample NameGroundwater microbial communities from aquifer - Crystal Geyser CG15_big_fil_post_rev_8/21/14_0.20 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size356611405
Sequencing Scaffolds27
Novel Protein Genes40
Associated Families34

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium RIFCSPHIGHO2_12_FULL_63_221
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_151
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense5
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1
All Organisms → Viruses → environmental samples → uncultured archaeal virus1
Not Available12

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDevelopment Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeaquifergroundwater
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Utah: Grand County
CoordinatesLat. (o)38.9383Long. (o)-110.1342Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000449Metagenome / Metatranscriptome1126Y
F001911Metagenome / Metatranscriptome618Y
F002651Metagenome539Y
F003014Metagenome513Y
F006336Metagenome / Metatranscriptome375Y
F007959Metagenome / Metatranscriptome341Y
F012423Metagenome / Metatranscriptome280Y
F016112Metagenome / Metatranscriptome249N
F016358Metagenome247Y
F016496Metagenome / Metatranscriptome246N
F017787Metagenome238Y
F019951Metagenome / Metatranscriptome226N
F021923Metagenome / Metatranscriptome216N
F026802Metagenome / Metatranscriptome196N
F039475Metagenome / Metatranscriptome163N
F050143Metagenome / Metatranscriptome145N
F054576Metagenome / Metatranscriptome139N
F055778Metagenome / Metatranscriptome138Y
F060596Metagenome / Metatranscriptome132N
F062438Metagenome / Metatranscriptome130N
F068436Metagenome / Metatranscriptome124Y
F068439Metagenome / Metatranscriptome124N
F073076Metagenome / Metatranscriptome120N
F075683Metagenome118N
F076873Metagenome / Metatranscriptome117Y
F079604Metagenome / Metatranscriptome115N
F083827Metagenome / Metatranscriptome112Y
F093254Metagenome106Y
F093503Metagenome106Y
F094905Metagenome105N
F094906Metagenome105N
F096615Metagenome / Metatranscriptome104N
F102531Metagenome101N
F104464Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0210038_1002233All Organisms → cellular organisms → Bacteria8544Open in IMG/M
Ga0210038_1010846All Organisms → Viruses → Predicted Viral3006Open in IMG/M
Ga0210038_1012478All Organisms → cellular organisms → Bacteria → Proteobacteria2754Open in IMG/M
Ga0210038_1026728All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium RIFCSPHIGHO2_12_FULL_63_221667Open in IMG/M
Ga0210038_1037063All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1338Open in IMG/M
Ga0210038_1038383All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_151307Open in IMG/M
Ga0210038_1040001All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus → Candidatus Nitrosopumilus koreensis1271Open in IMG/M
Ga0210038_1045756All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense1161Open in IMG/M
Ga0210038_1046856All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1142Open in IMG/M
Ga0210038_1056621All Organisms → Viruses → Predicted Viral1004Open in IMG/M
Ga0210038_1057638All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense992Open in IMG/M
Ga0210038_1066553All Organisms → Viruses → environmental samples → uncultured archaeal virus900Open in IMG/M
Ga0210038_1078705Not Available804Open in IMG/M
Ga0210038_1080830Not Available790Open in IMG/M
Ga0210038_1088275Not Available745Open in IMG/M
Ga0210038_1088984Not Available741Open in IMG/M
Ga0210038_1089016Not Available741Open in IMG/M
Ga0210038_1098489Not Available693Open in IMG/M
Ga0210038_1102214Not Available677Open in IMG/M
Ga0210038_1104539All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense667Open in IMG/M
Ga0210038_1105343All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense664Open in IMG/M
Ga0210038_1115049All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense628Open in IMG/M
Ga0210038_1127564Not Available589Open in IMG/M
Ga0210038_1130149Not Available582Open in IMG/M
Ga0210038_1153880Not Available525Open in IMG/M
Ga0210038_1158716Not Available515Open in IMG/M
Ga0210038_1164810Not Available503Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0210038_1002233Ga0210038_10022334F093503MKKYKVKIKHTDKEEIIKADSELEARVKFCEQNNLNYRHLAGKLEITLNNKPLQNNL
Ga0210038_1010846Ga0210038_10108465F075683MTHIYTANMQPATGWIDQCGEVEIRGGFGVGEMIRCVCCGKKRPAEDCVVQCYYDGMSVWCAEGKGCKSHAEIEQKRLIAHENRSRGQRARRAKERLLLAAVVPLD
Ga0210038_1012478Ga0210038_10124783F096615MAGCFGNNPEDRARERELNNYLDSADRLDNNDERIKELARDKFNALPSFYSPKGEKYRYTFMDDAMGSLKQEELIAIARLLRDGEHLQAGKLLEASLMRVLVAEAEEEIEDEEYD
Ga0210038_1026728Ga0210038_10267282F054576MEFQLGLPELIAFALANIGGGWALLRISFSQFEMRLDDRFKLLDKAVNDVKRIELEIVRADTRNAQTYVTQANHDKVLERIFNVLSSMEQKLDGKANAADCEAKLLRRMERK
Ga0210038_1033468Ga0210038_10334685F094905MTTIYGYIAAAVLIISLGAGWAHEHDKRIVFKAQVEQAGKDAAKHTAEIDAKHREEMQNAEQNTIIATNSIADWYRAH
Ga0210038_1037063Ga0210038_10370632F021923IKMNLTDFFNNYEFIKKLQNHLTVNKENLLNDFPNNNIEKSFNQLLLSPSNAYLAKEIIPQTFADMTWKFKLLEGQQTLTSPSPREFTLTYESQTSRSIFYYETPISGSCYVECNFVDVEPGTTNCIFLGNANNITSKTYLAIQTFPLSSFQGNKIGDGWVAIMEFNNSNIHNPNVLSWSYDIEPTGTFRLSKNTSTLKLHHNRTYSLGAKTTFIGEDLHAGFFSYAFASSKIKKTTFKNLSCGNY
Ga0210038_1038383Ga0210038_10383833F054576MEFQLGLPELIAFVLANIGGGWALLRLSFAQFEQRIKDQFELLDKAVNDVKRIELEIVRADTRNAQTYVAKTSHDKVLERIFNVLSSMEQKLDGKANAADCEAKHLRHMERK
Ga0210038_1040001Ga0210038_10400012F055778MLSSVDLPEPELPTIKTNSPCLIENETLSRALTLLSPCP
Ga0210038_1045756Ga0210038_10457561F007959METVINFEKFKQEYKKDFDIPVLSAKEKDIYIYLFLSLRRKMSKGMFPELYNSEIMFSKSDLKALVLKNIVIFQNHKKGWIISMNPKYITKTTECSFCGAKFNEIVYFRRNSITCPGCGFRMHGLVTAKRVNDYSVAIANIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKI
Ga0210038_1045756Ga0210038_10457564F060596MENKEASVGAIPIPDENVAKKKMAKNRTMVLLDRKVVDRLCCCKRKIGNTYNSVISNLLDKYENGQ
Ga0210038_1046856Ga0210038_10468562F083827MTGCQGSVGAVENGSVVFQGAVGAFVSVRSRGRFHRRWRTRIAKVTDDIGRFDDESR
Ga0210038_1056621Ga0210038_10566212F054576MEFQLGLPEIIAFVLANIGGGWALLRISFAQFELRLDDRFELLDNAMADVKRIELEIVRADTRNAQTYVTQTSHDKVLERIFNVLSSMEQKLDGKADAADCDMKIMRIMERK
Ga0210038_1057638Ga0210038_10576381F001911MKRPIEFEKFKQEYKKDFDIPVLSTKEKDIFMYLFLLLRRKMSKGIFPELYNSEIMFSKSDLKTLVLKNIVIFQNYKKGWIISMNPNCITKNAECSFCGAKFNEMVYFRQNSISCPGCGFRMHGLATAKRVNDYSVAITNIEKVEAIPKVTTSVVKVPIEHVITDVPGNLKVTDIVLAPTAMERTIQIANQEVIPFQTGKADKLNALSTLKSIENKKKEANALVVQDNILANFSPISINFLKEYFFIQHTFFSARKYINLKLTEISKDPRFIANASLKNNTLTEVKNACERLCESEEIRGITLTEE
Ga0210038_1066553Ga0210038_10665532F026802MYYYVPFSRNLTQGNTEEIAAADPGHLVGVFLRFKSGVEGFHPLRGQISISNIGTTKEYLLKNGIYAIFQRGTDGFVLNDWYQPAFIPLDHDIRANHQVLLETFTWGANFISVRGHFVYS
Ga0210038_1066553Ga0210038_10665533F019951ILFKAFQINVSANNVKCEANNKLHVQKGQTIILHAIQIAPGIDTADENVFIMLYRDQEQLAGEGIYHAIIDDYTHTIEFDADAVEGQTFVTKAWGPNARTIHGVYMYEITS
Ga0210038_1078705Ga0210038_10787052F054576MEFQLGLIELIAFAVANIGGWWALLRISFTQFELRIKDQFKLLDKAVNDVKHIELEVVRADTRNAQTYVTQTNQDKALERIFNVLSSMEQKLDGKANAADCEAKLLRHMERK
Ga0210038_1080830Ga0210038_10808301F102531AAMGITAGYFSDVIGSEQALQASAQQVQSVAQQGLTELSFASSILNLPNADNSAGWTLGTYGGQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK
Ga0210038_1083882Ga0210038_10838821F094905MTTLYGYIAAIVLIVFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAEIDAKHLEEMKNAEQNTIIATNSIADWYRAHPAVRVRYANADCSAV
Ga0210038_1085889Ga0210038_10858891F073076MVVLPMQSGVRTIEKIPKISLVSTIDQQEGFFYDVWKNPEKYGYTKLKMTWQECDGYTKEDMRKKKIEIGTRAFASQYECEAQSTTSSFFTHSIIEGSIRNCDYKHGITIGGIDLAKKKDYASISVVEKRDNNFNLIVNFQTQLNYTDLARISKQYEKEYLTTSFLVDTTTGEEFVDFASKEPYLVSLKPFAFTLNSKKQILDYLRIVMEQKRLVIPERFQELISDMRRYQYSDHLPDSISSLALSLWNEKAL
Ga0210038_1088275Ga0210038_10882752F068439MKSENESVDTKSTRDKKKICEACQKRETEENDSIIKNVEEIEIYDIKKTGKNYSYRIFFCTIENECFFEYYSYNDIKRFNKERTVFLSLLTDKNKKKVYDFERKMHEKYPEYLYKMKNKKVTYPEYKNIKEYRISENYTVLFFTIGQHAIFQYFEDDGLIVYNEVDRSCVLHWLSGEETERLFNFEDEIKEEYAKYIF
Ga0210038_1088984Ga0210038_10889842F093254MSDTTTSYRKMKAELRERTEGDFTLLSQDEADALSLARWREYSDWAKQEPIVESAVPEILYDAIHGDGTAASCSYETLRRAALAYLAAGTRNDWSCP
Ga0210038_1089016Ga0210038_10890161F039475RMEKLNVLIYTKDEDYNPLGSVQLELFQLDDAGNETYIGKSISGDDGRLEVSKTALGGEGARIRVRNAKRLSGEDIVQTSDGIYSTFIVSSDETVQEVEIVFQIKKKHNISINVTWK
Ga0210038_1089016Ga0210038_10890162F012423MVVTKYKGDNCVIAIGISNDADSTAQNYGLSLCLKDEDGDEYTLTMTGLVGTLLAGGFKKITDTDKIVKNKFYFEKGTTYNYNITATFPEMKEGKVYGQLGIYDGTDEDSAWKAGTTYDH
Ga0210038_1091879Ga0210038_10918792F079604MTKARRGITLIKNRKKFKLDTFLQDIRKSPVAFTEICQYQGRQVQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIALWYFYTCENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVL
Ga0210038_1095751Ga0210038_10957511F094906LSFNTNKSVTKIEVNWFRADGWNFSKYKNTSGTSHILPIKYLKPNTFTYFVIKAYTATENVQSAQYSVAVSNGDSKVYDVYINVSGSSVRLSWRYFVANPQSNFESWYRKNGGSWYSRSITRIGQKYYWTNYADIGWQAGDQMDIQIFNPADRSETNIEDTILYGNIYF
Ga0210038_1098489Ga0210038_10984891F017787EKIIREGEYSLTKKKYTLGVRAKFDVKFEYFEVTKYPNFKDECVYLSRKFFFLKNIITDDEKKEIEKFENEILNKFYYTK
Ga0210038_1098489Ga0210038_10984892F068436MENKKKNAENAWEKGIVNEIYEEIEKKNREYEKRMNLIEAILN
Ga0210038_1100788Ga0210038_11007882F079604MTARRGITFSKNRRKFKLDTFLQDIRKSPIAFMEICRYQGKQIQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIAIWYFYTHENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIV
Ga0210038_1102214Ga0210038_11022142F050143MTETDVKDAMVRGISNAGAAHTFGVNYPRQDPIQSAISAIQKGIWEKNFRDSIGKWEKKLALVTIEEWKAATIAAASMYAEKASTIGAEEWGKYYDKAKSVIESAATEYVKSDKKKENMIKFWTDMQKLKNL
Ga0210038_1104539Ga0210038_11045391F000449MKNKKEGGEDTPSPDLDQEFELRTSAPQEEMERYYARKKASLERKLRRLNKKIKKNIYNP
Ga0210038_1105343Ga0210038_11053432F076873MEKWKKFTVENKKDFILPRNLSPREKDIFMYIFRLLREKVNRGRYPELYNDEIIYSKSDVKNLILKEIVIFKQYKKGWIITIHPTHITKNVECPWCGARFNEKIYFRQKRMRCPSCMWGMNGSTVAEKFEEINDVSNSNITENSTKTEKVTTDIKLAPTSIERTMQIARQKIIPFQTSKADVLNAISALN
Ga0210038_1115049Ga0210038_11150492F002651MKNIEASVGAVPTQKKKEKVEGEGKKYIRKENEVIFCLEE
Ga0210038_1127564Ga0210038_11275641F006336DISVNFIREYFFMYNHTIFSTMKYMNIGLREIRKDPRFIANPSLKNTVLSEVKTACDELLRACRPLQQPNEYVTNEYVTNEYVTALKLL
Ga0210038_1130149Ga0210038_11301491F003014MKPPHNGGRGTTKNLYQREKEGTMPRRGSKQPYKPKVEYKYTIKDIAELAGMTRNALSVAKVHEKIEPGDFKNVVSFLTRRIIDNRLTGDLFTPTGGKAKRMKGSKSRAHVSGKKPKR
Ga0210038_1146178Ga0210038_11461781F062438SEIIAHPFDASTVLGEHPDIILADECAFFGDDSFFRMVVLPMQSGVRTIEKIPKISLVSTIDQDEGFFYDVWKNPDKYGYTKLKMTWQECDGYTKEDMKNKKIEIGTRAFASQYECEAQSTTSSFFTHSIIEGCIRGCDYLNGITIGGIDLAKKKDYASISVVEKRDNNFNLIVNFQTQL
Ga0210038_1146386Ga0210038_11463861F104464MAKIILLIEGEVKDGEELKALNKAFNILTTLFQVEIRNLAPREQKKXTNTEKATSSS
Ga0210038_1153880Ga0210038_11538801F016112MWGSIEKVRNICGITKEQINDIVIHDMLEQTDRKVRDRVYIFRRLTMQIDKIGTNKIRFDVNKIGDGNMDTSIDYTDISIYRRNNGVYELFPIQKLDILSNEITFQTDIGENYDMIVEYFEDNFHFSIDTLSDASSLLAAAMCMRSLPLTSQTNNDAK
Ga0210038_1158716Ga0210038_11587161F096615MAGCFGNNPEDRARERELNNYLDAAARLDNDDERIKELARDKFNALPNFYSPKDEKYRYTNMDDAMGSLKQEELIAIARLLRDGEHLQAGKLLEASLMRVLVAEAEEEIEDEE
Ga0210038_1159734Ga0210038_11597341F016358FINMIQRTDSKNNIHANKSWAKQSPAMKYAALKTIISKSDIFVLPE
Ga0210038_1164810Ga0210038_11648102F016496MEDKKKEDKYFFKKGNKYQQKGSAGLRYGGTTPVAKDKEKNLERLRCGWKPKTIVCKNITISDDEEAFANEFSAKVSEENDTYVDTVLIELAIAQILQVHRVYVYAKEKKISRDASRMIGTVLSTLREMNATKNARKEDNI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.