NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300004185

3300004185: Groundwater microbial communities from aquifer - Crystal Geyser CG13_big_fil_rev_8/21/14_2.50



Overview

Basic Information
IMG/M Taxon OID3300004185 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111384 | Gp0110938 | Ga0066642
Sample NameGroundwater microbial communities from aquifer - Crystal Geyser CG13_big_fil_rev_8/21/14_2.50
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size601162568
Sequencing Scaffolds18
Novel Protein Genes22
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea1
Not Available11
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_30531
All Organisms → cellular organisms → Eukaryota1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes communis1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDevelopment Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeaquifergroundwater
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Utah: Grand County
CoordinatesLat. (o)38.9383Long. (o)-110.1342Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000212Metagenome / Metatranscriptome1580Y
F000320Metagenome / Metatranscriptome1306Y
F000449Metagenome / Metatranscriptome1126Y
F001003Metagenome / Metatranscriptome808Y
F002777Metagenome530Y
F003622Metagenome / Metatranscriptome476Y
F016112Metagenome / Metatranscriptome249N
F016496Metagenome / Metatranscriptome246N
F020302Metagenome224Y
F026801Metagenome196Y
F028419Metagenome / Metatranscriptome191Y
F039475Metagenome / Metatranscriptome163N
F048100Metagenome148Y
F050470Metagenome / Metatranscriptome145Y
F055778Metagenome / Metatranscriptome138Y
F057106Metagenome / Metatranscriptome136Y
F057887Metagenome / Metatranscriptome135N
F065251Metagenome / Metatranscriptome128Y
F078218Metagenome116N
F085132Metagenome111Y
F085143Metagenome111Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0066642_10068339All Organisms → cellular organisms → Archaea1166Open in IMG/M
Ga0066642_10090271Not Available1004Open in IMG/M
Ga0066642_10101719Not Available939Open in IMG/M
Ga0066642_10132215Not Available811Open in IMG/M
Ga0066642_10134734Not Available803Open in IMG/M
Ga0066642_10143640All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense774Open in IMG/M
Ga0066642_10161719All Organisms → cellular organisms → Bacteria723Open in IMG/M
Ga0066642_10175907Not Available689Open in IMG/M
Ga0066642_10201521Not Available636Open in IMG/M
Ga0066642_10237976All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053576Open in IMG/M
Ga0066642_10242715Not Available569Open in IMG/M
Ga0066642_10257537Not Available549Open in IMG/M
Ga0066642_10263640Not Available541Open in IMG/M
Ga0066642_10271285All Organisms → cellular organisms → Eukaryota531Open in IMG/M
Ga0066642_10274939All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes communis527Open in IMG/M
Ga0066642_10277352Not Available524Open in IMG/M
Ga0066642_10281650All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense519Open in IMG/M
Ga0066642_10293731Not Available505Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0066642_10068339Ga0066642_100683391F065251MYLRAFKKGGKKYYYIAKAVRKGLRVIQKSILYIGTADTLYEK
Ga0066642_10090271Ga0066642_100902712F020302MTNVRSFNKKPRGEPLLTHQVHCNVTPKDYTKAKTLKLSWNQLLNIGINSTDVLRNLRTKDIEIYNLKQRIEKLTARLQFYNNKKNALLQENCKLRCVILDKDVYYAKEEEEFSTFLFDYKFNHHKIYCEPEASGRDEESQPDK*
Ga0066642_10098029Ga0066642_100980291F000212LRTETKCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPQCEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPHCPKPKCELVCENPNCAPKVECCACAAGGARVSSPMPFFKEAEKDPNCCGCHKPLVI*
Ga0066642_10101719Ga0066642_101017191F016496MEDKIKGRRHLFEKGNKYQLMGSAGLRYGGTTSIAKNKEKNLERLRCGWKPKTLICKDITISDDEEAFAVEFSKKVSEENDTYVDTVLVELAIAQILQVHRVYVYAKDKKISRDASRMIGTVLSTLREMNATKSARKVDNIKVTVNSDIMTLI
Ga0066642_10101719Ga0066642_101017192F016112INDIVIHDILEQTDRKVRDMVYIYRRLTMQMNKIGTNKIRFDCNKIGDGNMDTSIDSTDISIYHKNEGVYELFAIEKLDILSNEITFQTDIGENYEMLVEYFEDNFHFLIDSLSDASSLLATAMCVRSLPLTPQKDINAKEYERRAKEIIARSSTSFI*
Ga0066642_10128075Ga0066642_101280751F048100PQTIFEQHRTTIKIDTFRRDIGKRWAENMRKREEIYEIIDMLYELVKIETENSDAERSKQMSWAEIRQKIQERMKKTLFR*
Ga0066642_10132215Ga0066642_101322151F055778MLSSVDLPEPELPTIKTNSPCLIENETLSRALTLLSPCP*
Ga0066642_10134734Ga0066642_101347342F078218MKNTEAVVGVLLTQSDKIKKNILDRALKKIEDMQEIANEIDGMNKELYIFIRKTRCPETIAKQERNTKKINVFLTSIRKKLDKISEGRGEIDEIIDTLYELSKIEAENSDVKEQGQIFWADIRREIQKKIKKPKIK*
Ga0066642_10143640Ga0066642_101436401F001003MEKYKKFMQEYKKDFLLPHNLSPKEKDIFIYIFRLLRKKINIGIYPEVRNDEIIYSKSDMKSLISKGIILFQPYKRGWIITIHPTHITKNVECSWCGTKFNEKIYFRLKRMR
Ga0066642_10161719Ga0066642_101617192F050470ARGARNLTGKIEKIEDRDALAQVRRQARRIVFKAFLAAIPLTLIVFVLP*
Ga0066642_10175907Ga0066642_101759071F085132VKNYKSILIVGAIAVGLYFFLHSKKILPMTLAQKAAWHEKVSRRMAQMSTAEKIRALKAARKEKAAAEGT
Ga0066642_10201521Ga0066642_102015212F085143MARIDDGFATLIEFAEDSDVQMWEKEVTPPGVSGGGENDTSTMRNTTWRTKAPKGLMSLSEASLVVAYDPAVYNEIIIMLNVNQQITITFADGSTLVFWGWIDEFTPGAAAEGSQPTATVKIIPSNQNGSGVETAPQYSVAP*
Ga0066642_10237976Ga0066642_102379761F002777MENIKKNTGTNPNISMHTEIISGRIIRKINEIGHKTHALKKKQNAFYAEALEKSRRSTEFPGKDKLREMTDDKFEHEWKEIEEGKDEVDWLIHRLYLSIEGEEEKNK*
Ga0066642_10242715Ga0066642_102427152F057106MVQTIKTIYLTPPQKTKVAEMPHTLHRIFFSIRVVVGVNSWYPSKISFDDPSFASFYLLAGTERYFEMWGEDIFQGDIWVKNEAT
Ga0066642_10257537Ga0066642_102575371F039475IYTKDEDYNPLGSIQLEFFQVDNDGNETYIGKSISGDDGRIEVSKTTLGGEGARIVVRNAKRLSGEDIVQTSDGKYSTFIVSNDETIQKVEIVFQIKKKHNISINVTWK*
Ga0066642_10263640Ga0066642_102636401F057887ISELEVTKDEYEGEEFWKNLLAATIAPNLSFSDIIYDDLKVYRDEEGNEYVFDDLGNLYLVSRPWNPTEPIIFLSIDTEKEYEYNTNTHSFVDKPIVYEAGLGVFKLFSKILKKLKITGSTARVASVLARIKNIRTGKKILYFNRKRGTWTAARDVSLMSLVYFGLSQGGKVATQTGSIA
Ga0066642_10271285Ga0066642_102712851F028419PIITKLSYLYQNNYMSDKFSLASDPTGKIIITGGYNNMFHIIDCDQKLNTQIVIDENNEKIMNTNVIRKINSKGSCFYKKDDPSLTNINFDKRILHQAYSPVENFANLILLNCIYSYTGALAKKSK*
Ga0066642_10274939Ga0066642_102749391F003622MYDIMKNYQDELEIIANDLLEQAVGDGFKPNYTNRDFMNAVIIFQTALMDKMYDNQDYDKMDFDDRLNMATSCGQELRKLIHTYTGLDTHKVEEFL*
Ga0066642_10277352Ga0066642_102773521F026801KTYKLKKDVNDICGLILSKPGATMFVEGHKAELWKEASDDFSHRWKEIAAEKYEVECMVHLMYDLVLREEEKLKGKIVFNAKRRAMSMVK*
Ga0066642_10281650Ga0066642_102816502F001003MEKFKKFKIENKDFVLPQNLSPKEKDIFIYIFRLLRQKINKGIYPELYNNEIIYFKSDMKNLISKGIILFVRYKKGWVITLNPIHITKNVECPWCGAKFNEKIYFR
Ga0066642_10293731Ga0066642_102937311F000320MSRSKGGKVEVLCGEILRKIKDIGQKTYKLKKDVNDVCGIILRKPGGTFFFEGRKAELWEEASDDFFKQWKEIGAEKYEVECLVNLMYDLVLQEEVKR
Ga0066642_10293731Ga0066642_102937312F000449MENKKAGEEDTSSPDIDQEFELHTSAPQEEMERYYARKKASLERKIRKINKKIKKNIYHPYISDDEKYKNVPQ*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.