NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300013871

3300013871: Clean room microbial communities from NASA Spacecraft Assembly Facility at Jet Propulsion Laboratory, Pasadena, California, USA - InSight In4-11 gowning area SPAdes reassembly



Overview

Basic Information
IMG/M Taxon OID3300013871 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095512 | Gp0111883 | Ga0181466
Sample NameClean room microbial communities from NASA Spacecraft Assembly Facility at Jet Propulsion Laboratory, Pasadena, California, USA - InSight In4-11 gowning area SPAdes reassembly
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size87905619
Sequencing Scaffolds22
Novel Protein Genes27
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available11
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Pezizomycotina → leotiomyceta → dothideomyceta → Dothideomycetes → Pleosporomycetidae → Pleosporales1
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → rosids → fabids → Fagales → Fagaceae → Castanea → Castanea mollissima1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Acrogymnospermae → Pinopsida → Pinidae → Conifers I → Pinales → Pinaceae → Picea1
All Organisms → cellular organisms → Archaea1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Fungi incertae sedis → Mucoromycota → Glomeromycotina → Glomeromycetes → Diversisporales → Diversisporaceae → Diversispora → Diversispora epigaea1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → asterids → lamiids → Solanales → Solanaceae1
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameClean Room Microbial Communities From Nasa Spacecraft Assembly Facility At Jet Propulsion Laboratory, Pasadena, California, Usa
TypeEngineered
TaxonomyEngineered → Built Environment → Unclassified → Unclassified → Unclassified → Clean Room → Clean Room Microbial Communities From Nasa Spacecraft Assembly Facility At Jet Propulsion Laboratory, Pasadena, California, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Surface (non-saline)

Location Information
LocationUSA: Jet Propulsion Laboratory, Pasadena, California
CoordinatesLat. (o)34.1Long. (o)-118.1Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002654Metagenome / Metatranscriptome539Y
F002959Metagenome / Metatranscriptome517Y
F017428Metagenome240Y
F023843Metagenome / Metatranscriptome208Y
F028713Metagenome / Metatranscriptome190Y
F032009Metagenome / Metatranscriptome181Y
F033665Metagenome / Metatranscriptome176Y
F044253Metagenome154Y
F045048Metagenome / Metatranscriptome153Y
F049431Metagenome / Metatranscriptome146N
F054337Metagenome / Metatranscriptome140N
F057039Metagenome / Metatranscriptome136Y
F065766Metagenome / Metatranscriptome127Y
F072954Metagenome / Metatranscriptome120Y
F077757Metagenome / Metatranscriptome117N
F088701Metagenome / Metatranscriptome109N
F096629Metagenome / Metatranscriptome104Y
F103436Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0181466_1000188Not Available21824Open in IMG/M
Ga0181466_1000229Not Available20393Open in IMG/M
Ga0181466_1000315All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes18046Open in IMG/M
Ga0181466_1000359Not Available17120Open in IMG/M
Ga0181466_1001102All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Pezizomycotina → leotiomyceta → dothideomyceta → Dothideomycetes → Pleosporomycetidae → Pleosporales9169Open in IMG/M
Ga0181466_1001206All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae8573Open in IMG/M
Ga0181466_1002471Not Available4828Open in IMG/M
Ga0181466_1002667Not Available4536Open in IMG/M
Ga0181466_1002958Not Available4125Open in IMG/M
Ga0181466_1003107All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae3926Open in IMG/M
Ga0181466_1003557All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota3450Open in IMG/M
Ga0181466_1003878Not Available3170Open in IMG/M
Ga0181466_1005086All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → rosids → fabids → Fagales → Fagaceae → Castanea → Castanea mollissima2491Open in IMG/M
Ga0181466_1007415All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Acrogymnospermae → Pinopsida → Pinidae → Conifers I → Pinales → Pinaceae → Picea1790Open in IMG/M
Ga0181466_1008685All Organisms → cellular organisms → Archaea1556Open in IMG/M
Ga0181466_1009584Not Available1435Open in IMG/M
Ga0181466_1010266Not Available1353Open in IMG/M
Ga0181466_1014359All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Fungi incertae sedis → Mucoromycota → Glomeromycotina → Glomeromycetes → Diversisporales → Diversisporaceae → Diversispora → Diversispora epigaea1029Open in IMG/M
Ga0181466_1018436All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → asterids → lamiids → Solanales → Solanaceae843Open in IMG/M
Ga0181466_1025949Not Available645Open in IMG/M
Ga0181466_1026582Not Available633Open in IMG/M
Ga0181466_1027998All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium609Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0181466_1000188Ga0181466_100018819F044253MLLEYIPTKEQDADILKKALLRCKFEFHRDMNWVANNPFHVEREC*
Ga0181466_1000229Ga0181466_100022928F065766MADEKNDKGAGDPIKILLEEALERQQNAMMDSFAQIL*
Ga0181466_1000315Ga0181466_10003155F032009MKLLLCTLILLAGYTSFSQTYISLAPSFTNTPGTIPEKSNIALEIGQQWDVFSMGIDVGKTTLARMPHGDTSFYAELRPNLNVFQQGKFTNTFTAGIGCIFNAKEYLLTELTNGIEYAYSTKLHFNVFFGQYFYSGRASASDATFFGVSAAFYFSETTVGSLIKGAVK*
Ga0181466_1000359Ga0181466_10003591F002654KVVCPKGNSPEQVFKVPKLLLSEIKEVFFKYNQEIGLEAAIF*
Ga0181466_1000581Ga0181466_10005818F072954LIKRTKAEDFFMGDKVLRWDSRREDKGKHGKFDFLWRGPYVIQVVQGNNTYFLKRLDGTEAEDGPVNG*
Ga0181466_1000832Ga0181466_100083216F072954MFDKRTQAEDFFIGDKVLRWDSRREDKGKHGKFDFLWKGPFIIQVLQGNNTYFLKSLNGTEAEDGPVNGRMLKHYFDLL*
Ga0181466_1001102Ga0181466_100110215F088701VAVVLKRPEKIPVPLFKNRKNKLSYAIKAEISKIVIMALEHNIDLSRLSVIELDKSKMINGLINT*
Ga0181466_1001206Ga0181466_100120610F077757MNTNYYFFIIPLISGVSILSFFSLINIYPQNSEISVIVQLTNGDKFIGQTWDINAQLKDKSSNSEISKDQTTFKVNQNQTIQMILPLSNSTQDLNRYSIFVDASSTIDDIDVFGTIDSLTNDNNQITLNLTKAR*
Ga0181466_1002046Ga0181466_10020461F096629METPSKQPKISSMSHEEEEEMQHDISKLREKMQQLSLSQKVTEAKM
Ga0181466_1002471Ga0181466_10024715F033665MKIGGEAKAHEISKNQKWFGQILTSLGDIKIINIFNGVPKSKLRKNKILMGINIKLGSKVPKAIDQAKT*
Ga0181466_1002667Ga0181466_10026672F044253MLLEYIPLEEQDANILTKALSRCKFEFHKDKIGVTNNPFLVEREC*
Ga0181466_1002958Ga0181466_10029583F033665VLIWGIIGCALATKFGGEVGAHEISNTYKGCGKILNRSGDIKIINIFNGVPKSKFMENKILMCINMNLC*
Ga0181466_1003107Ga0181466_10031071F103436MIVASKKSWCDKSESEIFDNLREWVITCDLKYSKDDALDKIARASALWGGADYVTAVHLLDENEPYFKKSDWPYYSIGIEILKARKYEFFEE*
Ga0181466_1003557Ga0181466_10035574F045048VFNKGNSKGLIDSIPKGGHLAPNSTVGDKALWKNDQNMAKKNKATPILIPLCTAKV*
Ga0181466_1003878Ga0181466_10038788F023843MVRSGKIEIEKFNGKSFELWKLKMEDLVVDKDQWIGVDPSIEPMTMSNEDWEKLDRQAKTTI*
Ga0181466_1005086Ga0181466_10050862F049431HMTTGRINQVTILSPSAEAHERTPRRRPGCTGRKGRRRSEPQPQASQVCETPEPQATDSIAPTEFPKLRSATGGIRLLHRSLHRYIRPSGGEDSRQVNAQARVLDRAIPEDLVKRLAKPVIHRPQMLPANRETNRTSVPSTSTLRLPEGRQHRASRKESPSPK*
Ga0181466_1007415Ga0181466_10074154F017428MANQQKIISMGRLQGITVDIEGASTLADFEVIEFIDDSNPYPVLLGIDWATNMNRVINLKKHKMTFEKKSLRVIIPLDPVEGSRYSELVHNYESEDDLDCIYKITA*
Ga0181466_1008685Ga0181466_10086851F054337MYSVIGSVIFLNLGLFQEGLAQNLINSNLPTPNDTEIQDINNQSTDKDNGLANVPLVINITKNSNATETVIDFIINQVNSSVVDISVLEDLAKDRIPLLIDTLKNTNASSIAAEYVLNTTKNEILTNTTSS*
Ga0181466_1009584Ga0181466_10095841F033665MKFGGEVGAHEISNTQKGFGQILNRSGDIKIINIFNGVPKSKFRENKILMCINMNLC*
Ga0181466_1010266Ga0181466_10102661F057039MDVDGIVVVVEVNGKDNMEFGMEDIEENIDDIMDGSHKFLSNLVSVIGFGVEMDNVAKERVCSIYFEVDVHKLEIVNYDIVSYQAK*
Ga0181466_1014359Ga0181466_10143591F045048TVTPDDNNITVFSKGNSKGLIASIPIGGHIAPNSTVGDKALWKNAQKIAKKNKASDTINKATPIFIPLCTAKV*
Ga0181466_1018436Ga0181466_10184361F044253MLLEYIPTEEQDAGILTKSLSRSKFEFHRDKTGVADNPFLVEREC*
Ga0181466_1022405Ga0181466_10224052F096629MKTPSKEPESSSMTHEEEEEMLHDISKLQEQVKQKSHLQKLR*
Ga0181466_1025949Ga0181466_10259491F028713KVLLVKVDTLKNTADALTKSVSSEKFSWCRETMGVSGLKK*
Ga0181466_1026389Ga0181466_10263891F096629MSTPGKKPESSSMTYEEEEEMQHDIAKLRDQVQQVSLAQKGTEAK
Ga0181466_1026582Ga0181466_10265821F044253MYVWTEEHYADILTKALSKCEFEYYRDRIGVIDSPFLVEREC*
Ga0181466_1027998Ga0181466_10279982F002959MRDSIPKYVTQARAAFLLGMPVAEIHRISLEAGLGHLERAGNFEELYLTYDELKKVCALASEPAMATH*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.