NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006711

3300006711: Metatranscriptome of deep ocean microbial communities from Pacific Ocean - MP2255 (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300006711 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0053074 | Gp0092407 | Ga0031673
Sample NameMetatranscriptome of deep ocean microbial communities from Pacific Ocean - MP2255 (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size87314603
Sequencing Scaffolds22
Novel Protein Genes24
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales1
Not Available9
All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED1611
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Eukaryota → Opisthokonta1
All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Natrialbales → Natrialbaceae → Natronococcus → Natronococcus amylolyticus1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Saccharomycotina → Saccharomycetes → Saccharomycetales → Debaryomycetaceae → Meyerozyma → Meyerozyma guilliermondii1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Acartiidae → Acartia → Acartia pacifica1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda2
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae2
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Eukaryota → Cryptophyceae → Pyrenomonadales → Geminigeraceae → Guillardia → Guillardia theta → Guillardia theta CCMP27121

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDeep Ocean Microbial Communities From The Global Malaspina Expedition
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean → Deep Ocean Microbial Communities From The Global Malaspina Expedition

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationWest of El Salvador, Pacific Ocean
CoordinatesLat. (o)10.09Long. (o)-99.25Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000021Metagenome / Metatranscriptome6082Y
F000051Metagenome / Metatranscriptome3266Y
F000685Metatranscriptome937Y
F001293Metagenome / Metatranscriptome729Y
F003590Metagenome / Metatranscriptome478Y
F009881Metatranscriptome311N
F010768Metagenome / Metatranscriptome299Y
F012568Metatranscriptome279Y
F019484Metagenome / Metatranscriptome229Y
F021541Metagenome / Metatranscriptome218N
F025846Metagenome / Metatranscriptome200Y
F036025Metagenome / Metatranscriptome171N
F052865Metagenome / Metatranscriptome142Y
F060053Metagenome / Metatranscriptome133N
F061214Metagenome / Metatranscriptome132Y
F063612Metatranscriptome129Y
F063613Metatranscriptome129N
F071317Metagenome / Metatranscriptome122N
F078755Metatranscriptome116Y
F082867Metagenome / Metatranscriptome113Y
F090441Metatranscriptome108N
F097512Metagenome / Metatranscriptome104Y
F099485Metagenome / Metatranscriptome103Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0031673_1007544All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales839Open in IMG/M
Ga0031673_1008932Not Available909Open in IMG/M
Ga0031673_1019030All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED161638Open in IMG/M
Ga0031673_1021329All Organisms → cellular organisms → Bacteria1037Open in IMG/M
Ga0031673_1038725Not Available938Open in IMG/M
Ga0031673_1045674All Organisms → cellular organisms → Eukaryota → Opisthokonta687Open in IMG/M
Ga0031673_1181115All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Natrialbales → Natrialbaceae → Natronococcus → Natronococcus amylolyticus540Open in IMG/M
Ga0031673_1196013All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Saccharomycotina → Saccharomycetes → Saccharomycetales → Debaryomycetaceae → Meyerozyma → Meyerozyma guilliermondii612Open in IMG/M
Ga0031673_1196111Not Available704Open in IMG/M
Ga0031673_1196993All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Acartiidae → Acartia → Acartia pacifica895Open in IMG/M
Ga0031673_1197655Not Available898Open in IMG/M
Ga0031673_1204530All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda518Open in IMG/M
Ga0031673_1206074All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda785Open in IMG/M
Ga0031673_1216713Not Available577Open in IMG/M
Ga0031673_1230136Not Available516Open in IMG/M
Ga0031673_1249622Not Available784Open in IMG/M
Ga0031673_1250988Not Available892Open in IMG/M
Ga0031673_1252656All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae515Open in IMG/M
Ga0031673_1253337Not Available1775Open in IMG/M
Ga0031673_1256628All Organisms → Viruses → Predicted Viral1540Open in IMG/M
Ga0031673_1257104All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae543Open in IMG/M
Ga0031673_1257391All Organisms → cellular organisms → Eukaryota → Cryptophyceae → Pyrenomonadales → Geminigeraceae → Guillardia → Guillardia theta → Guillardia theta CCMP2712562Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0031673_1007544Ga0031673_10075442F099485IIHNWQHSPPKTQVARTPVKAVSKPLRQSQTITNEDLMGLSWIMFEESD*
Ga0031673_1008932Ga0031673_10089321F090441WELATLSFLDSLALGALITRRVIVGCQFPDAASCRKNDCDTFRSRSGT*
Ga0031673_1019030Ga0031673_10190303F025846AVLMVNLTVVPIVVHMVSASMAAGHAMVLLTAMTDQMKPIVAILHHVKTKVYGIVAMANVFQHHMYVMAQVNSVTQAGLQTVLMVQMKA*
Ga0031673_1021329Ga0031673_10213293F071317LITLFAGVTAAPLIHLDECNMPCCAGLATSCCDMDQEVACPTISDCGSSIFVLIVSGPFHKPELKSSDSISQRFITDLDIFKIETNYVTSFGKYDPGPITFLNLPLLI*
Ga0031673_1038725Ga0031673_10387252F063612ALMSVAGGSPADKVTSVAEKSGLKIPGRETSSKEGVNLLVL*
Ga0031673_1045674Ga0031673_10456741F061214LVNGFAKLTNISLALKETPINGGLTYEGPKVAEYPGR*
Ga0031673_1181115Ga0031673_11811151F003590VYISAVQGVAGMLEEPMRWEIKEGWVTDVTGGGEVGEELNRMFEEVPGSNKFVEIMFGYHPKASIQHGIEDPMHWELISKMPWAGLGTDRQNPNFRHMDGSIMNGQLYIDGQLLVDKYGMLDRSLLYHPDVVEVAREFGDPTEILAPVSHAAHGSNTQW*
Ga0031673_1196013Ga0031673_11960131F063613LTMNLITSLLKKRGGERGSGLLVGLALKGSGGLLALVVGGSTNLSLLLQSLDGILILPSDLVGQTTEASEVAARSESQHTEGLRANHTVDLVVRRGDTLEDLEVSQSGGTTGGLVGDHSTDGSPENLRGSTEVERTTSGVDVASLSKEVLVLELVSEVRTRDVDVLATNNNDLLSGQELLGNSGGKTAQKVTLSVNHDSLLKHA
Ga0031673_1196111Ga0031673_11961111F012568QATEGLLQPTEEDFKEFLMDFEDFKGDVASKMGNLTCVLKKMNALDDNLQINMKEYTGDFWNKIDLSQTMAGEDPEWRNMLVTSYKDCYDIARSWPQSTLNRNPLMKVFGRHMVFFKCARKEEAKACGMACANDMLTTLYGSNDAFDWTQYGMPQNKYERAAWTMKIMYGTASHEEHFVHDFFFSDPMM*
Ga0031673_1196993Ga0031673_11969931F009881MLILIHILFSLQLVGESELGGLLLQFGKLVLVLGNLLKGWLDELALHVTDRDGELVDLEITEDDLTLEEEHLSLQLVPLVEVSLADLLEIIHGGIVNVSLSSASLGNDTESLLGLALLLLLELLSGLLAKKSSELLLTLWGHKSLLLGHDEFCY*
Ga0031673_1197655Ga0031673_11976552F052865MDLRGLGIVVLGLGLPSIATWMANSGVGSRVMSYVDMVPGMTTKYGKAVIAIGLTAGVSYLAANFGVISAAEALTANMIAMGLVSVGLLKTMSLPVGQGLVDNLPAANFGGLAGSASYGYIGNYHEGVGAEMLPAPQSQQLFGVRANIF*
Ga0031673_1202042Ga0031673_12020421F000021MDDAAQVFLKSKILDNIAKYFSLIVFTAYMRQAAELARNSLSDEEKSKNALSGGKTAIPGDQLKITKTFVQYMEEHSKFRQMVEEGKGRLQWERDIPAAALANLENLATTDFKANLGKIIHDIYQTAHTMFSDMPQGDHKKRAKYRFASKTLMRILPADLKGEVEALIEKQSITLDLYDILGQCTWGQGLKA*
Ga0031673_1204530Ga0031673_12045301F078755MQGLVILAVTLSLASASAPAPGYGYAPEFYCRDTNTSIYADVCVPAFADKVTPIELAVKEVVDSDYCFDRVLTVCEETSTVVDREICTYVYEKEDVTAPCTATQVTYEDKSETMKVTACGAAGYGYGHQGYGQGEHQVCREEYQTQAYKVPLVTYPLEVSCSLSYPAPKEVC
Ga0031673_1206074Ga0031673_12060741F000685LDRMRGIVPDDILDALKKKKLGLPGIDFEVQEDRNAMQMGEFEVIKELLAAYPAAKIAKAQVDKLIDLAAPPPRGTGVENIRECIIESKMTFDVSADDWQAYLKAKIMNNIERYFYLIVFAMFIREVGPKGFPQSFLQYMDAHPDLRTMIDEGRCKLEWERKIPDEKLEELKTLLAAENFKDNMPKIIKRIYELSWDMFGDLPRGHHKNNSMHKLASKTMIEILPANLAEYIEKKCGSLAGTPDFFDVIGQVSWYETA*
Ga0031673_1216713Ga0031673_12167132F097512SQTDFSPILKDARAEGFEGLSNVARNILKGKYYTPSDEEVRGF*
Ga0031673_1230136Ga0031673_12301361F036025KIADQRSTEALSVIELEKLFLKCLDTGSVHNSTTHEVVFIIILTKGMTMNIDKLAPIAAVIVAIIGVLVVGDVTYSFQTGYDLTGNSQWLILALMLIGLVHGLMSPVTDHASQAMIIVAALAFPRLSDTLNSIPVIGMYLDNFIDCFAVAIAGYAIAALCNEVKSRP*
Ga0031673_1249622Ga0031673_12496221F082867MPRKRSSRRVHHINDINHNDTNITVQYRVLKPLTFTNNNLILSVNPTLTELSSTLASIYRQYRVTELSFTFQCSDIAGPFALAMQYVPQTGGVPITPPTTLAEFEGPAVGYCETGRGREYTYRVPSDILNAMGLNYYSTRVNISPSQDPDILTQGLMVFLTSTPATPIIAYMHVKYEFQTLEDPSFLARLVNGDENEAKDTMIVPHPKKGAGEKLWN*
Ga0031673_1250988Ga0031673_12509881F090441WELATLSFLDSLALGALITRRAIIGCQFPDAVSCRKNDCNMFRSHSGT*
Ga0031673_1251217Ga0031673_12512171F000051EDRVARTMDLQEDIAAKLEILKTCVATEADLLPQGDKVPADAHVFKDELNRIIKYVEELQANTKIECDKYSNDVKYWAEYRTGIKEFSPWLASAEKAATEGLSKPSDLDEVKALNDKVNSFDKTCVNYLKVLQAAEGASLKMTTHTEADNEVKALKERFDKVKAVSETWVKKCEVLVKEWVLLDNTVTELNSWVAKDKSAE
Ga0031673_1252656Ga0031673_12526561F001293MMRNSLICKLINVQASMKENNEKKIK*EDEISKLIKAPPKDTAALDKRSVLAKVNSQGSGT*
Ga0031673_1253337Ga0031673_12533371F019484QNIPKSYFLTILGGSLKKIAKKITTNVPINKKVYTNASII*
Ga0031673_1256628Ga0031673_12566283F060053MAFGLIPAGTITRETIISTRRIPIAGTTTITKGNVCELSGGYLANSPTGVNADVNHFVALETIDNSGGSAGDLSAPVAVSGHYVTVVADGAIRPGARVQVSASTAGQVITAAGDIDQQLGWYTGKEGGTIAKAATTPFQETFTDDSDFPPVACADGDIIEIYLGL*
Ga0031673_1257104Ga0031673_12571041F010768MKEVLNGGYNVHPVPPPNSEIKERIRRKYERKRTKIERLFTLGYTTSGDP*KIGAK*
Ga0031673_1257391Ga0031673_12573912F021541YGKTAAGLYARAAGQNAKARQLYNASADTYYKANDAAAYDDWVRARNHWDYFYNRS*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.