NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300018500

3300018500: Metatranscriptome of marine eukaryotic protist communities collected during Tara Oceans survey from station TARA_051 - TARA_N000000216 (ERX1782154-ERR1712171)



Overview

Basic Information
IMG/M Taxon OID3300018500 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117946 | Gp0216946 | Ga0193080
Sample NameMetatranscriptome of marine eukaryotic protist communities collected during Tara Oceans survey from station TARA_051 - TARA_N000000216 (ERX1782154-ERR1712171)
Sequencing StatusPermanent Draft
Sequencing CenterCanada's Michael Smith Genome Sciences Centre
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size6423211
Sequencing Scaffolds8
Novel Protein Genes23
Associated Families13

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available3
All Organisms → cellular organisms → Eukaryota5

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomedeep chlorophyll maximum layersea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationIndian Ocean: TARA_051
CoordinatesLat. (o)-21.4689Long. (o)54.3066Alt. (m)N/ADepth (m)80
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000048Metagenome / Metatranscriptome3365Y
F000075Metagenome / Metatranscriptome2622Y
F000108Metatranscriptome2213Y
F002456Metagenome / Metatranscriptome557Y
F008015Metagenome / Metatranscriptome340Y
F009522Metagenome / Metatranscriptome316Y
F021993Metagenome / Metatranscriptome216N
F023609Metatranscriptome209Y
F026786Metatranscriptome196N
F031887Metatranscriptome181Y
F035637Metatranscriptome171N
F039152Metatranscriptome164N
F089917Metatranscriptome108N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0193080_100529Not Available662Open in IMG/M
Ga0193080_100570All Organisms → cellular organisms → Eukaryota651Open in IMG/M
Ga0193080_100683All Organisms → cellular organisms → Eukaryota623Open in IMG/M
Ga0193080_101204Not Available547Open in IMG/M
Ga0193080_101209Not Available546Open in IMG/M
Ga0193080_101225All Organisms → cellular organisms → Eukaryota545Open in IMG/M
Ga0193080_101305All Organisms → cellular organisms → Eukaryota536Open in IMG/M
Ga0193080_101399All Organisms → cellular organisms → Eukaryota528Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0193080_100529Ga0193080_1005291F035637MGDPPFADWGRMATTLKVLVLALAFAAIATAAGVTVGIDNADLKEAMAEAEDTDSALEWTRKHLRNWGGDYKKILKHMPPLALKDAGINLAGARDLMGHTVSDSRMASNKELQILAKAENRLAAAKQDVHEAQTASSAVRRAQLEDLAEELTHEAVRRLEDTKEVE
Ga0193080_100570Ga0193080_1005701F002456HFPHTQGSKSAIYQFQIDNNNAKAKQTSAVQFTAEISPDMDIENAYFYNERSQSNPGAKAQKGKCKIGQRSPSSQKDIQKRFPGARITFPKLIRCNMHHINDFATVYVHAKVGPKVKEIPAEDMIKASYSQNNELKESVELPLIDPSHATEMFEDA
Ga0193080_100602Ga0193080_1006021F000075MGIIKKHKLTLQMKFAVALVATVAANQYDFMSEDELLSQLASNLSSAQMSEARGDGDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK
Ga0193080_100683Ga0193080_1006831F089917AEFLQDMDTEPAAPEQGKPFAQSYSFSSFTSNTNGEVVTHRSENFQSSAGEKLSRSRRAIGQGESAKAIEETMKGEEQTRTLHNMEEQDLPAFNQQHAARLHWQPFHALEPRAEQPKHALHAALEQDIDRVAGSQ
Ga0193080_100802Ga0193080_1008021F031887KTEAEKDAARQKMFDSMDLKSTGVITFDEWLKFCQEHIAAKTATMDPHPIIDHGSVDQYKTFIKAAMASPTCPEHVELYWYMLEIFTDHDSDKDGIIKMAEFPAMMNEFLTVPKKHNLPVPSEGQYEALFKKYDPRNDGRLTVDEWMSLASQEVYKKF
Ga0193080_100840Ga0193080_1008401F031887EKDAARQQMFDSMDLKSTGVITFDEWLKFCQEHIAAKTATMDPHPIIDHGSVDEYKKFIKAAMASPTCPEHVELYWYILEIFTDHDSDKDGIIKMAEFPAMMNEFLTVPKKHNLPAPSEGQYETLFKKYDPRNDGRLTVDEWMSLASQEVYKKF
Ga0193080_100970Ga0193080_1009701F000048MGTHAEADAEVAGLKTRFDAVKAVSDTWVKKCDVLVKEWILLDNTVTELNSWVAKDKSAEGENQFSLEKMESTLGELKNIFKQKEKLVEGL
Ga0193080_100980Ga0193080_1009801F000075MGIIKKHKLTLQMKFAVALVATVAANQYDFMSEDELLSQLASNLSSAQMSEARGDGDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKKXAPDLNXTYLLI
Ga0193080_101169Ga0193080_1011691F031887EKDAARQQMFDSMDLKSTGVITFDEWLKFCQEHIAAKTATMDPHPIIDHGSVDEYKKFIKAAMASPTCPEHVELYWYILEIFTDHDSDKDGIIKMAEFPAMMNEFLTVPKKHNLPVPSEGQYEALFKKYDPRNDGRLTVDEWMSLVSKEVYKKF
Ga0193080_101204Ga0193080_1012041F021993YDPTTQNTNPQGQGISNGNGEEITHNAQQLKLGVNVWKGRFDKVWTNTRVNYANGIDMTVDSVYVTGPYSGKAIVRFNRTTGEYMHHFEDEYLNYPVDIKEFRDYIYVCSEDQVRKYNRMNGEFIKTHSQTDGLLGSFLLFHISWAQNLGE
Ga0193080_101209Ga0193080_1012091F039152TWGILFFSSRKKMIAEATGTRRSNWKLYVVAITCVCFVAGMLVNSVSESAPADNKLDSVVTQALKMFHGSKANMPNDDDYLAGEADFQKRFKTLKGKADLYLQNETLTAWKGMNAKNVLVAGAIEKAFPQKDIKVAIAKALAGSGSGSAS
Ga0193080_101225Ga0193080_1012251F023609TINPVHSINDLLSGTRFQEDKVAQGHLKSLGNAFHDISKTEEGKVLLDAMEGLASRIQTEGSFKDGVKVCFRLAKKITKNPWVQFGMFIGLFGLLVALVGQFGTISVLGAGFTQFMRSIGMFIVSSALMPLTHGVKQSVDKFMKGGEDFVEKKN
Ga0193080_101269Ga0193080_1012691F031887WLKFCQEHIAAKTATMDPHPIIDHGSVEEYKEFIKVAICPTPCPQQVELYWYMLEIFTDHDSDMDGIIKMVEFPAMMNELLNVPKKHNLPVPSEGEYETLFKKYDPRNDGRLTVDEWMSLATQEVYKKF
Ga0193080_101272Ga0193080_1012721F009522WGIQTRSFSLICSPEMAKAMRAAMRAAMRAMRAKKSIIANGKLAKLVVFRGTKTKTVGGLKKTDLVKSKTGKIVSRKQSAAGKKAYARIRGWTAAVIKARKFLNVKGFVAVKRGTPLYNKAKEFYKTK
Ga0193080_101305Ga0193080_1013051F026786GGGIYDWSGTQASHVYAESGVAMGKPEIDALVKGWWDGASDGWSSDKEYNELAKDYKPTLGGDEYVCLGRQAFGDGDCMHIFRCKSKKKGDKTMACFAHSDYTLALALTNEDKGKAPGPAQVAVQSLMEAYKEAGY
Ga0193080_101313Ga0193080_1013131F009522TSLRIMAKAMRAAAMKAMKRVSKFAKGKLAKLVVFRGNKEATLSGLTKSSLMRNKNGKIVSKKQSANGKKAYARIKGWTMAVAKARKALGVKGFVAVKKGTPLYKKAKEFYGQ
Ga0193080_101376Ga0193080_1013761F031887ITFDEWLKFCQEHIAAKTATMDPHPIIDHGSVDDYKKFIKAAMASPTCPEHVELYWYMLEIFTDHDSDKDGILKMAEFPAMMNEFLTVPKKHNLTVPAEGQYETLFKKYDPRNDGRLTVDEWMKLASEEVYKKY
Ga0193080_101399Ga0193080_1013991F008015MTTVKDSDGKAIEAILTEVFTLMDANGDNCIDEDEGVQIGMAMGESKEQASKSWAAMCKDMDDDGNTTIELPEWLDFYKKSLKDAVLDDVLQMLNQMKDTIKAHKDKEAAEKGA
Ga0193080_101496Ga0193080_1014961F000075MKFAVALVATVAANQYDYMSEDELLSQLASNLSSAQMSEARGDGDAAVAKTAAIKNIQKSLTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKKXDAKQXSLSSLIXFYNXQLMSAFSLWASSLRSM
Ga0193080_101566Ga0193080_1015661F031887KFCQEHIAAKTATMDPHPIIDHGSVDDYKKFIKAAMASPTCPEHVELYWYMLEIFTDHDSDKDGIIKMAEFPAMMNEFLTVPKKHNLPVPSEGQYEALFKKYDPRNDGRLTVDEWMSLASQEVYKKF
Ga0193080_101677Ga0193080_1016771F031887FCQEHIAAKTATMDPHPIIDHGSVDDYKKFIKAAMASPTCPEHVELYWYMLEIFTDHDNDKDGIIKMAEFPAMMNEFLTVPKKHNLTVPAEGQYETLFKKYDPRNDGRLTVDEWMKLASEEVYKKY
Ga0193080_101678Ga0193080_1016781F009522TWGSRSDLVRLSEVFQTSLRAMAKAMKAMKAMKRVSKFAKGRFAKSVVFRGTKASTVGGLTKANLMRNKNGKIVSKKASANGKKAYARIKGWTAAVTKARKALNVKGFVAVKKGTALYKKAKEFYGH
Ga0193080_101744Ga0193080_1017441F000108HGKSTKNDGYGPKGVVCSRKPTQNCYNIPRKIQVEVCKTDVHRYCEKFSNIFPFPVEEQNCHFEPKKICELEMKTRPKKAKKYSYTKDCKEQPREICDQCEKKSLRPVCEMQQRLECVYTPKEECRDEDKQYCHKVEKVELVEECDMKFDTSYL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.