Basic Information | |
---|---|
IMG/M Taxon OID | 3300009333 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117984 | Gp0126422 | Ga0103833 |
Sample Name | Microbial communities of water from the North Atlantic ocean - ACM52 |
Sequencing Status | Permanent Draft |
Sequencing Center | University of Georgia |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 32113248 |
Sequencing Scaffolds | 20 |
Novel Protein Genes | 26 |
Associated Families | 24 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Eukaryota → Sar | 1 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 1 |
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Bacillariophyceae → Bacillariophycidae → Bacillariales → Bacillariaceae → Fragilariopsis → Fragilariopsis cylindrus → Fragilariopsis cylindrus CCMP1102 | 1 |
Not Available | 5 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 1 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 1 |
All Organisms → cellular organisms → Eukaryota | 4 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 2 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Choreotrichia → Choreotrichida → Strombidinopsidae → Strombidinopsis → Strombidinopsis acuminata | 1 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium | 1 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Siphonostomatoida → Caligidae → Lepeophtheirus → Lepeophtheirus salmonis | 1 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | marine biome → marine water body → surface water |
Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | North Pacific Ocean | |||||||
Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000073 | Metagenome / Metatranscriptome | 2639 | Y |
F000075 | Metagenome / Metatranscriptome | 2622 | Y |
F000237 | Metagenome / Metatranscriptome | 1498 | Y |
F000981 | Metatranscriptome | 814 | Y |
F001028 | Metagenome / Metatranscriptome | 801 | Y |
F001439 | Metagenome / Metatranscriptome | 694 | Y |
F005505 | Metagenome / Metatranscriptome | 398 | Y |
F006501 | Metagenome / Metatranscriptome | 371 | N |
F009464 | Metatranscriptome | 317 | Y |
F011139 | Metagenome / Metatranscriptome | 294 | Y |
F013645 | Metagenome / Metatranscriptome | 269 | Y |
F017830 | Metagenome / Metatranscriptome | 238 | Y |
F023858 | Metatranscriptome | 208 | Y |
F025911 | Metagenome / Metatranscriptome | 199 | Y |
F038531 | Metatranscriptome | 165 | N |
F039152 | Metatranscriptome | 164 | N |
F041786 | Metagenome / Metatranscriptome | 159 | Y |
F043763 | Metatranscriptome | 155 | N |
F046166 | Metatranscriptome | 151 | Y |
F061512 | Metatranscriptome | 131 | N |
F071939 | Metatranscriptome | 121 | N |
F078190 | Metatranscriptome | 116 | N |
F079563 | Metatranscriptome | 115 | N |
F100323 | Metatranscriptome | 102 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0103833_1001971 | All Organisms → cellular organisms → Eukaryota → Sar | 715 | Open in IMG/M |
Ga0103833_1002194 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 688 | Open in IMG/M |
Ga0103833_1002356 | All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Bacillariophyceae → Bacillariophycidae → Bacillariales → Bacillariaceae → Fragilariopsis → Fragilariopsis cylindrus → Fragilariopsis cylindrus CCMP1102 | 668 | Open in IMG/M |
Ga0103833_1002444 | Not Available | 659 | Open in IMG/M |
Ga0103833_1002642 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus | 640 | Open in IMG/M |
Ga0103833_1003114 | Not Available | 603 | Open in IMG/M |
Ga0103833_1003117 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 603 | Open in IMG/M |
Ga0103833_1003194 | Not Available | 597 | Open in IMG/M |
Ga0103833_1003529 | Not Available | 577 | Open in IMG/M |
Ga0103833_1003619 | All Organisms → cellular organisms → Eukaryota | 572 | Open in IMG/M |
Ga0103833_1003665 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 570 | Open in IMG/M |
Ga0103833_1003727 | All Organisms → cellular organisms → Eukaryota | 567 | Open in IMG/M |
Ga0103833_1003752 | All Organisms → cellular organisms → Eukaryota | 566 | Open in IMG/M |
Ga0103833_1003791 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Choreotrichia → Choreotrichida → Strombidinopsidae → Strombidinopsis → Strombidinopsis acuminata | 564 | Open in IMG/M |
Ga0103833_1004051 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium | 550 | Open in IMG/M |
Ga0103833_1004071 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 549 | Open in IMG/M |
Ga0103833_1004320 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Siphonostomatoida → Caligidae → Lepeophtheirus → Lepeophtheirus salmonis | 537 | Open in IMG/M |
Ga0103833_1004718 | All Organisms → cellular organisms → Eukaryota | 521 | Open in IMG/M |
Ga0103833_1005040 | Not Available | 508 | Open in IMG/M |
Ga0103833_1005146 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda | 504 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0103833_1001623 | Ga0103833_10016231 | F000237 | MVLHYFVP*YYLYLIQMHVAFCHES*DSDSGEATYEDKTGTYVSWFYDAMLKEFQDG*Y*VMYIFTYFSLHHFNGGTVNYFFFERWNISEVDEIRFYGVAPH*YFRPFMGLLVVTPTHYEGLM*MGL*LGLLAFLPIMYT*YNTYNNYLPIIPMQFSLLQTGAFILFMLSMYTMNSMLPCGRYYYDPEGGYVGNP*VKFSYQYAYVYMF*FIHHLDVIEYYSVNFSRAFIQRSFTLSQQGYRRDKTNLQH*N* |
Ga0103833_1001700 | Ga0103833_10017001 | F000073 | LTGGKVSTPTESLKLKDTFVNWMESKKELRNICDEGKGKLQWEREIPQEALDNLQNLANSDFKKNLGKIIHDIYLTGHNLFKDMPQGDHKKRAKYRFASKTLMRILPNELKAEVEGLIAGKTMTLDLYEILGQCTWGQGKSL* |
Ga0103833_1001971 | Ga0103833_10019711 | F046166 | EHTCPSGHVLQRHKAPNSDYSCDVCGKGVAEGETLWGCRLCDYDRCQQCANKGTVEFTDVDGDEVVLKDDNAFGIECYINGVLVNGEPMMKLRVSDRTIALEGDSADKWAAATVPIGQEYILKQALNLFAEVQSRKGV* |
Ga0103833_1002194 | Ga0103833_10021941 | F100323 | MSPAVEEPGLSLFCFAVYTKNTGSPKTSQELELFRMQRENSWSLFSCAEWAVYSDVVEDLGGGVKTIEVRDVKGDFNILKRKETGCWVNTGMFVQVWSAIRDAGHATNHNWVIKVDADAVFFPSKLVRALSDYTVPQEGVYMENCKYVDWGYFGNLEVFSKQAFITLVDNLETCYTSIPWKDGVLGGKYGPMGEDLFAQKCMDMLGVGRQENWMLT |
Ga0103833_1002356 | Ga0103833_10023561 | F025911 | NVVFGAAGHNGRMIVGGDLTVDGEVTELHNYEYDPASHPLPLGDDLSKICEMQPPPPCNETAFKIMTSEEVCPSKPEGVVKLVKSTADLPEDEPMIYNIIMEPPKDGAHTVKFQVDNPFTNYTDIYVKHSKKAGKYGMDPVCESMPFTAGCELEAPLIEVSCHEYDGVDPFALVSIYFASNEDAYVTDHAVLGTEIDKCCHPPSEYLEDGYGIIKYTFEIQC |
Ga0103833_1002444 | Ga0103833_10024441 | F078190 | AAKRKAGRKAMFDAIDGADGFKPRGKIAMGQFLGWSTTHIFSKVASIKGETGRVAFRHVEDFTKEEYVAYVEEAVNKPDSIASTTLYNYLLTLFVEADVDCKGKISYEQFDGLVEIAAASPRHFGLAPAGRDAAARRMMFDAMDYNKSGYVTFRKFLRFVREHAREKVADYKASN* |
Ga0103833_1002642 | Ga0103833_10026422 | F041786 | SLMTFGDKFSASEVDNAFGEFLIEDGMIDAVHLKGLMVSKKEEEAE* |
Ga0103833_1002714 | Ga0103833_10027141 | F000075 | SSAIAAANRYDSMNEDDLLVNLESTLSSALSSEARGDSDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK* |
Ga0103833_1003114 | Ga0103833_10031141 | F006501 | GLTKDANEKCQFLDFLTEPAALPKESADKKTKLAYGETMMGYWCNKDEQFKACSAATDALAPVVKECNKKQTQFESDFCAMAIVYHAQCQDLNDVCYTETRAAYDSSVASTSKLLGKWKIEYQALKKINCFLDVWMENGDANTVSSEKLAACKATEADASIMNIDFGTPVKEFVCADAGFGTLPDYPGTPEFVTKEYGAWP |
Ga0103833_1003117 | Ga0103833_10031172 | F013645 | MNPAFLLGTAFNIAYWHRKYHSGTICKGVSNPQTSNALSGCEKEAYEKNLKEKEVKNRKD |
Ga0103833_1003194 | Ga0103833_10031941 | F043763 | DFTDEAPEELIVSGREGYNETMNGRYQRGDRLHEGRVFYSHTERKFVIRWCPAKRSWFFDWRGLNTDTTASAALAQDIEHPHLATQAWRVFDGKKWISDAKLALCATIEKKQSEGEHVDFSEMNGYEEGTSALTGGVSV* |
Ga0103833_1003529 | Ga0103833_10035291 | F001439 | EAGAQTADGAQQGKDMVCVCKRIKKPANVKCNRTVEIEEPPQDPKTLKELVQCTPGGYNLTQAKRLLLRKEAKQRQEMELAQEKFEVDQVKLMTTCLSGHCNPASDLNSPTFFTPPKPVEPMECYDRCHDNQCKPIWKNPAQGWEDWYACVQQCVSGCYIIQ* |
Ga0103833_1003619 | Ga0103833_10036191 | F061512 | IGSMIGNLTCVMTKLDMLDSALQVNLKLWTTDVWQQLYLKKTLAGEDPEWRQKMIGGYTDCYQVASNWPQQSLDRNPITKVFGRHMIFFKCAKKNEMTNCALGQMKRWVEQWYGASENNTAYGLPEDEYEAAGLGLMVLDNAASEEEKFVSDFFWGVADM* |
Ga0103833_1003665 | Ga0103833_10036651 | F011139 | MGLFLGLLAFLPVVYNLYNTFSKYVATIAMQNSVLQTTAFIIFMLSLYCANSMLPCGRYYYEPEGGYVGNP* |
Ga0103833_1003727 | Ga0103833_10037271 | F038531 | QKIMKIASVFVCITIALVNGKPQGPFLGLGPGPFHGPGPLPYALGPAVCPEEVCTACEEEGATDLKPILEINCQEGLEDCHASAKDCSANAVDCNEPLKDCTCHIDPQDEGCTKECLIGPGPCLRENAITVGKCVAEHREEIAKCVLEQENTVGKCLNEKRISLAKAEECLACAPGCKKEEK* |
Ga0103833_1003752 | Ga0103833_10037521 | F023858 | KLWTCPNGRNSCVEYYLKLNGPNCCKCDQPSLQPPKMWDIPKSNFFTKVGFVGYEDTTELDDAPIKGAEHWATSSVLPKVLTVTYDYFLHREADDVITHRIDFNTSTGSEGSILYGGFAVAHDIDAHRAKFAQPEVCKGNIVPCCDTDEVMSKWCKHDYAVQQAEKSAVTV* |
Ga0103833_1003791 | Ga0103833_10037911 | F009464 | GYFGNLEVFSKQAFATLVNNLDTCYSSLPWKVGVHGGKYGPMGEDLFAQKCMDLMGVAKQENFGLTTDGACEADRPEGQKKNKKFVPTCAGVSTPSIHPFKKPEAYRECWAQAASVQP* |
Ga0103833_1004051 | Ga0103833_10040511 | F001028 | DGEGKMEISDAAKSGDGTLQNVGIKVGGKFKSPVNNQCFDTQAALDLHLKYLHDPKKQGSNLEE* |
Ga0103833_1004071 | Ga0103833_10040711 | F005505 | *Y*VMYIFIYFALHHFNGATVNYFFFER*NIAEMDEIRYYGVAPH*YFRPYMGILVISPTHYEGLM*MGLFLGSLACLPLIYNVYNTFNKYVSTIPMQNSILQTTTFTLFMMSLYCANSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLC*LLHHLDLIDHYIFQFSQTFLRKINPNLLK |
Ga0103833_1004257 | Ga0103833_10042572 | F071939 | TSNWEPHLIHYEPKKVLIKEWVTSDSFADDISSVFYEVDRDGNHMLEWNNGEIRNFINKVYQMKGLATPCESTMYDMYRIFDEDNNGGLDAVEAQHLAQAHVMSLVTALHL* |
Ga0103833_1004320 | Ga0103833_10043201 | F079563 | MKIYSNLRQMGGEELGHSEGTDFVIAEDLGHLLVGDEELLVFGILEVVLFQVSPKLFDAFSTASLFFANNVGEVSAKLHGFGESGSFGHFWMFFGG |
Ga0103833_1004689 | Ga0103833_10046891 | F000073 | KEYMDENSWMREHIETGKGNLQWERDIPPAALSNLENLAAKDFKGNLGQIIHDIYQTAHQMFSDMPQGDHKKRAKYRFASKTLMRILPDANKKEVEGLIEKGNITLDLYDILGQCTWLQPAA* |
Ga0103833_1004690 | Ga0103833_10046902 | F000075 | MEAYEAMNEDQLLVSLQSKLNSALSSESRGDGDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQQVVDAR |
Ga0103833_1004718 | Ga0103833_10047181 | F000981 | KTGRGALTLDQFVEWANAHVVSAIPKIPTGDVGLYHVEDYSEEQYIGFVEKAVNNPGSYEHASFYNFILNCFVEADEQCQGRITYDQFDKLLTRAATVPRHFGLAPPESSTEARKKMFDELELKRGGKGTGYVTARTFWEWTVVHVSAMIDLQKAGKGWRENH* |
Ga0103833_1005040 | Ga0103833_10050401 | F039152 | GPRRSNWKLYVVAITCVCFLAGMLVNTVPESAPADNQLDSVVTQALKMFHGSKANMPNDDDYLAGEKDFQKRFKTLKGKADLYLQNETLTAWKTMNKKNVVVAGAIEKAFPQKDLKAAIANAMKGSGSGSGSR* |
Ga0103833_1005146 | Ga0103833_10051461 | F017830 | LWFNHKNSMAGTTILLQLKKGDEVCVYAYTGTWLADFPMNHYTHWVGLLLKPSQEEAEQLKKEAYESC* |
⦗Top⦘ |