Basic Information | |
---|---|
Taxon OID | 3300027759 Open in IMG/M |
Scaffold ID | Ga0209296_1000046 Open in IMG/M |
Source Dataset Name | Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130206_EF_MetaG (SPAdes) |
Source Dataset Category | Metagenome |
Source Dataset Use Policy | Open |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Sequencing Status | Permanent Draft |
Scaffold Components | |
---|---|
Scaffold Length (bps) | 109757 |
Total Scaffold Genes | 166 (view) |
Total Scaffold Genes with Ribosome Binding Sites (RBS) | 123 (74.10%) |
Novel Protein Genes | 13 (view) |
Novel Protein Genes with Ribosome Binding Sites (RBS) | 11 (84.62%) |
Associated Families | 13 |
Taxonomy | |
---|---|
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Siphoviridae | (Source: IMG-VR) |
Source Dataset Ecosystem |
---|
Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake → Freshwater Microbial Communities From Northern Lakes Of Canada To Study Carbon Cycling |
Source Dataset Sampling Location | ||||||||
---|---|---|---|---|---|---|---|---|
Location Name | Lake Simoncouche, Canada | |||||||
Coordinates | Lat. (o) | 48.2311 | Long. (o) | -71.2508 | Alt. (m) | Depth (m) | 1 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000808 | Metagenome / Metatranscriptome | 882 | Y |
F004092 | Metagenome / Metatranscriptome | 453 | Y |
F004132 | Metagenome / Metatranscriptome | 451 | Y |
F004405 | Metagenome / Metatranscriptome | 439 | Y |
F004475 | Metagenome / Metatranscriptome | 436 | Y |
F007112 | Metagenome | 357 | Y |
F011077 | Metagenome / Metatranscriptome | 295 | Y |
F012455 | Metagenome / Metatranscriptome | 280 | Y |
F018928 | Metagenome / Metatranscriptome | 232 | Y |
F034566 | Metagenome | 174 | Y |
F038549 | Metagenome / Metatranscriptome | 165 | N |
F041188 | Metagenome / Metatranscriptome | 160 | N |
F047494 | Metagenome | 149 | Y |
Protein ID | Family | RBS | Sequence |
---|---|---|---|
Ga0209296_100004610 | F004405 | AGGAG | MEDCTTTDFLYPMKADLYYPVINQTQYGQASRTWFYDRTIICNATSIGGAGTEQIKPEAFLQHENKLIARVKADPRMSSTETENAINNILITNIRNANDQLIYRETGGSRSGRGTIYEVATVDPFTGPFGSIEYFKILLRRTENQTITD |
Ga0209296_1000046105 | F007112 | GGA | VHTLHYIAVEADSKQEAFDKVVVSLQSNEDGYRLADWSDWHVVGGGRWSTNANKKSENNLMGGYNDDPTDVIGYAENKEKFQEVVKDILRFRSQTMNRNIVEIKTDKFISQMVDYASEGGRGPWNGDTLMNVYSIKQAAEMLMGSWTSDSGFYDLQEHVSEFEYLDERLDKPDRAVRQYLVPVDFHF |
Ga0209296_1000046112 | F004132 | AGGA | MSEVMTQEQLSVPYNPNLLVTYKAIPDTYAAPESPTFMTDKVTDLEWELHQGRTNANLAAERRLDISWLEDQIPEWYDPNYSKEEVLKALTQHFNLNPVKQMSVYGTVTFSGTINIPLDEIEDFDLSSVTIDAELSSYDYDADLTVDDVSLEEN |
Ga0209296_100004612 | F038549 | N/A | MVSIVSAETGFPPLFVNAFVNSELKEFELMPTGPEPFQPFFPAQVPDSVEGIYNDIPFIRNNPDTTVIIFDRLMRFRPTPFYKHKREQLIYFIYSPNLSKLFDTTRVIIECLDREDVAAQALNSWIAENDIEDENGNVIPKNVFFHNLKVYQADESRDIIELASARTLGLNKLVIEYDYHTVSVEGSNQRYS |
Ga0209296_1000046120 | F000808 | GGA | MEYTYSLTTSYDGELIHTLRVSDMLEAVRAWDKCVDYGFAKEYATYNLSDPTGKMYTKTFYTNGEVVIK |
Ga0209296_1000046128 | F012455 | N/A | MLKEIKNKIIRIQELRRSNAATPIPNKKKYSRKVKHKNAK |
Ga0209296_1000046131 | F047494 | AGGA | MNSLLNKVCKHTANLLATSIVNDMKYSFCENCEQNIYSHYIEDSELLSYWSSWKVGK |
Ga0209296_100004631 | F004092 | AGGA | MSNNFTIYPGKFPCKTCQEEVNSLRYWRETGDATWMCSKKHISKVGLIPPKRKKKDFANE |
Ga0209296_100004668 | F018928 | AGGGGG | MFIRSLNTMEKIISKNNNLLWNGWDVIDLKESDIAKTSPMGIRVKDKWYIHKVYSPGRNGWDIPNKYRE |
Ga0209296_10000468 | F041188 | AGGAG | MNLTIEELSTKTVMALKAYAKKNKIELFEANTKLEILEILASWIPPEKTEEQVQEADKAKSMINKIALYSERNLHMDNLGALKVGYNIVSKEASEKWLTHRLVRVAPPEEVAAYYGK |
Ga0209296_100004683 | F004475 | AGAAG | MSNWTEELSDEHKEQIWHFIVETVKEIREQIAQDIQGTSDLWKAKGLNKSRRTTKAFEISAAIARGQNEI |
Ga0209296_100004692 | F034566 | AGG | MSFKLKSIIWVQSALIVILSVFMLFMGGDLKQSRKMTLDQSNYCIRYTSDIIASSRLDLIREQDAHSRTVDKANSVIDDIVNRYNSLVARYNKSTGGTNYDPLNTYSYRIRP |
Ga0209296_100004694 | F011077 | GGAGG | MHELMDTNLTWAEEDVNLWKGWTYSIEKNRYYFNDIGDESLASFWADDFLNQAYL |
⦗Top⦘ |