NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0209400_1000029

Scaffold Ga0209400_1000029


Overview

Basic Information
Taxon OID3300027963 Open in IMG/M
Scaffold IDGa0209400_1000029 Open in IMG/M
Source Dataset NameFreshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_EF_MetaG (SPAdes)
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)133591
Total Scaffold Genes181 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)58 (32.04%)
Novel Protein Genes11 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)4 (36.36%)
Associated Families11

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales(Source: IMG-VR)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake → Freshwater Microbial Communities From Northern Lakes Of Canada To Study Carbon Cycling

Source Dataset Sampling Location
Location NameLake Montjoie, Canada
CoordinatesLat. (o)45.4091Long. (o)-72.0994Alt. (m)Depth (m)3
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F009669Metagenome / Metatranscriptome314Y
F010817Metagenome / Metatranscriptome298Y
F032162Metagenome / Metatranscriptome180N
F034466Metagenome / Metatranscriptome174Y
F034827Metagenome / Metatranscriptome173Y
F038565Metagenome / Metatranscriptome165Y
F047507Metagenome / Metatranscriptome149N
F050171Metagenome / Metatranscriptome145Y
F078229Metagenome / Metatranscriptome116Y
F082149Metagenome / Metatranscriptome113N
F104483Metagenome100N

Sequences

Protein IDFamilyRBSSequence
Ga0209400_1000029139F050171N/AMGMFNYIKVEQDLPLNDELKALDIDFKKEEFQTKELEESLMSTYIIRDYRLFELKITSHWEDNPDYVKDGSRFGEFFNKNKLVKDSEEEVFRDDYTGTFTFGAYISGKTKESYDYFPDWKCVVVQGLLTEISLIKPIEKNSSCQRIEAQESFMKEIEDHERKMKCPVYSFYFKYYVKTMNLVEWKLSIGIVKIINFLNWLQWKGVRKVIRILTPR
Ga0209400_100002914F047507N/AMINLNLNLALKTTKKISKLWNWKTKTLLVGVALIVSWISCLKIGFELKKYNSITNLPNSCFVDSMIYASRCNLLLISSSDTWNSVYGFTFGYKDDKEAILGHAVCVFEYNNNLWMYDPNWGTSPICKIGDRKKYKEKIKLYINKTYPIIVIEDFMLNDWTYVQKTKKNKMNKTYQEVSIHLDEDKKE
Ga0209400_1000029145F082149AGGMKLGLHIDWCIGRWFGRNNNLEIRVTLPTISLGFKKEDDDKWFGFKFNLRTDISFESYEYGKIFTLILFGFGVKVSKFNL
Ga0209400_1000029150F104483GGAGMNKKIIDKIDLVIEDIFNLQELIDFDTSDSSEYDLDSVVSVLAEIRGRVSNQEEEVERIDLKQL
Ga0209400_1000029158F034827N/AMKYNITSQDKEALTEFDQIKEGEVFSFFTPESTDRGNHAIYMKIRVPNTTSVNILDLGDGKAYDFSAKKRNTINDPDEKQGNRVYKLNAKINIVVL
Ga0209400_1000029159F078229N/AMSKIKYTYYIEQDKDKDGNLVQGWSIYKTPLVKTIKVKTFKKLKDADAFLEKNESN
Ga0209400_100002918F034466GGAMRNLLLDNYEKSLPSKRDTEDFITNFHQYRAKKKAQQKTHYGLLFACILILTMIGSIVAKQTKNNLDIQTAAGLEKCNTQLMEKK
Ga0209400_10000293F032162AGGMKQNLKISYQTFGENNAYLLEGSIKQINHFFNSIYNWEGTNGKLHDMGNGKAFYFYAHPDDVMHALTKVALYSLVNKINAKGRKGGLLDLAKAKAQSIVDAMGQTCFLWGASSSEGYSLGTISAEKPSDYCGAVSNGRD
Ga0209400_100002935F009669N/AMKTMLSKIFGPNWRSSSSGIATVVAVCTAIAIHSDPTLVAFLPDAAEVYITGIAKLIAVVSGIVFALTVKDAAVTGGTVAQTNEAKGRTNGENI
Ga0209400_100002938F010817N/ALKKFVSVLALNLFLVGCATVTPDKIQDDKSSYDATTPKQYDKDNGGLISFIGDDALITYQARERYNNLISMYRIKFKKEKAIDLKEDSGIKVYKDNFNNNLYLIDSEHLVYFGVLNSWLKEKVPQDNIIDKVLDKTN
Ga0209400_100002984F038565N/AVTNKKKFNFPASLLKQIDECSFGGYILFNFSNKGEPQVYTKFDNQINAMALLYYLNTWGQSIDQLNLEATTDLIARKNEEEDSEEED

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.