NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300003821

3300003821: Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBE22May08



Overview

Basic Information
IMG/M Taxon OID3300003821 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0110170 | Gp0088305 | Ga0007843
Sample NameFreshwater microbial communities from Crystal Bog, Wisconsin, USA - CBE22May08
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Bioenergy Institute (JBEI), DOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size22602139
Sequencing Scaffolds19
Novel Protein Genes22
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage4
All Organisms → Viruses → Predicted Viral3
Not Available7
All Organisms → cellular organisms → Eukaryota4
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium → Strombidium rassoulzadegani1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameFreshwater Microbial Communities From Crystal Bog Lake, Wisconsin, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater → Freshwater Microbial Communities From Crystal Bog Lake, Wisconsin, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater lake biomefreshwater lakelake water
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationCrystal Bog, Wisconsin, USA
CoordinatesLat. (o)46.0072Long. (o)-89.6063Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000155Metagenome / Metatranscriptome1877Y
F000787Metagenome / Metatranscriptome891Y
F005903Metagenome / Metatranscriptome386Y
F014608Metagenome / Metatranscriptome261N
F030662Metagenome184Y
F035159Metagenome172Y
F036227Metagenome170Y
F036528Metagenome / Metatranscriptome169Y
F036591Metagenome / Metatranscriptome169Y
F038058Metagenome166Y
F039455Metagenome163Y
F048735Metagenome147N
F064665Metagenome / Metatranscriptome128N
F074449Metagenome119Y
F078045Metagenome116N
F088135Metagenome109N
F098526Metagenome103N
F102402Metagenome101Y
F104360Metagenome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0007843_100930All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2419Open in IMG/M
Ga0007843_101191All Organisms → Viruses → Predicted Viral1973Open in IMG/M
Ga0007843_101280All Organisms → Viruses → Predicted Viral1866Open in IMG/M
Ga0007843_101993Not Available1308Open in IMG/M
Ga0007843_102140All Organisms → Viruses → Predicted Viral1244Open in IMG/M
Ga0007843_102353Not Available1159Open in IMG/M
Ga0007843_103289All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage902Open in IMG/M
Ga0007843_103494Not Available863Open in IMG/M
Ga0007843_103585Not Available845Open in IMG/M
Ga0007843_103761All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage816Open in IMG/M
Ga0007843_103833All Organisms → cellular organisms → Eukaryota807Open in IMG/M
Ga0007843_104534All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage717Open in IMG/M
Ga0007843_104566Not Available713Open in IMG/M
Ga0007843_106283Not Available589Open in IMG/M
Ga0007843_106497All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium → Strombidium rassoulzadegani579Open in IMG/M
Ga0007843_106852All Organisms → cellular organisms → Eukaryota562Open in IMG/M
Ga0007843_107603All Organisms → cellular organisms → Eukaryota531Open in IMG/M
Ga0007843_108069All Organisms → cellular organisms → Eukaryota514Open in IMG/M
Ga0007843_108334Not Available507Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0007843_100930Ga0007843_1009305F102402MDTTAENIRRVVAILTAILSEQEDTAYSMVLESNPIELFSALTGVLLSALHTLAKINKTEVGDYLQHLGMSTFDNE*
Ga0007843_101191Ga0007843_1011911F038058PRNNMASHGVQYTVDKTERGIVSDHAMPQPMQSVQIKGDIPTIRAYKDARTARIKAIGEANQRVFSVGGPANETSMGKGSPFHSDFL*
Ga0007843_101280Ga0007843_1012801F048735MEYNGFSLEQPKDTNYGIPNLIHVPQMFRELTAYRLTRGEFGRRERIKNGIKLEQSGLLNPAQHMINAFQLIYGNDVLLHSQGIPNNYAIDIIDLFCNENDWGIAGCASSGKTFSVAACIVIDWLCAPDCTSTYVASTSLDASEDRLWGKVCTLYRTAMRNIQAQYGANQSIGNLVEYRRMIVFETIDTKDTERDYTNAIKALAFPKGGEGKRSVENTRGRKNARMRLFLDELAEMDLYALDTRVNLGANPDFIFGGMANPAATANNPHTELCQPDDPMEWDAVTRYTKKWKTRTGVALHLSGEDSPNFKVPDAEIPPFDRFLTVQGEAATLKRCYGNKNALEYWRNVYGWWPDSSVELTIFSKQFIQACDINWEPVWSNRTRVVCGFDPAFTAGGNRCAATFCRFGPNDTGRNLGFYLGTREYTSSVGDVFEESIAMQLVKDCLEYGVHPRDFGLDISGDGGKMMRAIIIEWSKFHPEAMFVFPISSMGMPTERKISNLDKRTCKEAYDRLVTEYWFAVHTALSTRSLVGIDVEKHSQVVNELCSRLYYHKGRKVAVEKKLDMKHRLKKSPDLADSLTYAVQMLRR
Ga0007843_101993Ga0007843_1019931F039455NLGRYGDVCSFLPVLQNEYNETGTKPSLIISKDYAEILDGVSYVEPIVYDGPFEDIAGALDYAKTLGHTVTTTQVVGITDVIVSQVYGNHHGPAIICDSFQQDAWRLANKLDLWPRQLPLVFDIRDKKREKRLYRGIPKTKPWIVVSTGGNSSPFPYNDLLWEILKNCLPEFHIIDLAEIKAEKFYDILGIMDHPNTAQLILTDSGPLHLAYATTKPVHAIVTDRPSLWHGTAWRPFYASYTRYRNFPRDVTRILDLIRTPRPKPTLPNIVHVYQRMPWATGDEKRRNALAEKTWKGIGWVDLGLDDNCFVRSSAEVIPDEKRRIPMIKDMLRLACLGRDDTDVLVMTNTDTCVASNFIQRLEGTLPAYAYRHDFKRLDEPLSYDRISAGAKYAGCDLFAMRVGWWRRNHHLFPDMVLGRHSWDRIMRELFNLSQ
Ga0007843_102140Ga0007843_1021401F036528RKAFDAACELCDNKVYRRDVGYGVAVLREEVYEEVDGKKKLTGYRDVTSTYEVEENGKTVLKKKPYVGIISQGMRNFWNQIAVISEKYGSLNEREIEIMRQGAGTDTTYMAFALDKKEIENISTRYSKFVPDIEAFLNRIGSQEYYDAQLRGVKPEAKDDFSTPTTSKVVEDEYEDEEYVDIEEATTADRLKAKLNG*
Ga0007843_102353Ga0007843_1023531F014608MQINAQIGAQIRQSHRNKANARGFFSKTYTPCYRTFVASYYFFFILELLVLKINPAIAVPDPAPTLLDSNFVEPPNLQLLFRPSGKYAASTHFIHIRVPFNFSQLLATPDNIFFQYHNYIELWPEPFRTQVEEIAEISRSCIADKLNDFIDILDALPQNEVITRDKRFLDLIALGMSTAALTLSTFNSAKISKLETQIASSNKRIDHLVDITSLHEKHFKAVDQKLDDVADKVALMLKVNKVHFAKMTDFMEQKFGTAVAISERLIHTAYNNKLSPGALHHEALL
Ga0007843_102991Ga0007843_1029913F005903WIGQPIMTVYDPRNMTWDYYCALMAELFAQNQLGTVPEARWREWVDGMNGIGNFNNSGIPDSRGFATWQDWACQMVGIMNVEQ*
Ga0007843_103289Ga0007843_1032891F036591DRDMELKVAQMKMMTERNTQVLLAHINNGAKIEVARIGAQDDDGAQAYLTEEEYVKAQEHPMQPIANAIGQGNNQMAQAIAALVDTINQQHNRPKTVLRDENGKIVGVH*
Ga0007843_103457Ga0007843_1034571F000155MKFAAALIATVAAXRXESMNEDELLVNLESTLSSALSSEARGDADAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQP*
Ga0007843_103494Ga0007843_1034942F098526MAYSIFTAAETDAFFNSIPDVGVINRQANPTGHKVVINNITYSILFDHNGAFWTIFEHPNGSFDRQSKILQRIHQSYTTYCCGVWQLGSFYCPFDAPQELEDLMIKMIASCARYKFAKGVMQGYFYKSRRKGAVYQHDRILKAFIRNGFVDNSPETFNPNSGNMIKGLVRPCQPPKAKRATRAMLQQRLDTLLGVLHGNT*
Ga0007843_103585Ga0007843_1035851F078045RWLKSPRTLSEAIKDAEYATSITRPEDGEYSLFWGLLGALLFVAIFGYGFWRYVNL*
Ga0007843_103761Ga0007843_1037611F074449FATPQGTIINPDNSEWADLFTMDIDLQNRDIDSFMEQVGDLTSIFGHCWIAVDMPQTAQGNLGRPYTCAISPIDVWDWEWEYYGGKPILKYVKVKEMEDVDYFYLKCYYLGDANTPSHWASYKLPKMSLSGQLDNEAECIGAGQFPAGMSIPLFIAFGRKDPRVIDLGVSDIDAASDAMREHYKLECEAYTALQFAHTIIRAEKGIAIPVHAGAIVRALKDQVEAIKIDTGDVEQIIKKQNDILSNLEGLVGMGGQRQDXQQVASGISIIE
Ga0007843_103833Ga0007843_1038331F088135IASGTTGSTSLVFNLRYASVKAMIAIFGGNAAAKSANKLMDSFDITSNNGDFQFNIGGVNYPQKSLSTLNNKAGIMMELRRAMGSILNNNVSLSVNVNEFNITDSSANPTTVVLPGKFWVAQNLQKLTVNQKAFFTGVSTQLAPINLNINIGTATTQAYTPLLVLLYDAIIEIDPATKQIIMIQ*
Ga0007843_104534Ga0007843_1045342F035159VKNGGDDAPWTREELXRLKGIXXKLRLAEAKRIAALKADQEARKQTITDLVNPKPVAKTQQSNIQSNQEVSVDIPSNLANIDRYIANLEQQQQDLQNAVLIRSAKLRLEQELAILEAKRQAELDDEEALLAIIL*
Ga0007843_104566Ga0007843_1045661F104360INSTRPPFTEIPITPAYGQDPSILSDFQTGNFQGAWPYKSIKTGSADGFVVAVAGTIYFLSIVNNIGTLYKLISGNDPTMMHTWFVQAEDWMYIQNGYQDPIAWSGDISGTPTNLQAQGDGLTTIALTWTDNAPGAKTNQIQCQYNGSVFFDVASVPYNQTRYNFSASSSTQSYAFQVRSVYPDGSFTPWSNIATTAASNISQTTAQNNTVYRLNPVKQQMPIGTIMAYAYGRVAVS
Ga0007843_106283Ga0007843_1062832F030662VFASTMRLKDRNGPIPQGLWYEYSDDKGNRYRVNGMEMTYGRLFSQKVASDMTINNVAVPDNLDYLIEQQICGRIPSQYCWQEAGDKVANVIHSFASLGDRVAQSLGVNSNLEQTAKGCTACQKRRQALNQVLG*
Ga0007843_106396Ga0007843_1063962F036227MKVSDLTKKSTTYSAIIQQMMNYQYAYLGGYIFKQQVRKKRPSEDSVLWNDLITNTVAQPICRYVVDTINDILFETGVKRDIRFATPQGTIINPDNSEWADLFTMDID
Ga0007843_106497Ga0007843_1064971F000787KYYSKHNKMKLFALVALFAAVEAGTPPTVGLGHAFIPANREDYDNAASLWKNNWAKYRKAHPNDQDCSISESDNWKGAQQCSQSWECRGARLCERGGWCSGYDGCEGTPLPDQAPGLSPDH*
Ga0007843_106852Ga0007843_1068521F088135CYNMIDMGQEVQNEIIRMNPKLRIKTSSFATSIAPNIATATSGSTSLVFNLRYASVKAMIAIFGGNDTTQSANKLFDSFDITNANGDFQFNIGGVNYPQKSLSTLNNKAGLMMELRRAMGSILTNNVSMSVNVNEFLIVDGGATTVVKPGKFWVAQNLQKLTVNQKAFFTGISTQLAPINLNINIGT
Ga0007843_107603Ga0007843_1076031F088135DMGQEVQNEIIRMNPKLRIKSSSFATSIAPNIASATSGSTSLVFNLRYASVKAILAVFGGTDTARSANKLFDSFDITSSNGDYQFNIGGVNYPQKSLSTLNNKGGLMMELRRAMGSILNNNVSLSVNFNEFNIADPAXAAPTSVVLPGKFWVAQNLQKLTVNQKAFFTGVSTQLAP
Ga0007843_108069Ga0007843_1080691F088135DMGQAVQNEIIRMNPKLRIKSSSFATSIAPNIASNTVGSTSLVYNLRYASVKAILAVFGGNDTTQSANKLFDSFDITNSNGDFQFNIGGVNYPQKSLSTLNNKGGLMMELRRAMGSILNNNVSLSVNFNEFTIVDGGNTTARLPAKFWVAQNLQKLTVNQKAFFTGVSTQL
Ga0007843_108334Ga0007843_1083341F064665FELYKSDLNQVFKLFRDLLASLPHVPERERRQWDVASFAIATSSLALSTYNTVQISKLESAIETXKQKTDLMADILQIHEQHLHQLDRMIEDIGNEIAKLKIQTGYYFSIDRAIAQVISDNNKLRAVVAIFERVINSAFDQKLAPGALSVDVLDTIVHHIKDTAAKNKF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.