NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008693

3300008693: Planktonic microbial communities from coastal waters of California, USA - Canon-50



Overview

Basic Information
IMG/M Taxon OID3300008693 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117987 | Gp0126532 | Ga0103943
Sample NamePlanktonic microbial communities from coastal waters of California, USA - Canon-50
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Hawaii
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size3690756
Sequencing Scaffolds14
Novel Protein Genes18
Associated Families17

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Eukaryota → Sar2
Not Available7
All Organisms → cellular organisms → Bacteria2
All Organisms → Viruses → environmental samples → uncultured marine virus1
All Organisms → cellular organisms → Eukaryota1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NamePlanktonic Microbial Communities From Coastal Waters Of California, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Coastal → Unclassified → Coastal Water → Planktonic Microbial Communities From Coastal Waters Of California, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomecoastal water bodycoastal sea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationPacific Ocean
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000237Metagenome / Metatranscriptome1498Y
F001521Metagenome / Metatranscriptome679Y
F001608Metatranscriptome663Y
F004235Metagenome / Metatranscriptome447Y
F010768Metagenome / Metatranscriptome299Y
F019484Metagenome / Metatranscriptome229Y
F031514Metagenome / Metatranscriptome182Y
F037051Metagenome / Metatranscriptome168Y
F039167Metagenome / Metatranscriptome164N
F045157Metagenome / Metatranscriptome153Y
F059997Metagenome / Metatranscriptome133N
F065844Metagenome / Metatranscriptome127N
F090440Metagenome / Metatranscriptome108N
F093978Metagenome / Metatranscriptome106N
F099434Metagenome / Metatranscriptome103Y
F100952Metagenome / Metatranscriptome102Y
F101265Metagenome / Metatranscriptome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0103943_10294All Organisms → cellular organisms → Eukaryota → Sar1089Open in IMG/M
Ga0103943_10376Not Available1006Open in IMG/M
Ga0103943_10419Not Available978Open in IMG/M
Ga0103943_10771Not Available816Open in IMG/M
Ga0103943_10859All Organisms → cellular organisms → Bacteria791Open in IMG/M
Ga0103943_11225All Organisms → cellular organisms → Eukaryota → Sar716Open in IMG/M
Ga0103943_11906Not Available612Open in IMG/M
Ga0103943_11965All Organisms → Viruses → environmental samples → uncultured marine virus605Open in IMG/M
Ga0103943_12343All Organisms → cellular organisms → Bacteria573Open in IMG/M
Ga0103943_12408All Organisms → cellular organisms → Eukaryota567Open in IMG/M
Ga0103943_12488Not Available561Open in IMG/M
Ga0103943_12502Not Available560Open in IMG/M
Ga0103943_12729Not Available545Open in IMG/M
Ga0103943_13232All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium513Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0103943_10294Ga0103943_102941F010768MKEVLNGGYNVHPVPPPNSEIKERIAKKYERKKTKIEKLFTLGYTTSGGP
Ga0103943_10376Ga0103943_103761F019484FYFLTILGGSLKKIAKKITTNVPISRKVYTNASII*
Ga0103943_10419Ga0103943_104192F039167MPSTNGTISEVIVGTGVLYVAQIANDGNASGDYVAFPADNGSGAWAAMASGWVDVGYSEHGCTLEGKLAQACMSILLIALGGSTFTSCDCVNYSSDYASLIPPSTDDFDEKSLLLKVDGPAGADRHVEIPRAINVGAFSMAHQKAPQKVVIATEFKVLLPKAVSQHTDLFRIVDNKNDTDVFDIN*
Ga0103943_10572Ga0103943_105721F000237TLTIVANIF*SLFNNIYKTYYIISTNKHLNTDQLTRLMILHYFVP*YYLYLVKLHVLFCHES*DSDSGENVYEDKSGSYVS*FYDAFLKEIQDA*Y*TLYIFVYFFIHHFDASTVNYHFFER*NISELDEIRFYGVAPH*YFRPLMGLLVVSPSHYEGLM*MGLWFVLLASLPIIYNLYNSNNSYLPVIPMQSSMLQTLCFILFMLSMYCTASMLPCGRYYYAPEGGYVGNP*VKFSYQYAYLYMA*VLHHLDLVEHYGFQYVQHLMRRHSNFKLYAQARQLPR*SSNSYLSRFSKS
Ga0103943_10594Ga0103943_105941F000237DG*MLGGYAFLWFHYIIFLGISLSCTHLSDLTLTIVANIF*SLSNNIYKTYYIMFTNKHLNTDQLTRFMVLHYFVP*YYLYLVKMHVFFCHES*DSDSGEATYEDKTGTYVS*FYDAMLKEFQDG*Y*VMYIFIYFSLHHFNGGTVNYFFFER*NISEVDEIRFYGVAPH*YFRPLMGMLTVCPTHYEGVF*VFLYLFSLAGLPMYNNFFNSMTKHHPAIPMQNSLLQTFCFIMFMMSIYCTASILPCGRYYYEPEGGYVGNPLIKFSYQFIYLYLI*LVHHLDAIDHYIFQFS
Ga0103943_10771Ga0103943_107711F065844DRDDQWYYENVDYNFVGPDKKSWLYFIVKDDNIVKCGESGNPLLLETKGKNPRWKKGTFSRLGRYINGCTTDTVIREELYSAVKKGKVSVWAKQLPVTTVTVNEMGEEKIVERSIHKDLKMVYFNI*
Ga0103943_10859Ga0103943_108592F045157EEVRNKIKTLRKTGHIRDAQSAIADMINLKSQQRK*
Ga0103943_11225Ga0103943_112251F037051PQAEGSAAVQLIEAAKNKLNKFYNPTMYKAPERRELTEEERIAVANGAVDPRDAEEAAAAGQGIAGTGITVFAQVRAASDAAPPPPPETFGAYTKKSGKSNGVIKLMDMMIGDVKTDLTDGEHAEEMAQKDYENLMAASQKTRATNAESITEKESAKSEWTEKIENAKTEHASTTEALAKLAEYIAGLHASFA*
Ga0103943_11906Ga0103943_119061F004235PNRTVTALIGEQDFIGKAITMIAVDWDVDADASREAMEAVSNTILSRATILAAGAVYDTGTKQDFLLEGDFTSTINDFTSLDGTLAQVLVEDIINLGTVDSINFGSGTVAVTIKTTFKYA
Ga0103943_11906Ga0103943_119062F099434MPATKNNFSHITNVELEGVATSSFTVDFINTMASETSDLSSGSATAGLEATRAVIGSYINILSEGPLHE
Ga0103943_11965Ga0103943_119651F093978DEPLMKEAISYLKTRYKEEIFNTSYKDHDQRQVLWMAYNMVDKIKGHLESVMNEGKLASKELDQLQDLTK*
Ga0103943_12343Ga0103943_123432F001521MLAMNKLWITEYIDAHEGIAIGPYIKADTIAQASRIAIQYGLLVLGEIQELEHDDQIKKRIVH*
Ga0103943_12408Ga0103943_124081F001608KDFKDSAAAVAKAIDVLNEYYSSAAFVQVAAKQPDASFGGAKGDVGSTIVSVLEVAESDFTSLLAESEADESAAQEAYDKLTQENAVTKATKTGDIKAKTSEVKQIEVALGNYKENKATVTEELDAVLLYLDKLKPQCETKVMSYAERKSRRESEISGLKEALEILEG*
Ga0103943_12488Ga0103943_124881F090440SLGGNEISGLKLATGLGLKRVYHCRITDVGEDDDLRIIIRKSIGG*
Ga0103943_12502Ga0103943_125022F031514MDVTGGVDAENRNIVYADEYPEEPKKGELNKDFGLYVERPFYIVSKLSSHRYLDMINNKDFVIKTPNGRNTQVWFFDQKSYTIKTQYNKKSWDITSSGKTNKM*
Ga0103943_12729Ga0103943_127292F101265SLGGNKISGLKLATDLRLKRTYHCRITDVGEDYDLRIIISKSIGVYIVKTTSSEIVEAN*
Ga0103943_12887Ga0103943_128871F100952RHRGAKQNQRVNPLADNYTGQQISSRFVQPGVMVLNKENTQFMLPWIEKYKDIKDDRIDDGMFLNSCIVDSDVPLLDMDKKFNHKNNGEQFDYNNVYFLHCAGGKKHKRQVKIWDKLKKIYPEVNPDLSGLIQ*
Ga0103943_13232Ga0103943_132321F059997AKSVLNTIDSFIDRAKELTSLRLEKGKMLSKSAQDSLMQIQDRIQEVYNDLDSILGLGAEQEEAKQPSDELDKLWLTTQEVLAQSQGITIEGEKE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.