NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025970

3300025970: Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - White_ThreeSqA_D1 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300025970 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111376 | Gp0101403 | Ga0210081
Sample NameWetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - White_ThreeSqA_D1 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size119736411
Sequencing Scaffolds27
Novel Protein Genes29
Associated Families29

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus3
All Organisms → cellular organisms → Archaea1
Not Available6
All Organisms → cellular organisms → Bacteria → Proteobacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium2
All Organisms → cellular organisms → Bacteria4
All Organisms → cellular organisms → Bacteria → FCB group1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → Ignavibacteriales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Phenylobacterium → Phenylobacterium soli1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → unclassified Desulfobacteraceae → Desulfobacteraceae bacterium IS31

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameNatural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands → Natural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationUSA: San Francisco Bay, California
CoordinatesLat. (o)38.131752Long. (o)-122.266335Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002477Metagenome / Metatranscriptome555Y
F008627Metagenome / Metatranscriptome330Y
F015245Metagenome / Metatranscriptome256Y
F020261Metagenome / Metatranscriptome225Y
F020359Metagenome / Metatranscriptome224Y
F025254Metagenome / Metatranscriptome202Y
F047211Metagenome150Y
F048676Metagenome / Metatranscriptome148Y
F052687Metagenome142Y
F055469Metagenome / Metatranscriptome138Y
F058158Metagenome / Metatranscriptome135Y
F060099Metagenome133Y
F061999Metagenome / Metatranscriptome131Y
F062887Metagenome / Metatranscriptome130Y
F074891Metagenome / Metatranscriptome119Y
F076224Metagenome118Y
F077315Metagenome / Metatranscriptome117Y
F077325Metagenome117Y
F078289Metagenome / Metatranscriptome116N
F080179Metagenome115Y
F082716Metagenome113Y
F084254Metagenome / Metatranscriptome112Y
F090405Metagenome / Metatranscriptome108Y
F093897Metagenome106Y
F103291Metagenome101Y
F103506Metagenome / Metatranscriptome101Y
F105200Metagenome / Metatranscriptome100Y
F105202Metagenome100Y
F105217Metagenome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0210081_1000036All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus31964Open in IMG/M
Ga0210081_1000059All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus25049Open in IMG/M
Ga0210081_1000066All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus24209Open in IMG/M
Ga0210081_1000146All Organisms → cellular organisms → Archaea11644Open in IMG/M
Ga0210081_1000801Not Available3683Open in IMG/M
Ga0210081_1001181All Organisms → cellular organisms → Bacteria → Proteobacteria3093Open in IMG/M
Ga0210081_1006553Not Available1542Open in IMG/M
Ga0210081_1010453All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium1263Open in IMG/M
Ga0210081_1012533All Organisms → cellular organisms → Bacteria1166Open in IMG/M
Ga0210081_1013932All Organisms → cellular organisms → Bacteria1108Open in IMG/M
Ga0210081_1017038All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium1011Open in IMG/M
Ga0210081_1018505All Organisms → cellular organisms → Bacteria → FCB group972Open in IMG/M
Ga0210081_1020435All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → Ignavibacteriales927Open in IMG/M
Ga0210081_1021821All Organisms → cellular organisms → Bacteria899Open in IMG/M
Ga0210081_1023061All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium875Open in IMG/M
Ga0210081_1024298Not Available855Open in IMG/M
Ga0210081_1026060All Organisms → cellular organisms → Bacteria → Proteobacteria827Open in IMG/M
Ga0210081_1030101All Organisms → cellular organisms → Bacteria774Open in IMG/M
Ga0210081_1036011All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Phenylobacterium → Phenylobacterium soli709Open in IMG/M
Ga0210081_1049750Not Available609Open in IMG/M
Ga0210081_1056962All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium571Open in IMG/M
Ga0210081_1057899Not Available567Open in IMG/M
Ga0210081_1058001All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium566Open in IMG/M
Ga0210081_1059766Not Available558Open in IMG/M
Ga0210081_1068413All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium522Open in IMG/M
Ga0210081_1071354All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → unclassified Desulfobacteraceae → Desulfobacteraceae bacterium IS3512Open in IMG/M
Ga0210081_1074166All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium501Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0210081_1000036Ga0210081_10000367F105217MKMASNVNSNNERNSLIDSIQYRLFYAEINLDKIPTFIPLDIFPKDKIASEVAIDGFLFYANSALDLVFVEINKKLELGLPPNQINPENIMLNFNSKKSNDSKMMLEEFQKYFQKPTHEEKIISDKEFNDGLNRYGFDVIGFHAEYEARGQEKYQHFWNRASSRLWEIRNQQDLESYDSLLKNAGKRGKDEPRNYLRVKLADDNRPSVYWDSAHYENPKRYFTNVLSLVKQFIDRILEILKPLYPSDSSLVV
Ga0210081_1000059Ga0210081_100005923F093897MKPGFYISMGIAIAGLIVGFVLLNDIQQEKESKNIWIQKIPIQCNDVWEREHQEFYDLNPELLNSNKEKSKEILETIIKNHYEKAGISILDLNLELDVIDEIRCESCDCLGSDRVSIKIPKNQFELISQSEGWKPLE
Ga0210081_1000066Ga0210081_100006613F077325MSDKGNWTEKDEVAIHAIIVNMLNQKQMFQDLGKTTILPNSFNIKNSNEYILGLFTGIVINLFANYWVGEHESGLSPEDLSYLYYKISLFREEIVSGLFD
Ga0210081_1000146Ga0210081_100014612F082716MISKKDIRLMPFTFIIAPILLISLTFVVVTAYYDVIEEKEFIAGLSCPELMIYTDKQTIESKLYFGNENYLIYAEERLHNMC
Ga0210081_1000801Ga0210081_10008013F103291MEMFGLSPVFATILIVSMGVILQNFLGWLKSKEDYDIKNALASTIIAFIVGITIIGPQIEAIQDQMLSELSELTIFASLIASIAGFDVLTKNVFKIANSKIHLQNKPI
Ga0210081_1001181Ga0210081_10011812F055469MLKRTVVILAAAAFFWGTVPTQTAQARDDIWDLMNPSWWADKMFDNDDDWRHYRHYAYNPYWGGPYAQRPRVIVIQPPETEAQNPDTRPPE
Ga0210081_1006553Ga0210081_10065533F062887MPVKQCSGGHKFGNKGKCYKGKGSKAKAARQGRAIKASQAKRGK
Ga0210081_1010453Ga0210081_10104531F025254MEESVREMVFLIVEQLKAYVDGDEDALLELTQVLDSGRHDADVVNQAFELIFRALEPYAREDFSEDPTTPRRSVRVPTGSERALLDAPGYQYLYGLIEQGRISPEQFEEIMSRVREDTSFLDTEASARELATTV
Ga0210081_1012436Ga0210081_10124363F105200MKPVQGRHRRRNVPDSFERKLGQLLEYSDAPHAEAFTMNVMRSVRREQRRRKIILWSFGLVGALFGLSGALMLTGPVSELLTFSLEMPVMKTMQVTLFVVGATA
Ga0210081_1012533Ga0210081_10125331F060099MGLTDDQRDDAAKALRAAQEAIEYTVKGVKGLEEAATLRGRIDEIELYLKRAKMALKFI
Ga0210081_1013932Ga0210081_10139321F020359MAARQSRRHKTPDEFWSKVLAAPKVKRILERRGVSPDDFQRDYEADNSRGARTPKRPTRGQIDAVEAFQKSGDFEAFKRALSTNSSAVANSALRRVVQFKAMGGSKTIRRRGGAENA
Ga0210081_1017038Ga0210081_10170382F080179MAVAAKSVTSTRPVDIRVRFEPGLGMALYASVAPGEVGSVTVARNLGRGRAMLRYRGFTLMTQGAPDWRDGESFSVVVKEVGPPLLLASTNRTAKDDQRVTTVDSAVVPGGGESAEAGR
Ga0210081_1018505Ga0210081_10185052F076224IFVTPDSIFEDSFVFDPVLMEKRIEWTEKKIKANEEKIKEVTFKVEFDSAFRYYSEEMKGHPFTVAFDSNICRVDVPNFKAYRTELPEIAAIDPLIELATNNIHFYAYKIPKIEKSNSEIKIEYYNDDSVYSYVVDYNVINIDSIVQANAGGRDLITLDKFKPVQPNDDTMLIKYQFDRDYYKRFYSDEEFKKQMEELQKELQELSKDVNNQKIRVHTEVTKTPKK
Ga0210081_1020435Ga0210081_10204351F052687YGLKTQYDKGTIFAKMLETVIRKSIETMVKFLTIVLLLTALGGGTYYLLSMETVEDVKVMGKLQISEQYGSYVMASQSKDANYYAVVEGKVKNNMKKPIKNVFIKYVIAGKETSATIFDLAPGQELHFNTSGVATTGSSPEYDFVGIYYD
Ga0210081_1021821Ga0210081_10218212F008627MMCRFLNADFELALRCPVDKCNHNVRFQAYPQLRDHALEVIACDAAGDADRLTCGKNCRALIESGHYWQSIYPESAIYSRNL
Ga0210081_1023061Ga0210081_10230612F090405MKAALAFVFAIGLLVPGIAASDPSDCGRLMYQINHFEGMADRAEALGRDDWAEKTQRHVDVLETRLANRCPAFSARDEQQEAARQMALLLKMAATAAAKFFTMGMY
Ga0210081_1024298Ga0210081_10242981F058158MGVTKKDRENRIDKRIINQKSYMSEFTKTEKDDIDIWELHVKEREYILAQIANDRRTSRIRNWIELIFMLGFIYYMLYDIYKDKDLDTIIKFLKA
Ga0210081_1026060Ga0210081_10260602F077315MMICRASTLFLSRLAYYQLWTLLLLGLLVPAQAAEALNPDALPQAQATLERLEEQLATARTANAQELKTLRKEVAMVRSTAQDCLQQAEPKIEILDSELAILQPAKPKDTQKKTAQETQPAEQAEAPVSPAIAGQLQELLSSKASLEGRIAICKLLLLKSNYLDSNVDDYLRTVQTRQLLARGPTLVSV
Ga0210081_1030101Ga0210081_10301012F020261MVVQEDLAVESFMVSSFGAFVMSLCQGLFSLRSEQTLAFLACGWALASGERQTITTYLWLSGATRVKHFSRFYVFLGGALYQARWQLWGRVIQQAAQWVPAEAVIVLEVDDSTKKK
Ga0210081_1036011Ga0210081_10360112F061999HFNRGILRGLEPKKIDVQMLHRNPPRLGEMLGEDDWFDKLGPKGRDYTRAAYMARLHAFTEQCRQRILRGGPPRPTR
Ga0210081_1040304Ga0210081_10403042F103506MGTYMINTESMLETDHFIILDESNTVRGNEFDRSYPKVQIVYPSSMNSYPDNKVTESRPENGSNYKEKEKHPFTITLNVDAD
Ga0210081_1049750Ga0210081_10497501F084254EVIRQSKRHPRADQVASALAIRGLPIAEKDVMTVFDQYDIEKKIADSH
Ga0210081_1056962Ga0210081_10569622F015245EVPESNRLIEIMFGYHPKASAAHGIADPMHWELISKMPWAGLGTPRKHPKFRHMDGSVFNGQLYIDDRLVVDKHGMLDRSLLHHPEVLEVAAEFGDPYQVLAPVSHEAHGSNTAW
Ga0210081_1057899Ga0210081_10578991F078289SIFILIIVVFPLYGFNSLVRHEDIPKMRELGISQEVIQYIISNQTSSVSSEDVIKMKQSGLNNNDIMSAIKSDLYRPEQKSTSMKEVELIAKLKESGMSDEAVLQFIQTVKSTRRVDSDGNVTKQYTNESQRTQYPTTGATFPKLDNYGYDPSNGRFLFFIKPQNQE
Ga0210081_1058001Ga0210081_10580011F002477IMAEPATGFLFAALVPAGNPSDVSYVVPLLDKVQGAIQRVTTTPKRQVHSVAGDLGINDATLRHTLHARGILTVGIPQTVEPLDPTPSPEASRAILTAASLTGKRTVHQVQGACTAGYSRPVVESYIASLLARGAGQLRYKGLAGAGVQLGMTVLAQNGATLVRIREQRLTKRAQKFRRFLRLPRPKP
Ga0210081_1059766Ga0210081_10597661F105202PSISALKPEEYSGEIDKYNKIIQSDLHLDIQQRAHLYLASLYFSPMNPKRDYGLALEHLETYALFDPDFANAVDPRLLLAAIIEIERFSALAEAQTKEIQALSQELDILKRQSAAFRGSRQDIQSANLKLQKRIGQLQKKVRNLETSNAKLNKTIEMLSTLDSRLKEKRINFIKTDSIEE
Ga0210081_1068413Ga0210081_10684131F074891MSAVTHDVVLKRKPRKKRNKGEAALRSLRRALARRKLELMREDEILHEQIYDVFEEEAGKKA
Ga0210081_1071354Ga0210081_10713541F048676RLCGTASDAVGAVKLEIPKGFRLLKFKMSECLKNIIKLTISKI
Ga0210081_1074166Ga0210081_10741661F047211EKETIDYAYEQMRQRATVDLVAPEAAIENLIKMVSYVDKRATTIDRSKLTDYSLLKELAQTGQLPAKR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.