NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000731

7000000731: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 763536994



Overview

Basic Information
IMG/M Taxon OID7000000731 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053325 | Ga0031239
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 763536994
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size124671135
Sequencing Scaffolds17
Novel Protein Genes17
Associated Families16

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
Not Available7
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00401
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4162
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Pseudoprevotella → Pseudoprevotella muciniphila1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F046433Metagenome151N
F068942Metagenome124N
F072446Metagenome121N
F077404Metagenome117N
F080166Metagenome115N
F081455Metagenome114N
F085820Metagenome111N
F089057Metagenome109N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F099454Metagenome103N
F103432Metagenome101N
F103433Metagenome101N
F103436Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2885894All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus742Open in IMG/M
C2893355Not Available793Open in IMG/M
C2901718Not Available860Open in IMG/M
C2904828Not Available889Open in IMG/M
C2919020Not Available1054Open in IMG/M
C2921924All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1099Open in IMG/M
C2952528All Organisms → Viruses → Predicted Viral2146Open in IMG/M
SRS014271_WUGC_scaffold_15102All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00401040Open in IMG/M
SRS014271_WUGC_scaffold_17555All Organisms → Viruses → Predicted Viral2562Open in IMG/M
SRS014271_WUGC_scaffold_28266Not Available1470Open in IMG/M
SRS014271_WUGC_scaffold_30217All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416695Open in IMG/M
SRS014271_WUGC_scaffold_38209All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416723Open in IMG/M
SRS014271_WUGC_scaffold_42157All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Pseudoprevotella → Pseudoprevotella muciniphila553Open in IMG/M
SRS014271_WUGC_scaffold_44746Not Available1399Open in IMG/M
SRS014271_WUGC_scaffold_45032Not Available4693Open in IMG/M
SRS014271_WUGC_scaffold_8434All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria11502Open in IMG/M
SRS014271_WUGC_scaffold_8782All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1099Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2885894C2885894__gene_153776F103436MITTSKDGWCDRSDAEILNSLKDWVSRCDVKYVKSDALKKIDSAFALWANGQYVSALRLLDENEVFLKKSDWPYYALGIEILRVRKHEFLNE
C2893355C2893355__gene_157637F089057MVHPIKFGCTIYIVLEANMTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIITSNDKYSEVLSNVLGSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIVNKDIDKLASLCTGYLAISKDEVLVKKLMNESATTAFQYMSEEDIHNVVDDINSRSVLSRYLSRM
C2901718C2901718__gene_161896F072446LLFGLCLYGFTACDSDHEPTKPVRPFHGDTLAQIAWNFRFIVEEHYHSIPGIVPEGTTYRVPVIPRSVEDRTKKEYNEQDVDKVAHLVFRATVHGDTINRHKKELETLARQLHALTLTTIGTSPVLCGVKSIKAVGVAENGRTYDLSWEMKLRIRDYSGRVKYNSGIVTLDCEDTESLTARYVVQLGQIREHELAEHIQPELKFYLPVKRCMDFSSIRFDITLFNGEVLSFQHKLPSKSVLQELPNNSVLENYKQYGFEREATYFTTLWPVPEKEIER
C2904828C2904828__gene_163426F103432MKLINSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTTAGLRVSEPRRVVPMLPRQLHVEMEGKTLFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSCPLLLGKIQNVNYRTREGVFHEHYEAASVDTFSVKDDWLLKTKAEPSLYVPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY
C2919020C2919020__gene_171050F081455MEITNTENKNLFQQLSSLGFDVNESLLELEKDYSPVEEIGMQNVEIIDSAEEAIQQPLANTDSSIAVNFSQMINKPEVEEVKTEVASVPDNGETKVNVFFPKNEHILSNYVDYDSFNKIKESNTETIVRAVRLLNYKMSDQNAAMKFGQFVSEFNSECDPNKRLRYELIRHQGREKDLVVRLSTVINGTTKYYADIYPDLNKIDVDHHLISSARK
C2921924C2921924__gene_172699F099454MKASKLLWAVVMALTFVLTSCDRLTDEPTLEDRGYYKYFDSTAQHKSFRVVTASGKPYNHKIDWHIIGILDPKSETYLTKKVDTLSNGDLKISYDWVAFIVRENKSVIDVEVQNNETGQDRSVDFVAQDNHKGLASPSMTVIQRAK
C2952528C2952528__gene_193163F095629MRELIICACLFGCFGVANAAAPVDQPKEVKVVHNDDNVALHKKIYKLEQRIERLEKLLAEKEGK
SRS014271_WUGC_scaffold_15102SRS014271_WUGC_scaffold_15102__gene_16893F077404MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRK
SRS014271_WUGC_scaffold_17555SRS014271_WUGC_scaffold_17555__gene_19538F095631EAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGANGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKTDQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
SRS014271_WUGC_scaffold_28266SRS014271_WUGC_scaffold_28266__gene_31867F068942MIRKILSLPTLALCFTLGSAFFAGCNEDYIEDTETKVRWSNVKPPQYGDPINITLKAEGETFTTVGDYPWISFRSYASTLDTFTRHSFSEADKDTAYYKDIVIYLTRNKREQTATLKLVAPPNRTQQPKQFDFSIGVTPLGTYIFKVRQPALPAKAQ
SRS014271_WUGC_scaffold_30217SRS014271_WUGC_scaffold_30217__gene_34205F080166VNILANFENYNKVVEQIFELNYQLTLKMEVTFNNTIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTIIDEAIDICDPGNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGDRLDLIPFVLIDDHSGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMK
SRS014271_WUGC_scaffold_38209SRS014271_WUGC_scaffold_38209__gene_45186F099453MLRRKDMNRFDIIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLTNEVYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRVYINSANLNDLGE
SRS014271_WUGC_scaffold_42157SRS014271_WUGC_scaffold_42157__gene_52320F068942MIRKILSLPTLALCFTLCTALFAGCGENNEGFVTEVRWSNVKNPEYGEYINIRLKAEGETFTTVGDHSWISFSNDASTIDTFTRHDIPKVDKDTAYYKDIVIYLTRNEREQTTTLKLVAPPNRTQQ
SRS014271_WUGC_scaffold_44746SRS014271_WUGC_scaffold_44746__gene_59457F085820MRSTFYLFAVLFLATTFFSCETDEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPPNPDHLKQRIVLRLADGTEIEKELSEKGTK
SRS014271_WUGC_scaffold_45032SRS014271_WUGC_scaffold_45032__gene_60837F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNIFNLYKDIINLDDISLLAELRHTEWYKDWFTSDKRNSDLIDLSKFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFNTLREDEDIELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR
SRS014271_WUGC_scaffold_8434SRS014271_WUGC_scaffold_8434__gene_9661F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTMKYE
SRS014271_WUGC_scaffold_8782SRS014271_WUGC_scaffold_8782__gene_10049F046433LSQAQDASRDNLMVYVKADNYLGTETSDPPFMESRYKTTEYEAINDFVQFIEMTKHYLPDYMEYCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQCRVKNEMKSSQYILSLVLSKFLDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIINGSQVEERIAGFEVDNDPEDHEASVLVMAASSNYIDNGIGVDSLWGEATYPVEAYYRLKNDHNDWGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEKIDELSLPALANIVRPYRNGED

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.