NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000367

7000000367: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 638754422



Overview

Basic Information
IMG/M Taxon OID7000000367 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052911 | Ga0031343
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 638754422
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size97089439
Sequencing Scaffolds21
Novel Protein Genes21
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1
Not Available7
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → Viruses → Predicted Viral4
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F032313Metagenome180N
F036281Metagenome170N
F043990Metagenome155N
F046432Metagenome151Y
F051211Metagenome144N
F054109Metagenome140N
F054110Metagenome140N
F071327Metagenome122N
F071328Metagenome122N
F076191Metagenome118N
F081455Metagenome114N
F081510Metagenome114N
F085820Metagenome111N
F092229Metagenome107N
F095629Metagenome105N
F099453Metagenome103N
F103433Metagenome101N
F105376Metagenome100N
F105378Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3231278All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes783Open in IMG/M
C3232338All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium794Open in IMG/M
C3243474Not Available935Open in IMG/M
C3252174Not Available1105Open in IMG/M
C3253650All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1142Open in IMG/M
C3258171Not Available1282Open in IMG/M
C3261675All Organisms → Viruses → Predicted Viral1422Open in IMG/M
C3266173All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1690Open in IMG/M
SRS048791_LANL_scaffold_12255Not Available895Open in IMG/M
SRS048791_LANL_scaffold_13752All Organisms → cellular organisms → Bacteria → Proteobacteria7362Open in IMG/M
SRS048791_LANL_scaffold_18029All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3997Open in IMG/M
SRS048791_LANL_scaffold_21409Not Available18357Open in IMG/M
SRS048791_LANL_scaffold_22098Not Available829Open in IMG/M
SRS048791_LANL_scaffold_26346All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161507Open in IMG/M
SRS048791_LANL_scaffold_37229All Organisms → Viruses → Predicted Viral3524Open in IMG/M
SRS048791_LANL_scaffold_62086All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1880Open in IMG/M
SRS048791_LANL_scaffold_64669All Organisms → Viruses → Predicted Viral1898Open in IMG/M
SRS048791_LANL_scaffold_65050Not Available880Open in IMG/M
SRS048791_LANL_scaffold_65887All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes56482Open in IMG/M
SRS048791_LANL_scaffold_65957All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales113632Open in IMG/M
SRS048791_LANL_scaffold_9214All Organisms → Viruses → Predicted Viral1500Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3231278C3231278__gene_154472F036281MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKIYNAKEYYDYWAAREGKPAPFFYESRQYYVKSFMRVPGSTDLWITAERETGHWYTFRMSDNQKSKFTRHTM
C3232338C3232338__gene_154879F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMLLLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILTCAGMITYVYMFLKSNQIL
C3243474C3243474__gene_159618F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVVAAREEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSDKGKK
C3252174C3252174__gene_163581F105376MNEKSEVSAKEFGALQAKVEYIKDGVDKHTVMLERIENIAQANVTQAQLAKHEKESEEKYVKRTEIEGVMNFWSLVTSNLAKLFAIALVGLAIYATNNLIQQNKTVTELQEEVQQSQVKR
C3253650C3253650__gene_164305F071328MKWEFILEGEYIVEFVKLCKHLVLERTIDPNKHQAAAIYLRYSSQLLLKKRRAIRRLGIEKKYVSAILRQYGIHYIEYGDNEHRVFFLDRGINLYFSKHDQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW
C3258171C3258171__gene_166407F054109MTGRPKSKKGMKVHTAFKIYPADKARAQAMADKLDLTLSAYVNKAVLEKVARDEKSED
C3261675C3261675__gene_168064F018385MMTERATRWDPYKELSIEKDRDPVLDDEIIYGNNVKHFTLTVYSHDGRVSKYWNARVLPDQVGNCRIACPRDGKILCFNWFNWTAYMFTHDGMNELVLMPDSKRRIVSQLSFDNSGGKEV
C3266173C3266173__gene_170431F046432VISKFQMPEGRYSIEQEGSFGRGEFFWIIKNQSTNQKYLLMNTYSHHGVESELECYREEGFDNLEAIPRKIETLEIPSDAEDEISKYLFGFYSIFEIKS
SRS048791_LANL_scaffold_12255SRS048791_LANL_scaffold_12255__gene_13355F081455METTVNKNIMVNEDTTAFDDFITYALTRKLPEGYDIPPVEEIGMQNVEIIDSAEEAIQQPLANTDSSIAVNFSQMINKPEEVKTELVSTPDNGEAKVNVVFPKTEHILGNYVDYDSFNKIKESNTDKVVRAVRLLNYKMADQNAAMKFGQFVSEFNPNGDPNKRLRYELIRHQGREKDLVVRLSTVINGTTKYYADIYPDLNKIDIDHHLISSARK
SRS048791_LANL_scaffold_13752SRS048791_LANL_scaffold_13752__gene_15068F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINIWKQAVPVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE
SRS048791_LANL_scaffold_18029SRS048791_LANL_scaffold_18029__gene_19723F051211MRVKKAIKVFEKIRDLPYGTSGSDEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLQIPEHILKTRRQQYERHVILRVFIDEFAYDVDPSIDIGLTPMLPMTCWDGKSSTTTMAPLRYLRVYRPHSLHERILSQLRRKIFRSNPESFYTAIDIWLATIRVS
SRS048791_LANL_scaffold_21409SRS048791_LANL_scaffold_21409__gene_23515F076191MKIIAENPAEEALLWRIKALSDELVNQDNRSTSMPVWTILDNNKAGKDYGAVMYFTGKAAEQHINENDHHYKKPMICVRSAHDNRELKDIVHLLILAGGNEIPSNHYGVLRDV
SRS048791_LANL_scaffold_22098SRS048791_LANL_scaffold_22098__gene_24369F032313MACDNDTPQEKPREQEKHEVPVPKSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEEK
SRS048791_LANL_scaffold_26346SRS048791_LANL_scaffold_26346__gene_29279F099453MLRRKDMNRFDIIELAQETLIFVYNTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLTNIIYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRIYINSANLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA
SRS048791_LANL_scaffold_37229SRS048791_LANL_scaffold_37229__gene_42213F092229MNKEYRFDHIPEVVLRNVKFIRENNIDIGTGDDVLECMMDINPVLRQRIYDDYDLAKDVAERRFNSTIEDLDLTTILQKCTTRPYIAILNNIYFRYFNSKLIDDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYYDKYAADADIISSHRVLSDPQVNAVKSAEFTYDLLVAARSENFNPEMVRDIFLKYGLKTNSSRNLYNRMDNNLSLFYYLEDYLEEYVNTGKFTYGSQEYSTIKEFKYLPLMNVLTQLTRSNPSGYILNHKLELVKG
SRS048791_LANL_scaffold_62086SRS048791_LANL_scaffold_62086__gene_75143F043990MIKKLGIIFTFGVIILGIVVYADHKIERSWIEGEFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDNSFTKLNKLPIKEDLPPNGIPKQFLNIANGYYKYIGDENDDRDFGILIVDTTRKEICIYYQIL
SRS048791_LANL_scaffold_64669SRS048791_LANL_scaffold_64669__gene_80523F054110VVLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD
SRS048791_LANL_scaffold_65050SRS048791_LANL_scaffold_65050__gene_81437F032313MYRFLILIFALMLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDLCAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNA
SRS048791_LANL_scaffold_65887SRS048791_LANL_scaffold_65887__gene_84480F081510MLSKTKGETKMKLPNMKAIKSAAKHSYTVSKILAKKYAPVALVTTGLVGYGVAVYKGIQSGKKLEATKAKYEAKDEAGEEYTRMDVVKDVAKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTMVTEEHARYRLRAKEVLDEETFKKIDAPIETKKVEIDGKEVEVESIVPKEGDFYGRWFKYSRHYASDDPDYNEAWVKEVDNMLTQKINTQSGGGMLTFAEVLDALGFEVPKAALPFGWTDTDGFYLEWDTHEVWNEDKQEHEPQIYVRWQTPRNLYSTTNFRDIIPGRKQLV
SRS048791_LANL_scaffold_65957SRS048791_LANL_scaffold_65957__gene_85280F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNVPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTTDAISIYNTGSFTGSFQCLIVYPLGSVNE
SRS048791_LANL_scaffold_9214SRS048791_LANL_scaffold_9214__gene_9911F095629MRELIICACLLGCFGVANAAAPVEQPKEVKVVHNDDSVALHKKVYKLEQRIERLEKLLAEKEGK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.