NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007996

3300007996: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765620695 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007996 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052939 | Ga0111052
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765620695 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size151884047
Sequencing Scaffolds23
Novel Protein Genes25
Associated Families22

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
Not Available4
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
All Organisms → Viruses1
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2792
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Capnocytophaga1
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F043990Metagenome155N
F045567Metagenome152N
F051212Metagenome144N
F053092Metagenome141N
F054109Metagenome140N
F054111Metagenome140N
F057446Metagenome136N
F061925Metagenome131N
F067846Metagenome125Y
F072446Metagenome121N
F073671Metagenome120N
F076191Metagenome118N
F078842Metagenome116N
F085820Metagenome111N
F089055Metagenome109Y
F089056Metagenome109N
F092230Metagenome107N
F094007Metagenome106N
F097490Metagenome104N
F099454Metagenome103N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111052_100193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis58350Open in IMG/M
Ga0111052_100624Not Available27762Open in IMG/M
Ga0111052_101123All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae18258Open in IMG/M
Ga0111052_101155All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis17836Open in IMG/M
Ga0111052_101883All Organisms → Viruses12031Open in IMG/M
Ga0111052_103632All Organisms → cellular organisms → Bacteria6933Open in IMG/M
Ga0111052_104151All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2796142Open in IMG/M
Ga0111052_105542Not Available4749Open in IMG/M
Ga0111052_107747All Organisms → cellular organisms → Bacteria3418Open in IMG/M
Ga0111052_110430All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2792536Open in IMG/M
Ga0111052_112179All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae2142Open in IMG/M
Ga0111052_114213All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Capnocytophaga1818Open in IMG/M
Ga0111052_116716All Organisms → Viruses → Predicted Viral1533Open in IMG/M
Ga0111052_117704Not Available1441Open in IMG/M
Ga0111052_118379Not Available1377Open in IMG/M
Ga0111052_119200All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1314Open in IMG/M
Ga0111052_123485All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris1053Open in IMG/M
Ga0111052_123611All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1047Open in IMG/M
Ga0111052_124222All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1020Open in IMG/M
Ga0111052_130238All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria803Open in IMG/M
Ga0111052_132049All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae754Open in IMG/M
Ga0111052_142573All Organisms → cellular organisms → Bacteria556Open in IMG/M
Ga0111052_143708All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium534Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111052_100193Ga0111052_10019338F089055LKVEKMNSTPERVTETPEIKIAREKLAVIFSDAKRYDGNSGVKPELGKAAIDGKDIKNIIKVNSADNEAVDLCNQALGSYGKSLDRINGNPLKIVREIGDLLQSFREDKTKGSCK*
Ga0111052_100624Ga0111052_10062431F076191MKIIAENPAEEALLWRIKALSDELVNQDNRSTSMPVWTILDNNKAGKDYGAVMYFTGKAAEQHINENDHHYKKPMICVRSAHDNRELKDVIHLLILAGGNEIPNNHYGFLRDAGY*
Ga0111052_101123Ga0111052_1011231F072446MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFPFIVEQHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTLTSIGTSPVLCGVKSIEAVGIAENGNTYDLSAEMKLRIRDYSDKWKYRSSGIVTLNCENTESKTAKYVVPLGRIREEELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSMLQELPSKSVQQYYTPNGYEREATYFTALWPVHDKYNEREL*
Ga0111052_101123Ga0111052_1011239F032313MACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMLTSKISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPRIMGWITTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWFVLRLNKNAVEFLQRGRTMWGPYDWYYGRNSGRSEVTLEAK*
Ga0111052_101155Ga0111052_10115516F033081MAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKSSILAVGLIIMTIAMIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATATAWIVGATGVFLMGNFTSYTKKQKSLQSLKATGCGMYNTLLLFALLEEQAFRSGSERWNWRERVRASVCFGLLHVTNIWYSFAAGIALSVTGFGFLMVYLWYYRKYRSQIIATAAAATVHALYNVIALSLIAVVLAIDIAKLL*
Ga0111052_101883Ga0111052_10188310F073671MNKEQAEHELAELHEKERSLEKALELVREKIRELVNYTDKNKVYK*
Ga0111052_101883Ga0111052_10188316F067846MSIIAEWERKTFNDYDKQCSKEDDYNRAVEMEIEAIKEDIANGDDDAICAFSEKMLDYDFLKAVILGTDYEEMRIKILTAMAEDRLEQLEKDYRNGYILND*
Ga0111052_103632Ga0111052_1036322F054109MVGRPKSKKGAKVHTAFKIYPADKERAQAMAEKLDMSLSSYINKAVLEKVARDEKSEN*
Ga0111052_104151Ga0111052_1041513F053092MQSDQGLILCLTHALLVLGALILEPAEMENTMDDHTVQLFGILIAKELGIATHRIKADEHVPRDHIPLTLVEGDDIGIVVMIEEVLIGLQDALITTELIAELADTAVIAGSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK*
Ga0111052_105542Ga0111052_10554210F054109MVGRPKSKKGAKVHTAFKIYPADKERAQAMAEKLDMSLSSYINKAVLEKV
Ga0111052_107747Ga0111052_1077475F078842MIISSIYKIADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEVKKTYDEALRKFDKLIIPEDDILRAASECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKTHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPIDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEVSISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSARIRQMINSIELDVYEVNL*
Ga0111052_110430Ga0111052_1104304F099454MKASKLLWAVVMALTFVLTSCDRLTDEPTLEDRGYYKYFDSTAQHKSFRVVTASGKPYNHKIDWHIIGIRDSKSDTYLTKKVDTLSNGDLKISYDWISFTVREKKSVIDVEVQKNETGEDRSVKFVAQDNHKGLASPSMKV
Ga0111052_112179Ga0111052_1121792F045567MRQRDGRDTLTEELEGDITPLLYRAEGEARRPWVGMVTEDVVHTSTHRVEDALLPIDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVRATVDEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEIPSLSSDRGYVVERVVR*
Ga0111052_114213Ga0111052_1142132F054111MNKSLESITHEEFLKLMEHLKNLQEFTFLEYIIAPEADIFYFNFMEKTVKIKWDLDYGLFLETEELSTADRDLFLNILDKEILFLI*
Ga0111052_116716Ga0111052_1167162F054109MVGRPKSKKGAKVHTAFKIYPADKERAQVMAEKFDMSLSSYINKAVLEKVARDEKSENQTETN*
Ga0111052_117704Ga0111052_1177042F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVVAAREEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSKKGKK*
Ga0111052_118379Ga0111052_1183792F032313MACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSITKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK*
Ga0111052_119200Ga0111052_1192001F097490MKIDENKKEVLVEIRNNTENNYYLLSPIVSIMTKHLQYIDGEMIEGQIHHKKLDSIVCSVCIWDDICKEEYYAMRDIVLLPKKSVKKIKYKYDNEEYIEIETVHIGFPYNGYYNEIGKKMQFMLKKKLDSSNIIKGYEFYNKDIETMTIKM*
Ga0111052_123485Ga0111052_1234851F089056PLIIKKMNVFCKIILPLLCIISCSKRKEADNTMVLEKNHTFFLWNNDSLGCKHERTIEMGEELYNTFKKSNKNDSILLKEYLGTPTRRFKDKEVIIFMYYINSCCDNGQLLEECDVSFISITFTNKNKILFGKGIQ*
Ga0111052_123611Ga0111052_1236111F051212MKKFFFIFVLYWLYSCNGTEKAMATSPDTQKTSISEKQNAEKIERIIYSETGGDTDGKNVHLVITKDSIIYRLTEGVTDERTIADLSLNNNNKDWEAFIDKIDLEDFEKGKPSEELIMDLPTIKIIIKTDKKEYSKTDIQANKTWDYITKQIIDIKYSQLYNHLNLEK*
Ga0111052_124222Ga0111052_1242221F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSDGDYAFKISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQSTEEKEARDKVDDAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIILPPC
Ga0111052_130238Ga0111052_1302381F094007MKSKTVEVLALARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDKDFTKRNRIIVEMCDLFGRIRRRAGFAECHRGRGDYDRARSIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0111052_132049Ga0111052_1320492F057446MNSISFGKTTITSYPEYFEITDNKKTSKLLYLSASLVFIAIYLFDLYQNDFDFGKVAHFKTISAVLWLVIFALQFWLINTESKIEKSKIKEVVVRKNRWTSIVIHYGDKKRKIDGFSQDEAEQIIKFLMNNR*
Ga0111052_142573Ga0111052_1425731F061925MSWEYSINLDSEESVSSVVADLKICKLFSSSTTDYIDWKNSEPIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKHDYHITDCDTDEEVTLEHIFRSVI*
Ga0111052_143708Ga0111052_1437081F043990MIKKLGIIFTFGVIILGIVVYANHKIERSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDSSFMKLNKLPIKEDLPPNGIPKQFLNIANGYYKYVVDENDDRSFDILIVDTTRKEICIYYQIL*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.