NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000144

7000000144: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 246515023



Overview

Basic Information
IMG/M Taxon OID7000000144 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052640 | Ga0031345
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 246515023
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size116721961
Sequencing Scaffolds22
Novel Protein Genes26
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → Viruses → Predicted Viral3
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus influenzae1
Not Available6

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F027205Metagenome195N
F033081Metagenome178Y
F051210Metagenome / Metatranscriptome144Y
F054109Metagenome140N
F067846Metagenome125Y
F071326Metagenome / Metatranscriptome122Y
F072446Metagenome121N
F073671Metagenome120N
F080166Metagenome115N
F081455Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103430Metagenome101N
F103433Metagenome101N
F103435Metagenome101N
F105376Metagenome100N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C4806501All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria576Open in IMG/M
C4808527All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria585Open in IMG/M
C4852253All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes957Open in IMG/M
C4869098All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1454Open in IMG/M
C4882328All Organisms → Viruses → Predicted Viral4131Open in IMG/M
C4884214All Organisms → cellular organisms → Bacteria → Proteobacteria14625Open in IMG/M
SRS045715_LANL_scaffold_10343All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41613368Open in IMG/M
SRS045715_LANL_scaffold_108822All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41617628Open in IMG/M
SRS045715_LANL_scaffold_110017All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41658693Open in IMG/M
SRS045715_LANL_scaffold_110496All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus influenzae4607Open in IMG/M
SRS045715_LANL_scaffold_18620Not Available324468Open in IMG/M
SRS045715_LANL_scaffold_23049Not Available1817Open in IMG/M
SRS045715_LANL_scaffold_33195Not Available2366Open in IMG/M
SRS045715_LANL_scaffold_37884All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria513Open in IMG/M
SRS045715_LANL_scaffold_63962All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41618723Open in IMG/M
SRS045715_LANL_scaffold_64296All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes996Open in IMG/M
SRS045715_LANL_scaffold_73382All Organisms → Viruses → Predicted Viral3406Open in IMG/M
SRS045715_LANL_scaffold_79643All Organisms → Viruses → Predicted Viral1868Open in IMG/M
SRS045715_LANL_scaffold_79697All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41618866Open in IMG/M
SRS045715_LANL_scaffold_87391Not Available21983Open in IMG/M
SRS045715_LANL_scaffold_94003Not Available985Open in IMG/M
SRS045715_LANL_scaffold_9877Not Available18155Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C4806501C4806501__gene_208905F033081GVMAWLLRRAMPRDTRPIFVWPKLVVAIEGYFHRRWLVAIAVLLIAVTIVIAKALLLVPGLDNSVVGLLTSIFETFLPARWATGAAWVAGMTGVFLIGDFTDYTPSQKSLHSLRATKWGVYNALLLFALWEEQAFRSGSEKWSWRERVRASVCFGAIHVVNIWYSFAAGIALSVTGFGFLLVYLWYYRKYR
C4808527C4808527__gene_209313F033081GVMAWLLRRVMPQDPRPVFVWPRLVAAIGNVGYFSRRGFSVLAVGLIIVTIATIKILLFVPGLNQSVVSLLTRGLETFLPTGWATIAAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDALLLLALIEEQAFRSGSEKWSWHGRARASVCFGLIHVTNIWYSFAAGIALSATGFGFLLVYLWYYRKYR
C4852253C4852253__gene_218222F027205MRAVKESEEFERKALAEARKRDRAEGKEPREVLHPDHKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYILDEYDIPRRIKRSAK
C4869098C4869098__gene_221338F073671MTNNKQAEHELAELHEKERSLEKALELVREKIRELVNYTDKNKEQK
C4882328C4882328__gene_223472F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHVKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPAEVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
C4884214C4884214__gene_223830F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWSGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAVDSKEMMSQDCNPSQVEPYVSQYGLQARVIAGLSPNDTSKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLSFSPWTAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESIAIFLKLLNGYEYVSTIAISVLVPIVFFELVHKNVKIINLWKQAVPVFTATVVAFFGAYWVNFVSLTDYYGSSDKAASAINAKASYRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGYALPHAHINGIIFYIPLLLFVYVLIGLWADCVVKRPVKYE
SRS045715_LANL_scaffold_10343SRS045715_LANL_scaffold_10343__gene_12882F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDIKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARKNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWTEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
SRS045715_LANL_scaffold_108822SRS045715_LANL_scaffold_108822__gene_159597F092229MTNEKYRFKHIPEVVLKNIRFIKDNNIDVGNGKDMLECMMNVNSVVRTKIYEDYEFAKDVAERRFGSTIEDLDMVTILQKCTTRPYNSILNNIYFRYFNSKLIDDLFKLAESPKILDLAIEYKCDYYAVNTAKTAVRRYIDDIYYNKFAIDSTIITSSRSLNDPQVNAVKSAEFTYELLMASRAEEFTPEIVRNIFIKYGLKPNPSRNVYNRINNNLNLFYYIEDYLEEYREEGKFIYGTKEYKISKDLRRLPLMLILTQLTRKNNSGYILNSNLELVKG
SRS045715_LANL_scaffold_110017SRS045715_LANL_scaffold_110017__gene_162077F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFKKVIEVDSYSLYDALAEENGVDLFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDNKTFKSTELFYIVKNNICGTIFFTLFDRMNEDTRTRAR
SRS045715_LANL_scaffold_110496SRS045715_LANL_scaffold_110496__gene_162987F067846MESLQAQWERKTFNDWDKQCGKEDDHNRAIEMEIEAIKEDIANGDDDAICLFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLKQLEEDYRKGFILND
SRS045715_LANL_scaffold_18620SRS045715_LANL_scaffold_18620__gene_24239F051210MNNYSQIMGNSAMMDALRASSVSAEDARLRGNEYAKMFSRNEEMMNVFGLGGNNANLLQKTFSGYSETPLLSTQYFNASVASYVSSFAGYMSIERDFDQPNGLFYWFDVLGVTDLRQVLPNLGPDQYQDVQVMGAFELPVTINTGTAAYSPLVGRKLIPGTVRVKVEDGTGKKFELIDNGQGSFMAVAGVLKTGTVNYLNGKIDFELTTAISNPAGKITIVGKEDTTGTPSCTNGASNAHANDKRFIAKMQQVALNTVPDMLVAEYNIAALGAMKKATGSDMATFLFTKLRELYTKTINYRLISTLEKGYTGDVMNDLDLSNASTSLASKFQDYRSRVDLFDAYLINVETSLATRAVKGVTTTAYVAGNQAANQFQKGGVIGKFERNTKMTYISDLLGWYDGVPVLRSTDIHEEQGEGTFYAIHKTQDGQMAPLARGIYMPLTDTPTIGNYNNPTQMASGIYYQEGVRYLAPELVQKVSFKFGF
SRS045715_LANL_scaffold_18620SRS045715_LANL_scaffold_18620__gene_24257F071326MAENMASRSVERSNKFYKATLKTIKAQLAMLGTKFIVLRPKENSKWKNVFGGSYSSDSTLENDYDEFTTTLIINQNEMKDVWNRNRDSVEAITNDGSLEVGDELQYTRDKRTYRFKITLKQGYSETGDTLFSYTLMSIIETLDM
SRS045715_LANL_scaffold_23049SRS045715_LANL_scaffold_23049__gene_30439F105376MNEKPEVSAKEFGALWANVEHIKESVDRHTTTLERIENIARANVTQAQLAQHEKESEEKYVKRTEIEGVMNFWSLVTSNLAKLFAIALVGLAIYATNNLIQQNKTVTELQEEVQQTQVRR
SRS045715_LANL_scaffold_33195SRS045715_LANL_scaffold_33195__gene_44570F103430MEQITIKAFIGSNNKTKKLEVDKIISTVNTNHEAFTLQYPVIGCWKGETEQTAILYLSDERSKVMDTLGELKEVLDQEAIAYQIENKINLI
SRS045715_LANL_scaffold_37884SRS045715_LANL_scaffold_37884__gene_51262F094007YLWILWYNSYYMKSKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAMNACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRPGWNKGCI
SRS045715_LANL_scaffold_63962SRS045715_LANL_scaffold_63962__gene_89400F089057SIPRETKYSLDDAFDIIVSNDEYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAIFDKVLGMVISQIRHTASSKEGKALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRSVLSRYLNNM
SRS045715_LANL_scaffold_63962SRS045715_LANL_scaffold_63962__gene_89407F103435MKFVFTTEPIYQYYRAYLYPSDKDKLDKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENVFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRNYGIHPQIIYNIDDVLSIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE
SRS045715_LANL_scaffold_64296SRS045715_LANL_scaffold_64296__gene_89968F018385WSLSRISGTETTMADLINSWLPYQELSIEKDRDPVTDDEIIYGNHVKHFTLTVCSPEGRVSKYWNARILKDQVGYCRVACPREKKILCFNWVNWTAYMFSHDGLNELVFMPDSRRRTVSQLSFDHVPMKEVK
SRS045715_LANL_scaffold_73382SRS045715_LANL_scaffold_73382__gene_103562F095629MTFKERMMRELIICLCLLGCFSVANANNVEQPKEVKIVHNDDSVALHKKIYQLEKRIERLEELLKKEDK
SRS045715_LANL_scaffold_79643SRS045715_LANL_scaffold_79643__gene_113053F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLGKITQLEPSIENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY
SRS045715_LANL_scaffold_79697SRS045715_LANL_scaffold_79697__gene_113151F081455METIAKTKNIGFINNLINTCDGYIKINHKEKLRERFPRNTIIEEKDIPPVEEIGAEKVDIIDVAEEAIQQPLQNKDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAIRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK
SRS045715_LANL_scaffold_79697SRS045715_LANL_scaffold_79697__gene_113152F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGTNRQNMLR
SRS045715_LANL_scaffold_79697SRS045715_LANL_scaffold_79697__gene_113154F099453MLRRKDMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLTDIIYTPKYPMAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLG
SRS045715_LANL_scaffold_87391SRS045715_LANL_scaffold_87391__gene_124801F105376MNKNKKGGDMEPDVSAKEFGALQAKVEYIKDGVDKHTATLERIENIARANVAQAQLKTYITEHEQESEKKYVKRSEIEGVMNFWSLVTSNLAKLFAVALVGLAIYATNNLIQQNKAITELQEEVQTQVRRK
SRS045715_LANL_scaffold_94003SRS045715_LANL_scaffold_94003__gene_134994F072446MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDRTEKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVQQYYTPNGYEREATYFT
SRS045715_LANL_scaffold_9877SRS045715_LANL_scaffold_9877__gene_12227F054109MAGRPKSKKGSKVHTAFKIYPDDKARVQAMADKLGISLSLYINKAVLEKVEHDEKSEN

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.