NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008147

3300008147: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 763961826 replicate 1 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008147 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053171 | Ga0114367
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 763961826 replicate 1 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size130104672
Sequencing Scaffolds22
Novel Protein Genes26
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2792
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4733
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2
All Organisms → Viruses → Predicted Viral1
Not Available2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F030786Metagenome184N
F033081Metagenome178Y
F040149Metagenome162N
F040685Metagenome161N
F043235Metagenome156N
F047127Metagenome150N
F047508Metagenome149N
F055792Metagenome138N
F061925Metagenome131N
F061926Metagenome131N
F063777Metagenome129N
F067846Metagenome125Y
F071327Metagenome122N
F072446Metagenome121N
F073671Metagenome120N
F078842Metagenome116N
F080164Metagenome115N
F085820Metagenome111N
F089056Metagenome109N
F095633Metagenome105N
F098763Metagenome103N
F103436Metagenome101Y
F105376Metagenome100N
F105378Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0114367_100002All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279342074Open in IMG/M
Ga0114367_100021All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473144879Open in IMG/M
Ga0114367_100031All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279120973Open in IMG/M
Ga0114367_100047All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473104398Open in IMG/M
Ga0114367_100269All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47336810Open in IMG/M
Ga0114367_100333All Organisms → cellular organisms → Bacteria32687Open in IMG/M
Ga0114367_101939All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus10060Open in IMG/M
Ga0114367_105596All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus4034Open in IMG/M
Ga0114367_106562All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae3470Open in IMG/M
Ga0114367_107288All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3119Open in IMG/M
Ga0114367_107967All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes2856Open in IMG/M
Ga0114367_108130All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae2799Open in IMG/M
Ga0114367_111059All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2026Open in IMG/M
Ga0114367_113561All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1630Open in IMG/M
Ga0114367_113725All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1611Open in IMG/M
Ga0114367_114889All Organisms → Viruses → Predicted Viral1474Open in IMG/M
Ga0114367_115963Not Available1371Open in IMG/M
Ga0114367_116644All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1312Open in IMG/M
Ga0114367_116986Not Available1287Open in IMG/M
Ga0114367_127062All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria803Open in IMG/M
Ga0114367_130210All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2718Open in IMG/M
Ga0114367_135786All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris609Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0114367_100002Ga0114367_100002169F043235MDEVVASDEGHLLIDLCDDDPRSLCSGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVNTLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGETAGGCHSMAKDQEVPALYYRSHGFKGGRSMTSNVLLPGSAH*
Ga0114367_100002Ga0114367_100002233F098763MDRRASLRILSPALYLATKFFKAVVRTTISYDPSDEVEGKGGMNAIPTAVDE*
Ga0114367_100002Ga0114367_100002244F047508MLGHRLVEGRVKYPYLRDLGEHLRHGFDTEDVGWVVKRSELGALMEHIYYLWGDTYALSKALCTVYKAVTNGIDLIEGLYEVLFFENVEDNLYAACVVRNVKVALNLLAFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGATTVEDQDFHKELYYMVRCELILSSP*
Ga0114367_100021Ga0114367_10002194F080164MVGLTLSAAPQVTLRERANAFPLITEKDPSEIYAPYAWRLPVVPLRLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRGDLFLNAANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0114367_100031Ga0114367_10003153F055792VEIASEALDSTSAVAHRILLLTTQLGESLLASLGAENGVIAEAMVTRALERDLAIDCALEEVRPVFVDESDDSTEAGTTWSRYPLETLQKEGYILFEGSMLPCEARRVDPRCSVKSLDLEPRIIGEAIEPVALPHVARLDESIALQGIGGLRDLLVTPDVSETDYLQTPREEGTDLLQLMSIIARKYQLFHTFVS*
Ga0114367_100031Ga0114367_10003154F040149VADEGAEELRGEVLIEEQGIPVLFVEVEAWYDGRVSSSEILRSVGVALEREPRLAPVWSHDSEDAIDYFIYDVLVPEGHALTAVRERETVVAQLLNIHRYVYYP*
Ga0114367_100047Ga0114367_1000472F072446MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDKTEKEYNDMDLGKEAHLVFRATVHGDTINRRRKELDAIALQLGRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLSWEIKLRIRDYFGRVKHRSSGIITIDCEDTRSKTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVQQYYTPNGYERESTYFTALWPVPDYKYNEMEW*
Ga0114367_100269Ga0114367_10026916F085820MYPRDLKVFAAGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDHLKQRIVLRLADGTEIEKELSEKGKK*
Ga0114367_100333Ga0114367_10033346F105376MDNTDKEVSAKEFGALGADVIHIKESVDRHTVTLERIENIARANVTQSQLKTYIAEHEKESEEKYVKRTEIEGVMNFWSLVTSNLAKLFAIALVGLAIYATNNLIQQNKAVTELQEEVQQTVRRK*
Ga0114367_101939Ga0114367_10193912F067846MSIIAEWERKTFNDYDRRCCAEDAYNEAVEREIECIEEDISNGDSDALCAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILNY*
Ga0114367_101939Ga0114367_1019397F073671MNKEQAEHELAELHEKERSLEKALELVREKIRELVNYTNKNKGQK*
Ga0114367_105596Ga0114367_1055963F103436MITTSKDGWCSKSDAEILNSLKDLVSSCDVDFKREALKKIDSAFALWGGRQYVAAVDLLDENEVFLKKSDWPYYALGIQILKVRKHEFFNE*
Ga0114367_106562Ga0114367_1065622F063777LIGWGIYSGISSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLIKEDYLPDENSQEYKIFNDFCVKSNEYLDDGYIFYKLTDEKSATDLNGAIISEFQEDIGNYILLQNLILEDSQLKNQLISFNKNTGKITVLADIKDFFWLDFDSETKIINGYNNKEQIEIAILE*
Ga0114367_107288Ga0114367_1072882F033081MHTDMHTDITVVYRPKKGVIAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKFCVVAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATVTAWTVGMAGGVFLMGDFTNYTPSQKFLHKIKATRCEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSATGFGFLLVYLWYYRKYRNQIIATAAAATVHALYNAIALSLITVVVAVYLAIDIAKLL*
Ga0114367_107967Ga0114367_1079675F030786MKRTKIHNVVFQMLVVMIVTGSLQLLLKNGSAAKGGNATGAKKISLADITDGDKSKVVAIKVKYFDDEGADKIFENNNNRILNTINSQHISYNSQSAEYSKPRLFLLYQSLKVDC*
Ga0114367_108130Ga0114367_1081303F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLLLQVHLSYKGRISDVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILACAGMITYVYMFLKNNQILTLKVLIIALAAALLFEYAYPWRIIFG*
Ga0114367_111059Ga0114367_1110593F078842MIISSIYKTADNDGLIAHVYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWCKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPIDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLSCDFLEYIKSLSLSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCVIGGKGWLEMADSARIRQMINSIELDIYEVNS*
Ga0114367_113561Ga0114367_1135612F047127MKKTFAFILLSIISLVKAQQTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERRQAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDKPEIAKEVDQMVGDRDIREILEKYNSK*
Ga0114367_113725Ga0114367_1137253F040685MKKLLFKLFFALAFASISLHGQEKIQQVEFNSFGGMALYSSQYTLNSLKKEFSAKPLMGQKEELPKEISLPNTPKNWETFTKKINLNKFKKLRDGPSEQAFGGQDKVIIIKTNKKTYRKMNASGNDHDREVWYDLLQIIAEEFGKKGVYE*
Ga0114367_114889Ga0114367_1148891F105380MSILQLDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRRKDEKFICLTNIWPDGSSHEESYFPNGDRLTLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKRCAVFLGKITKEQENVNTYEEFMEWTKNTKWFK*
Ga0114367_115963Ga0114367_1159632F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKELNNHYVFDDSIDKKNKVITLKIVGDNVKNQDIKNLEKKLALQVHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE*
Ga0114367_116644Ga0114367_1166442F061926MMIKTAKHIKTFLASVLLLIFVMNVSGLFVQLHHQETHQKTEKIAECSDKVCYHKAHLQTKSDCDCGFLCTLNYFYILPEKPQTEFHVNEYFSYFSSYKIFISERIILLWQSRAPPVLS*
Ga0114367_116986Ga0114367_1169862F061925MSWEYSINLDSEEAVNSVVTDLKICELFSSSTTDYIDWKNPKSIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSVTLEHIFRSVI*
Ga0114367_127062Ga0114367_1270621F095633MGNYEKSTEAWRREGLTEGELRTMGALAVEATEELKKTTIRKEVVLLGGVPFNSWDEFAKAVQEMAAHSYEPIPVKINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEHPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN*
Ga0114367_130210Ga0114367_1302101F105380MEGRQTQYNINLADLDDQISDGIVYADRSGKMIYKFGAKKIIQTAITKDLTITGLDDEFKMDYYSFWVPDIYLISFKSFNPDGGLYLAYHKKDEKHICLTNIWPDSRNQDDTYFPNGKRLETKSICTGRMMDDIDSAEYSAWKNDPVTRASQYINKFINARGNADLDFVNSTLRRKVPNHNTKKFAEFLGSITKE
Ga0114367_135786Ga0114367_1357861F089056MVLEKNHTFFLWNNDSLGCKHERTIEMGEELYNTFKKSNKNDSILLKEYLGTPNRRFKDKEVIVFMYYINSCCDNGQLLEECDVSFISITFTNKNKILFGKGIQ*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.