NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007315

3300007315: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 159247771 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007315 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052722 | Ga0104930
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 159247771 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size120670921
Sequencing Scaffolds13
Novel Protein Genes25
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
Not Available5
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F046433Metagenome151N
F054109Metagenome140N
F071328Metagenome122N
F076191Metagenome118N
F078842Metagenome116N
F080166Metagenome115N
F081455Metagenome114N
F084362Metagenome112N
F085820Metagenome111N
F089055Metagenome109Y
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103430Metagenome101N
F103435Metagenome101N
F103436Metagenome101Y
F105378Metagenome100N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0104930_100003All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales237799Open in IMG/M
Ga0104930_100010All Organisms → cellular organisms → Bacteria122596Open in IMG/M
Ga0104930_100066All Organisms → cellular organisms → Bacteria70501Open in IMG/M
Ga0104930_100108All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis57578Open in IMG/M
Ga0104930_100239Not Available39990Open in IMG/M
Ga0104930_103704Not Available6227Open in IMG/M
Ga0104930_104038All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes5728Open in IMG/M
Ga0104930_107721Not Available2846Open in IMG/M
Ga0104930_113931Not Available1451Open in IMG/M
Ga0104930_120025All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria961Open in IMG/M
Ga0104930_122971All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales832Open in IMG/M
Ga0104930_127096Not Available705Open in IMG/M
Ga0104930_133550All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus551Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0104930_100003Ga0104930_100003115F092232MNTQAKFIADYNDKNRPKFNDIFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEETFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEETLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDAHLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0104930_100003Ga0104930_10000314F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELVLNTISDTFRFDRDIKIMVGDYEFHLPYDLIIKRIELPTREYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGISNFCNYTYIDSSTIRVMFDNTSYLPTANTEVTVNLYTSQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPINIIPTNTIPIEAIRRDFDNISDSNYILTAGNVIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYLSNRDTYFGDISIMQNIQSDIGLVHKDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDHIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIIDRVPVISYDYVNTEERIQEFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0104930_100003Ga0104930_100003159F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNVFNLYKDIINLDDVSLLAELRHTEWYKDWFTSDKRNSDLIDLSKFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFDTLSEDEDLELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR*
Ga0104930_100003Ga0104930_100003199F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIISSNDKYSEVLSNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIVNKDIDKLASLCTGYLAITKDEVLVKDLMNESAMTAFKYMSEEDIHNVVDDINSRSVLARYLSRM*
Ga0104930_100003Ga0104930_100003207F103435MKFVFCTEPIYQYYRSYLYADDKDKLDKQLMIEYGDYKDIWDLKQQQDALPENIFVAELTARDYPRNPWNYVSQLISKLTYQYLIDSPDFETIFSEVLFNQSEVEFYEFYKAIDRFYNGSEVFIIVSNDEYSDMVTQMMCNVIRRTYGIHPQIIYDMDDVYSIRDDIDFSPQGAQLAYLQRAAYYKLEAKKNFEPLQIWYPFDMNTYTNALE*
Ga0104930_100003Ga0104930_10000340F081455MEITNTENKNLFQQLSSLGFDVNESLLELEKDYSPVEEIGMQNVEIIDSAEEAIQQPLANTDSSIAVNFSQMINKPEEVKTELVSTPDNGEAKVNVVFPKNEHILGNYVDYDSFNKIKESNTDKVVRAVRLLNYKMADQNAAMKFGQFVSEFNSECDPNKRLRYELIRHQGREKDLVVRLSTVVNGTTKYYADIYPDLNKIDVDHHLISSARK*
Ga0104930_100003Ga0104930_10000341F080166VNILANFENYTKVVEQIFELNYQLTLKMEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTIIDEAIDICDPNNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLDDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMR*
Ga0104930_100003Ga0104930_10000344F099453MNRFDIIELAQETLIFVYNTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEVIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLTNIIYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRSYINSANLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA*
Ga0104930_100003Ga0104930_10000365F092229MNEEYRFDHIPEVVLRNVKFIRENNIDIGTGDDVLECMMDINPVLRQRIYDDYDLAKDVAERRFHTTIEELDLTTILQKCTTRPYIAILNNIYFRYFNSKLIDDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLTDPQVNAVKSAEFTYDLLVAARSENFNPEMVRDIFLKYGLKTNSSRNLYNRMDNNLSLYYYLEDYLEEYVNTGKFTYGSQEYSTIKEFKYLPLMNVLTQLTRSNPSGYILNHKLELVKG*
Ga0104930_100003Ga0104930_10000392F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDTEYIKACMNSLREYQIDDNIIARTDEFHNTDDYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0104930_100003Ga0104930_10000398F095629MEVKTMRELIICACLFGCFGVAHADAPVEQPKEVKVVHNDDSVALHKKIYKLEQRIERLEKLLAEKEGK*
Ga0104930_100010Ga0104930_100010134F094007MKLKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0104930_100010Ga0104930_10001024F089055LKVEKMNSTPECVTKTPEIESRKKEAREKLAAIFSDAEQRDNSKVNPELGKTAIDIKMDFADNGAVDFCNQALGSYGKSLDYINNSPLETVQAIGNSLQLLREYKTKESCR*
Ga0104930_100066Ga0104930_10006639F078842MIISSIYKTADNDGLIAHIYEHLLAQYVLKSLQDNGFFVLSDIILSAKTYGDTCFMDAELYGPEAKKAYDEVLREFDKLVIPEDDILRAAGECGIEMNRNIAEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDERHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLSYDFLEYIKSLSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSARIRQMINSIELDVYEVKS*
Ga0104930_100108Ga0104930_10010825F046433MIELPTSPNALSELSPVAPPKLLSQAQDASRGNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIETTKRYLPDYMEDCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASGDYLDNGISAYSQYGGATYSVEACYRLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGKNFDGLSRFRQLLEKG*
Ga0104930_100239Ga0104930_10023918F103430MKQITIKAFIGSNNKTKELEVDKIISTVNTNHEAFTLQYPVIGCWRGEAEETAVLYLSDERQKVMNTLNELKEVLDQEAIAYQIENKINLI*
Ga0104930_100239Ga0104930_10023948F084362MSLMNCTFTVRWSDDKNKAHAKTYDTEDDAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDKQSETEAEQKGFWWEK*
Ga0104930_103704Ga0104930_1037042F084362MNCTFTVRWSDDKNKPHAKTYATESDAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDKQSETEAGQKGFWWEK*
Ga0104930_104038Ga0104930_1040388F054109MAGRPKTKRGMKVHTAFKIYPDDKARAQAMADKLELSLSAYINKAVLEKVERDEKSEA*
Ga0104930_107721Ga0104930_1077212F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNAPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVINGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTTDAISIYNTGSFTGSFQCLIVYPLGSVNE*
Ga0104930_113931Ga0104930_1139312F085820MRSTFYLFAVLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFAAGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDHLKQRIVLRLADGTEIEKELSEKGKK*
Ga0104930_120025Ga0104930_1200251F046433SDPSFMKSRYKTTEYEAINDFVQFIEMTKHYLPDYMEDCAKELIDELAFLGVPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQYRAKNKMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVRERISVFEVDNDPEDHEASVLVMAASSNYIDNGIGVDSLWGEATYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSAFGYEVDDIAYRAIEGGILKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLERE*
Ga0104930_122971Ga0104930_1229711F071328MKWEFILEGEYIVEFVKLCKHLVLERTIDPKKHQAAAIYLRYSSQLLLKKRRAIRRLGIEKEYVSAILRQYGIHYREYGDNEHRVFFLDRGINLYFSKHHQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW*
Ga0104930_127096Ga0104930_1270962F076191MKIIAENPAEEALLWRIKALSDELVNQDNRSTSMPVWTILDNNKAGKDYGAVMYFTGKAAEQHINENAHHYDNPTTCVRSAHDNRELKDIVHLLILAGGNEIPNNHYGFLRDA*
Ga0104930_133550Ga0104930_1335502F103436MITTSKGGWRYKSDFEIFDSLRDWVMKCDVKYVKRDALDKIDYARSLWCRAEYVAAVHLLDENEVFLKKSDWPYYALGIQILKARKH

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.