NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008521

3300008521: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 370425937 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008521 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053242 | Ga0115192
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 370425937 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size94352366
Sequencing Scaffolds17
Novel Protein Genes25
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4162
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2794
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus2
Not Available5
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → unclassified Lachnospiraceae → [Eubacterium] rectale1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F040149Metagenome162N
F043235Metagenome156N
F045567Metagenome152N
F047508Metagenome149N
F053092Metagenome141N
F054109Metagenome140N
F055792Metagenome138N
F071328Metagenome122N
F072446Metagenome121N
F077405Metagenome117N
F080164Metagenome115N
F080166Metagenome115N
F081455Metagenome114N
F084342Metagenome112N
F089057Metagenome109N
F092229Metagenome107N
F092230Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F097527Metagenome104N
F099452Metagenome103N
F099453Metagenome103N
F103436Metagenome101Y
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115192_100040All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales140083Open in IMG/M
Ga0115192_100128All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41658063Open in IMG/M
Ga0115192_100341All Organisms → cellular organisms → Bacteria28524Open in IMG/M
Ga0115192_100806All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41614053Open in IMG/M
Ga0115192_102397All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2795879Open in IMG/M
Ga0115192_104126All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2793638Open in IMG/M
Ga0115192_104129All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus3637Open in IMG/M
Ga0115192_104210All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2793583Open in IMG/M
Ga0115192_106700All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2792317Open in IMG/M
Ga0115192_107842Not Available1996Open in IMG/M
Ga0115192_108130All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → unclassified Lachnospiraceae → [Eubacterium] rectale1927Open in IMG/M
Ga0115192_110265All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1521Open in IMG/M
Ga0115192_111236Not Available1404Open in IMG/M
Ga0115192_111851Not Available1328Open in IMG/M
Ga0115192_114687All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1059Open in IMG/M
Ga0115192_117525Not Available882Open in IMG/M
Ga0115192_125914Not Available578Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115192_100040Ga0115192_100040117F092232MNTQAKFIADYNDKNRPKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDNRDPEAVAFDANLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0115192_100040Ga0115192_10004014F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELVLNTISDTFRFDRDIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGVSKYCNYTYIDSSTIRVMFDNASYLPTANTEVTVNLYTSQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYLTERDTYFGDISIMQNIQSDIGLVHKDDPYDPEKITGVDIKVLAVFYTDDKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDHIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIVTYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0115192_100040Ga0115192_10004040F081455METTINKKDLGFINNLIRDCDDYININHRERMAEKLLRVAIEDSDIPPVEEIGMQNIEIVDSATDAIQQPLVNTDSSIAVNFSQMINKPEEVKTEVASVPDNGETKVNVFFPKNEHILGNYVDYDSFNKIKESNTKTVVRAVRLLNYKMADQNAAMKFGQFVSEFNPECDPNKRLRYELIRHQGREKDLVVRLSTVVNGTTKYYADIYPDLNKIDIDHHLISSARK*
Ga0115192_100040Ga0115192_10004041F080166VNILANFDNYNKVVEQIFELNYQLTLKMEVTFNNTIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTIVDEAIDICDPNNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLDDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMR*
Ga0115192_100040Ga0115192_10004044F099453MNRFDVIELAQETLTFVYNTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEIIIHELTHVDQLIDYKCIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLTNEVYTPKYPMAIAMGKLEYMLGKKFREFSNNNIEIEYVDRLKTHYTFMVCENRIYINSANLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA*
Ga0115192_100040Ga0115192_10004066F092229MNEEYRFKHIPEVVLRNVKFIRENNIDIGTGDDVLECMMDINPVLRQRIYDDYDLAKDVAERRFNSTIEDLDLTTILQKCTTRPYIAILNNIYFRYFNSKLIEDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLSDPQVNAVKSAEFTYDLFMAARSENFNPEMVRDIFLKYGLKTNSSRNLYTRMDNNLSLYYYMEDYLDEYVKNGKVTYGSQEYHTIKEFKYLPLMNVLTQLTSSNPSGYILNHKLELVKENK*
Ga0115192_100040Ga0115192_10004092F105379MVIQFQLSQSDIESLLSISKLLKCDKILYDRNYINSIIGVGPERSYFQTTSYMIDLDPSINNLLINSLDLKNLSKATDGADITKTNVPVFDWDTLYIKSCMDSLREYQVDSHIIARDDNFHESNCYSELMAGSASTGACRINVDKYLIDIPKSAMPTLKSDHVEAIVYEVPNRNFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0115192_100040Ga0115192_10004097F095629MRELIICLCLFGCFGVANASTLVEQPKEVKVVHNDDSVALHKKVYKLEQRIERLEKLLAEKEGK*
Ga0115192_100128Ga0115192_10012831F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIITSNDKYSEVLSNILSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILGLFMAICIINKDIDKLASLCTGYLAITKDEVLVKKLMNESAMTAFKYMPDEDIHTVVDDINSRSVLARYLSRM*
Ga0115192_100341Ga0115192_10034118F054109MAGRPKSKKGAKVHTAFKIYPADKERAQAMADKLDISLSAYINRAVLEKVARDEKSED*
Ga0115192_100806Ga0115192_10080610F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNVFNLYKDIINLDDVSLLAELRHTEWYRDWFTSDKRNSDLIDLSKFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFDTLSEDEDIELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR*
Ga0115192_102397Ga0115192_1023974F084342LGGYLAIPIRRPFFAKALKKTLGQERSLGDAYDKDYQNEKALQWIHRRHILGICKEE*
Ga0115192_104126Ga0115192_1041263F055792MEIASQALDSTSAVTHRILLLTTQLSESLLASFGAEDGVIAEAMVTRALESNLSIDCALEEVGPVLIDKSDDSTEAGTTRGRYTLEALQKEGYIIFKGSMLPCEARRVDPRSSVKSLDLEPRIGGLWDLLVAPDVSQADDL*
Ga0115192_104126Ga0115192_1041264F040149VADEGAKELRWKVLIEEQGIPVLFVEVEAWYDSGIGSSEILRTIRLTLEREPRLTPVWSHDSEDAIDYFIYDVSIPERHTLTAVRERETVVAQLLNIHRLF*
Ga0115192_104129Ga0115192_1041295F103436MITTSKDGWCDKSDAEILNSLRDWVLKCDLKYSKREALKKIDSAFALWGGRQYVAAVDLLDENEVYFSKEDWPYYALGIEILKARKYTYFY*
Ga0115192_104210Ga0115192_1042103F045567MCQRDRCDTLTEELEGGITPLLYRTEGEARRPWVRMVTEDVVHTGTHRVEDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVRATVDEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEIPSLSSERGCVVERVVR*
Ga0115192_106700Ga0115192_1067002F047508MLGHRLVEGCVKYPYLRCIGEYLRHSFDTEDVGWVVKRSKLCALMEHIYYLWGDTYALSKALCTVYKAVTNGIDLIEGLYEVLFFENVEDNLYAACVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATVEDQDFHKVLYYMVRCELILSSP*
Ga0115192_107842Ga0115192_1078423F080164MVGLALCAAPQVTLRERASAFPLITEKDPSEIYAPYAWRLPVVPLRLDNREIRNFAKYPALPSLSGGKLTVRVLIVGDTVAVHQDLMDDFAKRCRATLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLLYKGQTGRLVLCEYYGSHRGDLFLDVANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0115192_108130Ga0115192_1081301F092230MRQAHKRMVNKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSEGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNGSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIDMAVGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLIIILPPCMLFLLIV*
Ga0115192_110265Ga0115192_1102653F097527MIYFKMEKIGNSTHNKEKKTRSENLVFITIPAAGGGTARSVG
Ga0115192_111236Ga0115192_1112362F071328MSQKHWTFTHIIRYIEEYKRNPLLIERIKWKFILEGEYIVEFVKLCKHLVLERTIDPKKHQAAAICLRYSSQLLLKKRRAIRRLGIEKKYVSAILRQYGIHYIEYGDNEHRVFFLDRGINLYFSKHHQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW*
Ga0115192_111851Ga0115192_1118511F072446MTYRAMHEFDAQAHKNKFNPPTHPQMKKLVFLLFGLCLYCFTACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGRVPEGTTYRVPVIPRSVEDRTKKEYNEQDVDKVAHLVFRATVHGDTLNRHKKELETLARQLHALTLTTIGTSPVLCGVKSIKAVGVAENGRTYDLSWEMKLRIRDYSGRVKYNSGIVTLNCEDTESMTARYVVQLGQIREHELAEHIQPEMKFYLPVKRCMDFSSIRFDITLFNGEVLSFQHKLPSKSVLQELPNNSVLENYKQYGFEREATYFTTMWPLPEKEIER*
Ga0115192_114687Ga0115192_1146871F043235RANRSKRELHQGAVNGDDIVQLRHMDEVIPSDEGHLLIDLCDDDPRSLCGGLGIVTRHPEGAIALFVGLAHRDQCDIDRIDTIPKEVWEFMEVTREEVDSLIQVSGATILVEEVKDGMYMPRHLWAEVPRLSKVQHVEGFHVRKALAVFVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSAH*
Ga0115192_117525Ga0115192_1175252F077405FPRLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQKQKVRLK*
Ga0115192_125914Ga0115192_1259141F053092PFTDQLMQSDQGLILCLTHALLVLGALILESAEMEDTMDDHTVQLFGILIAKELGIATHRIKADEHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIASSDLTDPVAKDTLSEARLLDVFVSVVSYKLRFFRHK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.