NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008589

3300008589: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765640925 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008589 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052956 | Ga0111083
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765640925 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size106628167
Sequencing Scaffolds23
Novel Protein Genes26
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
All Organisms → Viruses → Predicted Viral7
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → unclassified Candidatus Saccharimonas → Candidatus Saccharimonas sp.1
Not Available6

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F046433Metagenome151N
F054110Metagenome140N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095630Metagenome105N
F095631Metagenome105N
F095633Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103433Metagenome101N
F103435Metagenome101N
F105378Metagenome100N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111083_100077All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis42192Open in IMG/M
Ga0111083_100310All Organisms → cellular organisms → Bacteria21785Open in IMG/M
Ga0111083_101971All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes6842Open in IMG/M
Ga0111083_102859All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis5196Open in IMG/M
Ga0111083_103832All Organisms → Viruses → Predicted Viral4193Open in IMG/M
Ga0111083_104125All Organisms → Viruses → Predicted Viral3951Open in IMG/M
Ga0111083_105429All Organisms → Viruses → Predicted Viral3240Open in IMG/M
Ga0111083_107904All Organisms → Viruses → Predicted Viral2398Open in IMG/M
Ga0111083_109511All Organisms → Viruses → Predicted Viral2048Open in IMG/M
Ga0111083_109916All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81976Open in IMG/M
Ga0111083_109958All Organisms → Viruses → Predicted Viral1970Open in IMG/M
Ga0111083_112758All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1604Open in IMG/M
Ga0111083_113666All Organisms → Viruses → Predicted Viral1519Open in IMG/M
Ga0111083_116908All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1264Open in IMG/M
Ga0111083_116995All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1258Open in IMG/M
Ga0111083_117443All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → unclassified Candidatus Saccharimonas → Candidatus Saccharimonas sp.1229Open in IMG/M
Ga0111083_121712Not Available1016Open in IMG/M
Ga0111083_125368Not Available888Open in IMG/M
Ga0111083_126924All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes843Open in IMG/M
Ga0111083_126932Not Available843Open in IMG/M
Ga0111083_135441Not Available658Open in IMG/M
Ga0111083_136030Not Available648Open in IMG/M
Ga0111083_138110Not Available614Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111083_100077Ga0111083_10007741F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYNEEEYKLTFPHKYKKGKTFKEKQLYKKESEYSSTKQHEVLLFLVKTYKGGD*
Ga0111083_100310Ga0111083_10031012F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDTSKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGRITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVDMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0111083_101971Ga0111083_1019712F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGASRISVGGYMIDIPKSAMPTLKSDHVVATVYKAPNKDFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0111083_102859Ga0111083_1028595F094007LKTAEVLDLARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDKDFTKRNRIIVEMCDLFGRIRRRAGFAECHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0111083_103739Ga0111083_1037394F103435MKFVFCTEPIYQYYRSYLYADDKDKLDKQLMIEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYQYLIDSPEFETIFSEVLFNQSEAEFYEFYKAIDRFYNGSEVFIIIGNDEYSDMVTQMMCNVIRRTYGIHPQIIYDMDDVYSIRDDIDFSPQGAQLAYLQRAAYYKLEAKKNFEPLQIWYPFDMNTYTNALE*
Ga0111083_103832Ga0111083_1038321F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGVSKYCNYTYIDSSTIRVMFDNASYLPTANTEVTVNLYTCQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNVIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYLTERDTYFGDISIMQNIQSDIGLVHKDDPYDPEKITGVDIKVLAVFYTDDKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDHIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0111083_104125Ga0111083_1041256F092232MNTQAKFIADYNDKNRPKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQTLLIGDETPSISIKDSDLKILKVTYHVACAKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKILRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANSHMKSPFYISAVKTFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLAEDAGLELTDTRDPEAVAFDAHLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0111083_105429Ga0111083_1054291F081455METTNTENKNLFQQLARLGFDVNESLLELEKEYAPVEEIDMQNVDIIDAAEEAIQQPLANTDSSIAVNFSQMINKPEVEEVKTEVASVPDNGETKVNVFFPKNEHILSNYVDYDSFNKIKESNTETIVRAVRLLNYKMSDQNAAMKFGQFVSEFNSECDPNKRLRYELIRHQGREKDLVVRLSTVVNGTTKYYADIYPDLNKIDIDHHLISSARK*
Ga0111083_105429Ga0111083_1054292F080166MEVTFNNTIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTVIDEAIDICDPGNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGDRLDLIPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMR*
Ga0111083_105429Ga0111083_1054295F099453MLRRKDMNRFDVIELAQQTLTFVYNTFNGKVNTLDPYTRLSFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLTNEVYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYIDRLKTHYAFMVCENRSYINSVNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA*
Ga0111083_107904Ga0111083_1079041F081510GAKTTYNTAKILGKKYAPIALVTTGLVGYGIAVYQGIKSGKKLEATKAKYEAKDAAGEEYTRLEVIKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTAVTEEHARYRLQCKEVLDEETFKKIDTPMDQVTVEEDGKEAQSFVPKEGLMYGNWFKYSANYASDSPEYNEQWIRESIRVLEEKIARKGLLNFSDMLDQLGFDVPKAALPFGWTDTDGFYIEYDIMEVWNAEEQVHEPQIYVRWKCPRNLYATTNFRDLIPGRKELA*
Ga0111083_109511Ga0111083_1095111F092229MNDYRFDHIPEVVLRNIRFIRENNIDIGTGDDVLECMMDINPIIRTKIYDDYEFAKDVAERRFGSTIGGLDMITLLQKCNTRPYNSILNNIYFRYFNSKLIDDLFELAQSPKILDLAIEYECEYYAINTAKTSIRRYNSDAYYNKFAADSNIVSSTRVLNNPQVNAVKSAEFTHELLMASRAEKFSPENVREIFIKYGLKPNPSRNLYNRINDNLNLFYYIEDYLDEYREEGKFIYGGKE
Ga0111083_109916Ga0111083_1099163F095629MRELIICACLFGCFGVANAAAPVDQPKEVKVVHNDDSVALHKKIYKLEQRIERLEKLLAEKEGK*
Ga0111083_109958Ga0111083_1099583F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSPKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNASRNLYNRINDNLNLFYYIEDYLEEYKEEGRFIYGTKEYKIIKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG*
Ga0111083_112758Ga0111083_1127582F095630DNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDMILSAKTYGDTCFMDAELYSSEVKKTYDEVLREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAQDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAVVIQAVALNFLVMIREKHTVYDRGDQWSEASKSVGYRTFLGISKKDDSIVHQLKNEFMEYAQFLSASPFCDNLQAALVRCSHNYEQVLLGRDTLNSILGGCVVGGRGWLEMADNTLIRQILKAIEIDVYDI*
Ga0111083_113666Ga0111083_1136662F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNVFNLYKDIINLDDVSLLAELRHTEWYKDWFTSDKRNSDLIDLSKFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFDTLREDEDIELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR*
Ga0111083_116908Ga0111083_1169081F046433MIELPTSPNALSELSPVAPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIKMTECYLPDYMENCAKELIDELAFLGVPELNFAANALAKRLRHHLEVGNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVRERISVFEVDNDPEDHEASVLVMAASGDYLDNGISAYSQYGGTIYPVEAYYRLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGKNFDGLSRFRQLLEKE*
Ga0111083_116995Ga0111083_1169952F033081MHTDITVVYRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKFSILAVGLIIMTIAMVKMLLFVPGLNQSVVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNIILFLALLEEQAFRSGSEKWNWRERVRASVCFGLLHIMNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL*
Ga0111083_117443Ga0111083_1174431F095633MRNENFTEVGRREGLTESELRTMGALAVEATEKLRKTIVSKEAVLLGSVPFGSWDEFAKAVQEMAAHKMTAYSYEPIPVKINTKRLIAIAFLDDRGEMSVEENSVLEDAFIDLSRTRCVVDADRSHKSYKFTCPVLERYPDGELYPIRGVYAISVIDVNGSQEVDFNIIYGGLN*
Ga0111083_121712Ga0111083_1217122F105378KKWIQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKSKWNNKLNAPVPMQDHLENNQIGYDSTNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE*
Ga0111083_125368Ga0111083_1253681F081455VLFKAKEKHIMETTNIQEINMKAAEKLGELFDYVFCGKKPNTEEKDIPPVEEIGAEKVDIIDVAEEAIQQPLQNKDASIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKDLVIRLSTVINGTTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0111083_126924Ga0111083_1269241F054110VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRT
Ga0111083_126932Ga0111083_1269321F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKIYIVNRMYTEEEYKLTFPNKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRT
Ga0111083_135441Ga0111083_1354411F089057NRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRTSIVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAIFDKVLGMVVGQIKHTASSKEGRALGIFMTICILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRTVLSRYLNKM*
Ga0111083_136030Ga0111083_1360301F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIISSNDKYSEVLSNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIVNKDIDKLASLCTGYLAITKDEVLVKDLMNESATMAFQYMSEEDIHDAVDDIN
Ga0111083_138110Ga0111083_1381101F032313DNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGINVNHVDTVNYVYDEVGNEIVLEGTGIRWFVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.