NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029741

3300029741: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37296R1



Overview

Basic Information
IMG/M Taxon OID3300029741 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283836 | Ga0245200
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37296R1
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size194756924
Sequencing Scaffolds22
Novel Protein Genes27
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes5
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F013656Metagenome269Y
F029444Metagenome188Y
F032286Metagenome / Metatranscriptome180Y
F039147Metagenome164N
F042910Metagenome157N
F043945Metagenome155N
F045105Metagenome153N
F047126Metagenome150N
F050793Metagenome145N
F051934Metagenome143N
F051935Metagenome143N
F055715Metagenome138N
F055739Metagenome138N
F055775Metagenome138N
F056623Metagenome137N
F058154Metagenome135N
F059982Metagenome133N
F068811Metagenome124N
F070133Metagenome123N
F073574Metagenome120N
F075480Metagenome119N
F078693Metagenome116N
F087213Metagenome110N
F087334Metagenome110N
F090513Metagenome108N
F096287Metagenome105N
F101191Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245200_100019All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes312040Open in IMG/M
Ga0245200_100032All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales264418Open in IMG/M
Ga0245200_100039All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales254510Open in IMG/M
Ga0245200_100055All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales206993Open in IMG/M
Ga0245200_100212All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales108983Open in IMG/M
Ga0245200_100266All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales96340Open in IMG/M
Ga0245200_100294All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales88433Open in IMG/M
Ga0245200_100378All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes72695Open in IMG/M
Ga0245200_100387All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes70945Open in IMG/M
Ga0245200_100469All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales60501Open in IMG/M
Ga0245200_100526All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes55536Open in IMG/M
Ga0245200_100549All Organisms → cellular organisms → Bacteria54013Open in IMG/M
Ga0245200_100651All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes46692Open in IMG/M
Ga0245200_100682All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales44567Open in IMG/M
Ga0245200_101135All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae26526Open in IMG/M
Ga0245200_102256Not Available12864Open in IMG/M
Ga0245200_102412All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis11913Open in IMG/M
Ga0245200_102603All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis10994Open in IMG/M
Ga0245200_102911All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales9796Open in IMG/M
Ga0245200_103070All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales9211Open in IMG/M
Ga0245200_128844All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii826Open in IMG/M
Ga0245200_131582All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii745Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245200_100019Ga0245200_10001986F013656MKKIYKEPNKSETETTINVLYSENILSICTNKVDLQKKLNKLLGEPEKEYKIKRSIAGSTWNISLDDKTKIQKVILKANIYDM
Ga0245200_100032Ga0245200_100032124F096287MQRIQKRRAGQRQTQNALIGLLSAVSMTRIGLTQLLPLCGSAAWWLSAACMLPGLCVYGALRLLLRHTHTRTLTDCARKLLGNFGGILIMLALTLPLLLDGIASLTALITFFTEGIGARGSQFTLTLLTASVMLIALNHDGLPRGVYLLRHVLLVAAGIIAINALLDAHPDGIVPLLGEGVPSLLSGIRSAWGMSWVLLLLLEFPAEEGTRRTPAMLAGLLPSPVILLLLSLAIPPELTVPGRSLASRLALPTLFLQPAVRTLAQCLLMMTLFLSIAGSAQLAARFLTSSCQKPKKWVPYVLIGLLTLTQLFDISRLWRVLTAFTAWSLVPGVLLLLVLTIARLCRREKA
Ga0245200_100039Ga0245200_100039221F051934MGLMRRLAAAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQSTTLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGLRQAIADLTAQENDGRLENTALIACAEEIARLSEEDRALYYYIEAFGLESGADADIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVEFSATLRRDALLFTGALSLQSDVLNADVSFSLPASYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245200_100039Ga0245200_10003985F055739MTEQQLRKKLQTAYGNMPDATRAAFEHSLTHHRAPETHRNIGLSRTMRIVITAVLMALMLTAVGVAAARFFSVTDVHPAQDGTEGDYQAHYLALEERYDSDLLSVSVNDAVYDGSVLAFTMEMAAKTDDVLAVEVRICGECDGKMYRFDPLDVYGGEFQSLLMLPDLGGTFDGEKYAAEGILLDENGQMPPEGKPIAWTIEIDVLKAVWQTETMPDDLYEALSEEDDVAQYIREQAEQHIITLTDAGVEDYLLEMCGAGWDEIEQMSKADLLLRCGGFERAETYTVAFATEGNTQYVHPELAGLRIPLDGYTAVVDYVRASFLGGCVVLHCEAPNGTALPNGTALPDVWRVYRNDEQNPAGEAGWARASGYAGVPGGVVDVNQPSICLYFAPCADLTSLRIVPEGEAGFTLNLSGEKGTE
Ga0245200_100055Ga0245200_100055154F051935MKRLLGLLMAVMVMMGGISCAVAENANPLVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAENLKEMRFLPVVAQKDNQLRLLILRRQGDLWKVSEQNDRALMRDGWMLQNFSAMPYGNSDWTYIYFDFVDENQKRWNLMLNLGDGYVSSFGAISYYVEGYGTTYINMSYDRGLEFQIDVPVYSRFSYEVYPVEEHSFGVEDFDLATCPLSMQEFLVPAVVTCGEEGAGLYIMVQQDVQPIVMLADGEAIEAIPQRWQRDWVIVCYRGNYLFMKTENCKIEE
Ga0245200_100055Ga0245200_100055157F070133MKRLLGLLLAVMVVMGGMAGAQASTDNASMQLIRMNPLAFRKEPVELYSVTHTPNGSFVVIYFAEGEKAELQEIWMELFDSVGTSLLSAKLGEFDPKGEQIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTIKKLKFGAAPYAQHVGDYMVEKQAHSEDESPFRTVKITHIASGKSKKLMIYDWSFCACSDQDGNLLIAQQNEKGNLEIRSYNAAMQESIVELSGDFLQNSYISDAACIGQTVYFDIYLTNQQSEILFYDITQQKITDSQTLLAVDDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLPLQHESCLYIFTDVEQADITTIEMDEKSHSYFVCSYSISAGE
Ga0245200_100055Ga0245200_100055169F039147MRARRLLIPLMMLLLLLPQAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDYNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYSMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNISYRDLETDKVYTYPASLTRMHVCGSVLAISVMQENGIKVVLVDLTDGAIREIAGESLEAMYEWADGELLLWRLEGSTNEISRSSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLTKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRDGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTMNVDLVGKLLAQVIADQEATAKCVLVADYDEPIEDSYYLTMVNDYEKTLTELRRKQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFNALVSQVHQREISLEEFVEKADKLIEGLEQ
Ga0245200_100212Ga0245200_10021239F045105MSDKLKRVLDTNLAGLHVTDAQMNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAEDGSICRMRRERGAGSNHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADHSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSDVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGTLLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASAAIERQYQRNMAEEGYTCFIESDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLQALLTGSQTDEAGLRQMETDGTLSLFNALLALEVVPDASTKSTADDVTAAALKSLNAHLCYEDSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVDSVQIPSFSALRASILLPVRVWNTLAENLRVTIFYRDANTQPGIVYGMYANRIVQRYVDLYGANILRWDQATLRSFQSAMSISGSYVGDWSVACLRQTIYPDVPDYAISQAVAAEYAARALGDDDYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRAMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNI
Ga0245200_100212Ga0245200_1002129F056623MKSTEEMLQAAVQQLQTAEQARLTSETDPARRQLLHLLYAREAAALQSITLVRAEIPATQKRDWKPLIFPLAAAVLNLGALALWLTLPNCVTPENSGWMGRFALCIVAQSVSLCAAIGSIVSAMRRKPPKPVVVADETEFRRRLVEAEGRIALDAQTIEALFAQESVTIISTGAETAAELYASLYEMAQDARLDGNASQEKALSWPLSNAKRLLNAVGCEAVDYTPETAMFYDVMDADITQQRRPAIVQKADGIVQQRGLYLRKG
Ga0245200_100266Ga0245200_10026693F032286VTPVRHRALIFDVRLFTGNTDDDALVTGGTFRFCRLMCLCVKRRNIMLNDKRRSLLNSALFRADNRTGQKTFSPFSLALIFTFVFAALSERRSCPEDRSRRFVPVGTLTLSRSWRLLRWGTYPVRTVMQFSRFRCDYKTILAKNIALSRISKRSENAPKNADFHAYGQDRKGQEFEGRIPLCSQNSPFRNGSKTCVWSAV
Ga0245200_100294Ga0245200_10029449F087213MKKFFSLFLAVLLIFSCWTAATAESTDAVSGAPLDNDIEIVYKEHFDDMVTRYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADAALDARLQAILAACGLNPEDYDISVIRNLSGMPEPITGTNWYCTLIRKGVEVAEDETNPYDMVIVLYGDEMTLGAFVLNPEV
Ga0245200_100294Ga0245200_10029461F075480MNKMKQEKWQRAYGDTPDSFRQRVAASLPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTHTELVWNAGHPIENEADDRLGLLTGEAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVAVEADMEHEVRLTTPVDAKLDPSGISWAEWAEQNGKTLVPIGMEAAPTLQFLKVNGQTTDTPLIGAFLTQNPDGTVSAGFQVDLSEADVSHLKSCEVQMECRVGVFGKDGKATQWQKEILAATITFK
Ga0245200_100378Ga0245200_10037849F059982MRRMTRLLCLMLLFSLITFSCPLAEGTDAPTGTSAPTPMLTAVPESALAPFNVVLPEDAHVEMAEGRITLVRGDSRVVAMVISRVPDDDPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFDDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENVSMDGAAVYVAEDAAATASPEV
Ga0245200_100387Ga0245200_10038757F055715LTKANEFDKINKLLIERTAKKFERASKNKLKKFLTNEKLCDKINELIRVGTAKILDN
Ga0245200_100469Ga0245200_10046935F050793MRETGAIAPNDAEREKTMRKYEVDVPENVEMADIHAIQTRVKKHNLLTVALLIPLFAGCALRESSETLGGIVFVLSIALMVLSFISGRDDEKRLLAYWAKYQQIHRMDNLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLADMREYATQEGYEPMDVDFEQMTDEEFQQFLDAHRI
Ga0245200_100526Ga0245200_10052629F073574MLLPAAVRPYAADVDDTASRVRCAIALNGSKELHIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFDSRCDGYANVLDDRQPPAPIPLHPAICPKCRNAAFQLRLCFEYPEAEELAAFANPDNMFTWVWLSLRCTRCHAVFRGNLECD
Ga0245200_100549Ga0245200_10054939F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRIQAPAQSRLRYQPYLLPPEDASRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTTLYRGKAGAAIRQVMFRMVDYALPAYAVAMLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES
Ga0245200_100651Ga0245200_10065133F043945MKHNLRMAFLSLLLVLTLGVMPVFAESTDHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLDGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAENGLYVIAKDKTDEYQIVRFDVDTGKGEMMEFALEDEEDSHLMPDMPGFFRWSAESSKYERVLYDGKVLTMVNLPNGEATSVVHNLVNTLTLQPIQDYDVTLGDYVTHDAYYAYAPNKDNTAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGFYWRGFYLIQTHDGLYLCHTGDATARAYLYGGVE
Ga0245200_100682Ga0245200_1006821F042910LADRLYCALNGTTLHDLDARIHLLDVEELAPTVRTVTANRIGGGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDTAITTLTISAAGKISFQGLTLAPGAAIRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC
Ga0245200_101135Ga0245200_1011357F047126VYLKTKKMTNILQTQAGFKAKSLGILCGFQGFLTQNPAALVETDDIFDVSDTPHGFSGLNTLRAAGVPNPPPPFAQRFIARFCSQTAAASQSKSAYILSSLESPCILCSLLRHFHILSKKLQKTC
Ga0245200_102256Ga0245200_1022565F055775MAQIAQQDNLVIEVTTTAAALDGATKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTIGDSPKYTIHIINANSGTVTAIALN
Ga0245200_102412Ga0245200_1024123F087334MASRYPFVGAAAHLFPKNAIKMLSFSTSGKGSILLYPFRSSPLLAITFLYYPKDFFLYRAAALFEYREKHP
Ga0245200_102603Ga0245200_1026031F090513MAKYVRKMEWKLLHIKHILYMQPFGALKIAPQRVGNKNGTKAVLQGWAAAFVPYMLSFTYGNTA
Ga0245200_102911Ga0245200_1029111F029444MEVSEQLTGFELEDLMSWTVSNLQRPFRENFSLEISGIIAEKESQIFGRRFVGFDRPKKAAPFFNF
Ga0245200_103070Ga0245200_10307012F101191LLISHHPASAECLQLGEGMLLCGFDIDKALSSRDPLDCMAEAVADDTKRIGTTCGGGIFRAVPREFDPESGSHRLPFAGSIRLIDWRVTLSGTMLDVTPENLARLLPSDTEMTERVTTLTPKQARKPLSRLCWIGTTSRGLLVIELRNPLCISGASLTSVPDGAGRLPFTFLAQNDRPGDVNLPARLYWWKEETHDAA
Ga0245200_128844Ga0245200_1288442F068811MDQDGSEHNIGSNREGLCPGKEPHGASGWKKIFQHGKEPLRNKDSVSQYCNKKAAVSLILNENVSETLCIFSIDKTNCCRI
Ga0245200_131582Ga0245200_1315823F078693AAVRGAERFFIKANCSLLMSKENQKTTSDFDALDPRERGCSPLSDPEGEVETEKS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.