NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029706

3300029706: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35564



Overview

Basic Information
IMG/M Taxon OID3300029706 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283906 | Ga0245271
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35564
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size200643597
Sequencing Scaffolds20
Novel Protein Genes21
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales13
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:741
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039147Metagenome164N
F044554Metagenome154N
F045105Metagenome153N
F051934Metagenome143N
F052660Metagenome142N
F057385Metagenome136N
F058154Metagenome135N
F068811Metagenome124N
F068856Metagenome124N
F070133Metagenome123N
F072366Metagenome121N
F073573Metagenome120N
F073574Metagenome120N
F074898Metagenome119N
F088920Metagenome109Y
F090514Metagenome108N
F091068Metagenome108N
F092228Metagenome107N
F099451Metagenome103N
F101192Metagenome102N
F101193Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245271_100008All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii449222Open in IMG/M
Ga0245271_100545All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii47536Open in IMG/M
Ga0245271_100913All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales31739Open in IMG/M
Ga0245271_101578All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae19305Open in IMG/M
Ga0245271_105387All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5460Open in IMG/M
Ga0245271_106374All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4521Open in IMG/M
Ga0245271_107234All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3972Open in IMG/M
Ga0245271_107900All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium3581Open in IMG/M
Ga0245271_110200All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2680Open in IMG/M
Ga0245271_110673All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2546Open in IMG/M
Ga0245271_110795All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:742511Open in IMG/M
Ga0245271_112140All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2208Open in IMG/M
Ga0245271_112727All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2101Open in IMG/M
Ga0245271_114218All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1871Open in IMG/M
Ga0245271_116451All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1608Open in IMG/M
Ga0245271_123307All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1138Open in IMG/M
Ga0245271_127731Not Available954Open in IMG/M
Ga0245271_139467All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales672Open in IMG/M
Ga0245271_145274All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales595Open in IMG/M
Ga0245271_150372All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales542Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245271_100008Ga0245271_100008140F092228MKKKCTLVLISVLTVACIVSAYLLFFYNPSFNMVYDSDTDLYFNNSYLSYNDGTLAAADYRKTKVTAYDSKNNSTVNLPSNGCLINDNLFYINGDKLCCLDTTTNTRKIIDTDCRSFVCNNEVIAYTKNDSVILKDSDTLENIGDIKFDNQIYYINISDGNLYIAERIFEDKTDEYGYSFKVGKQYIFKKYDLKSCKLLKSKNANYVNGIRYVTVCQDTFYFFCDETQTVNNVCLDKDVNYPTIQHPDVKFITSNNDCVYYISEKTESAIILKTVESPYNGIWKLEVGSNKPVKIADKCDCDELLATKNFLYCYTINYILPRGVANSWVKGYLIDQLAIS
Ga0245271_100545Ga0245271_1005454F090514MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFLQQNVQPRGNFFANRPIFVYFADIPPCLTGAKWFVILLTIVPAGSRTRAGSSLPLSPASNIF
Ga0245271_100913Ga0245271_10091322F052660MCILRVLPENTPEKIGQERAGTEWTVVKNKIRLCIRNRSYGQFLHSGILMGIVLPIPSHRAKSHDFACWWPVAAGHSRSADALPGKSNS
Ga0245271_101578Ga0245271_1015787F044554VGLFHGGLPPPCIFFHTQAYVLAGRFIPVLCASIARLFPCRTEIARCLTLDFAISRYLFLSFSFSFRTNFAQALFSSLLFASDTRAKSILFLLFENEIAHLQGQYRFNSHRYCFSAFLVL
Ga0245271_105387Ga0245271_1053873F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAIVRVPVPAQSRLRYQPYLLPPEDASRHDVTLYMLCPLLYGGTLAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVAMLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES
Ga0245271_105688Ga0245271_1056883F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAENGSICRMRRERGAGSDHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADRSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSEVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGALLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASAAIERQYQRNMAEEGYTCFIESDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDSARFTLLMQGYLPPIYQRETVQQLQALLTGSQTDEAGLRQMETDGTLPLFNALLALEVVPDVSTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVGSVQIPSFSALRTSILLPVRVWNTLAENLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATLRSFQSAISISGSYVGDWSVACLCQTIYPDVPDYAISQEVAAEYAARALGDDAYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRVMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNV
Ga0245271_106374Ga0245271_1063742F070133MKRLLGLLLAMMVMMGGMAGAQASTDNVSMQLIRMNPLAFRKEPVELYSVTHTPNGSFVVIYFAEGEKTELQEMWMELFDSVGTSLLSAKLGEFDPNGEQIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTTKKLKFGAAPYAQHVGDYMVEKQAHSEDETPFRTVKITHIASGKSKKLRIYDWSFCAFPDQDGNLLIAQQNEKGNLEIRSYNAAMQESIVELSGDFLQNENVRDAACIGQTAYMRIRLTNEKSEILLYDITQQKITDSQTLLAVDDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLTLQHESCLYIFTDVEQADITTIEMDEKSHSYFVCSYSISAGE
Ga0245271_107234Ga0245271_1072344F091068MNEKNQQVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFARAMAKIDRKAKRQKRRTVLHMLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMEPAKEEQTANVPRMDVPDGWEGLFFPTFLPDSLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWSEGKRMLRVTSIGLPADEAVLVAQSVKKIFAE
Ga0245271_107900Ga0245271_1079004F101193MARYNWTRCPHCGKLCKPSRFKAATVLVPIEFVIFVVYIFFRNSMNDAIGWFAAWLLFVLLLFLPHYIYVRFFMPYETLSEDETRKFRDLQEH
Ga0245271_110200Ga0245271_1102002F068856MKRLTSILLALLMLVGMALAEETPDAALGDWYALNTENEAICLTLREDGTFCYDSREGTWRKTTDDEYWLTYNLPEVMERMVNSQAAEQDLTALLTETGLDVYYGSTAKGVVAHMVRDAEELQNVRTPKTDTPLEAFAGTWTMETVFAGAMEMTYTLDKGERLEFCTIDGLTMLPGAALGNFTEGTSYPMTLEDGKLHTTILMQMTEEETLDFDMTFFQTADGSLYATLQLSDVPDNPTTMFLLVPMEKE
Ga0245271_110673Ga0245271_1106732F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPNNVDEAMPFQLRPTELSICSVTRAMGGVVVLANDDNYDGLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYSMSRVPNYRMPDLTXXXXXSELTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMYVCGSVLAISVMQANGIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYALSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRNGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTMNAELVGKMLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFNALVSQVHQGEISLEEFVEE
Ga0245271_110795Ga0245271_1107952F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTAMVAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIKEKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEASANLGSMTEEQLMEELTQADTTAFMTMMNEILSGAEMAEVTEQPGDCDAAKNYVKVTVPPEKXXXXAEMTKALLEMIHSVPSIGAYMDALFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPISFTMTRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDTVEINVSGKGVGTDSSVVDVEIKETVDGVEQGFGVVVTTATTMDGEQGVRKVDVLVRFMGLDVVTITAETRTCDAKDALDVSKAQDLGAMTDSEFQTWFVKVXXXXXXPMTLLMSLPESMLTLLMGGSN
Ga0245271_112140Ga0245271_1121401F101192MQTEPSDKEGRNMTYRGWLLVDIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQLARTAPTSITPCGEDTIRRWVSGRYQIALRFNRYDVCLG
Ga0245271_112727Ga0245271_1127273F072366MLDILAIKADVYQLERQGKRLPVYRYLQEVWQKEPPSEGLTVLALQQMVDYVEYVDDLTVLGEPWEAENEYDLYQDFLLDVISWGLQKYRAKKRFLWQICYYVNAWATFYYIFGREITQDNVEQWKKTLFKEAKERYPDSLLFEFIPHVAQLDYVWFYRLTDEQRLRIRLEVGEWNLQKNNMDQAVQSYFDDAMTWYRDNGRKLLEAKKQDE
Ga0245271_114218Ga0245271_1142181F074898MRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMTAGLGETPFCSLTLPSGQIDLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEALGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHHLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFSQPYQEQYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDAIGTIELTQLLDGLF
Ga0245271_116451Ga0245271_1164512F051934MRRLAAAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVLASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQATDLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGTDANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYADADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNADVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLG
Ga0245271_123307Ga0245271_1233072F068811MDQDGSEHNICSNREGLCPGKEQHGASGWKKIFQHGKEPLRNKDSVSQYCNKKAAVLLILNENVSETLCIFSIDKTNCCRI
Ga0245271_127731Ga0245271_1277313F099451MEIKNVGQLRKIIENLPDDYEIEMRIRRKLTDEELKNCRYPYPYDTEYLTLEFDDIGVFDKVLCLGVTSNE
Ga0245271_139467Ga0245271_1394671F073573YAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMLFQVRRMELRTGDTTYAILPDNVQQYTKRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLTAAQKKAIRRFCMDCEESAITYQPAFLWYKDYCTKA
Ga0245271_145274Ga0245271_1452741F073574MLLPAAVRPYAADVDGTASRVRCAIALNGSKELHIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFDSRCDGYENVLDGKQPPAPISLHPAICPKCRNAAFQLRLTFEYPEAEELAAFANPDNMFTWVWVTMRCTRCHAVFRGD
Ga0245271_150372Ga0245271_1503722F088920VSANLHHYPAFEAGLILHLILHLILHFSQKVAIFAPKRVILLIFVPTLFFCCGAVLSLHTSSKISGQASLHQPW

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.