NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300003144

3300003144: Marine sediment microbial communities from deep subseafloor - Sample from 18.6 mbsf



Overview

Basic Information
IMG/M Taxon OID3300003144 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111471 | Gp0097590 | Ga0052244
Sample NameMarine sediment microbial communities from deep subseafloor - Sample from 18.6 mbsf
Sequencing StatusPermanent Draft
Sequencing CenterJapan Agency for Marine-Earth Science and Technology
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size39626905
Sequencing Scaffolds56
Novel Protein Genes57
Associated Families39

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Predicted Viral2
Not Available47
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium RBG_13_56_8b1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Sediment Microbial Communities From Deep Subseafloor
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Marine Sediment → Marine Sediment Microbial Communities From Deep Subseafloor

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine benthic biomeoceanic subsurface zonemarine sediment
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationNorthwestern Pacific Ocean
CoordinatesLat. (o)41.1773Long. (o)142.2013Alt. (m)N/ADepth (m)1180.5
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F016493Metagenome / Metatranscriptome246Y
F020845Metagenome221Y
F022812Metagenome / Metatranscriptome212Y
F031354Metagenome / Metatranscriptome182Y
F037999Metagenome / Metatranscriptome167Y
F041583Metagenome159Y
F046170Metagenome / Metatranscriptome151Y
F050157Metagenome145Y
F050887Metagenome144Y
F056273Metagenome137Y
F058676Metagenome134Y
F059633Metagenome133Y
F059653Metagenome / Metatranscriptome133Y
F069658Metagenome123Y
F069687Metagenome123Y
F070873Metagenome122Y
F074420Metagenome119N
F075650Metagenome118Y
F076658Metagenome118Y
F078009Metagenome117Y
F080857Metagenome / Metatranscriptome114Y
F082134Metagenome113Y
F083668Metagenome / Metatranscriptome112Y
F086576Metagenome110Y
F087951Metagenome110Y
F088228Metagenome109Y
F088254Metagenome109Y
F089412Metagenome109Y
F089742Metagenome108Y
F089904Metagenome108Y
F091295Metagenome / Metatranscriptome107Y
F096594Metagenome104Y
F098316Metagenome104Y
F102405Metagenome / Metatranscriptome101Y
F102488Metagenome101N
F104424Metagenome100Y
F104467Metagenome100Y
F104468Metagenome100Y
F105883Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0052244_1000094All Organisms → Viruses → Predicted Viral2789Open in IMG/M
Ga0052244_1000542Not Available1886Open in IMG/M
Ga0052244_1000973All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1594Open in IMG/M
Ga0052244_1002020Not Available1265Open in IMG/M
Ga0052244_1002421All Organisms → Viruses → Predicted Viral1154Open in IMG/M
Ga0052244_1002609Not Available1102Open in IMG/M
Ga0052244_1002797Not Available1048Open in IMG/M
Ga0052244_1003223Not Available922Open in IMG/M
Ga0052244_1003311Not Available902Open in IMG/M
Ga0052244_1003563Not Available868Open in IMG/M
Ga0052244_1003756All Organisms → cellular organisms → Bacteria848Open in IMG/M
Ga0052244_1004039Not Available818Open in IMG/M
Ga0052244_1005543Not Available1482Open in IMG/M
Ga0052244_1005909Not Available1414Open in IMG/M
Ga0052244_1006809Not Available1286Open in IMG/M
Ga0052244_1006834Not Available1283Open in IMG/M
Ga0052244_1006863Not Available1281Open in IMG/M
Ga0052244_1006977Not Available1267Open in IMG/M
Ga0052244_1007121Not Available1250Open in IMG/M
Ga0052244_1007153Not Available1247Open in IMG/M
Ga0052244_1007775Not Available1175Open in IMG/M
Ga0052244_1008058Not Available1145Open in IMG/M
Ga0052244_1009791Not Available969Open in IMG/M
Ga0052244_1010173All Organisms → cellular organisms → Bacteria928Open in IMG/M
Ga0052244_1011316Not Available824Open in IMG/M
Ga0052244_1011339Not Available822Open in IMG/M
Ga0052244_1011703Not Available790Open in IMG/M
Ga0052244_1012242All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium RBG_13_56_8b739Open in IMG/M
Ga0052244_1012326Not Available731Open in IMG/M
Ga0052244_1013264Not Available628Open in IMG/M
Ga0052244_1017057All Organisms → cellular organisms → Bacteria903Open in IMG/M
Ga0052244_1017334Not Available896Open in IMG/M
Ga0052244_1017597Not Available891Open in IMG/M
Ga0052244_1017939Not Available886Open in IMG/M
Ga0052244_1018179Not Available883Open in IMG/M
Ga0052244_1019837Not Available864Open in IMG/M
Ga0052244_1021533Not Available848Open in IMG/M
Ga0052244_1021769Not Available847Open in IMG/M
Ga0052244_1023465Not Available835Open in IMG/M
Ga0052244_1025769Not Available820Open in IMG/M
Ga0052244_1027234Not Available811Open in IMG/M
Ga0052244_1028448Not Available805Open in IMG/M
Ga0052244_1028796Not Available803Open in IMG/M
Ga0052244_1029148Not Available801Open in IMG/M
Ga0052244_1030814Not Available793Open in IMG/M
Ga0052244_1031524Not Available789Open in IMG/M
Ga0052244_1031744Not Available788Open in IMG/M
Ga0052244_1031866Not Available787Open in IMG/M
Ga0052244_1033220Not Available779Open in IMG/M
Ga0052244_1033465Not Available777Open in IMG/M
Ga0052244_1034094Not Available773Open in IMG/M
Ga0052244_1038017All Organisms → cellular organisms → Bacteria741Open in IMG/M
Ga0052244_1038646All Organisms → cellular organisms → Bacteria732Open in IMG/M
Ga0052244_1041399Not Available669Open in IMG/M
Ga0052244_1043171Not Available587Open in IMG/M
Ga0052244_1044282Not Available523Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0052244_1000094Ga0052244_10000942F088228VTVTESSLNELELLSNKSLVAEALQLFINTYIELSRRREVTKDFLRDLCSFSVEAQVWYILWESGGSQRFTDILLLVGCSRSKLSDVLRELLRVGLVRMVEKRYQAVSPAWLVCIREPNKQKP*
Ga0052244_1000542Ga0052244_10005423F104467MNCVTNILDFYNRKSKDLEDDLSGFNKGTLSKLLIKIMKNFNKDKNNDLELQNCLILLVNLFSNNESPDFYNKKGIDASDLLNCDRETFSDVLKLEFLNN*
Ga0052244_1000973Ga0052244_10009731F080857MSGDERGGETFLAKVYKGWRVTIYEPVREYLDLEIGDTLRVTVQKDEGSTHSRP*
Ga0052244_1002020Ga0052244_10020202F105883MKVQKIVSLDEKTMRISQKMENFSQWVRIGLRNYELQEDMASETMRRIRWAKVAHLLAAAIVEHSIELDAEYKGTIDDLVGKAMVEARSQSSLEEFE*
Ga0052244_1002421Ga0052244_10024211F091295MTNRSEYKEALGRLDKDTRVCIKYIERLINDKFNAAISTQQSPPRVNLSKIESTLDSLQKQIDGLRPKVDAAHSFVQVHDEQEKYLKKLTKNAEEVLDGVKQHFEELYDSGYYREGQRRKIREEGLI*
Ga0052244_1002609Ga0052244_10026092F104467MDCINNILDFYNNKSKDEEIDLSHFNKVWLSNLLIKIMKNFNEGRTNDLELQNCLILLVNLFSNDDSPDFYNKKGKNVRELLDDDRSNYINILKSEFRNN*
Ga0052244_1002797Ga0052244_10027971F102488MILEKLEKSPEIAITIIATEEVFKTYELICLDKLKEIGRSTARDWSFAMGYNHRSSLAKIIRRIKERYPEKLKIYENIYPRLYEAM*
Ga0052244_1003223Ga0052244_10032231F086576MMEPKYPEDFKDWKTLYLMLDKRYLVIESPDGAMEACIHLEDNEVVGIKDKATGKDIYDGNISLAKKIFKR*
Ga0052244_1003311Ga0052244_10033112F104424LNEEENQFLEFLHYQINQKNKFENVVRKVPMFKSMIQEGHKTILKMIDKFENIRRRETYDMLVHEFNNMNQKN*
Ga0052244_1003563Ga0052244_10035631F050887MNMSDSTPEYLKFFGEIKFAEEKSLSAIDFESSLKSYFSKEKRLPQGRYKFAELNNSSIKDFEDSFYSYFE*
Ga0052244_1003756Ga0052244_10037563F089412DATMAVSFTRAAEESYSGNLAYTALIATNNFFVAVLGNGNTSPKSVNGRLWGYRAKADATTYAALVQSEVLSA*
Ga0052244_1004039Ga0052244_10040392F041583MADKMYLVIVLRKEVPDRDTGKAIYDLVKERMADKPDVQVNGHVSNHFDLDEQ*
Ga0052244_1005543Ga0052244_10055431F102488MMLEKLDKSLEVAIIATEEVFKTYELICLDKLKEMGRSTARDWSFAMGYTHRSSLAKIIKRIKERYPDKLKIFDNRFPRVYEAL*
Ga0052244_1005909Ga0052244_10059092F088228MAESGFNKLQLLSQEDLIVETLRRLLKSYAELNRREEVTKTFLKSLCALGVESWIWFILWEGGGAQRFTDILHVANCSRSKLSNVLQELLRVGLVRMVGVRYQAISPAWLVRG*
Ga0052244_1006809Ga0052244_10068094F075650ENWVKLPVGKTVCMRFREYRVASREITDPFWKTPRTVESLLFLVSHVDGVPVDKSFSVVSEKLALEFKPYLDDGSYRGYEWCLVKDGEGFTAPRIASRRRV*
Ga0052244_1006834Ga0052244_10068341F082134MPDYDVKIKFHIFPYRKFNISARDEEEARRIANEQAQHLIDTLDFTIYSIKVKKYLMDRFL*
Ga0052244_1006863Ga0052244_10068633F083668MKWSPDKITALILIIGCLGLLFTGIDSEVKSILTIAAGYLFGTAIAEKKK*
Ga0052244_1006977Ga0052244_10069772F088254MNEKILKEILQELKAIHFHLDELSIFYKFVNRVEMKKEEVKKEIVEDKK*
Ga0052244_1007121Ga0052244_10071214F069687MEELAKKIEQILNQFIQEEMGNRLSQFAMIALKEMVINEIRNYKPGNIKDEVKK*
Ga0052244_1007153Ga0052244_10071532F096594MAYIVVTGSFPAHVGMEVGKTFLKLEKLPDYIKTEHVFNSAAGDYIFFTIYKIEDDSKYFDAIKAITKRFSGYMNIKGYKYTAHPVLEAKDSLSMVGLG*
Ga0052244_1007775Ga0052244_10077753F089742MAYEKFIPERGGRGSMFSTPYVTINKGGISFNRYVKEMIGEKFKYIAFYFDKESKKIGLWFWKNACPGSYSLLWFKNRETFSVNSKAFLRAYDIQEIIKKCNARHFPLERDEDNKQDKDFYCI
Ga0052244_1008058Ga0052244_10080581F091295VTNRSEYKENLSRLDKDIRRCLLYLEGLIEERFQLVLNAKQPPAPKVDFSKFESKVDSLQKQIDGLRPKVDAAYSLVQEHDEHEKYLKNLTKKAEELFDGLKQHFEELYNSGYYSKVKRRKILDEGLRLHKRES*
Ga0052244_1009791Ga0052244_10097912F080857LEKENEETFLAKIATGWRITIYEPVRESLGLEIGDLLRVTIRKDEAKR*
Ga0052244_1010173Ga0052244_10101733F089742GRGSMFLTPYVTINKGGMSFNRYVKEMIGDKFKYVAFYFDKETKKIGLWFWKDSCPGSYALIWFKNRETFTVNSKAFFLAYDIQEIIKKCKARHFPFERDEDNKQDKDFYCIQLKPPKGRSTIR*
Ga0052244_1011316Ga0052244_10113161F050887MSDTLPEYLKYFGEIKFAEEKSMSASDFERSLNKHFSIKTEKIPRGRYKFAEINNPSTDDFEESFNNYFE*
Ga0052244_1011339Ga0052244_10113391F022812QAMSKLAFRAQTVDIAYLPLRQDFLAKNVLMTAQRDFRLGNSPSTLKKR*
Ga0052244_1011703Ga0052244_10117032F075650VALENWVKLPVGKVVCMRFKEYRVTPREITDPFWKTPRTVESLLFLVSHVDGKPVDKTFSVVSEKLALEFKPYLEDGSYRGYEWCLVKDAPGFVAPRIAKRTRI*
Ga0052244_1012242Ga0052244_10122422F020845MNKTKQTEQKEIGRIKLGDTQDLVVSIVDDEKVDLRIFLNTDSYKGPTKRGVRFYLFDDNWTEFKKLIEKVDKVYEELA*
Ga0052244_1012326Ga0052244_10123263F086576DFKDWKTLYLMHDKLYLVIESPDGAMEACIHLEDNDVVGIKDKATGEDIYDGNLRLAKKILKR*
Ga0052244_1013264Ga0052244_10132642F089904MTVEVSEEVKKEIFKLIATPGIEIDDIVKKTKLDSETILNILSEEYLRCNLDHGKSLCCR
Ga0052244_1014009Ga0052244_10140091F037999KFNEMPMAERVKLIMKDPAELIDYVRKESHVVWKAVEAFIRRENNEGRDALIEGVAVLLELMSQLEDIPHRVVFIGNQGENHKENIKKSAEENEHDWMRGVSDQYIGAFAMFVKRMSAYIEQEAKKYGFEYIEMDKELFGDVTEEVMKSLGLSAR*
Ga0052244_1017057Ga0052244_10170571F016493MGWFGDKIEKLRRMPNPYLFMHVTGKFLFGVGLGILLAIWLPIWTGWLFIIAALLIAIPSARIILGSKAK*
Ga0052244_1017334Ga0052244_10173342F102405MAQEVNLLDPLGIAKAVRGQVNQMAISAKLPPLPEMPAIKVSMAGLPLLNQFNQIPDRAGMGRGERAG*
Ga0052244_1017597Ga0052244_10175972F056273MSWRAKVKEFGGGDITFLSSDGETIMFIVVDEPVLLHGKFKGKEQDRIGCPVVTDEGFVLFITGKRVARKLSKFEERFIDTAFMITRHGVEGDINASYDVGILSLPEKTEQLFDIAKKEYKPAMLKAAIEDAKEAMKN*
Ga0052244_1017939Ga0052244_10179392F050157IKPGPIDDKFKSDLLKTYENFIEIQAEVVPFGGKMIKRMAKYSLKFVDRYLDLIGAIPLIFHSCSKCDSVISSQTIVDLMSSGGSSSS*
Ga0052244_1018179Ga0052244_10181791F041583DKMYLIITLRKEVADRDEARAIYDLVKQKMTDRPDVTISGHCSNHFDMEEPT*
Ga0052244_1019837Ga0052244_10198371F059633MDSKKEKDPFDIHDRRKIRIKEFLVFKELDDDQDNYIIKEFEKTKEQTLEALEFIQKIIKNHPDMNTERIKGYLTRQRFFMKLFLKEVNREFSE*
Ga0052244_1021533Ga0052244_10215332F078009MPREKIFLTPEQTARLKGLADDIEWLREEIRRAEYVGIDVTELKARFEKTNTIRERMLEEYTR*
Ga0052244_1021769Ga0052244_10217691F020845MNKTKQTEQKEIGRIRLSDTQDLVASLVDNEKLDLRIFVKSDSYTGATKRGVRFYMFDDNWDEFKKLID
Ga0052244_1023465Ga0052244_10234653F020845MRKVRQTEQKEIGRIKLNDTQDLVVSIVDSEKLDLRVWKDTDRYKGWTKRGIRLYLFGDN*PEFKKLVDKVD
Ga0052244_1025769Ga0052244_10257691F087951KLSLKIEQLTSEISKMKEKLDKTNKFNRSFGLEAIRNSKPLEFAKEIHGLKSPTKRYILSIRTVSELWKGRKNDFLIAIRQQEKQTQKDIKALGIRIPIDDMKNLTTLAREVLSMLYISSELKGIEITEILREIISQINKEGQSMVREVKQNMLL*
Ga0052244_1027234Ga0052244_10272342F069658MTEEISEEVKNEIFKLVSTPGVEIDDIPDRVNKVYKTELNYEEIMKILSDEYLKYNLDYGRRLCCRF*
Ga0052244_1028448Ga0052244_10284481F080857MSGNERGGETFLAKVYKGWRITVYEPVREYLDLEIGDTLRVTVQKDERRARP*
Ga0052244_1028796Ga0052244_10287961F031354MKTTFDINDILYPMLNTALVLGKLDGGRIYRNKKPLNSELQDIVIIPLSNYVGDEIINDATFMVNCYCKNFNNGTPDIAKLRATIDEVADVIEKYKNASNYYVFDITNQILLNDIDQKSMSYINLRIKCYIEK*
Ga0052244_1029148Ga0052244_10291481F076658MWYRYYLTVPRGTEKTASVRKEIGLPEGVVTKVRVRFPPGPRGQVFTAVFQGTHKMWPRGEGNYAWGDDEPIEMSEHVRNITGWHYFLEGYAPLTSYDHTIWWDFNILEKEYAETWGPIQRLVELLEEFFGW*
Ga0052244_1030814Ga0052244_10308142F089412LSNSNTIAVAQEAIRSDAPTNGVSFIRNASESYAGNNLDYISIIATNNFFVSCLGSGNLGAKGVDGRMWGYRARADASTYAALVQSEVLSA*
Ga0052244_1031524Ga0052244_10315242F075650MALENWVKLPIGKTVCMHFREYRVTPRQITDPMFNVPRTVQSLVFLVDRVDGEPVDKTFSVVSEKLAQEFEPYLEDGSYRNYTWCLVKDAAGFVAPRVAKRSRV*
Ga0052244_1031744Ga0052244_10317442F059653MTNKEALQSQTEYSNDNLLEKLLLDRGIEAEETYATANAKDIDLCAANLYFILAAHPEYREGSYSIKYNSAQLIAMAKTILKKYDMDESTVTGEAIW*
Ga0052244_1031866Ga0052244_10318661F046170MIRIQDFGEIGYGGAVTLTEWWDNKRIDQGKIGPKDVFKKASFYTYLGVGLAATLMSVFGWMRRYERWSEHVSHGFLYDVPRFAYNLTKALGAGSKRGMGSESRAVQEAQRILNEKLRAKALTEGSGT
Ga0052244_1033220Ga0052244_10332201F058676VKERIIRLELTDLEDILKEKGSIKEEESLNDAHIEPGELVLRVLYEDEKAT*
Ga0052244_1033465Ga0052244_10334651F104468VSLINNTARYRDGRRFPFHLLVSPINGTAMTALNESDLWHVHNYLMPQLVLIKKQQKVIAQFHSLPRLGNWRALMNFADHCYTIKQPGQEKEYNLKGLPNIIDPDEYRPIRRGPPLKIAFAPSTQAPVGHPASKGYREVKAILNSVAEKRDVEIVWIEGRPYEANLRLKQQSHILIDDVVTGNWHRTSLEGACFGCAVLNKVMKTPFVYASIDTLEERILWLVDNPAVLSDFQERARLWVLQNWHAMDSIKEYIKAYE
Ga0052244_1034094Ga0052244_10340942F076658MWYRYALLVETDDTRTVPARKEIELPEGVVTGVRVRFPPGSRGQVYTAVFQGTHKMWPRGEGNYAWGDNESIDMDEHVRNITGWHYFLEGYAVNCRYDHTVWWDFNVLEKEYAETWGPIQKLVKLLEDLIGV*
Ga0052244_1038017Ga0052244_10380172F091295MTNRHEYKEALGRLDKDTRVCIKYLERLINDKFNAALGAQQPPPRVNLSKIESTLDSLQKQIDGLRPKVDAAHSFVQLHDEQEKYLKKLTKNAEEVLDGVKHHFEELYDSGYYSEGTRRKIRDKGLI*
Ga0052244_1038646Ga0052244_10386462F074420MFLRLIIYYLFNRKKGVYMKTFIVIMATVILGILIWGLILGSGENSLISQATRILDYGISQLQTIP*
Ga0052244_1041399Ga0052244_10413991F098316MRGAFRGGLSASRFIADMKAVGLSYRRTDMLADWRSVSGLEAKKDALKYVRKDRYPTEKVMASVTWALSKEYMYVVKVKSRLTPDVPVTERNVNIISDVPMTPAMIEAEVTERWGEWEKYAAEELVGLQVWTAVRKVME*
Ga0052244_1043171Ga0052244_10431712F070873MAWGGSLNKKGSDHLRNLREADKVIKFLEKFKEDLSVTEKKLFSNVIEIIVDHFTKSS*
Ga0052244_1044282Ga0052244_10442821F020845MNKTKQTEQKEIGRVKLSDTQELVASIVDNEKVDLRIFVKTDSYTGATKRGVRFYLFDNNWTEFK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.