NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0104054_100016

Scaffold Ga0104054_100016


Overview

Basic Information
Taxon OID3300007220 Open in IMG/M
Scaffold IDGa0104054_100016 Open in IMG/M
Source Dataset NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 764305738
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)234807
Total Scaffold Genes231 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)217 (93.94%)
Novel Protein Genes11 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)10 (90.91%)
Associated Families11

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales(Source: IMG-VR)

Ecosystem & Geography

Source Dataset Ecosystem
Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Source Dataset Sampling Location
Location NameUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)Depth (m)
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F080166Metagenome115N
F081455Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103435Metagenome101N
F105379Metagenome100N

Sequences

Protein IDFamilyRBSSequence
Ga0104054_100016127F081455AGGAGMENFLTKEISALISKHFEFTNNSDLIKDPIISDDNIIDIPPAEEIGAEKVNMSDIIDCVQEALQQPLENKDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPECNPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0104054_100016128F080166N/AVNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTVIEEAINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGSNRQNMLR*
Ga0104054_100016131F099453AGGAGGMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLGNIIYTPKYPIAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA*
Ga0104054_100016150F092229AGGAGMNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPIVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLVASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGRFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG*
Ga0104054_10001617F099452AGGMDKTYAALLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNIFNLYKEIVDLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSRFNFNSLARFEKEEYLRNVERYDFEAVNPVDGYGLFDTLTKDNDVELFKLAAENILINHGFFNKTDYNFCDVPNEYMGDKEVSVYMCLLNIENMIFVDKKTLDTTILYNIVKDHICGFIYFTLFDRLNKDTRTRAM*
Ga0104054_100016174F105379AGGAGGMVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLDKITQLQPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAMPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY*
Ga0104054_100016179F095629AGGAGMRELIICVCLLGCFSIANANNIEQPKEVKIVHNDDSVILHKKIYQLEKRIERLEELLKKEGK*
Ga0104054_100016198F092232GGAGGMNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPAAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0104054_10001660F089057GGAGGMINVIPLIAKKYNRKGDTSGSLKSLIDDLNCIGDTDDVLLFLTSIPRETKYSLAEAFSVIVSNDEYRNIFRTSIVFLNIDLDYHELLLTAIKSESYDIICMINKAIPTPDLFLAKNNYECLTIALDKSYTVFDKILGMVVSQIRHTASSKEGRALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRSVLSRYLNNM*
Ga0104054_10001668F103435GGGGGMKFVFTTEPIYQYYRAYLYPSDKDKLDKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDILSIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE*
Ga0104054_10001698F095631GGAGGMASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKTDQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.