NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026904

3300026904: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A5-10 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026904 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055863 | Ga0207584
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A5-10 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size36137000
Sequencing Scaffolds26
Novel Protein Genes26
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium4
Not Available10
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Beijerinckiaceae → Rhodoblastus → Rhodoblastus acidophilus1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002103Metagenome / Metatranscriptome593Y
F003059Metagenome / Metatranscriptome510Y
F003121Metagenome / Metatranscriptome506Y
F003202Metagenome / Metatranscriptome501Y
F004992Metagenome / Metatranscriptome416Y
F005126Metagenome / Metatranscriptome411Y
F006812Metagenome / Metatranscriptome364Y
F012381Metagenome / Metatranscriptome281Y
F012989Metagenome / Metatranscriptome275Y
F013448Metagenome271Y
F015418Metagenome / Metatranscriptome255Y
F025530Metagenome201Y
F031563Metagenome182Y
F031960Metagenome / Metatranscriptome181Y
F036290Metagenome / Metatranscriptome170Y
F038225Metagenome / Metatranscriptome166Y
F040398Metagenome / Metatranscriptome162Y
F047897Metagenome149N
F049708Metagenome / Metatranscriptome146Y
F063749Metagenome / Metatranscriptome129Y
F063987Metagenome / Metatranscriptome129N
F068052Metagenome125Y
F070475Metagenome / Metatranscriptome123Y
F081911Metagenome114N
F085779Metagenome / Metatranscriptome111N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207584_1000054All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1930Open in IMG/M
Ga0207584_1000145All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601554Open in IMG/M
Ga0207584_1000191All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1459Open in IMG/M
Ga0207584_1001042All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium951Open in IMG/M
Ga0207584_1001578Not Available846Open in IMG/M
Ga0207584_1001908All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales803Open in IMG/M
Ga0207584_1002568Not Available730Open in IMG/M
Ga0207584_1003009All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium693Open in IMG/M
Ga0207584_1003022Not Available692Open in IMG/M
Ga0207584_1003147All Organisms → cellular organisms → Bacteria → Proteobacteria683Open in IMG/M
Ga0207584_1003302Not Available672Open in IMG/M
Ga0207584_1003501All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium660Open in IMG/M
Ga0207584_1003561All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria657Open in IMG/M
Ga0207584_1004173All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium625Open in IMG/M
Ga0207584_1004184Not Available624Open in IMG/M
Ga0207584_1004189All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium624Open in IMG/M
Ga0207584_1004405Not Available614Open in IMG/M
Ga0207584_1004584Not Available606Open in IMG/M
Ga0207584_1005165All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Beijerinckiaceae → Rhodoblastus → Rhodoblastus acidophilus585Open in IMG/M
Ga0207584_1006111All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria554Open in IMG/M
Ga0207584_1006152Not Available553Open in IMG/M
Ga0207584_1006472All Organisms → cellular organisms → Bacteria544Open in IMG/M
Ga0207584_1006532Not Available543Open in IMG/M
Ga0207584_1007075All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium530Open in IMG/M
Ga0207584_1007435All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium522Open in IMG/M
Ga0207584_1008458Not Available501Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207584_1000054Ga0207584_10000541F040398AAVVSNEFEAVMPSNIKVLAKGSSAVPNFLRLCVATSGKVLSERRDDLVKFVAAEMDAYKFALANRAETIKVSQEMTHAKPDDKRAEFITDEAIKDKQIDPTLSIPLDRLDWMQNLFLKAGVIKQTVPIESIVDKSVNADAAKIAGK
Ga0207584_1000145Ga0207584_10001452F003059MVRAGIAAGMLAGAAIVMAAMPAAAQVRDAVYRGTLICDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVPEQGKGSLNGQDIELQGSWKSGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK
Ga0207584_1000191Ga0207584_10001911F063749AAVAKPSPVEPVRSANLIARASAEFNIGPAGATLERNYFRAPETDSHSGLYLCRFEPSMFAKVRLTQSCK
Ga0207584_1001042Ga0207584_10010421F063987MKFVSRTKSMAVPHQILGASTNEKRELLMCGHSLIARLTIAVFVFQMLGVTSVVHAEGPDSTAGTSSAGTRKLVIGPSSASVALGKASLIVSPLTHRDGNYVGDYQLKVRPYFFKSEKGSLLLAASDDAVRKLQAGTAIN
Ga0207584_1001578Ga0207584_10015781F081911GYIENPNQRAGSVIDNTSAGLLNTQSALPGPWTLPFKSNPWIGY
Ga0207584_1001908Ga0207584_10019083F006812VALPFGTAIACGALMQWTGNMISGWSSIGGLALGIYFFHKCMEVLAAA
Ga0207584_1002568Ga0207584_10025681F038225MKRLSLALLGAVGAFFVLTPAQAADYRVIQYNDTKICQVVDMAGPFKPISSNYTVLTKKSIPTFDAAMKARADVSKKAKCTFL
Ga0207584_1003009Ga0207584_10030092F002103MSTDEPFRTDYEFLKGVDYIFVSLDRNLSGEECHELAEKYFETHKGMTLPGQALRVDLRPAFRKSLADVTPKFRAVSIGYTFTPQR
Ga0207584_1003022Ga0207584_10030222F015418MSIESRRTFTKGLLASALVPGTSAFGQPNDPASIAIIDTPQNAAKVAAKLAAQNVKVVVRFFARKPQP
Ga0207584_1003147Ga0207584_10031473F031960DAPDADSFKNKDYEFYALKTFITLTRKSHGVVVKTYNACWVE
Ga0207584_1003302Ga0207584_10033022F012989AAAGIASAQPPITGTEVSTFTESFSDEPFLCQDELYAQTVSGHSIVHFTRFPDTGAVHFHEDVHGKAVAVPLDGTGPTYTANFWFSDTESIRAVKNGDLLVEQDTDFQHVVARGSDGSRALFDFHAHFTVNANGELAAQFETERAVCT
Ga0207584_1003501Ga0207584_10035011F004992MSNETPAAIEPDMFAAVFSQNWDNARYIKSERISFMNAYSVICAGVLALLQSVQASDLIRIALLFFMTLFSLVGLLTSLRLKSELEECLAKIEAMTVQAKVSQFVALGQLEGKPSRYPRFRWIFPIFYTMTTAGFITLIVYRLVTG
Ga0207584_1003561Ga0207584_10035613F031960SFTNKDYEFYALKTFITLTRKSHGVVVKTYNACRVE
Ga0207584_1004173Ga0207584_10041732F070475MRIDRIERAWGAEALRPAVGWKVWRVEDGALLSVLYGDVWPVDEPVRATCRRHVHVAHEAPAIGCECGIH
Ga0207584_1004184Ga0207584_10041842F031563GRRGRRFPNREEWIRRLEERQRDLEQEIADLADVIKHLKSGETPEGATAL
Ga0207584_1004189Ga0207584_10041892F003121METIYGTQTICGTQTICGIQTISEMETIICGVTGKNTFSLTAQGVGIATGTATATIGGMATNAPSLMDRG
Ga0207584_1004405Ga0207584_10044052F049708MRVKWVLIGFGILYLVFALSFIPTDKILPRPIEESWELVRDILWAGMTIISIGLLEVMSPLATPFASASGLRRGDVAEILLAVILVAAAVYSALRLRRKDLTPRWRGTHTFFLLLALTLV
Ga0207584_1004584Ga0207584_10045842F025530RRDWEAAYPLLQEADRSGDLAPDDLELLGEAARDNIAAGIWFAVLAGSLFLYAQSILMTTGLMLELTAAYSTFVLCGKSARSPFVHAIPYAFALAGAVFLCLAPDFHNAIEASLVFLGVTALMHGSVVYSALKNPRETEDPVYASAT
Ga0207584_1005165Ga0207584_10051651F068052AKIDSLEGQVTSLKNREWPKLTPAAVTDLESILAPQGPHVVSVLLQDRDSVFLARDLVDAFKRIGWKAKRDTSVNDVPDGLTVWPEDDVARAICNALTMATGALVAVREDQHLKDQGTYAIGVGYKLI
Ga0207584_1006111Ga0207584_10061111F012381DRDVKVVALGGLDAQLAALSRKEIHAYVWGDGGAVTQLAGKSKVLMRLDKVTPRWISQIQYVSEEGIKKQGDDIRKVMKGLFSAIRFMTDQTADAAEIISKKIGWSTEAVLAAHKISGSLMSHDGTVSLEALRSMQDTLLEHGVIKKRLQLDLQSEKVEGRGRVYRIAA
Ga0207584_1006152Ga0207584_10061521F036290AAAVAACSNNPTATRDEAADFPVVAHWTATATPIAPATITGQLMYDQHTGFHSDVTFAVTGPPNAVFQWRIFKRDCTVNVAATNNTSPTGLVLFATTQSYPDLTLDASGKATVKVAIAGWLDSLTAYSVRIRPTATTTFNGVNPASCGTLKYAPAQ
Ga0207584_1006472Ga0207584_10064721F005126MDIGRSGGNGTGNRSLPGSSDPHVQEVVKAAHEELRQLMRQRADVMKRIGTVKQTIVGLANLFGDEVLSEELLELVDRKSGGRQPGFTKACRMVLMEAARPLGAREVCEHIQQRIPPV
Ga0207584_1006532Ga0207584_10065321F047897VGALTESLLRRKPQTGSSRAVETFRFICYLDRIPPHILEDPAFRMLAGHGEVECELEAEDFAEAVAEAIRTLEVDFVHVLGVERLI
Ga0207584_1007075Ga0207584_10070751F003202NKVADLTAQLQDTQGELEQMRVQLADANRKLGFLEATKAHVQVTAYALTPEFGPDPLFSNNSPARTAYAVPKHTLPAGKVLNVALSPLAERKLHANLNDTLVLTTKRGGSKYLARFVDRTAQSETRPVVDVLFADAHQARIWGRRSFYAVNISRPDSPFRAE
Ga0207584_1007435Ga0207584_10074351F013448MPASLSSLLASLPMSLKGLGFAFQFQLLLLLVLEEFGKRGDVYNILWALVFGFATLNILGLVARRFEPSRNRLNFGETIAILVVIVSIVLLGWEMLTLFKIF
Ga0207584_1008458Ga0207584_10084581F085779GVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASNEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALEPAS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.