NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027116

3300027116: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A4-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027116 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055661 | Ga0207539
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A4-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size23624174
Sequencing Scaffolds17
Novel Protein Genes18
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available8
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis1
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F001757Metagenome / Metatranscriptome641N
F016050Metagenome / Metatranscriptome250Y
F030160Metagenome / Metatranscriptome186Y
F032172Metagenome / Metatranscriptome180Y
F047321Metagenome / Metatranscriptome150Y
F048418Metagenome / Metatranscriptome148Y
F049092Metagenome147N
F054170Metagenome140Y
F058224Metagenome / Metatranscriptome135Y
F059114Metagenome134N
F066928Metagenome / Metatranscriptome126Y
F070493Metagenome / Metatranscriptome123Y
F077482Metagenome117N
F079256Metagenome / Metatranscriptome116N
F080403Metagenome115Y
F088977Metagenome / Metatranscriptome109N
F090709Metagenome108Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207539_100519Not Available926Open in IMG/M
Ga0207539_100542All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis915Open in IMG/M
Ga0207539_100555Not Available911Open in IMG/M
Ga0207539_100670All Organisms → cellular organisms → Bacteria868Open in IMG/M
Ga0207539_100684All Organisms → cellular organisms → Bacteria864Open in IMG/M
Ga0207539_101016Not Available782Open in IMG/M
Ga0207539_101191All Organisms → cellular organisms → Bacteria750Open in IMG/M
Ga0207539_101596Not Available693Open in IMG/M
Ga0207539_101903Not Available660Open in IMG/M
Ga0207539_101979Not Available652Open in IMG/M
Ga0207539_102432Not Available619Open in IMG/M
Ga0207539_103107All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales575Open in IMG/M
Ga0207539_103607All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria546Open in IMG/M
Ga0207539_104286Not Available518Open in IMG/M
Ga0207539_104470All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium511Open in IMG/M
Ga0207539_104554All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium508Open in IMG/M
Ga0207539_104606All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia506Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207539_100519Ga0207539_1005192F054170AGNKRQLSAIDLEETEEILSWDVSDEALEIAGTAGQEIAGAYTLQFCTSMDCALAS
Ga0207539_100542Ga0207539_1005421F048418ARRQAAAHAIDWLGLWALSVAFCAIVLAGIAYFMIGDNTGASCILVGAAAIIAIVVRFGTDRDEARDEN
Ga0207539_100555Ga0207539_1005551F090709AIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK
Ga0207539_100670Ga0207539_1006702F080403ANQERELAAYVAKAQTEIQRRESEWWQKQLGSEEVSAA
Ga0207539_100684Ga0207539_1006841F016050MDIQPDANVEDQDRAKCGKNEAGGMESSGCWVRKHVGNRAADDRSDDAEHDCPKNRHGHVQYRFRDKARD
Ga0207539_101016Ga0207539_1010161F088977MRTTSLAGVLSLSLSAPLFSQTFNQTATYIALMRSSVGGLPPVATSTLQGDLQDGVALAIRYGYVPSSSRMDLPSMNNFGLTAVLPTGTASTVSITGGLSSLSRGGSDAWIIGAGGDLRLTDWAFSQGRSAPHLRVAVNGQLDYSKPRESALIAGSVGLPLSIIRPNRPKQEMQVVPFVTPSFAFGNFNPDDDTGLSHESGARFMIGGGLGLY
Ga0207539_101191Ga0207539_1011911F000268MRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYRDSTTHPKTDIETFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGTSGVTFGFDKNGRMTFPDSFDR
Ga0207539_101596Ga0207539_1015961F079256PGGLNKVAHNVSKTMKKAGRDTKAEVHRDASKTHQTLTKAGNDTKAQLKRTTGVTTHSPDANHKPGGVNKLARDVSHTSKTVGAKAKHSVKSASGEVHRDLTKAGKHAKEVVKDSVKKP
Ga0207539_101903Ga0207539_1019031F049092MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIIILESEKAVLQAQLDVALEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRR
Ga0207539_101979Ga0207539_1019791F077482HQWYASTLVLPIVILAYPSLRCGEAYELLLDRELSALDQFYAGWGTVIRMDENSEGQGPKDTMQDNEYRPAGGIKKFAEPTIQLRGTAEKIGLDRQSLSSFIGTKFLNEFAFLQSDFVFEKTYQTWEIGIFECETWTVGVNYPIAFHVQCAGGSMDEPREWHYASLGYGPADKTSETVRGTLDAIIQEYATFVRKASGKD
Ga0207539_102432Ga0207539_1024321F032172VQIIRGRFGPSGGIVPELDKDGQVVPTGHFNNRLGFHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF
Ga0207539_103107Ga0207539_1031072F066928MSPADRYRALAAYLRTCAAREQRPEVRTQWTKLAQCYLRLAEQADQNSRADIVYEFDRSA
Ga0207539_103607Ga0207539_1036072F058224MSELLGEVNRSIRDLASTVDPHDSSTWEFVCECGEEGCTERFGLPLARYDELKHTGVALLAPGHPPRRAEG
Ga0207539_103979Ga0207539_1039791F059114GARMRPHLIYLVSRAVLLFLSVVAAIASMQATSFGFISVTMSPSREGFLWLSLAIVLVAFAALGIRVALDWRLSYWILFPAVFATLIFATFGTSILSLL
Ga0207539_104286Ga0207539_1042861F030160VEQRLPVSVDALRVESYSATDDSIVILLRPKYSTAERAYSIPVECLQDLIIDLRRLSFSAPNAPYEKADSQTEPLLPLELSVAAE
Ga0207539_104470Ga0207539_1044701F070493MESEQVQLGSVTFVLAETILRELCAKVTHHLVARHLRDYAGSSDA
Ga0207539_104554Ga0207539_1045541F001757RGNFPCPVDIPMKERAIMMRVLKPLQDKATTGPGKKLVLPEPRRVRFLIRGEGSVSAGAVTIECCPESTKAGKFWMELATIPVPDNGIAQYYTEEASGLLRARISQPVSNGAVTVTPLVSRGRPDRPDRTKVV
Ga0207539_104606Ga0207539_1046061F047321MKNKCSTRSTDPHYSLFTPRSPAIARRRRLGNSGFINLRALFALLLCFSGVALAILAGRDVSVRRASEPERYMPVPSAKGQSEAVGLERLEQYWHDRLTFPTGRFDPAWVRAAVAQHDRMATG

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.