NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026834

3300026834: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-10 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026834 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055657 | Ga0207583
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-10 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size33838029
Sequencing Scaffolds12
Novel Protein Genes14
Associated Families14

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2
All Organisms → cellular organisms → Bacteria1
Not Available7
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp.1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F003059Metagenome / Metatranscriptome510Y
F015492Metagenome / Metatranscriptome254Y
F021340Metagenome219Y
F028161Metagenome / Metatranscriptome192N
F030160Metagenome / Metatranscriptome186Y
F034995Metagenome173N
F037212Metagenome / Metatranscriptome168Y
F044018Metagenome / Metatranscriptome155Y
F049092Metagenome147N
F078970Metagenome / Metatranscriptome116Y
F079360Metagenome / Metatranscriptome116N
F082749Metagenome / Metatranscriptome113Y
F095664Metagenome / Metatranscriptome105N
F097297Metagenome / Metatranscriptome104Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207583_1000066All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601787Open in IMG/M
Ga0207583_1000142All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1556Open in IMG/M
Ga0207583_1000219All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1410Open in IMG/M
Ga0207583_1001463All Organisms → cellular organisms → Bacteria784Open in IMG/M
Ga0207583_1001536Not Available772Open in IMG/M
Ga0207583_1001832Not Available724Open in IMG/M
Ga0207583_1002194All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp.681Open in IMG/M
Ga0207583_1002865Not Available624Open in IMG/M
Ga0207583_1003347Not Available599Open in IMG/M
Ga0207583_1004931Not Available534Open in IMG/M
Ga0207583_1005036Not Available531Open in IMG/M
Ga0207583_1005297Not Available522Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207583_1000066Ga0207583_10000662F003059MMRAGITAGMLAGAVIVTAAVPAAAQVRDAVYRGTLICDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK
Ga0207583_1000142Ga0207583_10001422F028161MKASHVVTLSLAAVLAALAIQAVQAQDNKNIREDDYVRKVPLEDFKVPIVPIIPPGSSLDLRPGRTPDSSDRIYNTTPFSRDQTTPSIGLSIKSPFDDRK
Ga0207583_1000216Ga0207583_10002163F095664HDAMVEWLTTYAKDAVAAWHGSFDLKCSNQAKLAYLKMNVVDIAGRYIEQNTLEYLYSPVVPGGSASNIHPTQIALAVSLTTEFSRGHAHRGRFYVPMPVHVVDATTGLISVSDAIQVATAAKTFIEALADEPGPDILPGMRVCVMSQRGTGATNVVTGVDVGRVLDTQQRRRNALKETYQHVTVDQGAS
Ga0207583_1000219Ga0207583_10002192F021340LQTFQAQRTIARSSEARTQFWHVTIVTMFFVVVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLRDGTLCHYMIFDNKTAQTVEDRIGRCDENKAKPKQERPATFTWGK
Ga0207583_1000585Ga0207583_10005852F082749TQPYHCIAHSAFAANATGEGRLAPAFARRGGMDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS
Ga0207583_1001463Ga0207583_10014632F030160MSEKVEQRVSVSVVAVRAESYSAADDSIVISLRTKYSTAERTYSVPVECLQDLIIDLRRLSFSAPNAPYEKADSQTEPLLPLELSVAAE
Ga0207583_1001536Ga0207583_10015361F097297MLVTERMRSLGDHAKSSRQPPLKERKLTKSDASAATQDVINEKEFKKGAAHESAAVVIVDNPAPQTVVHENRNAAVAEITPDEEAAVLATVAAVLAKIDPQPPTADDNVTRPEPVREDAELSAAKRAIIHEWESWSALHSDEL
Ga0207583_1001832Ga0207583_10018321F049092MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIIILESEKAVLQAQLDVALEESKTLADRLHAAEAASDRREATVASSI
Ga0207583_1002194Ga0207583_10021941F015492QVFDFGQIEEFESLGSGTQKGGSPPKTIIDDGARHTVLFTILESNTEAKIYWKSKDGSQTTIMRGQGLRAFQTVGEFRIEATGDDSRSFRYGYVLFRLKSEKSAQEDKI
Ga0207583_1002865Ga0207583_10028651F079360PAHWCKGDLGTSWKEPLMRQTLTVAAIAVLVFAAIAELVAAPSRHAAGRDTSVQPTISTYDLHTGYRGMNTLPVAEIPQP
Ga0207583_1003347Ga0207583_10033472F044018ETVMITSTSRKTQTAVCMILSAVIVSVGLSLGAFAAEHAAHHEGYSVTITQIQ
Ga0207583_1004931Ga0207583_10049311F037212LIAAAKIGFADWRRAMTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLRHMRAILDASRH
Ga0207583_1005036Ga0207583_10050361F078970SEISLWSDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRKLAEEAQQITQKFAQSLGNGRPGMTS
Ga0207583_1005297Ga0207583_10052971F034995GFPAWMLKEMSVPLPQYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFDLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYQPPGSAGLGTPRS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.