Basic Information | |
---|---|
IMG/M Taxon OID | 3300026834 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055657 | Ga0207583 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-10 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 33838029 |
Sequencing Scaffolds | 12 |
Novel Protein Genes | 14 |
Associated Families | 14 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 2 |
All Organisms → cellular organisms → Bacteria | 1 |
Not Available | 7 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp. | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F003059 | Metagenome / Metatranscriptome | 510 | Y |
F015492 | Metagenome / Metatranscriptome | 254 | Y |
F021340 | Metagenome | 219 | Y |
F028161 | Metagenome / Metatranscriptome | 192 | N |
F030160 | Metagenome / Metatranscriptome | 186 | Y |
F034995 | Metagenome | 173 | N |
F037212 | Metagenome / Metatranscriptome | 168 | Y |
F044018 | Metagenome / Metatranscriptome | 155 | Y |
F049092 | Metagenome | 147 | N |
F078970 | Metagenome / Metatranscriptome | 116 | Y |
F079360 | Metagenome / Metatranscriptome | 116 | N |
F082749 | Metagenome / Metatranscriptome | 113 | Y |
F095664 | Metagenome / Metatranscriptome | 105 | N |
F097297 | Metagenome / Metatranscriptome | 104 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207583_1000066 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1787 | Open in IMG/M |
Ga0207583_1000142 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1556 | Open in IMG/M |
Ga0207583_1000219 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1410 | Open in IMG/M |
Ga0207583_1001463 | All Organisms → cellular organisms → Bacteria | 784 | Open in IMG/M |
Ga0207583_1001536 | Not Available | 772 | Open in IMG/M |
Ga0207583_1001832 | Not Available | 724 | Open in IMG/M |
Ga0207583_1002194 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp. | 681 | Open in IMG/M |
Ga0207583_1002865 | Not Available | 624 | Open in IMG/M |
Ga0207583_1003347 | Not Available | 599 | Open in IMG/M |
Ga0207583_1004931 | Not Available | 534 | Open in IMG/M |
Ga0207583_1005036 | Not Available | 531 | Open in IMG/M |
Ga0207583_1005297 | Not Available | 522 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207583_1000066 | Ga0207583_10000662 | F003059 | MMRAGITAGMLAGAVIVTAAVPAAAQVRDAVYRGTLICDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK |
Ga0207583_1000142 | Ga0207583_10001422 | F028161 | MKASHVVTLSLAAVLAALAIQAVQAQDNKNIREDDYVRKVPLEDFKVPIVPIIPPGSSLDLRPGRTPDSSDRIYNTTPFSRDQTTPSIGLSIKSPFDDRK |
Ga0207583_1000216 | Ga0207583_10002163 | F095664 | HDAMVEWLTTYAKDAVAAWHGSFDLKCSNQAKLAYLKMNVVDIAGRYIEQNTLEYLYSPVVPGGSASNIHPTQIALAVSLTTEFSRGHAHRGRFYVPMPVHVVDATTGLISVSDAIQVATAAKTFIEALADEPGPDILPGMRVCVMSQRGTGATNVVTGVDVGRVLDTQQRRRNALKETYQHVTVDQGAS |
Ga0207583_1000219 | Ga0207583_10002192 | F021340 | LQTFQAQRTIARSSEARTQFWHVTIVTMFFVVVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLRDGTLCHYMIFDNKTAQTVEDRIGRCDENKAKPKQERPATFTWGK |
Ga0207583_1000585 | Ga0207583_10005852 | F082749 | TQPYHCIAHSAFAANATGEGRLAPAFARRGGMDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS |
Ga0207583_1001463 | Ga0207583_10014632 | F030160 | MSEKVEQRVSVSVVAVRAESYSAADDSIVISLRTKYSTAERTYSVPVECLQDLIIDLRRLSFSAPNAPYEKADSQTEPLLPLELSVAAE |
Ga0207583_1001536 | Ga0207583_10015361 | F097297 | MLVTERMRSLGDHAKSSRQPPLKERKLTKSDASAATQDVINEKEFKKGAAHESAAVVIVDNPAPQTVVHENRNAAVAEITPDEEAAVLATVAAVLAKIDPQPPTADDNVTRPEPVREDAELSAAKRAIIHEWESWSALHSDEL |
Ga0207583_1001832 | Ga0207583_10018321 | F049092 | MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIIILESEKAVLQAQLDVALEESKTLADRLHAAEAASDRREATVASSI |
Ga0207583_1002194 | Ga0207583_10021941 | F015492 | QVFDFGQIEEFESLGSGTQKGGSPPKTIIDDGARHTVLFTILESNTEAKIYWKSKDGSQTTIMRGQGLRAFQTVGEFRIEATGDDSRSFRYGYVLFRLKSEKSAQEDKI |
Ga0207583_1002865 | Ga0207583_10028651 | F079360 | PAHWCKGDLGTSWKEPLMRQTLTVAAIAVLVFAAIAELVAAPSRHAAGRDTSVQPTISTYDLHTGYRGMNTLPVAEIPQP |
Ga0207583_1003347 | Ga0207583_10033472 | F044018 | ETVMITSTSRKTQTAVCMILSAVIVSVGLSLGAFAAEHAAHHEGYSVTITQIQ |
Ga0207583_1004931 | Ga0207583_10049311 | F037212 | LIAAAKIGFADWRRAMTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLRHMRAILDASRH |
Ga0207583_1005036 | Ga0207583_10050361 | F078970 | SEISLWSDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRKLAEEAQQITQKFAQSLGNGRPGMTS |
Ga0207583_1005297 | Ga0207583_10052971 | F034995 | GFPAWMLKEMSVPLPQYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFDLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYQPPGSAGLGTPRS |
⦗Top⦘ |