Basic Information | |
---|---|
IMG/M Taxon OID | 3300026754 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091552 | Ga0207631 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01K3-12 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 20461439 |
Sequencing Scaffolds | 8 |
Novel Protein Genes | 9 |
Associated Families | 8 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 2 |
Not Available | 3 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycobacterium → Mycobacterium tuberculosis complex → Mycobacterium tuberculosis | 1 |
All Organisms → cellular organisms → Bacteria | 2 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F002471 | Metagenome / Metatranscriptome | 556 | Y |
F009131 | Metagenome / Metatranscriptome | 322 | Y |
F037171 | Metagenome / Metatranscriptome | 168 | N |
F055927 | Metagenome / Metatranscriptome | 138 | Y |
F063479 | Metagenome / Metatranscriptome | 129 | Y |
F077375 | Metagenome / Metatranscriptome | 117 | Y |
F099765 | Metagenome / Metatranscriptome | 103 | N |
F103019 | Metagenome / Metatranscriptome | 101 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207631_100003 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 3470 | Open in IMG/M |
Ga0207631_100317 | Not Available | 823 | Open in IMG/M |
Ga0207631_101203 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 621 | Open in IMG/M |
Ga0207631_101529 | Not Available | 589 | Open in IMG/M |
Ga0207631_102209 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycobacterium → Mycobacterium tuberculosis complex → Mycobacterium tuberculosis | 542 | Open in IMG/M |
Ga0207631_102790 | All Organisms → cellular organisms → Bacteria | 512 | Open in IMG/M |
Ga0207631_102843 | All Organisms → cellular organisms → Bacteria | 510 | Open in IMG/M |
Ga0207631_103051 | Not Available | 502 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207631_100003 | Ga0207631_1000031 | F037171 | MYVCARIENDPVEEPEEFAGEAPEQQSVGGGKCPLTYLCPIHYLIHLPLYTFMPKD |
Ga0207631_100025 | Ga0207631_1000252 | F103019 | MLFFHHAGKTSSIPPPAGGLSAAIVVVARRWKEYHVVAAVEGHELKTPETEHRPGPERLLETAHLELDGKLFVSTQQAPTWRANCRRFETGGSLGPTSECRVPQPRWVERVGER |
Ga0207631_100317 | Ga0207631_1003171 | F063479 | MSERNWAETHWIARSTRGGGRPKSGEVDLGPPVKSGRVRGLGELHGLLAELAEAQVGLEGGWSGLATAAVALAAMAGGNTLAGAKERWLAGEGECGAKRGAPGEAL |
Ga0207631_101203 | Ga0207631_1012031 | F002471 | VNDDLNAKLEIASKSTSCVEIVETCNRCKDFDIDACNEHLVSISKLNDEVASLNVQLKTSKNEFDKLKFARDAYTIGRHPSIKDGLGFKREAKNLTSHKAPIPAKEKGKDPMATSAKKNHAFLYHDRRQTRNAYRNYNAYDDFDSHVMFASSSSYMHGRNMSRRNAIHHVPRKNIIHAPRKVVNEPSTIYCALNASFAICRKDRKI |
Ga0207631_101529 | Ga0207631_1015291 | F099765 | APGRSQRARRAGDAVMGPKPPLVGLSDHPRASRSIQQIKAWGGLLGFVVVAGYSYMGGMGVADALLRGVIAGVAAQMLAWIAAIVLWQHLLEGEAKAAVRRARENRNRAAQRGSQGTEA |
Ga0207631_102209 | Ga0207631_1022091 | F037171 | MYVCNRIENDPVEEPEELAGEAPEQQSVGGGKCPLTYLCPIHSLIHLPLYIFMPKV |
Ga0207631_102790 | Ga0207631_1027903 | F077375 | YGSAAMVGKIARGAYVLVVGTMVVAWVISVNKEAPAKPQQTGPQVSYMYS |
Ga0207631_102843 | Ga0207631_1028431 | F055927 | IEGVFEPHYTAGCSGHDEPELDPVSNLAGSAQDITWRFVLPSNGSFPVDAVGPTFWFGGTVNDPNSLFGQAFEELQFYPNALVSNCNPNGAFVVSNVPGDYTVCSPVWSLTTTGQKPNYHEPAAFNAMLTAGPTSAPLVMHAGDAVSVHYFITSAKDGWHITVSDATTG |
Ga0207631_103051 | Ga0207631_1030511 | F009131 | KCYVKSIECVKKIKSSFASVGAFSSEESFSRGNPEGPLEWISHEAEAFEEILSSRSDICAFSGARGIASVLEKKGCEHVKFLAQSEATLSSEDIKDPSAEASAVGGKFFTDIWNDGGRGMAREIIERSEKGIHDARRVADAAERSREPERQLGTN |
⦗Top⦘ |