Basic Information | |
---|---|
IMG/M Taxon OID | 3300026828 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072099 | Ga0207502 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A5-12 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 33819262 |
Sequencing Scaffolds | 39 |
Novel Protein Genes | 41 |
Associated Families | 40 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root1462 | 2 |
Not Available | 19 |
All Organisms → cellular organisms → Bacteria | 4 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei | 1 |
All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 2 |
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 1 |
All Organisms → cellular organisms → Archaea | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000268 | Metagenome / Metatranscriptome | 1411 | Y |
F002103 | Metagenome / Metatranscriptome | 593 | Y |
F007000 | Metagenome / Metatranscriptome | 360 | Y |
F011852 | Metagenome / Metatranscriptome | 286 | Y |
F011965 | Metagenome | 285 | Y |
F014308 | Metagenome / Metatranscriptome | 264 | Y |
F015492 | Metagenome / Metatranscriptome | 254 | Y |
F016054 | Metagenome | 250 | Y |
F017514 | Metagenome / Metatranscriptome | 240 | Y |
F017538 | Metagenome / Metatranscriptome | 240 | Y |
F023931 | Metagenome | 208 | Y |
F024822 | Metagenome / Metatranscriptome | 204 | N |
F025096 | Metagenome / Metatranscriptome | 203 | Y |
F026499 | Metagenome | 197 | N |
F032172 | Metagenome / Metatranscriptome | 180 | Y |
F032607 | Metagenome / Metatranscriptome | 179 | Y |
F037759 | Metagenome / Metatranscriptome | 167 | N |
F038328 | Metagenome / Metatranscriptome | 166 | Y |
F038480 | Metagenome | 166 | Y |
F040813 | Metagenome / Metatranscriptome | 161 | Y |
F041300 | Metagenome / Metatranscriptome | 160 | Y |
F043066 | Metagenome / Metatranscriptome | 157 | Y |
F045732 | Metagenome / Metatranscriptome | 152 | N |
F045848 | Metagenome / Metatranscriptome | 152 | Y |
F050726 | Metagenome | 145 | Y |
F053646 | Metagenome / Metatranscriptome | 141 | Y |
F063452 | Metagenome | 129 | Y |
F064047 | Metagenome / Metatranscriptome | 129 | Y |
F068281 | Metagenome | 125 | Y |
F071766 | Metagenome | 122 | Y |
F072495 | Metagenome | 121 | N |
F077375 | Metagenome / Metatranscriptome | 117 | Y |
F081321 | Metagenome / Metatranscriptome | 114 | N |
F082749 | Metagenome / Metatranscriptome | 113 | Y |
F083408 | Metagenome / Metatranscriptome | 113 | Y |
F084203 | Metagenome / Metatranscriptome | 112 | N |
F089000 | Metagenome | 109 | N |
F089166 | Metagenome / Metatranscriptome | 109 | Y |
F099265 | Metagenome / Metatranscriptome | 103 | N |
F100610 | Metagenome | 102 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207502_100210 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1794 | Open in IMG/M |
Ga0207502_100213 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root1462 | 1790 | Open in IMG/M |
Ga0207502_100384 | Not Available | 1539 | Open in IMG/M |
Ga0207502_100400 | All Organisms → cellular organisms → Bacteria | 1524 | Open in IMG/M |
Ga0207502_100582 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root1462 | 1371 | Open in IMG/M |
Ga0207502_100997 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei | 1155 | Open in IMG/M |
Ga0207502_101046 | Not Available | 1142 | Open in IMG/M |
Ga0207502_101216 | All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium | 1084 | Open in IMG/M |
Ga0207502_101307 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 1058 | Open in IMG/M |
Ga0207502_101360 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1044 | Open in IMG/M |
Ga0207502_101378 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 1039 | Open in IMG/M |
Ga0207502_102040 | Not Available | 901 | Open in IMG/M |
Ga0207502_102461 | Not Available | 844 | Open in IMG/M |
Ga0207502_102710 | Not Available | 817 | Open in IMG/M |
Ga0207502_102724 | Not Available | 816 | Open in IMG/M |
Ga0207502_102783 | Not Available | 810 | Open in IMG/M |
Ga0207502_102925 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 795 | Open in IMG/M |
Ga0207502_102941 | Not Available | 794 | Open in IMG/M |
Ga0207502_103225 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 769 | Open in IMG/M |
Ga0207502_103521 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 745 | Open in IMG/M |
Ga0207502_103954 | Not Available | 714 | Open in IMG/M |
Ga0207502_104048 | Not Available | 709 | Open in IMG/M |
Ga0207502_104228 | Not Available | 698 | Open in IMG/M |
Ga0207502_104701 | Not Available | 671 | Open in IMG/M |
Ga0207502_104897 | All Organisms → cellular organisms → Bacteria | 661 | Open in IMG/M |
Ga0207502_105044 | Not Available | 653 | Open in IMG/M |
Ga0207502_105513 | Not Available | 631 | Open in IMG/M |
Ga0207502_106333 | All Organisms → cellular organisms → Bacteria | 602 | Open in IMG/M |
Ga0207502_106606 | Not Available | 593 | Open in IMG/M |
Ga0207502_106731 | Not Available | 589 | Open in IMG/M |
Ga0207502_106945 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 582 | Open in IMG/M |
Ga0207502_107136 | Not Available | 577 | Open in IMG/M |
Ga0207502_107712 | Not Available | 560 | Open in IMG/M |
Ga0207502_108048 | All Organisms → cellular organisms → Bacteria | 552 | Open in IMG/M |
Ga0207502_108084 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 551 | Open in IMG/M |
Ga0207502_108096 | Not Available | 551 | Open in IMG/M |
Ga0207502_108485 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 541 | Open in IMG/M |
Ga0207502_108862 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 531 | Open in IMG/M |
Ga0207502_109624 | All Organisms → cellular organisms → Archaea | 515 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207502_100102 | Ga0207502_1001023 | F082749 | RGGMDGESVDAAGKLGRKRLINHAMTLDAGLSFERLRHDIHPEVSLPARPVPGMTLVLVRFINHFEALRHESLGQLLCDEIGGSHIARLGERGLPVNGHKQVLKASPATAHNVRS |
Ga0207502_100210 | Ga0207502_1002102 | F015492 | MKRQVSIYVAALLALLYAGLQSGARAQVFDFGQIEEFESLGSGTQKGGSPPKTIIDDGARHTVLFTILESNTEAKIHWKSKDGSQTTIMRGQGLRAFQTIGEFRIEAAGDDSRSFRYGYVLFRLKSEKSAQEDKI |
Ga0207502_100213 | Ga0207502_1002133 | F064047 | MTLMKSSRAGLFALAFGILLSLFLSVPSYDVSYAQTQA |
Ga0207502_100384 | Ga0207502_1003841 | F045732 | LRLRGPNVVSKMRHRRRRVFIDTHKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWIATVIDWARRAA |
Ga0207502_100400 | Ga0207502_1004003 | F011852 | MHVSGVCVQMRMPLFSYFVVMGSTLTLGLIYISNRIEPLGSPVPTSQIVGLARPYMPEPEQSPYAVTGTNFAAANKPAAARAAAETTARRADSLQQQPAANTEVRRVPRWKHIAQNPIAALMGVH |
Ga0207502_100582 | Ga0207502_1005823 | F083408 | KRGADVLVRVEKMMDDANRALGPVAPARPTVSSGKDKQAQVPDKGEALAASLEGRAFGSH |
Ga0207502_100997 | Ga0207502_1009971 | F084203 | MRARIRRALWMFGALAFVAMPASAQESTEVAPLTTEDSALLANALVFDPGALATAPKKPLRLPGYRNNAYDITRTQKVDGSTTVVVKQPVQTEWSNSVGADLAPSKPTAYPLPLSTERNNGMPAGAAW |
Ga0207502_101046 | Ga0207502_1010463 | F068281 | RSIGRTHRLARCKIAGQHFEKPAEVRLVPIADISYAQKKVRFVAFLVGTSAAPSELIIMTYQSDARRDKRDDKFYISWIIRGGIVLVIVIAALAFTSTGNYPDLDVPQMTRTVPGPAS |
Ga0207502_101216 | Ga0207502_1012161 | F011965 | LAYHVLVVGADAWICAAAGSAGQNTYTQIYCLNPHDPSQHKFIDILKKTINGIAQHDPHWPTSAAGQTIGIHAMYGSAAGAWLDVGFVHHSWGANGEAVLNLSTNTWSLKTNADMYSSGHSSIGGKFVNGSGSINGMDSRGALLRDPNNLMDATKYSFIMQPPSTAGWYDAEHSSWFNSSTNPQAPVLFSRYNCTAPPGPLTWYGEIIAAATDGSNTVWRFAHNHNGGLVGFAGQSFAQISNDGRWALFSSYWDGTLGAAAGDFGFSKRIDTFIVDLVTSASPPTPSPGPTCLRYNPNGRCLKWSN |
Ga0207502_101307 | Ga0207502_1013071 | F099265 | MDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGELSPVEASKYLVNRKQDECRVSYDHNNVEYILRVMA |
Ga0207502_101360 | Ga0207502_1013603 | F038480 | ILVSIVLYFAATAAHSMPISVLNANGLSATIPISDQCGDRCGSSRSYVKDRRSGVGGYSGGYVLVRDPLIQRRPFCPFGSYVACVVSGTYCVDLCH |
Ga0207502_101378 | Ga0207502_1013783 | F100610 | MMPVIISLFGNEDMEISEFPLTNKSVFYRCSYGECFRFIADKCMHCCANPIPDTEMHINYLRRLPDIKKAVTDLG |
Ga0207502_102040 | Ga0207502_1020402 | F072495 | MLANEVEWLKPPQYVAFRTMERWVVIPVIARIESREGGRLYGYWFSRFTPEGAFREVPVDYVVDLGSDDDAQKLVDLIRGGRELPAEFVQRLHVIAD |
Ga0207502_102461 | Ga0207502_1024611 | F017538 | MARTPMPASLTLVRASRNRDGEWSSDDYDVFEGKQLVGRITLTPQAPEGRPWFWTITARPESSQNQGYAVSREQAMLE |
Ga0207502_102710 | Ga0207502_1027101 | F077375 | MVGKIARGAYVLVVGTMVVAWVISVNKEAPAKPQQTGPQ |
Ga0207502_102724 | Ga0207502_1027241 | F037759 | MKGTGLLHPGKLAAVIVAGGLLSGAARAQSPELGAPSIGILPPSDILASVSYLGLDPSGEPVRRGAYYMLHAFDRAGIELLVVVDAQFGDVLFMAPALNTSLTPPYVRAARIIQVEPPESGGQQKK |
Ga0207502_102783 | Ga0207502_1027833 | F089000 | MGEPTPATPASKYFAATVAMIAGAFFFAVGAGLLPIPGGPSNLHGPLWLLLCVGLAFFLAGLAILIPMLGHANDSGDLPAGAPFWLRAMQYLIGLCIFACFG |
Ga0207502_102925 | Ga0207502_1029253 | F089166 | MCIACELGYWAMVDALEAERNAAKKNNAGDHPAFSCEPEAEPSPAPR |
Ga0207502_102941 | Ga0207502_1029411 | F025096 | MGNTNFMVLNKRGTWRLCWLFVAVCALSACSHQSLQEPVPSFSSVFPYESRDVGNYSVDDHTKAPLQWVVQAWVKTETTTLYAKSINLEGIVQHVTWNNEGAPSVGVQHIPGDVRTIPFAWMNAKEILLITEPVRVHFYTLLKEESSAPTPMDH |
Ga0207502_103225 | Ga0207502_1032251 | F007000 | MMRGPHHTLLVSILIAAPVVAQNPGSVPRAILLPDTLGANFAAADTLTGTSGPADYDFLVGTWRFTFQARRRDGSFTPAFTGHWVFTKKQTGGQGVLLEDHWRPDDATSRWEAGTWTYRAYNPERKIWEMQGINTNVGAWQPGLMWTAGESRLLTEWYGPMLVRFRYFAIQPDKFLWRADATFDRGKSWIADYWTMEVHRISR |
Ga0207502_103521 | Ga0207502_1035212 | F040813 | LAGKLSVWKQKIVVFAGTLGAPGLFLISFLDSSVLTFPVINDLLLIELS |
Ga0207502_103954 | Ga0207502_1039542 | F050726 | ETDRWTLAEELWSFGEDSLYPVALQLSDEDMVRLWLLAGGLLLKERARSSGEATALAAVAVIEGNQRPLARKRRRPQPNRLRFEQTPEERYAEISRIEDSPSFDEKWR |
Ga0207502_104048 | Ga0207502_1040481 | F041300 | LPFFSAVDYRRYAAECVRLAQQVADPDDKVRLLDMAETFRELADKNDA |
Ga0207502_104149 | Ga0207502_1041492 | F023931 | MNEAQTLSYTRAQTATRLRIYQVLFAISIIAGLLAGLWCIFDPVGFAQLVFQIDPYPQTWPRIWGATLFGLQLAYIPGVRNPSFYRWPNWASIAIKFLMTIIFLTAGSSFYLLAAWELVWFVILLVAYYRLM |
Ga0207502_104228 | Ga0207502_1042282 | F024822 | LGEQLRLQFYSLRKGTADEPEQATYKWDRGAYQRTGGGMTDISSFSVHPLARDIFVVQSAAAKRPGMFEYAVARRLVDGVYQVIAIDEADAGRVTRARFCKRASDSSCRIQTRNQLYAFARATAERRRGQGGLVLRLADGVAESSR |
Ga0207502_104701 | Ga0207502_1047011 | F043066 | MAAAEQHRYEPGHSITRFCCGESMKLVKAIPRIGSYPELQTYRCERCHNVETIEVKITGS |
Ga0207502_104897 | Ga0207502_1048971 | F032607 | ALLLRFFGFIPWAAISFLAGALVSRIGWIAVGKVSGADPEAVLASQR |
Ga0207502_105044 | Ga0207502_1050441 | F026499 | ILELHNVHQRQANELQDAHDEIDRLSQTVSALQEAVTQYQAGAAAAEDEIVLLESEKAALQAQLDGAFEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRRQRSVFSQQKSILEDKLQEKEALAATQGTKIKQLEGVRDELDKRVRVIEALLASEREVAERKTRRPTEILGAAG |
Ga0207502_105513 | Ga0207502_1055132 | F053646 | MTKRLILPALLLAALALAATAAAGSGQGKGKGKGHGHHGKFGPYDVVTDDHGSCSNAWAVDTEKRTFKVRRNN |
Ga0207502_106333 | Ga0207502_1063332 | F000268 | MRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYRDSTTRPKTDIESFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPRGFTTMIKPQKLIAGRVYRGSATDEHGGSSGVTFGFD |
Ga0207502_106606 | Ga0207502_1066061 | F081321 | GNARFATLSQLMRIVSRAAALVSLALMLASPARAACTGSCEPSVEVAQAAMQKIFKETFLSPYTLISFERLDGRSGERYGGAFYEMRIRAVLHYDGVRLRCRRPSCPELHHYLLENDAASKKATVAGWLFLANDGDGWKTVPLTLQSPQ |
Ga0207502_106731 | Ga0207502_1067312 | F032172 | MFVSHMDGVGRAMTMMKLEAEAPVQIIRGRFGPSGGIIPELDKDRQVIPTGYFNNRLGFHALMRAVGVDERVVTLNELFANPKLNLEITRRIEAGQKSVSISGEDAAKTGEF |
Ga0207502_106945 | Ga0207502_1069451 | F002103 | MSTDEPFRTDYEFLKGVDYIFVSLDRNLSGEECHELAKKYFETHKGMTLPGQALRVDLRPAFGKPLADVTPKFRAVSIGYTFTPQR |
Ga0207502_107136 | Ga0207502_1071361 | F017514 | MPQGKERTKEQLLKEAKRLGIKGRSRMNKGALKAAVDR |
Ga0207502_107712 | Ga0207502_1077122 | F045732 | KLRGPRAESLVSKMRHRRRKVFIDTHKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWVATVIDWTRRAA |
Ga0207502_108048 | Ga0207502_1080482 | F014308 | MFMAGSTIGIQLIMPKTGTSPKATPEQQASERDAAAQPVVKAPPPPGMGKIVDKIA |
Ga0207502_108084 | Ga0207502_1080841 | F063452 | LSKAGTMNVDPIPKNLIAVHDLLTEALAISEDSGGACRYGIALFPRAGRFEAPVFGVVTRGGESTTLTYRLLRSLMERTVLVSGRVTATLSDGMSYSSTRTAPPALLDQPVLHLSIRCVIGTTPENHDRTAPLDSSLFAGDLKAPLVLLMESRPEGWPR |
Ga0207502_108096 | Ga0207502_1080961 | F071766 | MDEDFITLEVEEEGRGRLQFELPLDITDEEIAYITRAESGLLELLDPDTGEVVFSCTPVLVH |
Ga0207502_108485 | Ga0207502_1084852 | F045848 | LANVILAGYRSTWLLPEKSREPWLAAEEETARQGLGASTQVQERSVLRATIAQVRERFAAWKLELPRIDHPEIGTI |
Ga0207502_108862 | Ga0207502_1088621 | F038328 | YQTQVPVVRWDPVPGAASYEVQVADWNGTACLWGTADYLKNTAVPEWAPLASTSADPAPWQGTLAEDVLPIITPGDYCFRVRARADRAPGNQEVWGDYTYLQNGNVDSKDPVGPAFTWTAYPSAADPTAAGACLFGYPCGSSYLGPATGATSTRTPLFTWDAISGANSYFVVVSKD |
Ga0207502_109624 | Ga0207502_1096241 | F016054 | FVGHSAWMVESLLENEKEPDVYELGVAEGIKEMLALRGFTKEKILNSTVSNLAETLQIDYYVALIIYNSAKKN |
⦗Top⦘ |