Basic Information | |
---|---|
Family ID | F086878 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 110 |
Average Sequence Length | 50 residues |
Representative Sequence | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD |
Number of Associated Samples | 107 |
Number of Associated Scaffolds | 110 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 100.00 % |
% of genes near scaffold ends (potentially truncated) | 6.36 % |
% of genes from short scaffolds (< 2000 bps) | 6.36 % |
Associated GOLD sequencing projects | 104 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.66 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (93.636 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (12.727 % of family members) |
Environment Ontology (ENVO) | Unclassified (33.636 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (34.545 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 44.87% β-sheet: 0.00% Coil/Unstructured: 55.13% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.66 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 110 Family Scaffolds |
---|---|---|
PF03259 | Robl_LC7 | 75.45 |
PF00025 | Arf | 14.55 |
PF00071 | Ras | 4.55 |
PF02518 | HATPase_c | 0.91 |
COG ID | Name | Functional Category | % Frequency in 110 Family Scaffolds |
---|---|---|---|
COG2018 | Predicted regulator of Ras-like GTPase activity, Roadblock/LC7/MglB family | Signal transduction mechanisms [T] | 75.45 |
COG1100 | GTPase SAR1 family domain | General function prediction only [R] | 14.55 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 93.64 % |
All Organisms | root | All Organisms | 6.36 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300002245|JGIcombinedJ26739_100228780 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1750 | Open in IMG/M |
3300019880|Ga0193712_1016607 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1535 | Open in IMG/M |
3300020002|Ga0193730_1025923 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1696 | Open in IMG/M |
3300020003|Ga0193739_1016318 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1938 | Open in IMG/M |
3300022531|Ga0242660_1009446 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1626 | Open in IMG/M |
3300022724|Ga0242665_10015695 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1657 | Open in IMG/M |
3300025965|Ga0210090_1004467 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1870 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 12.73% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 11.82% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 10.91% |
Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 3.64% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 3.64% |
Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 3.64% |
Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 3.64% |
Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 2.73% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 2.73% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 2.73% |
Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 2.73% |
Corn Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere | 2.73% |
Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 1.82% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.82% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.82% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 1.82% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 1.82% |
Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 1.82% |
Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.82% |
Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 1.82% |
Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 1.82% |
Freshwater Sediment | Environmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment | 0.91% |
Hot Spring | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Hot Spring | 0.91% |
Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 0.91% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.91% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 0.91% |
Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 0.91% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 0.91% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.91% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 0.91% |
Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 0.91% |
Untreated Peat Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil | 0.91% |
Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.91% |
Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 0.91% |
Soil | Environmental → Terrestrial → Soil → Loam → Unclassified → Soil | 0.91% |
Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 0.91% |
Environmental → Unclassified → Unclassified → Unclassified → Unclassified → | 0.91% | |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.91% |
Corn, Switchgrass And Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere | 0.91% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.91% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.91% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.91% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.91% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
2124908016 | Sample 642 | Environmental | Open in IMG/M |
3300001990 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3 | Host-Associated | Open in IMG/M |
3300002245 | Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027) | Environmental | Open in IMG/M |
3300004052 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 | Environmental | Open in IMG/M |
3300004114 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5 | Environmental | Open in IMG/M |
3300004145 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 | Environmental | Open in IMG/M |
3300004156 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1 | Environmental | Open in IMG/M |
3300005347 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG | Host-Associated | Open in IMG/M |
3300005441 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG | Environmental | Open in IMG/M |
3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
3300005540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 | Environmental | Open in IMG/M |
3300005563 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 | Host-Associated | Open in IMG/M |
3300005616 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 | Host-Associated | Open in IMG/M |
3300005719 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 | Host-Associated | Open in IMG/M |
3300005875 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 | Environmental | Open in IMG/M |
3300006041 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 | Environmental | Open in IMG/M |
3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
3300009094 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2) | Host-Associated | Open in IMG/M |
3300009814 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60 | Environmental | Open in IMG/M |
3300009836 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20 | Environmental | Open in IMG/M |
3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
3300010313 | Hot spring microbial communities from South Africa to study Microbial Dark Matter (Phase II) - Sagole hot spring metaG | Environmental | Open in IMG/M |
3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
3300011120 | Combined assembly of Microbial Forest Soil metaT | Environmental | Open in IMG/M |
3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
3300012350 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaG | Environmental | Open in IMG/M |
3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
3300012683 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaG | Environmental | Open in IMG/M |
3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012960 | Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MG | Environmental | Open in IMG/M |
3300013102 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaG | Host-Associated | Open in IMG/M |
3300013307 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaG | Host-Associated | Open in IMG/M |
3300014326 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaG | Host-Associated | Open in IMG/M |
3300014873 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10D | Environmental | Open in IMG/M |
3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
3300017656 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015 | Environmental | Open in IMG/M |
3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
3300017927 | Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4 | Environmental | Open in IMG/M |
3300017973 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MG | Environmental | Open in IMG/M |
3300018051 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1 | Environmental | Open in IMG/M |
3300018076 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coex | Environmental | Open in IMG/M |
3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
3300018469 | Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 T | Environmental | Open in IMG/M |
3300019259 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300019789 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
3300019880 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1 | Environmental | Open in IMG/M |
3300019999 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1 | Environmental | Open in IMG/M |
3300020001 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2 | Environmental | Open in IMG/M |
3300020002 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1 | Environmental | Open in IMG/M |
3300020003 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2 | Environmental | Open in IMG/M |
3300020010 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2 | Environmental | Open in IMG/M |
3300020067 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300020199 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300020580 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-M | Environmental | Open in IMG/M |
3300022506 | Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-26-M (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
3300022531 | Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
3300022724 | Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
3300023057 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S136-409B-6 | Environmental | Open in IMG/M |
3300025165 | Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1 | Environmental | Open in IMG/M |
3300025167 | Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes) | Environmental | Open in IMG/M |
3300025901 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4 (SPAdes) | Host-Associated | Open in IMG/M |
3300025935 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025941 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025959 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes) | Environmental | Open in IMG/M |
3300025965 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes) | Environmental | Open in IMG/M |
3300026355 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-A | Environmental | Open in IMG/M |
3300026360 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-B | Environmental | Open in IMG/M |
3300026361 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-B | Environmental | Open in IMG/M |
3300026371 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-B | Environmental | Open in IMG/M |
3300026446 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-B | Environmental | Open in IMG/M |
3300026469 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-B | Environmental | Open in IMG/M |
3300026496 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-A | Environmental | Open in IMG/M |
3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
3300027424 | Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S PM (SPAdes) | Host-Associated | Open in IMG/M |
3300027583 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes) | Environmental | Open in IMG/M |
3300027614 | Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co S AM (SPAdes) | Host-Associated | Open in IMG/M |
3300027650 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeq | Environmental | Open in IMG/M |
3300027910 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes) | Environmental | Open in IMG/M |
3300027915 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes) | Environmental | Open in IMG/M |
3300028047 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes) | Environmental | Open in IMG/M |
3300028673 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-B | Environmental | Open in IMG/M |
3300028792 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_S | Environmental | Open in IMG/M |
3300028793 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159 | Environmental | Open in IMG/M |
3300031114 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300031150 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4 | Environmental | Open in IMG/M |
3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
3300031854 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1 | Environmental | Open in IMG/M |
3300031912 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2) | Environmental | Open in IMG/M |
3300031949 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197 | Environmental | Open in IMG/M |
3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
3300032828 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4 | Environmental | Open in IMG/M |
3300033814 | Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17 | Environmental | Open in IMG/M |
3300034155 | Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17 | Environmental | Open in IMG/M |
3300034165 | Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
OU_00374440 | 2124908016 | MAQIGGKAGLSEEERQSLSESARLCEMIIEANPADTGALETLKEVYTKLGDRERLSQVV | |
JGI24737J22298_100428411 | 3300001990 | Corn Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV |
JGIcombinedJ26739_1002287801 | 3300002245 | Forest Soil | MAQIESXPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARV |
Ga0055490_100256834 | 3300004052 | Natural And Restored Wetlands | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLAR |
Ga0062593_1003053263 | 3300004114 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIY |
Ga0055489_100762411 | 3300004145 | Natural And Restored Wetlands | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLA |
Ga0062589_1000761414 | 3300004156 | Soil | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALE |
Ga0070668_1006308431 | 3300005347 | Switchgrass Rhizosphere | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESLARVVARL |
Ga0070700_1002351601 | 3300005441 | Corn, Switchgrass And Miscanthus Rhizosphere | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL |
Ga0070708_1008858291 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | MTQGSRLTSFSDDERQSLAESARLCEMIVEANPSDTGALETLKEI |
Ga0066697_103698281 | 3300005540 | Soil | VSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTKLGDRERLGQVVG |
Ga0068855_1016760182 | 3300005563 | Corn Rhizosphere | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLG |
Ga0068852_1000972671 | 3300005616 | Corn Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENL |
Ga0068861_1000062381 | 3300005719 | Switchgrass Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKE |
Ga0075293_10086621 | 3300005875 | Rice Paddy Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD |
Ga0075023_1000872181 | 3300006041 | Watersheds | MAQIESKPLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK |
Ga0066665_112256462 | 3300006796 | Soil | VSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVGRMA |
Ga0079221_109825312 | 3300006804 | Agricultural Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA |
Ga0075421_1021495441 | 3300006845 | Populus Rhizosphere | MAQIENKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVA |
Ga0075425_1017608931 | 3300006854 | Populus Rhizosphere | MIHHDGLMALSDDDRQSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRDRL |
Ga0075435_1020129582 | 3300007076 | Populus Rhizosphere | MAQVNAKAPISDEERQSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGHVMARLATL |
Ga0099830_116224641 | 3300009088 | Vadose Zone Soil | MPKAEPKASSLSSEEKRSLSESARLCEMIVQAIPSDTGALET |
Ga0111539_107819193 | 3300009094 | Populus Rhizosphere | MPDVNSKTSFSDEERHSLVESARLCEMIIEANPSDTGALETLKEIY |
Ga0105082_10817541 | 3300009814 | Groundwater Sand | MAQIESKTLSTEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLG |
Ga0105068_10423563 | 3300009836 | Groundwater Sand | MAQASGRTSLSDEERQALAESAQVCEMIVEANPSDTGALETLKE |
Ga0126382_108298141 | 3300010047 | Tropical Forest Soil | MPDVNSKPSFSDEERHSLVESASLCEMIVEANPSDTGALETLKEIYTKLGDRERLAQVM |
Ga0116211_11340211 | 3300010313 | Hot Spring | MAQPSGKPAFSDEERQALAESARLCEMIIEANPSDTG |
Ga0126377_110374493 | 3300010362 | Tropical Forest Soil | MPDVNSKPSFSDEERHSLVESASLCEMIVEANPSDTGALETLKEIYTKLGDRERLSQVMA |
Ga0126383_109572372 | 3300010398 | Tropical Forest Soil | MAPPSGKPSFSDEERQSLAESARLCEMIIEANPADTGALETLKEI |
Ga0150983_134103703 | 3300011120 | Forest Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRE |
Ga0137389_104978321 | 3300012096 | Vadose Zone Soil | MAQASGKASFSDEARQSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVG |
Ga0137383_103424783 | 3300012199 | Vadose Zone Soil | MAQIVSKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYT |
Ga0137363_115385532 | 3300012202 | Vadose Zone Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGAL |
Ga0137399_105436311 | 3300012203 | Vadose Zone Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVA |
Ga0137376_115689872 | 3300012208 | Vadose Zone Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV |
Ga0137372_112417172 | 3300012350 | Vadose Zone Soil | MAQASGRTSLSQEERQALAESAQVCEMIVEANPSDTGALETLKEIYTKLGDRER |
Ga0137369_102847191 | 3300012355 | Vadose Zone Soil | MAQTDNKTSLSSEEKRSLVESVRLCEMIVEANPLDIGALEILKEIYTKLGS |
Ga0137358_102692243 | 3300012582 | Vadose Zone Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLSG |
Ga0137398_107814602 | 3300012683 | Vadose Zone Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGRVVARLAG |
Ga0137359_116719981 | 3300012923 | Vadose Zone Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTG |
Ga0137404_110242531 | 3300012929 | Vadose Zone Soil | MAPVSGKAGLSEVERQSLTESAQLCEMIVEVIPSDTGALENMKEIYTKL |
Ga0164301_103465453 | 3300012960 | Soil | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAG |
Ga0157371_109234231 | 3300013102 | Corn Rhizosphere | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLADRDNLARVVARLA |
Ga0157372_130587901 | 3300013307 | Corn Rhizosphere | MAKTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALET |
Ga0157380_102639234 | 3300014326 | Switchgrass Rhizosphere | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALE |
Ga0180066_10021711 | 3300014873 | Soil | MPQSEPTVSLSSEEKRSLFESARLCEMIVQANPSDTGALETLKEIYSKLGDPENLS |
Ga0134085_105562461 | 3300015359 | Grasslands Soil | MAQASGKASFSAEERQSLMESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGPVV |
Ga0134112_102979002 | 3300017656 | Grasslands Soil | MAQASDKASFSAEERQSLVESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQVVGRMA |
Ga0134083_104700082 | 3300017659 | Grasslands Soil | MAQASDKASFSAEERQSLVESARLCEMIVEANPSD |
Ga0187824_101073613 | 3300017927 | Freshwater Sediment | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDT |
Ga0187780_105667381 | 3300017973 | Tropical Peatland | MAQIDNKTSLSPEEKRSLSESARLCEMIVEANPSDTG |
Ga0184620_101006381 | 3300018051 | Groundwater Sediment | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVAR |
Ga0184609_104575891 | 3300018076 | Groundwater Sediment | MAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLAR |
Ga0066667_120803732 | 3300018433 | Grasslands Soil | VSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTRL |
Ga0190270_111350551 | 3300018469 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDT |
Ga0184646_15049063 | 3300019259 | Groundwater Sediment | MAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLA |
Ga0137408_11712901 | 3300019789 | Vadose Zone Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRVGRAWPG |
Ga0193712_10166074 | 3300019880 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK |
Ga0193718_11004231 | 3300019999 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLG |
Ga0193731_10129581 | 3300020001 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESL |
Ga0193730_10259231 | 3300020002 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRESLARVV |
Ga0193739_10163184 | 3300020003 | Soil | MAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA |
Ga0193749_10208041 | 3300020010 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKE |
Ga0180109_12435373 | 3300020067 | Groundwater Sediment | MAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIY |
Ga0179592_101146761 | 3300020199 | Vadose Zone Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRE |
Ga0210403_114789001 | 3300020580 | Soil | MAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSKLGDQE |
Ga0242648_10345013 | 3300022506 | Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKE |
Ga0242660_10094461 | 3300022531 | Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDR |
Ga0242665_100156951 | 3300022724 | Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGD |
Ga0247797_10114351 | 3300023057 | Soil | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGA |
Ga0209108_102707223 | 3300025165 | Soil | MAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDREKLAKIVVRLA |
Ga0209642_101018434 | 3300025167 | Soil | MAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDR |
Ga0207688_101407244 | 3300025901 | Corn, Switchgrass And Miscanthus Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLA |
Ga0207709_102546133 | 3300025935 | Miscanthus Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLK |
Ga0207711_102518324 | 3300025941 | Switchgrass Rhizosphere | MAQTESKPLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVTPATVITTDT |
Ga0210116_11140691 | 3300025959 | Natural And Restored Wetlands | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAG |
Ga0210090_10044671 | 3300025965 | Natural And Restored Wetlands | MAPIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL |
Ga0257149_10097051 | 3300026355 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTK |
Ga0257173_10002764 | 3300026360 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGRVV |
Ga0257176_10309123 | 3300026361 | Soil | MAQIDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGA |
Ga0257179_10212711 | 3300026371 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALET |
Ga0257178_10478832 | 3300026446 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLGR |
Ga0257169_10804882 | 3300026469 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDREN |
Ga0257157_10039444 | 3300026496 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEI |
Ga0209056_106134802 | 3300026538 | Soil | VSGKASLSDEERRSLVESARLCEMIVEANPSDTGALETLKETYTRLGDRERLGQVVG |
Ga0209161_103729022 | 3300026548 | Soil | MAQAGGKASLSDEERQSLAESARLCEMIVEANPSDT |
Ga0209984_10021221 | 3300027424 | Arabidopsis Thaliana Rhizosphere | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTGALETLKEIYTKLADRDN |
Ga0209527_10390583 | 3300027583 | Forest Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETL |
Ga0209970_10079981 | 3300027614 | Arabidopsis Thaliana Rhizosphere | MAQIENRTSLSPEEKRSLSESARLCEMIVEANPSDTG |
Ga0256866_11137821 | 3300027650 | Soil | MAPSDNKTLLSSEEKRSLAESARLCEMIVEANPSDT |
Ga0209583_107735032 | 3300027910 | Watersheds | MAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSK |
Ga0209069_106728652 | 3300027915 | Watersheds | MAQIEKTSLSQEEKRSLSESARLCEMIVEANTSDT |
Ga0209526_102505561 | 3300028047 | Forest Soil | MAQTESKPLSPEEKRSLAESARLCEMIVEANPSDTGALET |
Ga0257175_10319703 | 3300028673 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLK |
Ga0307504_100587503 | 3300028792 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGAL |
Ga0307504_100862343 | 3300028792 | Soil | MARKDTGDSLSSDERSSLLESARLCEMIVEANPADTGALETLKEIYTKLDDREKLAKI |
Ga0307299_103508331 | 3300028793 | Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVV |
Ga0308187_100794571 | 3300031114 | Soil | MAQIESKPLSPEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLAGT |
(restricted) Ga0255311_10667483 | 3300031150 | Sandy Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKLGDRENLARVVARLS |
Ga0307473_114446242 | 3300031820 | Hardwood Forest Soil | MSQTSRAAPLSEAERQSLGESARLCEMIVEANPSDTGALETLKEIYTKLGDRERLGQ |
Ga0310904_109158862 | 3300031854 | Soil | MPDVNSKTSFSDEERHSLVESARLCEMIIEANPSDTGALET |
Ga0306921_114423011 | 3300031912 | Soil | MPDVNSKAPFSDEERHSLVESARLCEMIVEANPSDTGALETLKEIYTKL |
Ga0214473_119918572 | 3300031949 | Soil | MAQIDSKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLKEI |
Ga0214473_121200522 | 3300031949 | Soil | MTQADSRSSIAPEERDSLLESARLCEMIVEANPSDTGALETLKEIYTKLGDRD |
Ga0307471_1000348081 | 3300032180 | Hardwood Forest Soil | MAQIEKTSLSQEEKRSLSESARLCEMIVEANPSDTGALETLKEIYSKLGE |
Ga0307471_1034402382 | 3300032180 | Hardwood Forest Soil | MTQTSGTASFSEEERQSFAESARLCEMIIEANPFDTGALETLKEIYTKLGDRAR |
Ga0335080_121794241 | 3300032828 | Soil | MAPPTSKAPLSDDERQSLAESARLCEMILEANASDTGALETLKEIYTKL |
Ga0364930_0097245_819_1004 | 3300033814 | Sediment | MAQKDSKGSMSTEEKHSLEESARLCEMIVEANPSDTGALETLKEIYTKLGDREKLAKIVVRL |
Ga0370498_130849_470_598 | 3300034155 | Untreated Peat Soil | MAQTDNKTSLSSEEKRSLAESARLCEMIVEANPSDTGALETLK |
Ga0364942_0059717_2_148 | 3300034165 | Sediment | MAQTENKTSLSQEEKRSLAESARLCEMIVEANPSDTGALETLKEIYTKL |
⦗Top⦘ |