Basic Information | |
---|---|
Family ID | F103371 |
Family Type | Metagenome |
Number of Sequences | 101 |
Average Sequence Length | 60 residues |
Representative Sequence | MSYDDWKTHNPDDDRCEFCGAHPRECRGGWQPSACTGECGKGWRDPDAEYEKMRDEA |
Number of Associated Samples | 88 |
Number of Associated Scaffolds | 100 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | No |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 100.00 % |
% of genes near scaffold ends (potentially truncated) | 0.00 % |
% of genes from short scaffolds (< 2000 bps) | 0.00 % |
Associated GOLD sequencing projects | 78 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.33 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (97.030 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil (12.871 % of family members) |
Environment Ontology (ENVO) | Unclassified (25.743 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (33.663 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 21.18% β-sheet: 0.00% Coil/Unstructured: 78.82% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.33 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 100 Family Scaffolds |
---|---|---|
PF00589 | Phage_integrase | 5.00 |
PF04851 | ResIII | 4.00 |
PF01555 | N6_N4_Mtase | 3.00 |
PF13392 | HNH_3 | 3.00 |
PF00805 | Pentapeptide | 2.00 |
PF00145 | DNA_methylase | 2.00 |
PF14354 | Lar_restr_allev | 2.00 |
PF13560 | HTH_31 | 2.00 |
PF12684 | DUF3799 | 2.00 |
PF14284 | PcfJ | 1.00 |
PF13362 | Toprim_3 | 1.00 |
PF00196 | GerE | 1.00 |
PF07120 | DUF1376 | 1.00 |
PF00959 | Phage_lysozyme | 1.00 |
PF02592 | Vut_1 | 1.00 |
PF13518 | HTH_28 | 1.00 |
PF09374 | PG_binding_3 | 1.00 |
PF08299 | Bac_DnaA_C | 1.00 |
PF00436 | SSB | 1.00 |
PF01381 | HTH_3 | 1.00 |
PF04448 | DUF551 | 1.00 |
PF05345 | He_PIG | 1.00 |
PF09588 | YqaJ | 1.00 |
PF01844 | HNH | 1.00 |
PF04466 | Terminase_3 | 1.00 |
COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
---|---|---|---|
COG0863 | DNA modification methylase | Replication, recombination and repair [L] | 3.00 |
COG1041 | tRNA G10 N-methylase Trm11 | Translation, ribosomal structure and biogenesis [J] | 3.00 |
COG2189 | Adenine specific DNA methylase Mod | Replication, recombination and repair [L] | 3.00 |
COG0270 | DNA-cytosine methylase | Replication, recombination and repair [L] | 2.00 |
COG1357 | Uncharacterized conserved protein YjbI, contains pentapeptide repeats | Function unknown [S] | 2.00 |
COG0593 | Chromosomal replication initiation ATPase DnaA | Replication, recombination and repair [L] | 1.00 |
COG0629 | Single-stranded DNA-binding protein | Replication, recombination and repair [L] | 1.00 |
COG1738 | Queuosine precursor transporter YhhQ, DUF165 family | Translation, ribosomal structure and biogenesis [J] | 1.00 |
COG1783 | Phage terminase large subunit | Mobilome: prophages, transposons [X] | 1.00 |
COG2965 | Primosomal replication protein N | Replication, recombination and repair [L] | 1.00 |
COG3756 | Uncharacterized conserved protein YdaU, DUF1376 family | Function unknown [S] | 1.00 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 97.03 % |
All Organisms | root | All Organisms | 2.97 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300024310|Ga0247681_1000024 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 30541 | Open in IMG/M |
3300025900|Ga0207710_10000889 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 15992 | Open in IMG/M |
3300025941|Ga0207711_10000553 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 38209 | Open in IMG/M |
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 12.87% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 5.94% |
Aqueous | Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous | 4.95% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 4.95% |
Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 4.95% |
Corn Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere | 4.95% |
Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 3.96% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere | 3.96% |
Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 2.97% |
Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 2.97% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 2.97% |
Landfill Leachate | Engineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate | 2.97% |
Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 1.98% |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 1.98% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 1.98% |
Permafrost | Environmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost | 1.98% |
Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 1.98% |
Deep Subsurface Aquifer | Environmental → Terrestrial → Deep Subsurface → Aquifer → Unclassified → Deep Subsurface Aquifer | 1.98% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 1.98% |
Ectomycorrhiza | Host-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza | 1.98% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 1.98% |
Wastewater | Engineered → Wastewater → Unclassified → Unclassified → Unclassified → Wastewater | 1.98% |
Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.99% |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake | 0.99% |
Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 0.99% |
Aquatic | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Aquatic | 0.99% |
Freshwater | Environmental → Aquatic → Freshwater → Drinking Water → Unclassified → Freshwater | 0.99% |
Freshwater | Environmental → Aquatic → Freshwater → Creek → Unclassified → Freshwater | 0.99% |
Marine | Environmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine | 0.99% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.99% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.99% |
Ore Pile And Mine Drainage Contaminated Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Ore Pile And Mine Drainage Contaminated Soil | 0.99% |
Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 0.99% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.99% |
Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 0.99% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 0.99% |
Miscanthus Rhizosphere | Host-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere | 0.99% |
Sugarcane Root And Bulk Soil | Host-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil | 0.99% |
Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 0.99% |
Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere | 0.99% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.99% |
Activated Sludge | Engineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge | 0.99% |
Anaerobic Digestor Sludge | Engineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge | 0.99% |
Wastewater Effluent | Engineered → Wastewater → Nutrient Removal → Unclassified → Unclassified → Wastewater Effluent | 0.99% |
Visualization |
---|
Powered by ApexCharts |
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300001078 | Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O2 | Environmental | Open in IMG/M |
3300001963 | Marine microbial communities from Nags Head, North Carolina, USA - GS013 | Environmental | Open in IMG/M |
3300002461 | Freshwater microbial communities from a drinking water treatment plant in Ann Arbor, Michigan, USA | Environmental | Open in IMG/M |
3300003312 | Ore pile and mine drainage contaminated soil microbial communities from Mina do Sossego, Brazil - P1 sample | Environmental | Open in IMG/M |
3300003320 | Sugarcane root Sample H2 | Host-Associated | Open in IMG/M |
3300003354 | Arabidopsis root microbial communities from the University of North Carolina, USA - plate scrape MF_Cvi_mMS | Host-Associated | Open in IMG/M |
3300005295 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3 | Environmental | Open in IMG/M |
3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
3300005356 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG | Host-Associated | Open in IMG/M |
3300005367 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG | Host-Associated | Open in IMG/M |
3300005441 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG | Environmental | Open in IMG/M |
3300005458 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG | Environmental | Open in IMG/M |
3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
3300005471 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaG | Environmental | Open in IMG/M |
3300005563 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 | Host-Associated | Open in IMG/M |
3300005614 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 | Host-Associated | Open in IMG/M |
3300005664 | Freshwater viral communities from Emiquon reservoir, Havana, Illinois, USA | Environmental | Open in IMG/M |
3300005987 | Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 B DNA | Engineered | Open in IMG/M |
3300006354 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 | Environmental | Open in IMG/M |
3300006358 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 | Host-Associated | Open in IMG/M |
3300006802 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 | Environmental | Open in IMG/M |
3300006805 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNA | Environmental | Open in IMG/M |
3300007351 | Combined Assembly of Gp0115775, Gp0115815 | Environmental | Open in IMG/M |
3300007352 | Deep subsurface aquifer microbial community from Lead, South Dakota (DUSEL-D aquifer) | Environmental | Open in IMG/M |
3300008065 | Wastewater microbial communities from the domestic sewers in Singapore - Site 3 | Engineered | Open in IMG/M |
3300008507 | Wastewater microbial communities from the domestic sewers in Singapore - Site 2 | Engineered | Open in IMG/M |
3300009101 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG | Host-Associated | Open in IMG/M |
3300009151 | Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_MF_MetaG | Environmental | Open in IMG/M |
3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
3300010356 | AD_USDEca | Engineered | Open in IMG/M |
3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
3300010371 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1 | Environmental | Open in IMG/M |
3300010373 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4 | Environmental | Open in IMG/M |
3300010396 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2 | Environmental | Open in IMG/M |
3300010397 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4 | Environmental | Open in IMG/M |
3300010399 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3 | Environmental | Open in IMG/M |
3300011435 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2 | Environmental | Open in IMG/M |
3300012001 | Permafrost microbial communities from Nunavut, Canada - A24_80cm_12M | Environmental | Open in IMG/M |
3300012008 | Permafrost microbial communities from Nunavut, Canada - A39_80cm_12M | Environmental | Open in IMG/M |
3300012892 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 | Environmental | Open in IMG/M |
3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300014205 | Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 162 metaG | Engineered | Open in IMG/M |
3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
3300018055 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_coex | Environmental | Open in IMG/M |
3300018481 | Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 T | Environmental | Open in IMG/M |
3300020020 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a1 | Environmental | Open in IMG/M |
3300020034 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2 | Environmental | Open in IMG/M |
3300020579 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-M | Environmental | Open in IMG/M |
3300020580 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-M | Environmental | Open in IMG/M |
3300020582 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-O | Environmental | Open in IMG/M |
3300020583 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-M | Environmental | Open in IMG/M |
3300021082 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redo | Environmental | Open in IMG/M |
3300021088 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-M | Environmental | Open in IMG/M |
3300021168 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-M | Environmental | Open in IMG/M |
3300021170 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-M | Environmental | Open in IMG/M |
3300021180 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-O | Environmental | Open in IMG/M |
3300021420 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-M | Environmental | Open in IMG/M |
3300021440 | Freshwater microbial communities from McNutts Creek, Athens, Georgia, United States - 3-17 MG | Environmental | Open in IMG/M |
3300022173 | Freshwater viral communities from Lake Michigan, USA - Sp13.VD.MM15.D.D | Environmental | Open in IMG/M |
3300024181 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK34 | Environmental | Open in IMG/M |
3300024310 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK22 | Environmental | Open in IMG/M |
3300025302 | Arabidopsis root microbial communities from the University of North Carolina, USA - plate scrape MF_Cvi_mMS (SPAdes) | Host-Associated | Open in IMG/M |
3300025896 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300025900 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025914 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025917 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes) | Environmental | Open in IMG/M |
3300025922 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025937 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025941 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025942 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300025949 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300026078 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300027037 | Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O2 (SPAdes) | Environmental | Open in IMG/M |
3300027513 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes) | Environmental | Open in IMG/M |
3300027548 | Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes) | Environmental | Open in IMG/M |
3300027573 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes) | Environmental | Open in IMG/M |
3300027894 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes) | Environmental | Open in IMG/M |
3300028800 | Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-26 metaG | Host-Associated | Open in IMG/M |
3300028804 | Activated sludge microbial communities from WWTP in Nijmegen, Gelderland, Netherland - WWTP Weurt | Engineered | Open in IMG/M |
3300031507 | Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 10_EM | Host-Associated | Open in IMG/M |
3300031681 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20 | Environmental | Open in IMG/M |
3300031730 | Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 19_EM | Host-Associated | Open in IMG/M |
3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
3300032828 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4 | Environmental | Open in IMG/M |
3300032829 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3 | Environmental | Open in IMG/M |
3300032893 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1 | Environmental | Open in IMG/M |
3300033004 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4 | Environmental | Open in IMG/M |
3300034149 | Sediment microbial communities from East River floodplain, Colorado, United States - 20_j17 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Protein ID | Sample Taxon ID | Habitat | Sequence |
JGI12640J13246_1007762 | 3300001078 | Forest Soil | MELPGYDDWKTHNPDDDRCEFCGMHPREWSAGWQPTRCTGECGLSWRDPDFEYEQARDDAQFFGNDIQANDDEY* |
GOS2229_10041137 | 3300001963 | Marine | SISLYAGESMSSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNSCTGECRQSWRDPDAEYERMRDEE* |
AADWTP_100017591 | 3300002461 | Freshwater | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPNGCTGKCNTSWRDPDHEYEKMRDEPHF* |
P12013IDBA_10544951 | 3300003312 | Ore Pile And Mine Drainage Contaminated Soil | GYDDWKTHNPDDDRCEFCGVHPRECRAGWQPDRCTGECGRAWRDPDYEYDQMRDDLTLQ* |
rootH2_1011090536 | 3300003320 | Sugarcane Root And Bulk Soil | MNYDDWKTHDPDDDRCEFCGVAPWQCRGGWQPDQCTGECQRGWRDPDAEYEKMRDEG* |
JGI25160J50197_10002946 | 3300003354 | Arabidopsis Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNKCSGECGTGWRDPDHEYEKMRDEA* |
JGI25160J50197_10017803 | 3300003354 | Arabidopsis Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNNCTGECGKGWRDPDHEYEKMRDEA* |
Ga0065707_100940476 | 3300005295 | Switchgrass Rhizosphere | MSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECGRGWRDPDAEYEKRRDEQ* |
Ga0066388_1078986912 | 3300005332 | Tropical Forest Soil | WKTHNPDDDRCEFCGVHPGRQLPGWMPALCTGECGRSWRDPDAEYEAMRDDADF* |
Ga0070674_1015889712 | 3300005356 | Miscanthus Rhizosphere | MTNLPGYDEWKTHNPDDDRCEFCGAHPNESRHGWAPQACVGKCRTSWRDPDDEYDRMRDERDAP* |
Ga0070667_1014190362 | 3300005367 | Switchgrass Rhizosphere | MSNGMSLPGYDDWKTHNPDDDRCEFCGVHPRECRDGWEPLGCTGECGTVWRDPDFEYDQMRENS* |
Ga0070700_10000041226 | 3300005441 | Corn, Switchgrass And Miscanthus Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYERMRDDHE* |
Ga0070681_101074835 | 3300005458 | Corn Rhizosphere | VSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGECGRGWRDPDAEYERMKDET* |
Ga0070707_1002879384 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEKMRDDHE* |
Ga0070707_1009915511 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | SMSYDDWKTHNPDDDRCEFCSAGPREYRGGWQPGSCTGECNKSWRDPDAEYEKMRDEA* |
Ga0070698_1002105394 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | LDGIDERPDEESMSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECGRGWRDPDAEYEKMRDDHE* |
Ga0068855_1022190623 | 3300005563 | Corn Rhizosphere | MNYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCNGECGKGWRDPDHEYEKMRDEA* |
Ga0068856_1009671582 | 3300005614 | Corn Rhizosphere | MSYDDWKTHNPDDDRCEFCGVHPRDCRGGWQPNACTGECGKGWRDPDYEYEKMRDEA* |
Ga0073685_10166614 | 3300005664 | Aquatic | MATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGRSWRDPDYEYDKMRDERDHYI* |
Ga0075158_102608722 | 3300005987 | Wastewater Effluent | MATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGLSWRDPDYEYDKMRDERDHYI* |
Ga0075021_107391962 | 3300006354 | Watersheds | MMTLPGYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGECKTSWRDPDFEYEKMRDEDNG* |
Ga0068871_1009245452 | 3300006358 | Miscanthus Rhizosphere | MSYDDWKTHNPDDDRCEFCGAHPRECRGGWQPSACTGECGKGWRDPDAEYEKMRDEA |
Ga0070749_101943925 | 3300006802 | Aqueous | VNLPGYDDWKTHNPDDDRCEYCGAYPWQCRGGWEPDCCTGECGRKWRDPDAERDARMDR* |
Ga0075464_1000733013 | 3300006805 | Aqueous | MSYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDRCTGECNRGWRDPDAEYEKMRDEA* |
Ga0075464_105116343 | 3300006805 | Aqueous | MSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGDCGRGWRDPDAEYEAMRDER* |
Ga0104751_100976515 | 3300007351 | Deep Subsurface Aquifer | VSDMNLPGYDDWKTHNPDDDCGCEFCGASRWECRAGWQPEFCTGECGCSWRDPDYERDKMRDERMER* |
Ga0104756_10244655 | 3300007352 | Deep Subsurface Aquifer | MISHPGYDIWKTSNPDDGRCEFCGAYPRECRGGWQPDCCTGECGCSWRDPDYERDLRNDQ |
Ga0110935_10341873 | 3300008065 | Wastewater | MSYDDWKTRNPDDDRCEFCGVHPRECRAGWQPNSCTGECGQSWRDPDYEYEKMRDEDR* |
Ga0110934_10046653 | 3300008507 | Wastewater | MSYDDWKTHNPDDDRCEFCGAAPWEFKGGWQPNRCNGECRQSFRDPDAEYDRMRDEG* |
Ga0105247_1000138114 | 3300009101 | Switchgrass Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRISWRDPDAEYERMRDDHE* |
Ga0114962_104073492 | 3300009151 | Freshwater Lake | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGKCRTSWRDPGYEYEKMRDDRD* |
Ga0126382_113793021 | 3300010047 | Tropical Forest Soil | MDGLPGYDDWKTHNPDDDCCEFCGVHPRECRDGWQPDRCTGECGRKWRDPDYEYDQMRDEGR* |
Ga0116237_105275254 | 3300010356 | Anaerobic Digestor Sludge | MATETLPGYDDWKTHNPDVDRCEFCGVHPRECRGGRQPDLCTGECGRSWRDPDYEYDKMR |
Ga0126377_114399542 | 3300010362 | Tropical Forest Soil | MNLPGYDAWKTHNPDDDRCEYCGVHIRETRSGWKPDRCTRECGIGWFDPDRLYEEKRDEPPPPAAPEDYDT* |
Ga0134125_114249191 | 3300010371 | Terrestrial Soil | MSLPGYDEWKLASPDDGYCEFCGVHERRCRDGWRPDECTGECRQSWRDPDAEYDRMRDEQ |
Ga0134128_105860194 | 3300010373 | Terrestrial Soil | MTYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDSCTGECGKGWR |
Ga0134126_116776192 | 3300010396 | Terrestrial Soil | MELPGYDDWKTHNPDDDRCEHCGADPRVYRAGWQPTPCTGECGTVWRDPDAEYEAARDNANDALT* |
Ga0134124_1001811215 | 3300010397 | Terrestrial Soil | PGYDDWKTHDPDDERCEYCGVHPREIYGGWQPTRCTGQCGLVWKDPDAEYEKMRDEGDQW |
Ga0134127_102866986 | 3300010399 | Terrestrial Soil | VSYDDWKTHNPDDDRCEFCGAAPWEYRGGWQPDKCSGECGKGWRDPDQEYEKMRDDHE* |
Ga0137426_10214302 | 3300011435 | Soil | MDLPGYDAWKTHNPDDDRCEFCGADPRYCKGWQPDACTGECGIGWRDPDAEYDRRRDDPPP* |
Ga0120167_10483312 | 3300012001 | Permafrost | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGECGKGWRDPDAEYEKMRDEE* |
Ga0120174_10808932 | 3300012008 | Permafrost | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGECGKGWRDPDAEYEKMRDDA* |
Ga0157294_101972222 | 3300012892 | Soil | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEK |
Ga0137404_100946906 | 3300012929 | Vadose Zone Soil | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGKCNTSCRDPDAEYERMRDEG* |
Ga0137404_101040966 | 3300012929 | Vadose Zone Soil | MSYDDWKTHNPDDDRCEFCGVAPWECRGGYQPNNCTGECGKGWRDPDAEYERMRDEA* |
Ga0172380_100804416 | 3300014205 | Landfill Leachate | MDGLPGYDDWKTHNPADDCCEFCGADPRYCRAGWEPDGCTGECGKVWRDPDAERERIRDESY* |
Ga0172380_107668271 | 3300014205 | Landfill Leachate | MSYDDWKTHNPDDDRCEFCGVAPWQCRGGWQPDQCTGECCKGWRDPDAEYEKMRDEA* |
Ga0172380_110455182 | 3300014205 | Landfill Leachate | MSYDDWKTYNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPD |
Ga0132258_102282369 | 3300015371 | Arabidopsis Rhizosphere | MSELPGYDEWKTHNPDDDRCEYCGAHPNESRQGWAPENCTGKCKTSWRDPDYEYDRMRDEEDRGYA* |
Ga0132258_121472022 | 3300015371 | Arabidopsis Rhizosphere | MSDLPGYDDWKTHNPDDDRCEHCGAHPNESRAGWAPQNCTGKCGTSWRDPDYEYDRMRDRRDFGR* |
Ga0184616_101590483 | 3300018055 | Groundwater Sediment | MSDLPGYDDWKTHNPDDDRCEFCGANPNASKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDRGYA |
Ga0190271_100240037 | 3300018481 | Soil | MSDLPGYDDWKTHNPDDDRCEFCGANPNASKHGWAPDECTGKCETSWRDPDYEYDRKRDEEDRGYA |
Ga0190271_129849181 | 3300018481 | Soil | RKKEGNRTMNLPGYDAWKTHNPDDDRCEFCGANPNASKHGWAPEECTGKCNTSWRDPDFEYDRMRDEEMQERNR |
Ga0193738_11397612 | 3300020020 | Soil | MSLPGYDDWKTHNPDDDRCEFCGVHPRECRAGWQPDRCTGECGQKWRDPDAEYDQMRDERDAGLE |
Ga0193753_100003892 | 3300020034 | Soil | MTYDDWKTYNPDDDRCEFCGAALWESRGGWQPNGCNGECHISWRDPDYEYEKMRDDA |
Ga0210407_113061502 | 3300020579 | Soil | MTNYDDWKTHNPDDDRCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYERRRDDG |
Ga0210403_100829526 | 3300020580 | Soil | MSELPASYDQWRTHNPDDDRCEHCGAAPWECRGGWQPNNCTGECGKGFRDPDAEYEKMRDEA |
Ga0210403_107629062 | 3300020580 | Soil | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPSNCTGACNTSWRDPDHEYEKMRDEG |
Ga0210395_104127163 | 3300020582 | Soil | MTNYDDWKTHNPDDDHCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYERRRDDG |
Ga0210401_1000062818 | 3300020583 | Soil | MSLPAYDDWKTHNPDDDRCEFCGVAPWECRGGWQPSSCTGECKKAWRDPDYEYDKMRDEA |
Ga0210380_101116793 | 3300021082 | Groundwater Sediment | MSDLPGYDDWKTHNPDDDRCEFCGANPSVSKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDGRDA |
Ga0210404_105285821 | 3300021088 | Soil | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWMPSGCTGKCNTSWRDPDYEYEKMRDDDNG |
Ga0210406_106089823 | 3300021168 | Soil | MSYDDWKTHNPDDDRCEHCGVAPWQCRGGWQPDCCTGECGRSWRDPDYEYKKMREEGP |
Ga0210400_101998133 | 3300021170 | Soil | MTYDDWKTHDPDDDRCEFCGAAPWEFEGGWQPDRCNGECRQSFRDPDREYDEMRDEG |
Ga0210396_100274772 | 3300021180 | Soil | MSYDDWKTHNPDDDRCEFCGAAPWESKGGWQPSACTGECRTSWRDPDAEYERMRDEETQ |
Ga0210394_101091352 | 3300021420 | Soil | MSYDDWKTHNPDDDRCEFCGAHPREFHGGWQPSACTGECRCSFRDPDYEYEKMRDET |
Ga0213919_10203861 | 3300021440 | Freshwater | MSYDDWKTHNPADDCCEFCGADPRYCRGGWQPNSCTGECGKGWRDPDYEYEKMRDEA |
Ga0181337_10210781 | 3300022173 | Freshwater Lake | TERAALMSNYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSSCTGECGKGWRDPDAEYEKMRDDHE |
Ga0247693_10045356 | 3300024181 | Soil | MDLPGYDDWKTHDRDAERCEYCGVHPTETPRGGWQPHCCTGQCGLIWKDPDAEYEKMRDE |
Ga0247681_10000243 | 3300024310 | Soil | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCSGECGKSWRDPDYEYEKMRDEA |
Ga0207426_100060926 | 3300025302 | Arabidopsis Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNNCTGECGKGWRDPDHEYEKMRDEA |
Ga0207426_100060932 | 3300025302 | Arabidopsis Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPNKCSGECGTGWRDPDHEYEKMRDEA |
Ga0208916_100182242 | 3300025896 | Aqueous | MSYDDWKTHNPDDDRCEFCGVAPWECRGGWQPDRCTGECNRGWRDPDAEYEKMRDEA |
Ga0208916_103974601 | 3300025896 | Aqueous | TRPHDSRCSMATETLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDLCTGECGRSWRDPDYEYDKMRDERDHYI |
Ga0207710_1000088927 | 3300025900 | Switchgrass Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRISWRDPDAEYERMRDDHE |
Ga0207671_103793051 | 3300025914 | Corn Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWECRGGWQPDKCTGDC |
Ga0207660_110748902 | 3300025917 | Corn Rhizosphere | SSVSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPSQCTGECGRGWRDPDAEYERMKDET |
Ga0207646_102511113 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYEKMRDDHE |
Ga0207646_106176362 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | MNYDDWKTHNPDDDRCEFCGAGPREYRGGWQPGSCTGECNKSWRDPDAEYEKMRDEA |
Ga0207669_118320031 | 3300025937 | Miscanthus Rhizosphere | MTNLPGYDEWKTHNPDDDRCEFCGAHPNESRHGWAPQACVGKCRTSWRDPDDEYDRMRDERDAP |
Ga0207711_1000055339 | 3300025941 | Switchgrass Rhizosphere | MSLPGYDDWKTHNPDDDRCEFCGVHPRECRDGWEPLGCTGECGTVWRDPDFEYDQMRENS |
Ga0207689_109647342 | 3300025942 | Miscanthus Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDAEYERMRDDHE |
Ga0207667_118648412 | 3300025949 | Corn Rhizosphere | MSSYDDWKTHNPDDDRCEFCGAAPLECRGGWQPDKCNGECGKGWRDPDHEYEKMRDDA |
Ga0207702_117307022 | 3300026078 | Corn Rhizosphere | MSYDDWKTHNPDDDRCEFCGVHPRDCRGGWQPNACTGECGKGWRDPDYEYEKMRDEA |
Ga0207702_121467562 | 3300026078 | Corn Rhizosphere | MSYDDWKTHNPADDCCEFCGADPRYCRGGWQPNACTGECNRGWLDPDAEYERMRDEA |
Ga0209005_10010075 | 3300027037 | Forest Soil | MELPGYDDWKTHNPDDDRCEFCGMHPREWSAGWQPTRCTGECGLSWRDPDFEYEQARDDAQFFGNDIQANDDEY |
Ga0208685_10982732 | 3300027513 | Soil | VSLPGYDDWKTHNPDDDRCEFCGVDPRGNNGWQPADCSGKCGIIWRDPDYEYERQRDDRA |
Ga0209523_10240664 | 3300027548 | Forest Soil | MDLPGYDDWKTHNPDDDRCEFCGVHPRECSSGWQPARCTGECGLSWRDPDFEYEQARDDAQFFGNDIQVNDDEY |
Ga0208454_11033621 | 3300027573 | Soil | PVRQPPRDSDREDLVSLPGYDDWKTHNPDDDRCEFCGVDPRGNNGWQPADCSGKCGIIWRDPDYEYERQRDDRA |
Ga0209068_106573542 | 3300027894 | Watersheds | MMTLPGYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCTGECKTSWRDPDFEYEKMRDEDNG |
Ga0265338_102481482 | 3300028800 | Rhizosphere | MSYDDWKTHNPDDDRCEFCGAAPWENRGGWMPSGCTGKCNTSWRDPDYEYEKMRDDGND |
Ga0268298_103254763 | 3300028804 | Activated Sludge | MSYDDWKTRNPDDDRCEFCGVHPRECRAGWQPNSCTGECGQSWRDPDYEYEKMRDEDR |
Ga0307509_1002090825 | 3300031507 | Ectomycorrhiza | MPGYDAWKTHNPDDDRCEFCGVHPREYRGGWQPNACTGECRKGWRDPDAEYEKMRDEP |
Ga0318572_107117032 | 3300031681 | Soil | VSYDDWKTHNPDDDRCEFCGVHPRECRGGWQPNACTGECNRGWRDPDAEYERMRDDA |
Ga0307516_106455832 | 3300031730 | Ectomycorrhiza | MSYDDWKTHNPDDDRCEFCGAAPWESRGGWQPSGCNGECRTSWRDPDHEYEKMRDDQ |
Ga0307471_1005252356 | 3300032180 | Hardwood Forest Soil | VIDMDLPGYDDWKTHNPDDDRCEFCGVHPRECRGGWQPDRCTGECGQSWRDPDFEYEQARDDAQFF |
Ga0335080_111655953 | 3300032828 | Soil | MSDLPGYDSWKTHNPDDDRCEFCGAHERECRAGWQPDCCTGECRRTWRDPDAEYEKSRDEVSTAMELDE |
Ga0335070_102958743 | 3300032829 | Soil | MSRSNGLPGYDAWLTHDPDDDGCEFCGVGKAECRSGWRPEECTGECRRVWRDPDEEYERRREEPRE |
Ga0335069_102811126 | 3300032893 | Soil | MSRLNGLPGYDDWLTHNPDDDRCEFCGVGKAERRSSWRPEECTGECRRVWRDPDEEYERRREEPRE |
Ga0335084_111255133 | 3300033004 | Soil | MTDLPGYDSWKTHNPDDDRCEFCGAHERECRAGWQPDCCTGECQRIWRDPDYEYEKARDE |
Ga0364929_0316921_2_187 | 3300034149 | Sediment | GYDDWKTHNPDDDRCEFCGANPSVSKHGWAPDECTGKCDTSWRDPDYEYDRKRDEEDGRD |
⦗Top⦘ |