Basic Information | |
---|---|
Family ID | F095088 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 105 |
Average Sequence Length | 84 residues |
Representative Sequence | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH |
Number of Associated Samples | 99 |
Number of Associated Scaffolds | 105 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 10.48 % |
% of genes near scaffold ends (potentially truncated) | 28.57 % |
% of genes from short scaffolds (< 2000 bps) | 82.86 % |
Associated GOLD sequencing projects | 94 |
AlphaFold2 3D model prediction | No |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (53.333 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (25.714 % of family members) |
Environment Ontology (ENVO) | Unclassified (32.381 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (36.190 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 52.87% β-sheet: 0.00% Coil/Unstructured: 47.13% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 105 Family Scaffolds |
---|---|---|
PF03050 | DDE_Tnp_IS66 | 12.38 |
PF05717 | TnpB_IS66 | 6.67 |
PF01609 | DDE_Tnp_1 | 1.90 |
PF03328 | HpcH_HpaI | 0.95 |
PF13808 | DDE_Tnp_1_assoc | 0.95 |
PF10073 | DUF2312 | 0.95 |
PF01595 | CNNM | 0.95 |
PF00174 | Oxidored_molyb | 0.95 |
PF08378 | NERD | 0.95 |
COG ID | Name | Functional Category | % Frequency in 105 Family Scaffolds |
---|---|---|---|
COG3436 | Transposase | Mobilome: prophages, transposons [X] | 19.05 |
COG3039 | Transposase and inactivated derivatives, IS5 family | Mobilome: prophages, transposons [X] | 1.90 |
COG3293 | Transposase | Mobilome: prophages, transposons [X] | 1.90 |
COG3385 | IS4 transposase InsG | Mobilome: prophages, transposons [X] | 1.90 |
COG5421 | Transposase | Mobilome: prophages, transposons [X] | 1.90 |
COG5433 | Predicted transposase YbfD/YdcC associated with H repeats | Mobilome: prophages, transposons [X] | 1.90 |
COG5659 | SRSO17 transposase | Mobilome: prophages, transposons [X] | 1.90 |
COG0469 | Pyruvate kinase | Carbohydrate transport and metabolism [G] | 0.95 |
COG2041 | Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductases | Energy production and conversion [C] | 0.95 |
COG2301 | Citrate lyase beta subunit | Carbohydrate transport and metabolism [G] | 0.95 |
COG3836 | 2-keto-3-deoxy-L-rhamnonate aldolase RhmA | Carbohydrate transport and metabolism [G] | 0.95 |
COG3915 | Uncharacterized conserved protein | Function unknown [S] | 0.95 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 53.33 % |
All Organisms | root | All Organisms | 46.67 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
2140918013|NODE_244781_length_975_cov_8.541538 | All Organisms → cellular organisms → Bacteria | 1007 | Open in IMG/M |
2228664021|ICCgaii200_c0965927 | All Organisms → cellular organisms → Bacteria | 1652 | Open in IMG/M |
3300000033|ICChiseqgaiiDRAFT_c0622038 | Not Available | 855 | Open in IMG/M |
3300000787|JGI11643J11755_11394094 | All Organisms → cellular organisms → Bacteria | 920 | Open in IMG/M |
3300002560|JGI25383J37093_10183326 | Not Available | 551 | Open in IMG/M |
3300003319|soilL2_10098350 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 4598 | Open in IMG/M |
3300003321|soilH1_10068303 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3033 | Open in IMG/M |
3300004281|Ga0066397_10021251 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 967 | Open in IMG/M |
3300004480|Ga0062592_100018236 | All Organisms → cellular organisms → Bacteria | 3112 | Open in IMG/M |
3300004778|Ga0062383_10176797 | All Organisms → cellular organisms → Bacteria | 972 | Open in IMG/M |
3300004808|Ga0062381_10267201 | Not Available | 618 | Open in IMG/M |
3300005174|Ga0066680_10670071 | Not Available | 641 | Open in IMG/M |
3300005178|Ga0066688_10477924 | All Organisms → cellular organisms → Bacteria | 804 | Open in IMG/M |
3300005180|Ga0066685_10452721 | Not Available | 891 | Open in IMG/M |
3300005330|Ga0070690_100226836 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 1311 | Open in IMG/M |
3300005332|Ga0066388_102378585 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 960 | Open in IMG/M |
3300005332|Ga0066388_106181802 | Not Available | 604 | Open in IMG/M |
3300005340|Ga0070689_101744823 | Not Available | 567 | Open in IMG/M |
3300005471|Ga0070698_100301514 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1533 | Open in IMG/M |
3300005545|Ga0070695_100149986 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 1626 | Open in IMG/M |
3300005552|Ga0066701_10237244 | All Organisms → cellular organisms → Bacteria | 1126 | Open in IMG/M |
3300005554|Ga0066661_10065224 | Not Available | 2115 | Open in IMG/M |
3300005555|Ga0066692_10583593 | All Organisms → cellular organisms → Bacteria | 703 | Open in IMG/M |
3300005586|Ga0066691_10160292 | All Organisms → cellular organisms → Bacteria | 1298 | Open in IMG/M |
3300005713|Ga0066905_101754129 | Not Available | 572 | Open in IMG/M |
3300005829|Ga0074479_10154464 | Not Available | 1144 | Open in IMG/M |
3300005829|Ga0074479_11160709 | All Organisms → cellular organisms → Bacteria | 2106 | Open in IMG/M |
3300006057|Ga0075026_100794610 | Not Available | 573 | Open in IMG/M |
3300006845|Ga0075421_101366217 | Not Available | 781 | Open in IMG/M |
3300006846|Ga0075430_100693723 | Not Available | 839 | Open in IMG/M |
3300007255|Ga0099791_10009087 | All Organisms → cellular organisms → Bacteria | 4123 | Open in IMG/M |
3300007255|Ga0099791_10015400 | All Organisms → cellular organisms → Bacteria | 3252 | Open in IMG/M |
3300007258|Ga0099793_10707389 | Not Available | 508 | Open in IMG/M |
3300007265|Ga0099794_10343665 | Not Available | 776 | Open in IMG/M |
3300009012|Ga0066710_103197275 | Not Available | 629 | Open in IMG/M |
3300009090|Ga0099827_10446702 | All Organisms → cellular organisms → Bacteria | 1108 | Open in IMG/M |
3300009090|Ga0099827_10449930 | Not Available | 1104 | Open in IMG/M |
3300009137|Ga0066709_100873252 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella factor | 1309 | Open in IMG/M |
3300009143|Ga0099792_10350333 | All Organisms → cellular organisms → Bacteria | 891 | Open in IMG/M |
3300009147|Ga0114129_12566489 | Not Available | 608 | Open in IMG/M |
3300009148|Ga0105243_11277881 | Not Available | 750 | Open in IMG/M |
3300009515|Ga0129286_10003202 | All Organisms → cellular organisms → Bacteria | 3341 | Open in IMG/M |
3300009610|Ga0105340_1073666 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 1349 | Open in IMG/M |
3300009800|Ga0105069_1041926 | Not Available | 550 | Open in IMG/M |
3300010043|Ga0126380_10129812 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1574 | Open in IMG/M |
3300010046|Ga0126384_10531511 | Not Available | 1019 | Open in IMG/M |
3300010046|Ga0126384_11111149 | Not Available | 725 | Open in IMG/M |
3300011270|Ga0137391_11007501 | Not Available | 678 | Open in IMG/M |
3300011413|Ga0137333_1058724 | Not Available | 876 | Open in IMG/M |
3300012189|Ga0137388_11416623 | Not Available | 633 | Open in IMG/M |
3300012199|Ga0137383_10167283 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1611 | Open in IMG/M |
3300012201|Ga0137365_10674787 | Not Available | 756 | Open in IMG/M |
3300012202|Ga0137363_11299061 | Not Available | 616 | Open in IMG/M |
3300012203|Ga0137399_10659429 | All Organisms → cellular organisms → Bacteria | 880 | Open in IMG/M |
3300012206|Ga0137380_10272069 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 1522 | Open in IMG/M |
3300012209|Ga0137379_10464181 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1174 | Open in IMG/M |
3300012351|Ga0137386_10054083 | All Organisms → cellular organisms → Bacteria | 2779 | Open in IMG/M |
3300012356|Ga0137371_10749811 | All Organisms → cellular organisms → Bacteria | 745 | Open in IMG/M |
3300012685|Ga0137397_10041614 | All Organisms → cellular organisms → Bacteria | 3285 | Open in IMG/M |
3300012685|Ga0137397_10753570 | Not Available | 723 | Open in IMG/M |
3300012918|Ga0137396_10331870 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1125 | Open in IMG/M |
3300012923|Ga0137359_10065453 | All Organisms → cellular organisms → Bacteria | 3173 | Open in IMG/M |
3300012925|Ga0137419_11347734 | Not Available | 601 | Open in IMG/M |
3300012929|Ga0137404_10322746 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 1344 | Open in IMG/M |
3300014271|Ga0075326_1262450 | Not Available | 533 | Open in IMG/M |
3300015054|Ga0137420_1239398 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella | 955 | Open in IMG/M |
3300015264|Ga0137403_10395131 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 1263 | Open in IMG/M |
3300015372|Ga0132256_100523769 | All Organisms → cellular organisms → Bacteria | 1297 | Open in IMG/M |
3300017659|Ga0134083_10304749 | Not Available | 677 | Open in IMG/M |
3300018052|Ga0184638_1040143 | Not Available | 1698 | Open in IMG/M |
3300018071|Ga0184618_10223540 | Not Available | 792 | Open in IMG/M |
3300018075|Ga0184632_10048869 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1824 | Open in IMG/M |
3300018422|Ga0190265_10808039 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → unclassified Planctomycetaceae → Planctomycetaceae bacterium | 1060 | Open in IMG/M |
3300018468|Ga0066662_11344749 | Not Available | 735 | Open in IMG/M |
3300019233|Ga0184645_1198654 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella → Candidatus Entotheonella factor | 1327 | Open in IMG/M |
3300020065|Ga0180113_1102269 | Not Available | 734 | Open in IMG/M |
3300021081|Ga0210379_10510134 | Not Available | 535 | Open in IMG/M |
3300021307|Ga0179585_1066271 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2364 | Open in IMG/M |
3300022209|Ga0224497_10112750 | Not Available | 1101 | Open in IMG/M |
3300022226|Ga0224512_10174819 | Not Available | 1122 | Open in IMG/M |
3300022893|Ga0247787_1068964 | Not Available | 539 | Open in IMG/M |
3300023260|Ga0247798_1001561 | All Organisms → cellular organisms → Bacteria | 2986 | Open in IMG/M |
3300025925|Ga0207650_10649360 | Not Available | 889 | Open in IMG/M |
3300026325|Ga0209152_10264869 | Not Available | 639 | Open in IMG/M |
3300026376|Ga0257167_1083383 | Not Available | 506 | Open in IMG/M |
3300026480|Ga0257177_1047580 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium | 660 | Open in IMG/M |
3300026497|Ga0257164_1057794 | Not Available | 631 | Open in IMG/M |
3300026527|Ga0209059_1349580 | Not Available | 502 | Open in IMG/M |
3300026542|Ga0209805_1233587 | Not Available | 749 | Open in IMG/M |
3300026548|Ga0209161_10040718 | All Organisms → cellular organisms → Bacteria | 3119 | Open in IMG/M |
3300027324|Ga0209845_1029518 | Not Available | 885 | Open in IMG/M |
3300027655|Ga0209388_1140648 | Not Available | 684 | Open in IMG/M |
(restricted) 3300027799|Ga0233416_10010420 | All Organisms → cellular organisms → Bacteria | 3016 | Open in IMG/M |
3300027910|Ga0209583_10608523 | Not Available | 557 | Open in IMG/M |
3300027949|Ga0209860_1051310 | Not Available | 547 | Open in IMG/M |
3300028590|Ga0247823_11502481 | Not Available | 505 | Open in IMG/M |
3300030993|Ga0308190_1180944 | Not Available | 520 | Open in IMG/M |
3300031093|Ga0308197_10041341 | Not Available | 1145 | Open in IMG/M |
3300031226|Ga0307497_10492048 | Not Available | 603 | Open in IMG/M |
3300031548|Ga0307408_100037862 | All Organisms → cellular organisms → Bacteria | 3399 | Open in IMG/M |
3300031562|Ga0310886_10783897 | Not Available | 599 | Open in IMG/M |
3300031892|Ga0310893_10006995 | All Organisms → cellular organisms → Bacteria | 2947 | Open in IMG/M |
3300031944|Ga0310884_10463627 | Not Available | 738 | Open in IMG/M |
3300032017|Ga0310899_10008279 | All Organisms → cellular organisms → Bacteria | 3000 | Open in IMG/M |
3300032163|Ga0315281_10277180 | Not Available | 1840 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 25.71% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 14.29% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 4.76% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 3.81% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 3.81% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 3.81% |
Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 2.86% |
Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 2.86% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 2.86% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 2.86% |
Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 2.86% |
Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 2.86% |
Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 1.90% |
Wetland Sediment | Environmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment | 1.90% |
Sediment | Environmental → Aquatic → Marine → Sediment → Unclassified → Sediment | 1.90% |
Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 1.90% |
Sediment (Intertidal) | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal) | 1.90% |
Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 1.90% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.90% |
Sugarcane Root And Bulk Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil | 1.90% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 1.90% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 1.90% |
Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment | 0.95% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 0.95% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 0.95% |
Natural And Restored Wetlands | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands | 0.95% |
Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere | 0.95% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.95% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.95% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere | 0.95% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
2140918013 | Soil microbial communities from Great Prairies - Iowa soil (MSU Assemblies) | Environmental | Open in IMG/M |
2228664021 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
3300000033 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
3300000787 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
3300002560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm | Environmental | Open in IMG/M |
3300003319 | Sugarcane bulk soil Sample L2 | Environmental | Open in IMG/M |
3300003321 | Sugarcane bulk soil Sample H1 | Environmental | Open in IMG/M |
3300004281 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBio | Environmental | Open in IMG/M |
3300004480 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4 | Environmental | Open in IMG/M |
3300004778 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare3Fresh | Environmental | Open in IMG/M |
3300004808 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh | Environmental | Open in IMG/M |
3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
3300005178 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 | Environmental | Open in IMG/M |
3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
3300005330 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaG | Environmental | Open in IMG/M |
3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
3300005340 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG | Environmental | Open in IMG/M |
3300005471 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaG | Environmental | Open in IMG/M |
3300005545 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaG | Environmental | Open in IMG/M |
3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
3300005554 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 | Environmental | Open in IMG/M |
3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
3300005586 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 | Environmental | Open in IMG/M |
3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
3300005829 | Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBC | Environmental | Open in IMG/M |
3300006057 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012 | Environmental | Open in IMG/M |
3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
3300006846 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4 | Host-Associated | Open in IMG/M |
3300007255 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 | Environmental | Open in IMG/M |
3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
3300009143 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 | Environmental | Open in IMG/M |
3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
3300009148 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG | Host-Associated | Open in IMG/M |
3300009515 | Microbial community of beach aquifer sediment core from Cape Shores, Lewes, Delaware, USA - CF-2 | Environmental | Open in IMG/M |
3300009610 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700 | Environmental | Open in IMG/M |
3300009800 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_30_40 | Environmental | Open in IMG/M |
3300010043 | Tropical forest soil microbial communities from Panama - MetaG Plot_26 | Environmental | Open in IMG/M |
3300010046 | Tropical forest soil microbial communities from Panama - MetaG Plot_36 | Environmental | Open in IMG/M |
3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
3300011413 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT231_2 | Environmental | Open in IMG/M |
3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
3300012201 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaG | Environmental | Open in IMG/M |
3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
3300012356 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaG | Environmental | Open in IMG/M |
3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300014271 | Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D2 | Environmental | Open in IMG/M |
3300015054 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
3300015264 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
3300015372 | Soil combined assembly | Host-Associated | Open in IMG/M |
3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
3300018052 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2 | Environmental | Open in IMG/M |
3300018071 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1 | Environmental | Open in IMG/M |
3300018075 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1 | Environmental | Open in IMG/M |
3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
3300019233 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300020065 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300021081 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redo | Environmental | Open in IMG/M |
3300021307 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_06_16RNAfungal (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300022209 | Sediment microbial communities from San Francisco Bay, California, United States - SF_Jul11_sed_USGS_13 | Environmental | Open in IMG/M |
3300022226 | Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_13 | Environmental | Open in IMG/M |
3300022893 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S126-311R-4 | Environmental | Open in IMG/M |
3300023260 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S197-509C-6 | Environmental | Open in IMG/M |
3300025925 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300026325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes) | Environmental | Open in IMG/M |
3300026376 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-B | Environmental | Open in IMG/M |
3300026480 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-B | Environmental | Open in IMG/M |
3300026497 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-B | Environmental | Open in IMG/M |
3300026527 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes) | Environmental | Open in IMG/M |
3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
3300027324 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60 (SPAdes) | Environmental | Open in IMG/M |
3300027655 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes) | Environmental | Open in IMG/M |
3300027799 (restricted) | Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MG | Environmental | Open in IMG/M |
3300027910 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes) | Environmental | Open in IMG/M |
3300027949 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50 (SPAdes) | Environmental | Open in IMG/M |
3300028590 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day30 | Environmental | Open in IMG/M |
3300030993 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_185 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300031093 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300031226 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_S | Environmental | Open in IMG/M |
3300031548 | Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3 | Host-Associated | Open in IMG/M |
3300031562 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3 | Environmental | Open in IMG/M |
3300031892 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D2 | Environmental | Open in IMG/M |
3300031944 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1 | Environmental | Open in IMG/M |
3300032017 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D4 | Environmental | Open in IMG/M |
3300032163 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G07_0 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
Iowa-Corn-GraphCirc_03333840 | 2140918013 | Soil | KDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH |
ICCgaii200_09659271 | 2228664021 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH |
ICChiseqgaiiDRAFT_06220382 | 3300000033 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH* |
JGI11643J11755_113940941 | 3300000787 | Soil | LKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH* |
JGI25383J37093_101833262 | 3300002560 | Grasslands Soil | MCLLWTAVATPMLQRLQLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSQILRAMLRLPDESVQEPSALEAPFPQVKSKRQRH* |
soilL2_100983508 | 3300003319 | Sugarcane Root And Bulk Soil | MGLLWTAVVTLMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSEDERNRVSQILRALLRLPDESVQEPSTLAAPFPQVQSKRQRHCAKALCRRQRPASLSHWA* |
soilH1_100683035 | 3300003321 | Sugarcane Root And Bulk Soil | MGLLWTAVVTLMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSEDERNRVSQILRAMLRLPDESVQEPSTLAAPFPQVQSKRQRHCAKALCRRQRPASLSHWA* |
Ga0066397_100212511 | 3300004281 | Tropical Forest Soil | MCLLWTAVVTPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDKSVQEPSALEAPFSQVKSKRQRH* |
Ga0062592_1000182364 | 3300004480 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH* |
Ga0062383_101767972 | 3300004778 | Wetland Sediment | MCSLWTAVATTMLQRLKLLKDFLGLYLRPHQGMALLACVQASDLGEADRARVSHILRAMLRLPAESLQEPSAPPAPFPQGKAKRQRH* |
Ga0062381_102672012 | 3300004808 | Wetland Sediment | TAVATTMLQRLKLLKDFLGLYLRPHQGMALLACVQASDLGEADRARVSHILRAMLRLPAESLQEPSAPPAPFPQGKAKRQRH* |
Ga0066680_106700712 | 3300005174 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKLKRQRH* |
Ga0066688_104779241 | 3300005178 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRHRVSHILRAMLRLPDESGQEPSSLEAPFPRVKSKCQRH* |
Ga0066685_104527212 | 3300005180 | Soil | MCLLWTAVATPMLQQLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0070690_1002268361 | 3300005330 | Switchgrass Rhizosphere | MLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH* |
Ga0066388_1023785852 | 3300005332 | Tropical Forest Soil | MCLLWTAVTTPMLQRLRLLKDLLGLYLSRRQAMALLARVQASNLSEDDRNRVSHILRAMLRLPEESFQEPSSLDAPCPQVPAKRQRH* |
Ga0066388_1061818022 | 3300005332 | Tropical Forest Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLRRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESVQEASSREAPLPQVKSKRQRH* |
Ga0070689_1017448231 | 3300005340 | Switchgrass Rhizosphere | LWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH* |
Ga0070698_1003015142 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | MLQRLKLLKDVVGLYLNRQKAMALLEHVQASNLSDDDRNRVSHILRAMLRLPDKSVQEPSSREAPFPQVNSKRQRH* |
Ga0070695_1001499861 | 3300005545 | Corn, Switchgrass And Miscanthus Rhizosphere | QPMCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH* |
Ga0066701_102372442 | 3300005552 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0066661_100652242 | 3300005554 | Soil | LLNALALVTIPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0066692_105835932 | 3300005555 | Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0066691_101602922 | 3300005586 | Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0066905_1017541291 | 3300005713 | Tropical Forest Soil | YLSRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESMQEASSREAPLPQVKSKRQRH* |
Ga0074479_101544642 | 3300005829 | Sediment (Intertidal) | MCSLWTAVATTMLQRLKLLKDFLALYLRPHQGIALLERVQASHLGEADRARVSHILRVMLRLPAASLQEPSAPQAPFPQGKTKRQGH* |
Ga0074479_111607091 | 3300005829 | Sediment (Intertidal) | MCSLWTAVATTMLQWLQLLKDFLGLYLRRHQGMALLEHVQASNLSDDDRARVSHILRAMLRLPAESLPEPSAPQAPYPQSNAQGQRH* |
Ga0075026_1007946102 | 3300006057 | Watersheds | PMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0075421_1013662172 | 3300006845 | Populus Rhizosphere | MCLLWTAVATPMLQRLKPLKDLFGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPCPQGKSKRQRH* |
Ga0075430_1006937231 | 3300006846 | Populus Rhizosphere | WTAVTTPMLQRLKLLKDLLGLYLSRRQAMALLARVQASNLSEDDCNRVSHILRAMLRRPEESFQEPSSLDAPCPQVPAKRQRH* |
Ga0099791_100090874 | 3300007255 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLSLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0099791_100154001 | 3300007255 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0099793_107073892 | 3300007258 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLAQVKSKRQCH* |
Ga0099794_103436652 | 3300007265 | Vadose Zone Soil | MCLLWTTVATPMLQRPKLLKDLLGLSLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0066710_1031972752 | 3300009012 | Grasslands Soil | AVATPMLQQLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0099827_104467021 | 3300009090 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKLKRQRH* |
Ga0099827_104499301 | 3300009090 | Vadose Zone Soil | MCLLWTAVATPMWPRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKLKRQRH* |
Ga0066709_1008732522 | 3300009137 | Grasslands Soil | VATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0099792_103503332 | 3300009143 | Vadose Zone Soil | WTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0114129_125664891 | 3300009147 | Populus Rhizosphere | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSQILRAMLRLPDESVQEPSSLEAPFPQVTSKRQRH* |
Ga0105243_112778811 | 3300009148 | Miscanthus Rhizosphere | MCLLWTAVATLMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH* |
Ga0129286_100032025 | 3300009515 | Sediment | MYSLWTAVVTPMLQRLKRLKDVLWLYLDRGQAKALLESVQASHLSDEDRDRVSHILRVMLRLPEDPVQEPSGPEAP* |
Ga0105340_10736662 | 3300009610 | Soil | MCLLWTAVVTPMLQRLKLLKDLLGLYLNRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPVPQVKSKRQRH* |
Ga0105069_10419261 | 3300009800 | Groundwater Sand | MLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0126380_101298121 | 3300010043 | Tropical Forest Soil | TPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDKSVQEPSALEAPFSQVKSKRQRH* |
Ga0126384_105315112 | 3300010046 | Tropical Forest Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESVQEASSREAPLPQVKSKRQRH* |
Ga0126384_111111492 | 3300010046 | Tropical Forest Soil | CLLWTAVTTPMLQRLRLLKDLLRLYLSRRQAMAFLARVQASNLSEDDRNRVSHILRAMLRLPEESFQEPSSLDAPCPQVPAKRQRH* |
Ga0137391_110075011 | 3300011270 | Vadose Zone Soil | MLQRLKLLKDLLGLYLSRRQAMALLERVQTSNPSDADRNRVRHILRAMLRPPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0137333_10587241 | 3300011413 | Soil | MCLLWTAVATPMLQRLTLLKDLLGLSLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0137388_114166232 | 3300012189 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNFSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0137383_101672834 | 3300012199 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNCVSHILRAMLRLPDESVQELSSREAPFPQVKSKRQRH* |
Ga0137365_106747872 | 3300012201 | Vadose Zone Soil | MCLLWTVVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0137363_112990611 | 3300012202 | Vadose Zone Soil | MLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0137399_106594292 | 3300012203 | Vadose Zone Soil | VATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQCH* |
Ga0137380_102720692 | 3300012206 | Vadose Zone Soil | MCLLWTVVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKLKRQRH* |
Ga0137379_104641812 | 3300012209 | Vadose Zone Soil | MLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0137386_100540831 | 3300012351 | Vadose Zone Soil | LKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKLKRQRH* |
Ga0137371_107498111 | 3300012356 | Vadose Zone Soil | MCLLGTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSQILRAMLRLPDESVQEPSSLEAPCPQVKSKRQRH* |
Ga0137397_100416141 | 3300012685 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQVMALLERVQASNLSDADRNRVSQILRAMLRLPDESVQEPSALEAPFPQVQSKR* |
Ga0137397_107535702 | 3300012685 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQCH* |
Ga0137396_103318701 | 3300012918 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSQILRAMLRLPDESVQEPSSLEAPFPQVKSKR* |
Ga0137359_100654531 | 3300012923 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLSLSRRQAMALLECVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0137419_113477342 | 3300012925 | Vadose Zone Soil | CLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0137404_103227462 | 3300012929 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLSLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESMQEPSSRAAPLPQVKSKRQRH* |
Ga0075326_12624501 | 3300014271 | Natural And Restored Wetlands | MYSLWTTVATTMFQRLKLFKDLLGLYLNRRQGKELLEQVQASNLSDDDRDRVSQILRLMLRLPDESLQEPSSPEIPLPVRPTP |
Ga0137420_12393981 | 3300015054 | Vadose Zone Soil | MCLLWTAVATPMLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH* |
Ga0137403_103951312 | 3300015264 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLSLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH* |
Ga0132256_1005237692 | 3300015372 | Arabidopsis Rhizosphere | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPEESFQEPSSLDAPCPQVPAKRQRH* |
Ga0134083_103047491 | 3300017659 | Grasslands Soil | MMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0184638_10401431 | 3300018052 | Groundwater Sediment | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDHDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0184618_102235401 | 3300018071 | Groundwater Sediment | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKAKRQRHASFSHWA |
Ga0184632_100488692 | 3300018075 | Groundwater Sediment | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKAKRQRH |
Ga0190265_108080392 | 3300018422 | Soil | MCLLWTVVTTPMLQRLKLLKDLLGLYLSRRQAMALLKRVQASNLSDDDRNRVSHILRAMLRLPDESLQELSSLEAPLPQVKSKRQRH |
Ga0066662_113447492 | 3300018468 | Grasslands Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH |
Ga0184645_11986541 | 3300019233 | Groundwater Sediment | LLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPEASLQEPSSLEAPLPQGKAKRQRH |
Ga0180113_11022692 | 3300020065 | Groundwater Sediment | LLWTAVATPMLQRLTLLKDLLGLSLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0210379_105101342 | 3300021081 | Groundwater Sediment | MCLLWTAVATPMLQRLTLLKDLLGLSLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0179585_10662713 | 3300021307 | Vadose Zone Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0224497_101127502 | 3300022209 | Sediment | MCLLREAVGSAMFQRLKLFKDLLGLYLSRRQAMALLECVDASNLSDDARKRVSHILRAMLRLPDASLQEPSSLEAPLARVKSKRQGH |
Ga0224512_101748193 | 3300022226 | Sediment | MCLLREAVGNAMFQRLKLLKDLLGLYLSRRQAVALLESVEASNLSDDDRKRVSHILRAMLRLPDTSLQEPSSLEAPLARVKSKRQGH |
Ga0247787_10689641 | 3300022893 | Soil | VATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0247798_10015611 | 3300023260 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0207650_106493601 | 3300025925 | Switchgrass Rhizosphere | MLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0209152_102648691 | 3300026325 | Soil | MLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSCAAPLPQ |
Ga0257167_10833831 | 3300026376 | Soil | LQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0257177_10475801 | 3300026480 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0257164_10577941 | 3300026497 | Soil | MLQRLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0209059_13495802 | 3300026527 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRHRVSHILRAMLRLPDESGQEPSSLEAPFPRVKSKCQRH |
Ga0209805_12335871 | 3300026542 | Soil | MCLLWTTVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRHRVSHILRAMLRLPDESGQEPSSLEAPFPRVKSKCQRH |
Ga0209161_100407185 | 3300026548 | Soil | MCLLWTAVATPMLQQLKLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0209845_10295182 | 3300027324 | Groundwater Sand | VATPMLQRLKLLKDLLGIYLNRQHGLAVLERVQASNLSDDDRDRVTHIMRAMLRLPEAPLHKPSSPEAP |
Ga0209388_11406482 | 3300027655 | Vadose Zone Soil | MCLLWTTVATPMLQRLKLLKDLLGLSLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSRAAPLPQVKSKRQRH |
(restricted) Ga0233416_100104201 | 3300027799 | Sediment | MCLLWTAVATPMLQRLKLLKDLFGLYLSRRQAMALLERVQASNLSDADRNRVSHILRAMLRLPDESVQEPSSREAPFPQVKSKRQRH |
Ga0209583_106085232 | 3300027910 | Watersheds | LWTAVATPMWQRLTLRKDLLGLSLSRRQAMALRERIHASNLSDDDRHRVTPILRAMLRLPKASWQEPSALEAPFPQGTSKRQRP |
Ga0209860_10513102 | 3300027949 | Groundwater Sand | MFQGLKLLKRLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPFPQGKSKRQRP |
Ga0247823_115024812 | 3300028590 | Soil | DLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0308190_11809441 | 3300030993 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQVMALLERIQASNLSDDDRNRVNHILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0308197_100413412 | 3300031093 | Soil | MCLLWTAVATPMLQRLTLLKDLLGLYLSRRQAMALLERIQASNLSDDDRNRVNQILRAMLRLPDASLQEPSSLEAPLPQGKSKRQRH |
Ga0307497_104920481 | 3300031226 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSRAAPFPQAKSKRQRH |
Ga0307408_1000378625 | 3300031548 | Rhizosphere | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALLARVQASNLSEDDRNRVSHILRAMLRLPEESLQEPSSLEAPFPQVTAKRQRH |
Ga0310886_107838972 | 3300031562 | Soil | MCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQ |
Ga0310893_100069951 | 3300031892 | Soil | QPMCLLWTAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0310884_104636272 | 3300031944 | Soil | ATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0310899_100082794 | 3300032017 | Soil | TAVATPMLQRLKLLKDLLGLYLSRRQAMALIERVQASNLSDDDRNRVSHILRAMLRLPDESVQEPSSREAPLPQVKSKRQRH |
Ga0315281_102771801 | 3300032163 | Sediment | MCSLWTAVATTMLQWLQLLKDFLGLYLRRHQGMALLEHVQASNLSDDDRARVSHILRAMLRLPAQSLPEPSAPQAPYPQSNAQGQRH |
⦗Top⦘ |