NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F058487

Metagenome / Metatranscriptome Family F058487

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F058487
Family Type Metagenome / Metatranscriptome
Number of Sequences 135
Average Sequence Length 93 residues
Representative Sequence MARRSGVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALSLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVDTTHQIPNVTLTRAVIRLLNL
Number of Associated Samples 121
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 22.22 %
% of genes near scaffold ends (potentially truncated) 27.41 %
% of genes from short scaffolds (< 2000 bps) 82.22 %
Associated GOLD sequencing projects 114
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (60.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(19.259 % of family members)
Environment Ontology (ENVO) Unclassified
(29.630 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(41.481 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.39%    β-sheet: 25.20%    Coil/Unstructured: 50.41%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.177.1.1: FAHd1gtta11gtt0.55014
d.177.1.0: automated matchesd5d2ka_5d2k0.53873
d.189.1.0: automated matchesd4hasa_4has0.53392
d.177.1.1: FAHd1nkqa_1nkq0.53183
d.92.1.0: automated matchesd4iuwa_4iuw0.53081


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF00072Response_reg 12.59
PF01068DNA_ligase_A_M 5.93
PF03404Mo-co_dimer 5.19
PF00753Lactamase_B 3.70
PF00691OmpA 2.96
PF02518HATPase_c 2.22
PF04255DUF433 1.48
PF13560HTH_31 1.48
PF05359DUF748 1.48
PF12850Metallophos_2 1.48
PF05494MlaC 1.48
PF14226DIOX_N 0.74
PF01975SurE 0.74
PF00571CBS 0.74
PF00392GntR 0.74
PF01842ACT 0.74
PF04972BON 0.74
PF09335SNARE_assoc 0.74
PF00589Phage_integrase 0.74
PF00528BPD_transp_1 0.74
PF060523-HAO 0.74
PF04909Amidohydro_2 0.74
PF07589PEP-CTERM 0.74
PF01208URO-D 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 5.93
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 5.93
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 1.48
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 1.48
COG2982Uncharacterized conserved protein AsmA involved in outer membrane biogenesisCell wall/membrane/envelope biogenesis [M] 1.48
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 0.74
COG0407Uroporphyrinogen-III decarboxylase HemECoenzyme transport and metabolism [H] 0.74
COG0496Broad specificity polyphosphatase and 5'/3'-nucleotidase SurEReplication, recombination and repair [L] 0.74
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 0.74
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms60.00 %
UnclassifiedrootN/A40.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2162886012|MBSR1b_contig_12683030All Organisms → cellular organisms → Bacteria → Proteobacteria955Open in IMG/M
3300000550|F24TB_10587115All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium1546Open in IMG/M
3300000559|F14TC_100001465All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium597Open in IMG/M
3300000956|JGI10216J12902_105803683All Organisms → cellular organisms → Bacteria1929Open in IMG/M
3300002245|JGIcombinedJ26739_100833947All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria803Open in IMG/M
3300003347|JGI26128J50194_1007353All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium710Open in IMG/M
3300003911|JGI25405J52794_10123841Not Available584Open in IMG/M
3300004139|Ga0058897_10812338All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1350Open in IMG/M
3300004156|Ga0062589_100992256Not Available783Open in IMG/M
3300004463|Ga0063356_100013818All Organisms → cellular organisms → Bacteria → Proteobacteria7374Open in IMG/M
3300004479|Ga0062595_100076651All Organisms → cellular organisms → Bacteria1670Open in IMG/M
3300005174|Ga0066680_10137072All Organisms → cellular organisms → Bacteria1524Open in IMG/M
3300005293|Ga0065715_10072444All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300005295|Ga0065707_10208229All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300005332|Ga0066388_103334223Not Available820Open in IMG/M
3300005341|Ga0070691_10776914Not Available581Open in IMG/M
3300005437|Ga0070710_10387850All Organisms → cellular organisms → Bacteria → Proteobacteria933Open in IMG/M
3300005445|Ga0070708_100003573All Organisms → cellular organisms → Bacteria12188Open in IMG/M
3300005467|Ga0070706_100612612All Organisms → cellular organisms → Bacteria1011Open in IMG/M
3300005471|Ga0070698_100755516All Organisms → cellular organisms → Bacteria915Open in IMG/M
3300005563|Ga0068855_102417011All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium524Open in IMG/M
3300005568|Ga0066703_10398678Not Available826Open in IMG/M
3300005586|Ga0066691_10170958All Organisms → cellular organisms → Bacteria1257Open in IMG/M
3300005713|Ga0066905_100002890All Organisms → cellular organisms → Bacteria → Proteobacteria6598Open in IMG/M
3300005875|Ga0075293_1002916All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1581Open in IMG/M
3300005876|Ga0075300_1023949Not Available788Open in IMG/M
3300005881|Ga0075294_1038684Not Available512Open in IMG/M
3300005937|Ga0081455_10001029All Organisms → cellular organisms → Bacteria → Proteobacteria35164Open in IMG/M
3300006047|Ga0075024_100449758Not Available665Open in IMG/M
3300006163|Ga0070715_10200848All Organisms → cellular organisms → Bacteria → Proteobacteria1013Open in IMG/M
3300006755|Ga0079222_10410330All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales948Open in IMG/M
3300006804|Ga0079221_10489860All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria794Open in IMG/M
3300006806|Ga0079220_10422708All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300006854|Ga0075425_100461052All Organisms → cellular organisms → Bacteria1463Open in IMG/M
3300006914|Ga0075436_100257330All Organisms → cellular organisms → Bacteria → Proteobacteria1244Open in IMG/M
3300007255|Ga0099791_10328823Not Available730Open in IMG/M
3300007265|Ga0099794_10237595All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300009038|Ga0099829_10034029All Organisms → cellular organisms → Bacteria3661Open in IMG/M
3300009038|Ga0099829_10442626Not Available1076Open in IMG/M
3300009089|Ga0099828_10096877All Organisms → cellular organisms → Bacteria2544Open in IMG/M
3300009143|Ga0099792_10910164Not Available583Open in IMG/M
3300009148|Ga0105243_12611269Not Available545Open in IMG/M
3300010047|Ga0126382_10065655All Organisms → cellular organisms → Bacteria2200Open in IMG/M
3300010362|Ga0126377_11097803All Organisms → cellular organisms → Bacteria864Open in IMG/M
3300011270|Ga0137391_11109194Not Available639Open in IMG/M
3300012202|Ga0137363_10642624Not Available897Open in IMG/M
3300012203|Ga0137399_10130328All Organisms → cellular organisms → Bacteria1987Open in IMG/M
3300012205|Ga0137362_10153830Not Available1968Open in IMG/M
3300012205|Ga0137362_10743027Not Available842Open in IMG/M
3300012211|Ga0137377_10156279Not Available2186Open in IMG/M
3300012211|Ga0137377_11247620Not Available673Open in IMG/M
3300012349|Ga0137387_10843852All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300012355|Ga0137369_10214235All Organisms → cellular organisms → Bacteria1480Open in IMG/M
3300012361|Ga0137360_11886345Not Available503Open in IMG/M
3300012363|Ga0137390_11061080All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300012685|Ga0137397_10043464All Organisms → cellular organisms → Bacteria3214Open in IMG/M
3300012906|Ga0157295_10025716All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium1224Open in IMG/M
3300012912|Ga0157306_10091115All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium864Open in IMG/M
3300012922|Ga0137394_10036498All Organisms → cellular organisms → Bacteria4015Open in IMG/M
3300012929|Ga0137404_10975192All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300012929|Ga0137404_11136053Not Available717Open in IMG/M
3300012944|Ga0137410_10020270All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4574Open in IMG/M
3300015077|Ga0173483_10131885All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium1081Open in IMG/M
3300015241|Ga0137418_11009220Not Available601Open in IMG/M
3300015264|Ga0137403_10893171Not Available739Open in IMG/M
3300015371|Ga0132258_12293281Not Available1354Open in IMG/M
3300017936|Ga0187821_10078494Not Available1201Open in IMG/M
3300017993|Ga0187823_10074464Not Available975Open in IMG/M
3300017994|Ga0187822_10024305All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300017997|Ga0184610_1068333All Organisms → cellular organisms → Bacteria1081Open in IMG/M
3300018000|Ga0184604_10020435All Organisms → cellular organisms → Bacteria1560Open in IMG/M
3300018027|Ga0184605_10153116All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300018028|Ga0184608_10412221Not Available585Open in IMG/M
3300018052|Ga0184638_1128391Not Available924Open in IMG/M
3300018053|Ga0184626_10101016All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1222Open in IMG/M
3300018054|Ga0184621_10067231Not Available1227Open in IMG/M
3300018075|Ga0184632_10098083Not Available1284Open in IMG/M
3300018075|Ga0184632_10334984All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium651Open in IMG/M
3300018076|Ga0184609_10042738Not Available1910Open in IMG/M
3300018078|Ga0184612_10043236Not Available2341Open in IMG/M
3300018422|Ga0190265_10121216All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Tepidamorphaceae → Lutibaculum2499Open in IMG/M
3300018429|Ga0190272_10078005All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2039Open in IMG/M
3300018429|Ga0190272_10558414All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium991Open in IMG/M
3300018429|Ga0190272_13150965Not Available513Open in IMG/M
3300019458|Ga0187892_10003929All Organisms → cellular organisms → Bacteria → Proteobacteria26092Open in IMG/M
3300020003|Ga0193739_1014147All Organisms → cellular organisms → Bacteria2085Open in IMG/M
3300020003|Ga0193739_1155935Not Available542Open in IMG/M
3300020006|Ga0193735_1137925Not Available647Open in IMG/M
3300020170|Ga0179594_10415996Not Available510Open in IMG/M
3300020580|Ga0210403_10238978Not Available1491Open in IMG/M
3300021073|Ga0210378_10330537Not Available570Open in IMG/M
3300021088|Ga0210404_10208290Not Available1051Open in IMG/M
3300021171|Ga0210405_10359892All Organisms → cellular organisms → Bacteria → Proteobacteria1148Open in IMG/M
3300021406|Ga0210386_11542553Not Available553Open in IMG/M
3300021432|Ga0210384_10059625Not Available3450Open in IMG/M
3300021972|Ga0193737_1057595Not Available552Open in IMG/M
3300022726|Ga0242654_10026815All Organisms → cellular organisms → Bacteria1472Open in IMG/M
3300023066|Ga0247793_1006172All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium1625Open in IMG/M
3300025905|Ga0207685_10819116Not Available514Open in IMG/M
3300025910|Ga0207684_10005007All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium12350Open in IMG/M
3300025910|Ga0207684_10268421All Organisms → cellular organisms → Bacteria1472Open in IMG/M
3300025910|Ga0207684_10444562All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300025912|Ga0207707_10342441All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300025915|Ga0207693_10266435All Organisms → cellular organisms → Bacteria1343Open in IMG/M
3300026005|Ga0208285_1003731Not Available993Open in IMG/M
3300026285|Ga0209438_1139177Not Available645Open in IMG/M
3300026360|Ga0257173_1049138Not Available595Open in IMG/M
3300026371|Ga0257179_1017596Not Available807Open in IMG/M
3300026482|Ga0257172_1016120All Organisms → cellular organisms → Bacteria1283Open in IMG/M
3300026514|Ga0257168_1003028All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2577Open in IMG/M
3300026514|Ga0257168_1077225All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300026535|Ga0256867_10018752Not Available2977Open in IMG/M
3300026793|Ga0207441_104456All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300027552|Ga0209982_1017678All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300027617|Ga0210002_1018952Not Available1102Open in IMG/M
3300027651|Ga0209217_1075253All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria987Open in IMG/M
3300027846|Ga0209180_10018571All Organisms → cellular organisms → Bacteria3665Open in IMG/M
3300028814|Ga0307302_10479695All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300028819|Ga0307296_10535196All Organisms → cellular organisms → Bacteria641Open in IMG/M
3300028828|Ga0307312_10300775All Organisms → cellular organisms → Bacteria1045Open in IMG/M
3300028884|Ga0307308_10469957All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300031152|Ga0307501_10088149All Organisms → cellular organisms → Bacteria765Open in IMG/M
(restricted) 3300031197|Ga0255310_10058172Not Available1014Open in IMG/M
3300031229|Ga0299913_10181175All Organisms → cellular organisms → Bacteria → Proteobacteria2083Open in IMG/M
(restricted) 3300031248|Ga0255312_1054354Not Available960Open in IMG/M
3300031716|Ga0310813_10115553All Organisms → cellular organisms → Bacteria2111Open in IMG/M
3300031720|Ga0307469_10382174All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → unclassified Acidimicrobiaceae → Acidimicrobiaceae bacterium1195Open in IMG/M
3300031720|Ga0307469_11381759Not Available671Open in IMG/M
3300031944|Ga0310884_10540868Not Available689Open in IMG/M
3300032180|Ga0307471_100054248All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium3333Open in IMG/M
3300032180|Ga0307471_100888238All Organisms → cellular organisms → Bacteria → Proteobacteria1058Open in IMG/M
3300032180|Ga0307471_102407454All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300032421|Ga0310812_10580376All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria502Open in IMG/M
3300033513|Ga0316628_101506443Not Available896Open in IMG/M
3300034257|Ga0370495_0127299Not Available800Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil14.07%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.15%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.15%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.70%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.22%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.22%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.96%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil2.96%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.48%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.48%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.48%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.48%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere1.48%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.48%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.48%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.74%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.74%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.74%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.74%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.74%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.74%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.74%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.74%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.74%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2162886012Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003347Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PMHost-AssociatedOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005881Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_202EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012906Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S212-509R-1EnvironmentalOpen in IMG/M
3300012912Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S163-409C-2EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021972Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2m2EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023066Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S223-509R-6EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026793Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G06A3a-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027552Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027617Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
MBSR1b_0087.000083502162886012Miscanthus RhizosphereMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSIHGLDGLTDVLRAGRVPISEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL
F24TB_1058711523300000550SoilMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGVTLTPAVIRLLGF*
F14TC_10000146513300000559SoilMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGVTL
JGI10216J12902_10580368343300000956SoilMPMARRSVGRLYITRLRPAVDPVGAEYLLTFGSPTTSEVNLNLGSVRGLDTLTEVLRAAHVSIREIERAWCTLNADTVHEIAGVTLTPTVLRMLAL*
JGIcombinedJ26739_10083394713300002245Forest SoilMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
JGI26128J50194_100735323300003347Arabidopsis Thaliana RhizosphereMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILGL*
JGI25405J52794_1012384113300003911Tabebuia Heterophylla RhizosphereMARRSTMGRLYITRLRPAVDPVRAEYLLTFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPIGEIERAWSTLAADTMHEIPGVILTPAVLRILGL*
Ga0058897_1081233813300004139Forest SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL*
Ga0062589_10099225613300004156SoilVDPVNAEYLLTFGSPSSAEIALSLGSVRGLDGLTEVLRAGRVPLDEIEKAWRVLAVDIRHQIPNVTLTRALMRLLNL*
Ga0063356_10001381883300004463Arabidopsis Thaliana RhizosphereMARRTNVGRVYVVRLRPAVDPVNAEYLLTFGSPSSAEIALSLGSVRGLDGLTEVLRAGRVPLDEIEKAWRVLAVDIRHQIPNVTLTRALMRLLNL*
Ga0062595_10007665123300004479SoilMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSIHGLDGLTDVLRAGRVPISEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
Ga0066680_1013707213300005174SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAEDIKHQIPDVRLTRALIRLLNL*
Ga0065715_1007244423300005293Miscanthus RhizosphereMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSIHGLDGLTDVLRAGRVPISEIEKAWRVLAVETTHQIPNVMLTRALLRLLNX*
Ga0065707_1020822923300005295Switchgrass RhizosphereMAGRNSVGRLYIVRLRPAADPVNAEYLLTFGSPSSAEIALDLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITHQIPSITLTRAVIRLLNF*
Ga0066388_10333422323300005332Tropical Forest SoilMARRSGVGRLYITRLRPAVDPVGAEYLLTFGSPTTSEINLNLGSVRGLDGLTEVLRAARVPLEEIERAWRVLAADTQHYVSNVTLTRAVLRLLGL*
Ga0070691_1077691423300005341Corn, Switchgrass And Miscanthus RhizosphereMARRSTAGRLYITRLRPAQDPVGATYLLTFGSAGSREIALSLGSVRGLDGLTEVLRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRAMMRRLDL*
Ga0070710_1038785013300005437Corn, Switchgrass And Miscanthus RhizosphereMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETMHQIPNVMLTRALLRLLNL*
Ga0070708_100003573113300005445Corn, Switchgrass And Miscanthus RhizosphereMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVSEIEKAWRVLAVDIMHQVPGIRLTRALIRLLNL*
Ga0070706_10061261223300005467Corn, Switchgrass And Miscanthus RhizosphereMPRRSSVGRLYIRRLRSAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL*
Ga0070698_10075551623300005471Corn, Switchgrass And Miscanthus RhizosphereMPRRSSVGRLYIRRLRPAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL*
Ga0068855_10241701113300005563Corn RhizosphereMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILG
Ga0066703_1039867813300005568SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPDIRLTRALIRLLNL*
Ga0066691_1017095813300005586SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDI
Ga0066905_10000289093300005713Tropical Forest SoilMARLAMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGVTLTPAVIRLLGL*
Ga0075293_100291643300005875Rice Paddy SoilMARRSSVGRLYIARVRPAVDPVNAEYVLAFGSPGTREIAFSLGSIRGLDGLTDVLRAGRVPVSEIEKAWRVLAVEMTHLIPNVTLTPALIRLLGL*
Ga0075300_102394923300005876Rice Paddy SoilMASGTRLASPMARRSTAGRLYITRLRPAQDPVGATYLLTFGSAGSREIALSLGSVRGLDGLTEVLRAGRVPTGEIEKAWRVLVVETTHQVPNVALTRALMRRLDL*
Ga0075294_103868423300005881Rice Paddy SoilMARRRSVGRLYITRVRPAVDPVNAEYVLAFGSPGTREIAFSLGSIRGLDGLTDVLRAGRVPVSEIEKAWRVLAVEMTHLI
Ga0081455_10001029213300005937Tabebuia Heterophylla RhizosphereMGRLYITRLRPAVDPVRAEYLLTFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPIGEIERAWSTLAADTMHEIPGVILTPAVLRILGL*
Ga0075024_10044975813300006047WatershedsMARRSSVGRLYIVRLRPAVDPVNAEYLLTYGAPATTEIALNLGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEITHQIPNVTLTRAVIRLLNL*
Ga0070715_1020084813300006163Corn, Switchgrass And Miscanthus RhizosphereMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPVSEIEKAWRVLAVETMHQIPNVMLTRALLRLLNL*
Ga0079222_1041033033300006755Agricultural SoilMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSIHGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
Ga0079221_1048986023300006804Agricultural SoilMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSATGDITLNLGSIHGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
Ga0079220_1042270823300006806Agricultural SoilMARRSTVGRLYITRIRPAQDPVDATYLLTFGSAGSRDIALSLGSVRGLDGLTGILRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRAMMRRLDL*
Ga0075425_10046105223300006854Populus RhizosphereMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
Ga0075436_10025733013300006914Populus RhizosphereRHYADPQMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL*
Ga0099791_1032882313300007255Vadose Zone SoilMARRSSVGRLYIRRLRPAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLL
Ga0099794_1023759523300007265Vadose Zone SoilMARRSSVGRLYIRRLRPAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL*
Ga0099829_1003402963300009038Vadose Zone SoilMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPVKEIEKAWRVLAVEFRHQIPNVMLTRALIQLLNL*
Ga0099829_1044262613300009038Vadose Zone SoilMARCSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL*
Ga0099828_1009687733300009089Vadose Zone SoilMARRSIVGRLYIVRFRPAVDPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPVKEIEKAWRVLAVEFRHQIPNVMLTRALIQLLNL*
Ga0099792_1091016423300009143Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL*
Ga0105243_1261126913300009148Miscanthus RhizosphereVRLRPAADPVNAEYLLTFGSPSSAEIALDLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITHQIPSITLTRAVIRLLNF*
Ga0126382_1006565513300010047Tropical Forest SoilMARRLAMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGV
Ga0126377_1109780323300010362Tropical Forest SoilMARRSVGRLYITRLRPAVDPVGAEYLLTFGSPTTSEVNLNLGSVRGLDTLTEVLRAAHVSIREIERAWCTLAADTVHEIAGVTLTPTVLRMLAL*
Ga0137391_1110919413300011270Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLKDVLRAGRVPVNEIEKAWRVLAVDIMHQIPDIRLTRALIRLLNL*
Ga0137363_1064262413300012202Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIKHQIPDVRLTRALIRLLNL*
Ga0137399_1013032823300012203Vadose Zone SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNITLTRAVIRLLNF*
Ga0137362_1015383023300012205Vadose Zone SoilMARRSNVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIKHQIPDVRLTRALIRLLNL*
Ga0137362_1074302713300012205Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPGIRLPRALIRLLNL*
Ga0137377_1015627933300012211Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPDVRLTRALIRLLNL*
Ga0137377_1124762023300012211Vadose Zone SoilMARRSSVGRLYIRRLRPAVDPVNAEYLLTFGSPKTSEIALNLGTVRGLDGLTSVPRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL*
Ga0137387_1084385223300012349Vadose Zone SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDLTHQIQSMTLTRAVI
Ga0137369_1021423533300012355Vadose Zone SoilMPGRDSGGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPLNEIERAWRVLAADITHQVPNITLTSAVIRLLNL*
Ga0137360_1188634513300012361Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGI
Ga0137390_1106108013300012363Vadose Zone SoilMARRSSVGRLYIRRPRPAVDPVNAEYLLSFGSPRTSEIALNLGTVRGLDGLITVFRAGRVSLNEIETAWRVLAVEITHQIPNVMLTRAGIQLLNL*
Ga0137397_1004346423300012685Vadose Zone SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFDDPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNITLTRAVIRLLNF*
Ga0157295_1002571623300012906SoilMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSGRGLDTLTDLLRAARVPIGEIQRAWSALVSDTVHEIAGVTLTPAAIRILGL*
Ga0157306_1009111533300012912SoilMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTL
Ga0137394_1003649843300012922Vadose Zone SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNVTLTRAVIRLLNF*
Ga0137404_1097519223300012929Vadose Zone SoilMAGRNSVGRLYIVRLRPAADPVNAEYLLTFGSPSSAEIALDLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITHQIPNVTLTRAVIRLLNF*
Ga0137404_1113605323300012929Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIM
Ga0137410_1002027013300012944Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYFLTFGSPSIAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL*
Ga0173483_1013188513300015077SoilMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVT
Ga0137418_1100922013300015241Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKARRVLAVDIMHQIPDIKPTRALIRLLNL*
Ga0137403_1089317123300015264Vadose Zone SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL
Ga0132258_1229328113300015371Arabidopsis RhizosphereAQDPVTATYLLTFGSPGSRDIALSLGSVRGLDGLTEVLRAGRVPTGEIEKAWRVLAVETSHQVPNVALTRAMMRRLDL*
Ga0187821_1007849433300017936Freshwater SedimentMARRSTVGRLYITRIRPAQDPVDATYLLTFGSAGSRDIALSLGSVRGLDGLTGILRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRAMMRRLDL
Ga0187823_1007446433300017993Freshwater SedimentVGRLYITRIRPAQDPVDATYLLTFGSPGSRDIALSLGSVRGLDGLTGILRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRALMRRLDL
Ga0187822_1002430533300017994Freshwater SedimentMARRSTVGRLYITRIRPAQDPVDATYLLTFGSPGSRDIALSLGSVRGLDGLTGILRAGRVSTGEIEKAWRVLAVETTHQVPNVALTRALMRRLDL
Ga0184610_106833313300017997Groundwater SedimentMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGPPSTAEIALNLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVEFRHQIPNVTLTRALIQLLNL
Ga0184604_1002043513300018000Groundwater SedimentMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQISNITLTRAVIRLLNF
Ga0184605_1015311613300018027Groundwater SedimentMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNITLTRAVIRLLNF
Ga0184608_1041222113300018028Groundwater SedimentMARRSAVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALSLGSIRGLDGLTDVLRAGRVPVNEIEKAWRVLAVDTTHQIPNVTLTRAVIRLLNL
Ga0184638_112839113300018052Groundwater SedimentMARRRGVGRLYIVRLRPAADPVNAEYLLSFGSPSTAEVALNLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVEFRHLIPNVTLTPALIRLLNL
Ga0184626_1010101613300018053Groundwater SedimentMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPVKEIEKAWRVLAVEFRHQIPNVTLTRALIQLLNL
Ga0184621_1006723123300018054Groundwater SedimentMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPSITLTRAVIRLLNF
Ga0184632_1009808323300018075Groundwater SedimentMARRRGVGRLYIVRLRPAADPVNAEYLLSFGSPSTAEVALNLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVEFRHLIPSVTLTPALIRLLNL
Ga0184632_1033498413300018075Groundwater SedimentMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPVDEIEKAWRVLAVELRHQIPNVTLTHALIRLLNL
Ga0184609_1004273823300018076Groundwater SedimentMARRSSVGRLYIVRLRPATDPVNARYLLTFGSRSTDEIALNLGSVRGLDGVTDVLRAGRVPVDEIEKAWRVLAVELRHQIPNVTLTHALIRLLNL
Ga0184612_1004323653300018078Groundwater SedimentMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGPPSTAEIALNLGSVRGLDGLTDVLRAGRVPVKEIEKAWRVLAVEFRHQIPNVTLTRALIQLLNL
Ga0190265_1012121643300018422SoilMARRRNAGRLYIMRLRPATDPVTAEYLLTFGSRSTAEIALNLGSVRGLDGLTDVLRAGRVPVGEIEKAWRVLAVEITHQIPNVTLTRAVIQLLNL
Ga0190272_1007800543300018429SoilMARRTSIGRLYIVRLRPATDPVNAEYLLTFGSPSAAETALNLGSVRGLDGLTDVLRAGRVPLGEIEKAWRVLAVEFRHQVPNVTLTRALIRLLNL
Ga0190272_1055841413300018429SoilMARRSGVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALSLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVDTTHQIPNVTLTRAVIRLLNL
Ga0190272_1315096513300018429SoilMARRSSVGRLYIVRFRPATDPVSAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGRVPVREIEKAWRVLAVEFRHQVPNVTLTRALIQLLNL
Ga0187892_10003929223300019458Bio-OozeMLIAMARRPAVGRLYITRLRPAVDPVSAEYLLTFGSPARSEVALLLGSVRGLDSLTAVLRAARVPVDEIERAWRVLAVDTSHEIASATLTRAVIRVLDL
Ga0193739_101414713300020003SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSAAEIALSLGSVRGLDGLTDVLRAGRVPVNEIEKAWRVLAVDTTHQISNVTLTRAVIRVLNL
Ga0193739_115593513300020003SoilMARRRGVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPVNEIEKAWRVLAVEFRHLIPNVTLTPALIRLLNL
Ga0193735_113792513300020006SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPINEIERAWRVLAVDITHQISNITLTRAVIRLLNF
Ga0179594_1041599613300020170Vadose Zone SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNVTLTRAVIRLLNF
Ga0210403_1023897823300020580SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTYGAPATTEIALNLGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEITHQIPNVTLTRAVIRLLNL
Ga0210378_1033053713300021073Groundwater SedimentMARRSSVGRLYIVRLRPATDPVNARYLLTFGSRSTDEIALNLGSVRGLDGVTDVLRAGRVPVDEIEKAWRVLAVELRHQIPNVTLTHALIRL
Ga0210404_1020829033300021088SoilGRLYIVRLRPAVDPVNAEYLLTYGAPATTEIALNLGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEITHQIPNVTLTRAVIRLLNL
Ga0210405_1035989213300021171SoilPQMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPVSEIEKAWRVLAVETMHQIPNVMLTRALLRLLNL
Ga0210386_1154255323300021406SoilMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPVSEIEKAWRVLAVETMHQIPNVMLTRALLRLLNL
Ga0210384_1005962553300021432SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPTTAEIALHFGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEIMHQIPNVTLTSAVIRRLNL
Ga0193737_105759513300021972SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITHQIPSITLTRAVIRLLNF
Ga0242654_1002681513300022726SoilMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETMHQIPNVMLTRALLRL
Ga0247793_100617233300023066SoilMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILGL
Ga0207685_1081911613300025905Corn, Switchgrass And Miscanthus RhizosphereMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPVSEIEKAWRVLAVETMHQIPNVMLTRALLR
Ga0207684_1000500723300025910Corn, Switchgrass And Miscanthus RhizosphereMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVSEIEKAWRVLAVDIMHQVPGIRLTRALIRLLNL
Ga0207684_1026842133300025910Corn, Switchgrass And Miscanthus RhizosphereMARRSSVGRLYIRRLRSAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL
Ga0207684_1044456223300025910Corn, Switchgrass And Miscanthus RhizosphereMARRSSVGRLYIRRLRPAVDPVSAEYLLTFGSPKTSEIALNLGTVRGLDGLTTVLRAGRVSLSEIETAWRVLAVEITHQIPNVMLTRAVIQLLNL
Ga0207707_1034244133300025912Corn RhizosphereMARRSGVGRMYITRLRPAVDPVGAEYLLTFGSGRSSEIALNLGSVKGLDTLTDIFRAGHVPVAEIERAWRVLATE
Ga0207693_1026643523300025915Corn, Switchgrass And Miscanthus RhizosphereMARRSRLGRLRVTRVRPAVDPVSAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL
Ga0208285_100373133300026005Rice Paddy SoilMASGTRLASPMARRSTAGRLYITRLRPAQDPVGATYLLTFGSAGSREIALSLGSVRGLDGLTEVLRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRAMMRRLDL
Ga0209438_113917713300026285Grasslands SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL
Ga0257173_104913813300026360SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGGVPVNEIEKAWRVLAVDIMHQIPGIRLTRAL
Ga0257179_101759613300026371SoilMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIKHQIPDVRLTRALIRLLNL
Ga0257172_101612033300026482SoilMARRSSVGRLYIRRLRPAVDPVNAEYLLTFGSPTTSEIALNLGTVRGLDGLTSVLRAGRVSLSAIETAWRVLAVEITHQIPNVTLTRAVIQLLNL
Ga0257168_100302843300026514SoilSMARRSSVGRLYIVRLRPAVDPVNAEYLLTFGSPSTAEIALNLGSVHGLDGLTDVLRAGRVPVNEIEKAWRVLAVDIMHQIPGIRLTRALIRLLNL
Ga0257168_107722523300026514SoilMARRSSVGRLYIRRLRPAVDPVNAEYLLSFGSPRTSEIALNLGTVRGLDGLITVFRAGRVSLNEIETAWRVLAVEITHQIPNVMLTRAGIQLLNL
Ga0256867_1001875233300026535SoilMARRSGVGRLYIVRLRPAADPVSAEYLLTFGSPSAAEIALNLGSVRGLDVLTDVLRAGRVPVSEIEKAWRVLAVDFRHQIPNVTLTRALIRLLNL
Ga0207441_10445623300026793SoilPGGPHGYATSMARRSAMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILGL
Ga0209982_101767813300027552Arabidopsis Thaliana RhizosphereSMARRSAMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILGL
Ga0210002_101895233300027617Arabidopsis Thaliana RhizosphereMGRLYITRLRPAVDPVRAEYLLTFGSPITSEVNLNLGSVRGLDTLTDLLRAARVPIGEIERAWSALVSDTVHEIAGVTLTPAAIRILGLSTTAMTK
Ga0209217_107525333300027651Forest SoilMARRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSATGDITLNLGSVYGLDGLTDVLRAGRVPLSEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL
Ga0209180_1001857163300027846Vadose Zone SoilMARRSSVGRLYIVRFRPAVDPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGRVPVKEIEKAWRVLAVEFRHQIPNVMLTRALIQLLNL
Ga0307302_1047969513300028814SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITHQIPSIT
Ga0307296_1053519613300028819SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSSAEIALDLGSVRGLDGLTDVLRAGYVPVNEIERAWRVLAVDITH
Ga0307312_1030077523300028828SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSSAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDLTHQIQSMTLTRAVIRLLNF
Ga0307308_1046995723300028884SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEVALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDLTHQIQSMTLTRAVIRLLNF
Ga0307501_1008814923300031152SoilMAGRSSVGRLYIVRLRPAADPVNAEYLLTFGSPSTAEIALNLGSVRGLDGLTDVLRAGHVPVNEIERAWRVLAVDITHQIPNITLTRAVIRLLNF
(restricted) Ga0255310_1005817213300031197Sandy SoilITRIRPAQDPVDATYLLTFGSPGSRDIALSLGSVRGLDGLTGVLRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRALMRRLDL
Ga0299913_1018117533300031229SoilMARRSGVGRLYIVRLHPAADPVSAEYLLTFGSPSAAEIALNLGSVRGLDVLTDVLRAGRVPVSEIEKAWRVLAVDFRHQIPNVTLTRALIRLLNL
(restricted) Ga0255312_105435413300031248Sandy SoilPAQDPVDATYLLTFGSPGSRDIALSLGSVRGLDGLTGVLRAGRVPTGEIEKAWRVLAVETTHQVPNVALTRALMRRLDL
Ga0310813_1011555343300031716SoilMARRSTTGRLYITRLRPAQDPVNATYLLTFGSPGSRDIALSLGSVRGLDGLTEVLRAGRVATGEIEKAWRVLAVETTHQVPNVALTRAMMRRLDL
Ga0307469_1038217423300031720Hardwood Forest SoilMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGVTLTPAVIRLLGL
Ga0307469_1138175923300031720Hardwood Forest SoilVDPVNAEYLLTYGAPATTEIALNLGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEITHQIPNVALTRAVIRLLNL
Ga0310884_1054086823300031944SoilMASGTRLASPMARRSTAGRLYITRLRPAQDPVGATYLLTFGSAGSREIALSLGSVRGLDGLTEVLRAGRVPTGEIEKAWRVLAVETTHQVPSVALTRAMMRRLDL
Ga0307471_10005424833300032180Hardwood Forest SoilMALCCFAMARRSSVGRLYIVRLRPVVDPVNAEYLLTYGAPATTEIALNLGSVRGLDELTDVLRAGRVPVSEIEKAWRVLAVEITHQIPNVTLTRAVIRLLNL
Ga0307471_10088823813300032180Hardwood Forest SoilRSRLGRLRVTRIRPAVDPVNAEYLVTFGSSGTGDITLNLGSIHGLDGLTDVLRAGRVPISEIEKAWRVLAVETTHQIPNVMLTRALLRLLNL
Ga0307471_10240745413300032180Hardwood Forest SoilSRGPPPGLSRRAPRVYATPMARRLAMGRLYITRLRPAVDPVRAEYLLIFGSPTTAEVNLNLGSVRGLDTLTEVLRAARVPLGEIERAWSALVNDTVHEITGVTLTPAVIRLLGL
Ga0310812_1058037623300032421SoilMARRSTTGRLYITRLRPAQDPVNATYLLTFGSPGSRDIALSLGSVRGLDGLTEVLRAGRVATGEIEKAWRVLAVETTHQVPN
Ga0316628_10150644313300033513SoilMARRSTVGRLYITRLRPAQDPVNATYFLTFGSPGTREVALSLGSVRGLDGLTDVLRAGRVPTGEIEKAWRVLAVETTHQVPNVTLTRAMMRRLDL
Ga0370495_0127299_440_7273300034257Untreated Peat SoilMARSRAVGRLYIVRFRPAADPVNAEYLLTFGSPSTAEATLNLGSVLGLDGLTDVLRAGRVPVNEIEKAWRVLAVEYRHLIPNVTLTPALLRLLNT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.