NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078936

Metagenome / Metatranscriptome Family F078936

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078936
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 188 residues
Representative Sequence MGIDHEETVDESDAILGMPRVLADLADGKLSEMETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Number of Associated Samples 70
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 72.41 %
% of genes near scaffold ends (potentially truncated) 46.55 %
% of genes from short scaffolds (< 2000 bps) 85.34 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (77.586 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere
(18.103 % of family members)
Environment Ontology (ENVO) Unclassified
(43.103 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(53.448 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 13.06%    β-sheet: 30.63%    Coil/Unstructured: 56.31%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF04542Sigma70_r2 8.62
PF04545Sigma70_r4 6.03
PF00106adh_short 3.45
PF12770CHAT 2.59
PF13424TPR_12 1.72
PF02776TPP_enzyme_N 0.86
PF02518HATPase_c 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 8.62
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 8.62
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 8.62
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 8.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A77.59 %
All OrganismsrootAll Organisms22.41 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003321|soilH1_10366946All Organisms → cellular organisms → Bacteria1409Open in IMG/M
3300004157|Ga0062590_100491574Not Available1042Open in IMG/M
3300004157|Ga0062590_100832121Not Available854Open in IMG/M
3300004463|Ga0063356_103607084All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila667Open in IMG/M
3300004799|Ga0058863_11948139Not Available1433Open in IMG/M
3300004800|Ga0058861_11933830All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila1104Open in IMG/M
3300004800|Ga0058861_11997895All Organisms → cellular organisms → Bacteria1293Open in IMG/M
3300005529|Ga0070741_10341180Not Available1393Open in IMG/M
3300005535|Ga0070684_102272849All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila511Open in IMG/M
3300005764|Ga0066903_101486378All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300005843|Ga0068860_100700671Not Available1022Open in IMG/M
3300006038|Ga0075365_11190694All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila536Open in IMG/M
3300006046|Ga0066652_100546148All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Pseudonocardiales → Pseudonocardiaceae → Saccharothrix → Saccharothrix espanaensis1084Open in IMG/M
3300006177|Ga0075362_10172990All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila1044Open in IMG/M
3300006844|Ga0075428_100015681All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi8396Open in IMG/M
3300006845|Ga0075421_100154210All Organisms → cellular organisms → Bacteria2869Open in IMG/M
3300006880|Ga0075429_100819296All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Caldilineae → Caldilineales → Caldilineaceae → Caldilinea → Caldilinea aerophila815Open in IMG/M
3300006954|Ga0079219_11018134Not Available690Open in IMG/M
3300006969|Ga0075419_10031798All Organisms → cellular organisms → Bacteria3274Open in IMG/M
3300007004|Ga0079218_12169709Not Available644Open in IMG/M
3300009032|Ga0105048_10034993All Organisms → cellular organisms → Bacteria → Terrabacteria group7389Open in IMG/M
3300009147|Ga0114129_10015354All Organisms → cellular organisms → Bacteria10896Open in IMG/M
3300009685|Ga0116142_10293169Not Available804Open in IMG/M
3300009687|Ga0116144_10495475Not Available602Open in IMG/M
3300010038|Ga0126315_10658557Not Available680Open in IMG/M
3300010040|Ga0126308_10157499Not Available1438Open in IMG/M
3300010044|Ga0126310_10055825Not Available2216Open in IMG/M
3300010044|Ga0126310_10309799Not Available1091Open in IMG/M
3300010044|Ga0126310_10646461Not Available795Open in IMG/M
3300010044|Ga0126310_11059751Not Available642Open in IMG/M
3300010045|Ga0126311_10122552Not Available1816Open in IMG/M
3300010141|Ga0127499_1015302Not Available584Open in IMG/M
3300010356|Ga0116237_10101420All Organisms → cellular organisms → Bacteria → Terrabacteria group2945Open in IMG/M
3300011332|Ga0126317_10647754Not Available615Open in IMG/M
3300012204|Ga0137374_10252623Not Available1477Open in IMG/M
3300012212|Ga0150985_100212554Not Available1912Open in IMG/M
3300012212|Ga0150985_100265208Not Available1674Open in IMG/M
3300012212|Ga0150985_100376842Not Available1339Open in IMG/M
3300012212|Ga0150985_103716611Not Available1012Open in IMG/M
3300012212|Ga0150985_104284552Not Available1675Open in IMG/M
3300012212|Ga0150985_104610729Not Available1195Open in IMG/M
3300012212|Ga0150985_104891134All Organisms → cellular organisms → Bacteria8173Open in IMG/M
3300012212|Ga0150985_106617266All Organisms → Viruses → Predicted Viral1607Open in IMG/M
3300012212|Ga0150985_109038931Not Available1731Open in IMG/M
3300012212|Ga0150985_110203008Not Available1630Open in IMG/M
3300012212|Ga0150985_111015923Not Available1058Open in IMG/M
3300012212|Ga0150985_111280867Not Available502Open in IMG/M
3300012212|Ga0150985_113748379Not Available1669Open in IMG/M
3300012212|Ga0150985_115392068Not Available951Open in IMG/M
3300012212|Ga0150985_115687219Not Available868Open in IMG/M
3300012212|Ga0150985_115760928Not Available1261Open in IMG/M
3300012212|Ga0150985_115845521All Organisms → cellular organisms → Bacteria3702Open in IMG/M
3300012212|Ga0150985_118478383Not Available1083Open in IMG/M
3300012212|Ga0150985_121159748Not Available1406Open in IMG/M
3300012212|Ga0150985_121299682Not Available723Open in IMG/M
3300012212|Ga0150985_121315470Not Available521Open in IMG/M
3300012350|Ga0137372_10132653All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2052Open in IMG/M
3300012358|Ga0137368_10052138All Organisms → cellular organisms → Bacteria → Terrabacteria group3460Open in IMG/M
3300012360|Ga0137375_10416196Not Available1169Open in IMG/M
3300012360|Ga0137375_10876072Not Available715Open in IMG/M
3300012384|Ga0134036_1087547Not Available856Open in IMG/M
3300012388|Ga0134031_1302773Not Available582Open in IMG/M
3300012399|Ga0134061_1194470Not Available721Open in IMG/M
3300012410|Ga0134060_1090436Not Available661Open in IMG/M
3300012469|Ga0150984_101592674Not Available2423Open in IMG/M
3300012469|Ga0150984_106176240Not Available531Open in IMG/M
3300012469|Ga0150984_106787230Not Available710Open in IMG/M
3300012469|Ga0150984_108597200Not Available1783Open in IMG/M
3300012469|Ga0150984_109100038Not Available1761Open in IMG/M
3300012469|Ga0150984_113065754Not Available558Open in IMG/M
3300012469|Ga0150984_114228393Not Available1134Open in IMG/M
3300012469|Ga0150984_114765277Not Available873Open in IMG/M
3300012469|Ga0150984_115402332Not Available1706Open in IMG/M
3300012469|Ga0150984_116897458Not Available1641Open in IMG/M
3300012469|Ga0150984_117346019Not Available1371Open in IMG/M
3300012469|Ga0150984_118825884Not Available1402Open in IMG/M
3300012469|Ga0150984_123655183Not Available3286Open in IMG/M
3300012684|Ga0136614_10030521All Organisms → cellular organisms → Bacteria → Terrabacteria group4015Open in IMG/M
3300015371|Ga0132258_10125709All Organisms → cellular organisms → Bacteria → Terrabacteria group6105Open in IMG/M
3300017789|Ga0136617_10152210All Organisms → cellular organisms → Bacteria → Terrabacteria group1974Open in IMG/M
3300017789|Ga0136617_10297601Not Available1329Open in IMG/M
3300017789|Ga0136617_10937643Not Available659Open in IMG/M
3300018465|Ga0190269_11110175Not Available611Open in IMG/M
3300018465|Ga0190269_11606078Not Available543Open in IMG/M
3300018466|Ga0190268_10264447Not Available1001Open in IMG/M
3300020063|Ga0180118_1349318Not Available580Open in IMG/M
3300020067|Ga0180109_1334114Not Available1035Open in IMG/M
3300020070|Ga0206356_11798165Not Available1211Open in IMG/M
3300020082|Ga0206353_10215392Not Available553Open in IMG/M
3300022530|Ga0242658_1161601Not Available585Open in IMG/M
3300024430|Ga0196962_10239411Not Available589Open in IMG/M
3300025861|Ga0209605_1275572Not Available606Open in IMG/M
3300027878|Ga0209181_10221823Not Available1677Open in IMG/M
3300027880|Ga0209481_10528513Not Available610Open in IMG/M
3300027909|Ga0209382_10060819All Organisms → cellular organisms → Bacteria4479Open in IMG/M
3300027909|Ga0209382_10096233All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia → Herpetosiphonales → Herpetosiphonaceae → unclassified Herpetosiphonaceae → Herpetosiphonaceae bacterium3473Open in IMG/M
3300028381|Ga0268264_10678422Not Available1022Open in IMG/M
3300030917|Ga0075382_10926176Not Available644Open in IMG/M
3300030959|Ga0102747_11006838Not Available586Open in IMG/M
3300030990|Ga0308178_1090908Not Available635Open in IMG/M
3300031058|Ga0308189_10078050Not Available997Open in IMG/M
3300031058|Ga0308189_10203753Not Available718Open in IMG/M
3300031058|Ga0308189_10249272Not Available670Open in IMG/M
3300031082|Ga0308192_1075055Not Available544Open in IMG/M
3300031091|Ga0308201_10009785Not Available1729Open in IMG/M
3300031091|Ga0308201_10042996Not Available1095Open in IMG/M
3300031092|Ga0308204_10010434Not Available1680Open in IMG/M
3300031093|Ga0308197_10022940Not Available1385Open in IMG/M
3300031094|Ga0308199_1202298Not Available501Open in IMG/M
3300031096|Ga0308193_1006368Not Available1258Open in IMG/M
3300031100|Ga0308180_1025661Not Available593Open in IMG/M
3300031114|Ga0308187_10048749Not Available1157Open in IMG/M
3300031114|Ga0308187_10335525Not Available578Open in IMG/M
3300031123|Ga0308195_1012758Not Available945Open in IMG/M
3300031421|Ga0308194_10101925Not Available826Open in IMG/M
3300031469|Ga0170819_12161762Not Available758Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere18.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil16.38%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere11.21%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.90%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil6.03%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.31%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.31%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand3.45%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge3.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.59%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated2.59%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.72%
FreshwaterEnvironmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater1.72%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.72%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.72%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere1.72%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil0.86%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.86%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.86%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.86%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.86%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.86%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.86%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004799Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-3 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004800Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-1 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006038Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006177Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-2Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009032Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009685Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC033_MetaGEngineeredOpen in IMG/M
3300009687Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC035_MetaGEngineeredOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010044Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot60EnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010141Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010356AD_USDEcaEngineeredOpen in IMG/M
3300011332Soil microbial communities from California, USA to study soil gas exchange rates - SR-CA-SC2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012384Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012388Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012399Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012684Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ279 (21.06)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017789Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ322 (21.06)EnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020067Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020082Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022530Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024430Soil microbial communities from Anza Borrego desert, Southern California, United States - S3+v_20EnvironmentalOpen in IMG/M
3300025861Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC035_MetaG (SPAdes)EngineeredOpen in IMG/M
3300027878Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05 (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030917Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FB5 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030959Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines Pi 2A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031082Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_193 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031091Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_355 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031096Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_194 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031100Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_151 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031123Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_196 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
soilH1_1036694623300003321Sugarcane Root And Bulk SoilMNDTYDESVDKMSPVDLDELDDVGMPRILADLADGALDDDEANAVADWLLATADEAPPSWVVNRAVRIAGQAVGKDAPRPSIWRHLVAALVYDNRLQPRVAGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLISGIHRMEIGLAYELIEIPSVQI*
Ga0062590_10049157423300004157SoilMDKEDDMSVDQASVESFEELGMPRILADLADGDLGDEEANAVADWLLATAEEEPPSWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRMAGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLISGIHRMEIGLAYELIEIPS
Ga0062590_10083212123300004157SoilMDYRTPHGDAAAEHVDVALGMPRILADLADGALSDQEATAVAAWLLSAAEEETPPWMVNRAARIVGQAFGQGVPRPSIWRRLVAALVYDTRLQPRIAGARAVANEHPRLMYQAGGVEIDLEVGPSTIAGRLRMLGQVTASEPDLTRAWVIADGPSGRLETEVDALGQFSLDGLVSGVHRLEVGLAHALIEISSIQL*
Ga0063356_10360708413300004463Arabidopsis Thaliana RhizosphereMDKEDDMSVDQASVESFEELGMPRILADLADGDLGDEEANAVADWLLATAEEEPPSWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRMAGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLISGIHRME
Ga0058863_1194813923300004799Host-AssociatedMGIDHEERVDESDAILGMPRILADLADGKLSLDETDAVVDWLGELGDEDEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSVALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0058861_1193383013300004800Host-AssociatedMGIDHEETVDESEVLLGMPRVLADLADGKLNEVETDAVVDWLSALADEEEPPHWLVNRAVRIAGHSLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSLALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0058861_1199789513300004800Host-AssociatedMGIDHEERVDESDAILGMPRILADLADGKLSLDETDAVVDWLGELGDEDEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSVALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVS
Ga0070741_1034118023300005529Surface SoilMGIDHEETVDQSDVILGMPRVLADLADGRLGEAETDAVVDWLSALGEEEEPPHWLVNRAVRIAGQVLGKDAPRPAIWRRLVAALVYDNRLQPRVVGARSVALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVQI*
Ga0070684_10227284913300005535Corn RhizosphereTMNNDFDKAVDKVSTEDHDELGMPRILADLADGALDDDEANAVANWLLATAEEEPPSWIINRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRLQPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTAAEPDLTRAWVIAEGPSGRLETEVDDLGQF
Ga0066903_10148637823300005764Tropical Forest SoilMNNDYDKVANKLSADDLDELGMPRILADLADGALDDDEATAIADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDNRLQPRVAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVRI*
Ga0068860_10070067113300005843Switchgrass RhizosphereNSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0075365_1119069413300006038Populus EndospherePTDDLDELGMPRILADLADGALDDDEANAVADWLMATADEEPPSWVVNRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRLQPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGVHRMEIGLAYELI
Ga0066652_10054614813300006046SoilMGIDHDETVEESGYVLGMPRILADLADGKLSASETGMVVDWLSASATEEPPHWLVNRAVRVAGQAIGKDAPRPAMWRRLVAALVYDNRVQPKIAGARSAALEAPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQF
Ga0075362_1017299013300006177Populus EndosphereMGTNNSTDDEMNMHMKQLADLDDELGMPRILADLADGNLSDEEANSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0075428_10001568153300006844Populus RhizosphereMEKDKDMIVEQAPAESFEDLGMPRILADLANGDLDEDEANAVADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDTRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI*
Ga0075421_10015421033300006845Populus RhizosphereMDKDKDTDNDATVHRASADGFEDLGMPRILADLADGALGDEEANAVADWLLTTAEEEPPSWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVAGARAIAIEHPRLMYQAGGVEIDLEVGQSKIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI*
Ga0075429_10081929613300006880Populus RhizosphereDGCAMPRVWHREEGEGWSVVASGRPAFPGRTGRSRPYDRMKSLGEGESTMDKDKDTDNDATVHRASADGFEDLGMPRILADLADGALGDEEANAVADWLLTTAEEEPPSWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVAGARAIAIEHPRLMYQAGGVEIDLEVGQSKIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI*
Ga0079219_1101813413300006954Agricultural SoilMGIDHEETVDESDVILGMPRVLADLADGKLGEVETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRRMGARSPVLDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVKI*
Ga0075419_1003179833300006969Populus RhizosphereMEKDRDMIVEQAPAESFEDLGMPRILADLANGDLDEDEANAVADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDTRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRVETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI*
Ga0079218_1216970913300007004Agricultural SoilAVADWLLATAQEEPPSWVINRAVRIAGQARTHEAPRPSTWRRLVTALVYDTRLQPRPAGARAVAVEQRRLRYQAGGTEIDLEVGGSQIAGRLRMLGQVTAQWPDLARAWAIADGPSGRLEAELDALGQFAFDGLVSGVHRLEIGLASELIEIPAVPI*
Ga0105048_1003499343300009032FreshwaterMTDDKLSVDELDALADLGMPRILADLADGALSDDEANAVADWLMSTAEEEPAGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRMTGARAIAIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0114129_1001535443300009147Populus RhizosphereMEKDKDMIVEQAPAESFEDLGMPRILADLANGDLDEDEANAVADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDTRLQPRIAGARAIAINHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI*
Ga0116142_1029316923300009685Anaerobic Digestor SludgeDEANAVADWLMATSEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVSGARAIAIDQPRLMYQASGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0116144_1049547513300009687Anaerobic Digestor SludgeMSKDKERTDDELSVDEFDALEDLGMPRILADLADGSLSDDEANAVADWLMATSEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVSGARAIAIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIG
Ga0126315_1065855713300010038Serpentine SoilKAVDNASADGDDALGMPRILADLADGALDDVEANALADWLMTTADEEPPSWIVNRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRLQPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTAAEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0126308_1015749923300010040Serpentine SoilMRVIDEAVEPIGMPRILSDLADGRLDLAETEAVVDWLRASGEDEPPAWVVNRAVRIPRQATGGRKSRPAVWRRLVAALVYDNRLQPRAAGARAITFDQPRLMYQAGGVEIDLEVSESSISGRLRMLGQVTAEEPDLARAWVVAEGPGGRTEGEVDELGQFVLDGLVGGRHKMEIGLTYELIEIPELEL*
Ga0126310_1005582523300010044Serpentine SoilMATNDDKPVGRAPAGGDLDELGMPRILADLAAGALDEDEANAVAAWLVATADEVPSIWDVNRAVRIAGQVVGQDEPRPSTWRRLVATLAYDNRYQPAPVGARGVMMDQPRLMYQAGGVEIDLEVGRSTIAGRLRMLGQVTATEPDLTRAWVLAEGPSGRFETEIDDLGQFSLDGLMSGVHRMEIGLAYELIEIPSVEI*
Ga0126310_1030979923300010044Serpentine SoilMRVIDEAVEPIGMPRILSDLADGRLDLAETEAVVDWLRASGEDEPPAWVVNRAVRIPRQATGGRKSRPAVWRRLVAALVYDNRLQPRAAGARAITFDQPRLMYQAGGVEIDLEVSESSISGRLRMLGQVTAEEPDLARAWVVAEGPGGRTEGEVDELGQFVLDGLLGGRHKMEIGLTYELIEIPELEL*
Ga0126310_1064646123300010044Serpentine SoilAEGALDEDEANAVADWLLATADDEPPGWVVNRAVRVAGQAVGQDDPRPSTWRRLVAALVYDNRFQPAPAGARGVIMDQPRLMYQAGGVEIDLEVGRSTIAGRLRMLGQVTAAEPDLTRAWVLAEGPSGRLETEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVQV*
Ga0126310_1105975113300010044Serpentine SoilLADLAEDALDDDEANAVVDWLLATADEAPPSWVVNRAVRIAGQAVGQDAPRPSTWRRLVAALVYDNRAQPRVAGARGVMMDQPRLMYQAGGVEIDLEVGRSTIAGRLRMLGQVTATEPDLVRAWVLAEGPSGRLETEVDHLGQFSLDGLISGIHRMEIGLAYELIEIPSVQV*
Ga0126311_1012255223300010045Serpentine SoilMATNDDKPVGRAPAGGDLDELGMPRILADLAAGALDEDEANAVAAWLVATADEVPSIWDVNRAVRIAGQVVGQDEPRPSTWRRLVATLAYDNRYQPAPVGARGVMMDQPRLMYQAGGVEIDLEVGRSTIAGRLRMLGQVTAAEPDLARAWVLAEGPSGCFETEVDDLGQFSLDGLMSGVHRMEIGLAYELIEI
Ga0127499_101530213300010141Grasslands SoilVDESDVILGMPRVLADLADGKLSEAETDAVVDWLSALADDEEPPHWLVNRAVRVAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0116237_1010142023300010356Anaerobic Digestor SludgeMSKDKERTDDELSVDEFDALEDLGMPRILADLADGSLSDDEANAVADWLMATSEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVSGARAIAIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0126317_1064775413300011332SoilMDKGNDMAINLASAEDLNALGMPRILADLADGELDEDEANAVADWLMATADEEPPSWVINRAVRIAGQAVGKDAPRPSIWRKLVAALVYDNRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGLHRMEIGLAYELIEIP
Ga0137374_1025262323300012204Vadose Zone SoilMSDDETPQDEKTTDPVEMPRILIDLAQGRLDGTETEAVVSWLGKSAEPEPAPWLVNRAVRIPRQAAGDRPARPAAWRRLVAALVYDNRLQPRRAGARAVGAEQPRLRYQAAGIEIDLEVGESSIAGRLRMLGQVSAAEADLAKAWVAVEGPSGREETDVDEHGQFVLDGLAPGRHRMEIGLAYELIEIPELEI*
Ga0150985_10021255413300012212Avena Fatua RhizosphereMNNDDGKMVDKLSAHDVDDLDLGMPRILADLADGVLGDDEANAVADWLMATADEEPPSWLVNRAVRIAGQAVGQDAPRPSIWRKLVAALVYDNRAQPRMVGARAVAFDHPRLMYQAGGVEIDLEVGNSTIAGRLRMLGQVTAAEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPAVQI*
Ga0150985_10026520823300012212Avena Fatua RhizosphereMNNDFDKGVGNLPTDDLGMPRILADLADGALDDDETNAVANWLLATTEEEPPSWVVNRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRVQPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_10037684213300012212Avena Fatua RhizosphereMSIDHRETVDESEVLLGMPRVLADLADGKLDEGETDAVIDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIE
Ga0150985_10371661113300012212Avena Fatua RhizosphereMNDKYDETVDTVSAEEFDALGMPRILADLADGALDDDEANAVADWLLSTSDEAPPSWVVNRAVRIAGQVVGQDAPRPSIWRHLVAALVYDNRVQPRAIGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRMETEVDDLGQFSLDGLISGIHRMEIGLAYELIEIPSVQI*
Ga0150985_10428455223300012212Avena Fatua RhizosphereMSDTYDDAAGKLSAEELEDLGMPRILVDLADGVLDDDEAGSVADWLVATADEVPPTWVVNRAVRIAGQAVGKDTPRPSIWRHLVAALVYDNRAQPRMAGARAVAFDHPRLMYQAGGVEIDLEVGQSTIAGKLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_10461072933300012212Avena Fatua RhizosphereMNDNYDELVDKLSAEELEDFGMPRILADLADGALDGDEANAVADWLVATADEAPPSWVVNRAVRIAGQAVGQDAPRPSIWRHLVAALVYDNRVQPRAIGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRMETEVDNLGQFSLDGLIS
Ga0150985_10489113423300012212Avena Fatua RhizosphereMGIDHDETVEESGYVLGMPRILADLADGKLSASETGMVVDWLSASATEEPPHWLVNRAVRVAGQAIGKDAPRPAMWRRLVAALVYDNRVQPKIAGARSVALEAPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFTLDGLVSGAHRMEIGLAYELIEIPSVQI*
Ga0150985_10661726613300012212Avena Fatua RhizosphereMNDKYDESVDKMSPVDLDELDDIGMPRILADLADGALDDDEANAVADWLAATADETPPGSVVNRAVRIAGQVVGKDAPRPSIWRHLVAALVYDNRLQPRVAGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLG
Ga0150985_10903893123300012212Avena Fatua RhizosphereMSNEHDKTVDERPAYDDEDLGMPRILADLADGMLDDDEADAVADWLTSTAEEEPPGWVVNRAVRIAGQVVGKEAPRPSIWRRLVAALVYDNRLQPRISGARAVAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_11020300823300012212Avena Fatua RhizosphereMNDNYDELVDTLSAEERDDLGMPRILADLADGTLDDDEANAVADWLLATADEAPPSWVVNRAVRVAGQAVGQDAPRPSIWRHLVAALVYDNRVEPRAVGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLISGIHRMEIGLAYELIEIPSVQI*
Ga0150985_11101592323300012212Avena Fatua RhizosphereGGGGRAGGDLAELGTPRTLADLAAGALDEDEGNAVAAWLVATADEVPSSWDVNRAVRIAGQAVGQDEPRPSTWRRLVATLAYDNRYQPAPVGARGVMMDQPRVMYQAGGVEIDLEVGRSTIAGRLRMLGQVTAAEPDLARAWVLAEGPSGCFETEVDDLGQFSLDGLMSGVHRMEIGLAYALIEIPSVQI*
Ga0150985_11128086713300012212Avena Fatua RhizosphereDLADGTLSETEMDAVVDWLSVAEDEGPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDNPRLMYQAGGVEIDLEVGDSMIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPS
Ga0150985_11374837913300012212Avena Fatua RhizosphereMGIDHEETVDESDALLGMPRVLADLADGKLDMGETDAVVDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_11539206813300012212Avena Fatua RhizosphereMKNDFDKEIGKLPIHDLDELGMPRILADLADGALDDDEANAVANWLLATAEEEPPSWVVNRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRVQPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_11568721913300012212Avena Fatua RhizosphereMGIDHIETVDESDVLLGMPRVLADLADGRLDENETDAVVNWLSALGDEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRITGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGVHRIEIVLAYELIEIPSVQI*
Ga0150985_11576092823300012212Avena Fatua RhizosphereMSNDNDMTVDERSTYDADEIGMPRILADLADGVLGDEEANAVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRIAGARAVPIENPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEI
Ga0150985_11584552123300012212Avena Fatua RhizosphereMYMGIDHDETVDESDVILGMPRVLAELADGKLSAAETDAVVDWLSAMEDEEPPHWLVNRAVRIAGQSLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSLALDNPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRIESEVDDLGQFSIDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150985_11847838313300012212Avena Fatua RhizosphereANDVAHWLMATADEAPPSWVVNRAVRIAGQAVGQDAPRPSTWRRLVAALVYDNRSQLRVAGARGVMIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVLADGPSGRMEAEVDNLGQFSLDGLISGIHRMEIGLAYELIEIPSVQI*
Ga0150985_12115974823300012212Avena Fatua RhizosphereMGKDDDMTIDPPAVDNLEALGMPRILADLADGELGDEEANAVADWLLATAEEEPPSWVVNRAVRIAGQAVGKDAPQPSIWRRLVAALVYDNRLQPRVAGARSVANDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPAVNI*
Ga0150985_12129968213300012212Avena Fatua RhizosphereMNDNYDEMVDKLSAEELEDLGMPRILADLADGALDDDEASAVADWMVATADEAPPSWVVNRAVRIAGQAVGQDAPRPSIWRHLVAALVYDNRVQPRAIGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRMETEVDDLGQFSLDGLISGIHRMEIGLAYELIEIPSV
Ga0150985_12131547013300012212Avena Fatua RhizosphereGKLPIHDFDDLGMPRILADLADGALDDDEANAVANWLMATAEEEPPSWVVNRAVRIAGQSVGQDAPRPSIWRKLVAALVYDNRVEPRVAGARAVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRME
Ga0137372_1013265313300012350Vadose Zone SoilMGIDHEETVDESYAILGMPRVLADLADGTLSEAETGAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIELEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0137368_1005213823300012358Vadose Zone SoilMSDDETPQDEKTTDRVEMPRILIDLAQGRLDGTETEAVVSWLGKSAEPEPAPWLVNRAVRIPRQAAGDRPARPAAWRRLVAALVYDNRLPPRLAGARAVGAEQPRLRYQAAGIEIDLEVGESSIAGRLRMLGQVSAAEADLAKAWVAVEGPSGREETDVDEHGQFVLDGLAPGRHRMEIGLAYELIEIPELEI*
Ga0137375_1041619613300012360Vadose Zone SoilMSDDETPQDEKTTDPVEMPRILIDLAQGRLDGTETEAVVSWLGKSAEPEPAPWLVNRAVRIPRQAAGDRPARPAAWRRLVAALVYDNRLPPRLAGARAVGAEQPRLRYQAAGIEIDLEVGESSIAGRLRMLGQVSAAEADLAKAWVAVEGPSGREETDVDEHGQFVLDGLAPGRHRMEIGLAYELIEIPELEI*
Ga0137375_1087607213300012360Vadose Zone SoilMGIDHEETVDESYAILGMPRVLADLADGTLSEAETGAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0134036_108754713300012384Grasslands SoilEETVDESDAILGMPRVLADLADGKLDEAETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0134031_130277313300012388Grasslands SoilTVDESDAILGMPRVLADLADGKLDEAETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0134061_119447013300012399Grasslands SoilMGIDHEETVDESDVILGMPRVLADLADGKLSEEETDAVVDWLSALADDEEPPHWLVNRAVRVAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0134060_109043613300012410Grasslands SoilMGIDHEETVDESDVILGMPRVLADLADGKLSEAETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGMHRMEIGLAYELIEIPSVKI*
Ga0150984_10159267423300012469Avena Fatua RhizosphereMSDTYDDAAGKLSAEELEDLGMPRILVDLADGVLDDDEAGSVADWLVATADEVPPTWVVNRAVRIAGQAVGKDTPRPSIWRHLVAALVYDNRAQPRMAGARAVAFDHPRLMYQAGGVEIDLEVGQSTISGKLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150984_10617624013300012469Avena Fatua RhizosphereGALDEDEANAVAAWLVATADEVPSSWDVNRAVRIAGQAVGRDEPRPSTWRRLVATLAYDNRYQPAPVGARGVMMDQPRVMYQAGGVEIDLEVGRSTIAGRLRMLGQVTASEPDLTRAWVLAEGPSGRFETEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVQV*
Ga0150984_10678723023300012469Avena Fatua RhizosphereMNDNYDEMVDKLSAEGLDDLRMPRILADLADGVLDDDEANAVADWLLATADEAPPSWVVNRAVRIAGQAVGQDAPRPSIWRHLVAALVYDNRVQPRAIGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSWTG*
Ga0150984_10859720033300012469Avena Fatua RhizosphereMNDNYDELVDKLSAEELEDFGMPRILADLADGALDGDEANAVADWLVATADEAPPSWVVNRAVRIAGQAVGQDAPRPSIWRHLVAALVYDNRVQPRAIGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIADGPSGRMETEVDNLGQFSLDGLISGIHRMEIGLAYELIEIPSVQI*
Ga0150984_10910003823300012469Avena Fatua RhizosphereMDKDNEATVDNISADGFEEFGMPRILADLADGALGAEEANAVAHWLLATADEEPPGWVVNRAVRIAGQAVGRDAPQPSIWRRLVAALVYDNRLQPRVAGARALAVEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVQI*
Ga0150984_11306575413300012469Avena Fatua RhizosphereMGKDDDMTIDPPAVDNLEALGMPRILADLADGELGDEEANAVADWLLATAEEEPPGWVINRAVRIAGQAVGRDAPRPSIWRRLVAALVYDNRLQPRVAGARSVAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDL
Ga0150984_11422839323300012469Avena Fatua RhizosphereDDLGMPRILADLADGALDDDEANAVADWLLATADEAPPNWVVNRAVRIAGQAVGQDAPRPSIWRHLVAALVYDNRVQPRAVGARAVAIDHPRLMYQAGGIEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLISGLHRMEIGLAYELIEIPSVQI
Ga0150984_11476527713300012469Avena Fatua RhizosphereMGHETPGTLAVLDATDATLGMPRILIELADGNLGTEAANAVADWLLAAADEEPPSWAVNRSVRIAGQSRGQEAPQPPAWRRIVAALVYDTRLQPRFAGARAVAVERRRMRYQAGGTEIDLEVGGSQMTGRLRMLGQVTAGETGLARACVITEGPSGRFETKVDALGQFSFDGLVSAVHRLEIGLAHELIEIPAIHL*
Ga0150984_11540233223300012469Avena Fatua RhizosphereMSNEHDKTVDERPAYDDEDLGMPRILADLADGMLDDDEADAVADWLTSTAEEEPPGWVVNRAVRIAGQVVGKEAPRPSIWRRLVAALVYDNRLQPRISGARAVAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIG
Ga0150984_11689745813300012469Avena Fatua RhizosphereMGIDHIETVDESDVLLGMPRVLADLADGRLDENETDAVVNWLSALGDEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRITGARSIALDHPRLMYQAGGVDIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVQI*
Ga0150984_11734601923300012469Avena Fatua RhizosphereMYMGIDHDETVDESDVILGMPRVLAELADGKLSAAETDAVVDWLSAMEDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSLALDNPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRIESEVDDLGQFSIDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150984_11882588413300012469Avena Fatua RhizosphereMSNDNDMTVDERSTYDADEIGMPRILADLADGVLGDEEANAVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRIAGARAVAIENPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI*
Ga0150984_12365518333300012469Avena Fatua RhizosphereMQRREQMGIDHDETVEESGYVLGMPRILADLADGKLSASETGMVVDWLSASATEEPPHWLVNRAVRVAGQAIGKDAPRPAMWRRLVAALVYDNRVQPKIAGARSVALEAPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFTLDGLVSGAHRMEIGLAYELIEIPSVQI*
Ga0136614_1003052113300012684Polar Desert SandMRVIDEAAEPIGMPRILIDLAEGRLDPAETETVVDWLRASGNDEPPAWVVNRAVRIPRQAADGRKSRPAAWRRLVAALVYDNRLQPRVAGARAITFDQPRLMYQAGGVEIDLEVSESSISGRLRMLGQVTAEEPDLARAWVVAEGPGGRTEGEVDELGQFVLDGLVGGRHKMEIGLTYELIEIPELEL*
Ga0132258_1012570943300015371Arabidopsis RhizosphereMYMGLDHDETVDESDVFLGMPRVLADLADGKLSATETDAVVDWLSAMEDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVAGARSLALDNPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGVHRMEIGLAYELIEIPSVNI*
Ga0136617_1015221013300017789Polar Desert SandMRVIDEAAEPIGMPRILIDLAEGRLDPAETETVVDWLRASGNDEPPAWVVNRAVRIPRQAADGRKSRPAAWRRLVAALVYDNRLQPRVAGARAITFDQPRLMYQAGGVEIDLEVSESSISGRLRMLGQVTAEEPDLARAWVVAEGPGGRTEGEVDELGQFVLDGLVGGRHKMEIGLTYELIEIPELEL
Ga0136617_1029760113300017789Polar Desert SandMVDVKGTTAARARAAGRYIKWSVDDDERRAIMDQDHMTEGDELDDGLAVPRILRDLAAGHLAEAEADTVVAWLEATGLEEAPSWLVNRAVRIAGQALGGETPRPAMWRRLVAALVFDNRLQPRLAGSRALGLDHPRLMYEAGGIEIDLEVGDSSIAGRLRMLGQVTASEPDLVRAWVVVDGPSGRLETEVDAMGQFAVDGLASGAHRMEIGLAYELIEIPEVRL
Ga0136617_1093764313300017789Polar Desert SandRIVRDLAEGRLPETEADTVVAWLEATGIEEAPPWLVNRAVRIAGQAMGGATPRPAIWRRLVAALVYDNRLQPRLAGVRALGLDHPRLMYEAGGIEIDLEVGDSSIAGRLRVLGQVTASEPDLVRAWVVVDGPSGRLETEVDAMGQFAVDGLASGAHRMEIGLAYELIEIPEVRL
Ga0190269_1111017513300018465SoilVPMHDETAGMAAVLDADDTPLGMPRILLELADGGLGDADANAVADWLLATADEAPPSWVVNRAVRIAGRARAHEAPRPSTWRRLIAALVCDTRLQPRLAGARAVALEQHRLLYQAGGTEIDLEVGDSRITGRLRLLGQVTATGSDLAHAWVVAEGPTGRFEAEIDALGQFALDGLEPGIHRLEIGLAYELIEIPSVPL
Ga0190269_1160607813300018465SoilGVPRVLHDLAEGLLTEADTDALVAWLEAEGLEEAPPWVVNRAVRIAGQALGGDAPRPAMWRRLVAALVYDNRLQPRLAGARSVSLEHPRLMYEAGGIEIDLEVGDSSIAGRLRMLGQVTASEPDLVRAWVAVDGPSGRLETEVDDLGQFSVDGLASGAGCVVAPCRAGMALMRDALGRDA
Ga0190268_1026444713300018466SoilMNDETAGTAAMLSAADASLGMPRILLELADGGLGDEDANAVADWLLATADEAPPSWVVNRAVRIAGQARSHEAPRPAAWRRLVAALVCDTRLQPRPVGARAVAAEQHRLLYQAGGTEIDLEVGESQISGRLRMLGQVTAREPDLARAWAVADGPSGRLEAEIDALGQFAFDGLVSGIHRLEIGLDDELIEIPAVPI
Ga0180118_134931813300020063Groundwater SedimentVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0180109_133411413300020067Groundwater SedimentVRYDKDKMVDNSSADELDDLGMPRILADLADGALSDEEASSVANWLMTTAEEEPPGWVINRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVSGARAVAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0206356_1179816513300020070Corn, Switchgrass And Miscanthus RhizosphereMGIDHEETVDESEVLLGMPRVLADLADGKLNEVETDAVVDWLSALADEEEPPHWLVNRAVRIAGHSLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSLALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0206353_1021539213300020082Corn, Switchgrass And Miscanthus RhizosphereRVLADLADGKLNEVETDAVVDWLSALADEEEPPHWLVNRAVRIAGHSLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSLALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDRTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0242658_116160113300022530SoilGIDHEETVDESDALLGMPRVLADLADGKLDEGETDAVVDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0196962_1023941113300024430SoilMDKDNDMTVDHAAADGFEELGMPRILADLADGALGEDEANAVANWLLATAEEEPPSWVVNRAVRIAGQAVGRDAPRPSIWRRLVAALVYDNRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLD
Ga0209605_127557213300025861Anaerobic Digestor SludgeMSKDKERTDDELSVDEFDALEDLGMPRILADLADGSLSDDEANAVADWLMATSEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVSGARAIAIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIE
Ga0209181_1022182313300027878FreshwaterMTDDKLSVDELDALADLGMPRILADLADGALSDDEANAVADWLMSTAEEEPAGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRMTGARAIAIDQPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTATEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0209481_1052851313300027880Populus RhizosphereMEKDRDMIVEQAPAESFEDLGMPRILADLANGDLDEDEANAVADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDTRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRVETEVDD
Ga0209382_1006081933300027909Populus RhizosphereMDKDKDTDNDATVHRASADGFEDLGMPRILADLADGALGDEEANAVADWLLTTAEEEPPSWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRVAGARAIAIEHPRLMYQAGGVEIDLEVGQSKIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI
Ga0209382_1009623343300027909Populus RhizosphereMEKDKDMIVEQAPAESFEDLGMPRILADLANGDLDEDEANAVADWLMATADEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRKLVAALVYDTRLQPRIAGARAIAIDHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVSI
Ga0268264_1067842213300028381Switchgrass RhizosphereNSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0075382_1092617613300030917SoilMGIDHEETVDESDAILGMPRVLADLADGKLSEMETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0102747_1100683813300030959SoilDVILGMPRVLADLADGKLNEAETDAVVDWLSALADDEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308178_109090813300030990SoilMGTHKSTDDEMNMYMQQLADLDDELGMPRILADLADGNLSDEEANSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRMQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDL
Ga0308189_1007805013300031058SoilMGIDHELTVDESEILLGMPRVLADLADGKLGEGETDAVVDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRITGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEI
Ga0308189_1020375313300031058SoilMKYDDGKMVDELSADDVDALGMPRILADLADGALDDDEANAVADWLMATADEEPASWLVNRAVRIAGQAVGQDAPRPSIWRKLVAALVYDNRVQPRMVGARAVAFDHPRLMYQAGGVEIDLEVGNSTIAGRLRMLGQVTASEPDLTRAWVIAVGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIE
Ga0308189_1024927213300031058SoilMGTHKSTDDEMNMHMQQLADLDDELGMPRILADLADGNLSDEEANSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRMQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALD
Ga0308192_107505513300031082SoilEILLGMPRVLADLADGKLGEGETDAVVDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRITGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSV
Ga0308201_1000978523300031091SoilMKYDDGKMVDELSADDVDALGMPRILADLADGVLDDDEANAVADWLMATADEEPASWLVNRSVRIAGQAVGQDAPRPSIWRKLVAALVYDNRVQPRMVGARAVAFDHPRLMYQAGGVEIDLEVGNSTIAGRLRLLGQVTAAEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308201_1004299623300031091SoilMVTDDDKPVGRALTGDDLNELGMPRILADLAEGALDDDEANAVAHWLLATADDEPPGWVVNRAVRIAGQAVGQDSPRPSTWRRLVAALIYDNRLQPAPVGARGVLMDQPRLMYQAGGVEIDLEVGRSTIAGRLRMLGQVIAAEPDLTRAWVLAEGPSGRLEAEVDDLGQFSLDGLVSGLHRMEIGLAYELIEIPSVQI
Ga0308204_1001043423300031092SoilMKYDDGKMVDELSADDVDALGMPRILADLADGALDDDEANAVADWLMATADEEPASWLVNRAVRIAGQAVGQDAPRPSIWRKLVAALVYDNRVQPRMVGARAVAFDHPRLMYQAGGVEIDLEVGNSTIAGRLRMLGQVTAAEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308197_1002294023300031093SoilMSNDNDKTVDKRPEYDADDLGMPRILADLADGVLDGEEANAVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAVAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYE
Ga0308199_120229813300031094SoilDESDVILGMPRVLADLADGKLSEAETDAVVDWLSTSADDDEPPHWLINRAVRIAGQVLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSMALDHPRLMYQAGGVEIDLEVGDSSIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIH
Ga0308193_100636813300031096SoilMGIDHEETVDESDVILGMPRVLADLADGKLSEAETDAVVDWLSTSADDDEPPHWLINRAVRIAGQVLGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSSIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308180_102566113300031100SoilMGTHKSTDDEMNMHMQQLADLDDELGMPRILADLADGNLSDEEANSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRMQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALDGLV
Ga0308187_1004874913300031114SoilMSKDTDKTVDMQSADELDDLGMPRILADLADGALDDEEANAVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDL
Ga0308187_1033552513300031114SoilVDKRPEYDADDLGMPRILADLADGVLDGEEANAVADWLMSTAEEEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRLQPRISGARAVAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308195_101275813300031123SoilMGTHKSTDDEMNMHMQQLADLDDELGMPRILADLADGNLSDEEANSVADWLMSTVEDEPPGWVVNRAVRIAGQAVGKDAPRPSIWRRLVAALVYDNRMQPRISGARAIAIEHPRLMYQAGGVEIDLEVGHSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLETEVDDLGQFALDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0308194_1010192513300031421SoilMGIDHEETVDESGVVLGMPRVLADLADGKLSETETEAVVNWLTAMEDEGPPHWIVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDNPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI
Ga0170819_1216176213300031469Forest SoilHMGIDHEEAVDESEVLLGMPRVLADLADGKLDEGETDAVIDWLSALADEEEPPHWLVNRAVRIAGQALGKDAPRPAIWRRLVAALVYDNRLQPRVTGARSIALDHPRLMYQAGGVEIDLEVGDSTIAGRLRMLGQVTASEPDLTRAWVIAEGPSGRLESEVDDLGQFSLDGLVSGIHRMEIGLAYELIEIPSVQI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.