NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F039751

Metagenome / Metatranscriptome Family F039751

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F039751
Family Type Metagenome / Metatranscriptome
Number of Sequences 163
Average Sequence Length 116 residues
Representative Sequence LKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Number of Associated Samples 141
Number of Associated Scaffolds 163

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 40.49 %
% of genes near scaffold ends (potentially truncated) 44.79 %
% of genes from short scaffolds (< 2000 bps) 74.23 %
Associated GOLD sequencing projects 131
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.086 % of family members)
Environment Ontology (ENVO) Unclassified
(36.196 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(45.399 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214.216.218.220.
1GPIPI_00134720
2GPIPI_03016970
3F47_05320410
4F62_08359770
5JGI11643J12802_116041581
6JGI25381J37097_10024414
7JGI25389J43894_10049985
8Ga0063356_1057109941
9Ga0066683_100786021
10Ga0066673_105854072
11Ga0066684_104313531
12Ga0066671_105655961
13Ga0066676_100011619
14Ga0066675_100315063
15Ga0070663_1015958321
16Ga0070707_1014718231
17Ga0066695_102305063
18Ga0070664_1006996902
19Ga0066702_109766921
20Ga0066708_101427862
21Ga0066708_101707062
22Ga0066905_1001643032
23Ga0066903_1000525425
24Ga0066903_1007417003
25Ga0066652_1000041676
26Ga0066652_1013202342
27Ga0070712_1000125813
28Ga0070712_1004815843
29Ga0101567_108900771
30Ga0075433_107067691
31Ga0075434_1007942673
32Ga0099791_100041244
33Ga0099793_104952911
34Ga0066710_1009184631
35Ga0066710_1022554582
36Ga0066709_1000216115
37Ga0066709_1024447671
38Ga0114129_116482283
39Ga0105249_100368044
40Ga0126380_100104484
41Ga0126380_111972991
42Ga0134070_100175221
43Ga0134086_104859831
44Ga0134064_101934971
45Ga0134065_102047311
46Ga0134080_101463622
47Ga0134063_100638402
48Ga0134062_103799771
49Ga0134066_100113483
50Ga0134066_100947251
51Ga0126379_124581071
52Ga0134125_109603311
53Ga0126383_126085301
54Ga0137383_100036624
55Ga0137382_101303602
56Ga0137382_106456331
57Ga0137365_100377821
58Ga0137363_112241372
59Ga0137363_116279371
60Ga0137374_101541783
61Ga0137362_114209661
62Ga0137381_103056943
63Ga0137376_100394133
64Ga0137376_106740141
65Ga0137377_114675121
66Ga0137370_102095612
67Ga0137372_108347641
68Ga0137367_104841311
69Ga0137366_101041814
70Ga0137371_111205831
71Ga0137368_100982163
72Ga0137385_111166852
73Ga0137375_100463215
74Ga0137360_101797303
75Ga0134041_11906862
76Ga0150984_1201638922
77Ga0137373_103035201
78Ga0137358_100898872
79Ga0157285_102232661
80Ga0157298_100223692
81Ga0137394_106186512
82Ga0137419_102208241
83Ga0137416_111420291
84Ga0137404_100222993
85Ga0137404_101273793
86Ga0137407_100529733
87Ga0137407_106304493
88Ga0134110_100639372
89Ga0157307_10089771
90Ga0134081_101057341
91Ga0137409_102403541
92Ga0137403_100269381
93Ga0132255_1008108051
94Ga0163161_120372061
95Ga0184604_100070492
96Ga0184638_10506382
97Ga0184621_102788671
98Ga0184635_100153371
99Ga0066655_101308412
100Ga0066655_102934521
101Ga0066667_113952321
102Ga0066662_101846624
103Ga0066662_127329171
104Ga0066669_100842684
105Ga0066669_122342181
106Ga0137408_13559223
107Ga0193704_10342301
108Ga0193720_10004094
109Ga0193715_10257591
110Ga0193707_11034861
111Ga0193728_12669361
112Ga0193693_10205371
113Ga0193732_10792111
114Ga0193734_10385091
115Ga0193721_10045263
116Ga0193745_10283562
117Ga0210381_102285281
118Ga0210382_104399261
119Ga0210382_104755981
120Ga0210402_119893931
121Ga0126371_122845071
122Ga0222622_101431953
123Ga0222622_101850451
124Ga0193714_10055123
125Ga0207693_100102317
126Ga0207657_105915072
127Ga0207646_101292773
128Ga0207651_111315811
129Ga0207677_107188483
130Ga0207678_107674753
131Ga0209688_10009712
132Ga0209687_10525851
133Ga0209470_11817621
134Ga0209470_13346591
135Ga0209473_10034905
136Ga0209158_10621653
137Ga0209057_10423841
138Ga0209057_10669473
139Ga0257163_10799161
140Ga0257157_10875291
141Ga0209808_10349721
142Ga0209577_104401601
143Ga0207497_1010611
144Ga0209858_10188051
145Ga0137415_100122541
146Ga0307299_100219372
147Ga0307284_102678641
148Ga0307302_100380811
149Ga0307312_106658191
150Ga0307304_106001051
151Ga0138303_14320631
152Ga0075375_104148501
153Ga0068589_116437251
154Ga0308189_102852341
155Ga0308193_10535572
156Ga0308187_103031922
157Ga0308194_103106541
158Ga0308194_103698041
159Ga0170818_1055153981
160Ga0310813_104278191
161Ga0307468_1000892782
162Ga0310810_100348911
163Ga0310811_103932101
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.70%    β-sheet: 2.82%    Coil/Unstructured: 46.48%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110LKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEKSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Grasslands Soil
Grass Soil
Soil
Forest Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Groundwater Sand
Arabidopsis Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Avena Fatua Rhizosphere
17.2%22.1%3.1%7.4%14.7%8.0%3.7%3.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_001347202088090014SoilREQSFGLPRQPYFKAKSYRQAPLPLVLLDYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
GPIPI_030169702088090014SoilASSARVSPAGPIQQRKLSARPIPVALIDYLRPAVFRRAAHRAFINCESLLRPAGVSPPFFAEGAAFVPAPFLLAAQRAFISSESFLRPAAVSAPFFLVGAGFVPPFNLDQRALAAAESLARVEGEK
F47_053204102170459009Grass SoilVLIDYLRAAFRRAAQRAFINCESLLRPAGVSPPFLAEGAGFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFGPPFSFAQRALAAAESFARVEGEK
F62_083597702170459010Grass SoilYLEPKTIGKARIPVVLIVIDYLRPAALRRAAQRAFINCDSLFRPAGVSPPFFAEGEGAALVPAPFLLAAQRAFISSESFLRPAGVSAPFFLVGAGFVPPFNLAQRALAAAASLARVEGEK
JGI11643J12802_1160415813300000890SoilRTAQDRMGKPVEVRVLSRAVLWSPRQPYLKAKSYRQAPLPLLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK*
JGI25381J37097_100244143300002557Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
JGI25389J43894_100499853300002916Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRAL
Ga0063356_10571099413300004463Arabidopsis Thaliana RhizosphereSGPYGETRGGSSPLASSAWVPATYLEANTIGKLRIPVVLIDYLRPAAFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRPAGVSAPFFLVGVGFVPPFSLAQRALAAAASLARVEGEK*
Ga0066683_1007860213300005172SoilVLDSATYLEAKTIGKLRIPVVLIDYLRPAVFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGFVPPLSLAQRALAAAASLARVEGEK*
Ga0066673_1058540723300005175SoilVLDPATYLEAKTIGKLRIPVILIYYLRPAAFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0066684_1043135313300005179SoilEVRVLSRAELGFLATGPYPEVKTSGKPPTSIILLDYLRAAFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGEK*
Ga0066671_1056559613300005184SoilLPDYLRAAFFRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLEGAGFVPPFHLAQRALAAAESFARVEGEK*
Ga0066676_1000116193300005186SoilLKAKTIGKLPIPIASIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESLARVEGEK*
Ga0066675_1003150633300005187SoilLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK*
Ga0070663_10159583213300005455Corn RhizosphereMVDYLRAVFRRAAQRAFINCESLLRPAAVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAHRALAAAESLARVEGEKYRP
Ga0070707_10147182313300005468Corn, Switchgrass And Miscanthus RhizosphereSSPLASSAWAPATYLEAKTIRKLRILVVLIDYLRPAAFRRAAQRAFINCDSLLRPAGVRPPFFAEGAAFFVPAVFLLAAQRALMSSESLLRPAGVSAPFFLVAVGFVPPFSLAQRALAAAASLARVEGEK*
Ga0066695_1023050633300005553SoilVVLTDYLRPVFLRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFINSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0070664_10069969023300005564Corn RhizosphereVLSRAVLWSPRQPYLKAKSYRQAPLPLVLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK*
Ga0066702_1097669213300005575SoilTRTAQDRMGKPVEVRVLSRAVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0066708_1014278623300005576SoilIGRRVRLRTVWGNPWRFESSREQLNSATYLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK*
Ga0066708_1017070623300005576SoilPFLLAAQRAFISWESRFRPAGVSPPFFGAGAVFVPPAFLRAAQRALISSESLLRPAGVSAPFFLAGAGFAVVFCFAQRALAASESLARVVGEK*
Ga0066905_10016430323300005713Tropical Forest SoilLNDYLRAAFRRAAQRAFINCESLLRPAGVSPPFFAEGAAFVAPAFLLAAQRAFMSSESFLRPAAVSTPFFLVGAGFVPPFNLAQRALAAAESLARVEGEK*
Ga0066903_10005254253300005764Tropical Forest SoilPALTLTKTIGKLPISVLLTDYFRPAVFLRAAQRAFINCESLLRPAAVSPPFFFDEAAFVPPPFLLAAQRALMSSESFFRPAGVSAPFFWAAAGLLPPFSLAHLALAAAESLARVEGEK*
Ga0066903_10074170033300005764Tropical Forest SoilLRLPAFLLAAQRAFINCESFLRPAAVSPPFFTEGAFFVPAAFLLAAQRAFISSESFLRPAAVRAPLFLVGAAFIGPFSLAQRALAAAESLARVEGEK*
Ga0066652_10000416763300006046SoilVKTSGKPPTSIILLDYLRAPFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGEK*
Ga0066652_10132023423300006046SoilVLDPATYLEAKTIGKLRIPVILIYYLRPAAFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0070712_10001258133300006175Corn, Switchgrass And Miscanthus RhizosphereLRPAVLRRAAQRAFINCESLFRPAGVRPPFFAEGPALAVLVPAPFLRAAQRAFISSESFLRPAGVSAPFFLVGAGFVPPFSLAQRALAAAASLARVEGEK*
Ga0070712_10048158433300006175Corn, Switchgrass And Miscanthus RhizosphereVLAFSLPTLSRREKHRQAADCVCIDRDYLRPLAFLRAAQRAFINCESLLRPAGVSPPFFAEAVVFAPPAFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFSRVEGEK*
Ga0101567_1089007713300006624SoilVRLRTVWGNLWRFESSREHSFGLLPSTLSESESYQQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK*
Ga0075433_1070676913300006852Populus RhizosphereVLDCPRRPYLEAKTVGKLPILGALIDYLRAAFRRAAQRAFINCESLLRPAAVSPPFFAEGVALVPPPFLLAAQRAFINSDSFLRPAAVRAPFFLAGAGFVPPFSRAQRALAAAESLARVEGEK*
Ga0075434_10079426733300006871Populus RhizosphereLRPAAFRRAAQRAFINCESLLRPAGVRPPFFAVGAFLVPPPFLLAAQRAFISSESFFRPAGVRAPFFRGAAGFVAPPLSLDQRALAAAESLARVAGE
Ga0099791_1000412443300007255Vadose Zone SoilLEAKTIGKLRIPVVLIDYLRPEAFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRPAGVSAPFFLVGADFVPPFSLAQRALAAAASLARVEGEK*
Ga0099793_1049529113300007258Vadose Zone SoilYLEPKTIGELPIPVVLIDYLRPAALRRAAQRAFINCESLLRPAGVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLTGAGFVPPFSFAQQALAAVESFARVEGEK*
Ga0066710_10091846313300009012Grasslands SoilKDYFAPPFLLAAQRAFISWESRFRPAGVSPPFFGAGAVFVPPAFLRAAQRALISSESLLRPAGVSAPFFLAGAGFAVVFCFAQRALAASESLARVVAEK
Ga0066710_10225545823300009012Grasslands SoilVSQRPYLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFINSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0066709_10002161153300009137Grasslands SoilLTDYLRPVFLRAAQRAFISCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFINSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0066709_10244476713300009137Grasslands SoilKPPTSIILLDYLRAPFLRAALRAFINCDSLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGEK*
Ga0114129_1164822833300009147Populus RhizosphereVLDCPRRPYLEAKTVGKLPILGALIDYLRAAFRRAAQRAFINCESLLRPAAVSPPFFAEGVALVPPPFLLAAQRAFINSDSFLRPAAVSAPFFLGGAGFVPPFSLAQRA
Ga0105249_1003680443300009553Switchgrass RhizosphereLVSPPTLFESESYRQSPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK*
Ga0126380_1001044843300010043Tropical Forest SoilLVGKTVGSLQFQAVLNDYLRAAFRRAAQRAFINCESLLRPAGVSPPFFAEGAAFVAPAFLLAAQRALMSSESFLRPAAVSTPFFLVGAGFVPPFNLAQRALAAAESLARVEGEK*
Ga0126380_1119729913300010043Tropical Forest SoilRGGSSPLASSAKSPSGLYPEQKTIGKLPMSVLLTDHFRPAVFLRAAQRAFINCESLLRPAAVSPPFFFDEAAFVPPPFLLAAQRALISSESFFRPAGVSAPFFWAAAGLLPPFSLAHLALAAAESLARVEGEK*
Ga0134070_1001752213300010301Grasslands SoilSATYLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK*
Ga0134086_1048598313300010323Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYLRPAVFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0134064_1019349713300010325Grasslands SoilDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0134065_1020473113300010326Grasslands SoilTRTAQDRMGKPVEVRVLSRAVLDSATYLEAKTIGKLRIPVVLIDYLRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0134080_1014636223300010333Grasslands SoilWRFESSREQCLGVPATLFGSENYRFRVVSTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0134063_1006384023300010335Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPFSLAQRALAAAASLARVEGEK*
Ga0134062_1037997713300010337Grasslands SoilVLDPATYLEAKTIGKLRIPVILICYLRPAAFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGFVPPLSLAQRALAAAASLARVEGEK*
Ga0134066_1001134833300010364Grasslands SoilRVLSRAVLDSATYLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK*
Ga0134066_1009472513300010364Grasslands SoilLSRAELGFLATGPYPEVKTSGKPPTSIILLDYLRAPFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGEK*
Ga0126379_1245810713300010366Tropical Forest SoilMDYLRPAAFLRAAQRAFINCESLLRPAGVSPPFFAEGAVFVPAAFLLAAQRAFISSESFLRPAGVSAPFFLVGAGFPPPFSFAQRAFAAAESLARVEGEK*
Ga0134125_1096033113300010371Terrestrial SoilMVDYLRAVFRRAAQRAFINCESLLRPAAVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAHRALAAAESLARVEGEK*
Ga0126383_1260853013300010398Tropical Forest SoilLSRAVPSCLPALSLTKTIGKLPISVLLTDYFRAAVFLRAAQRAFINCESLLRPAAVSPPFFFDEAAFVPPPFLLAAQRALISSESFFRPAGVSAPFFWAAAGLLPPFSLAHLALAAAESLARVEGEK*
Ga0137383_1000366243300012199Vadose Zone SoilVYPRPYVEAKTIGKLPILVASIDYLRAAFRRAAQRAFINCESLLRPAGVSPPFFAEGAALVPPAFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLDQRALAAAESLARVEGEK
Ga0137382_1013036023300012200Vadose Zone SoilVLYSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0137382_1064563313300012200Vadose Zone SoilLKAKAIGQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0137365_1003778213300012201Vadose Zone SoilAFLRAAQRAFINCESLLRPAGVSPPFFAEGAALVPADFLLAAQRAFINSESFLRPAAVSAPFFFAGAGFVPPFSFAQRALAAAESLARVEGEK*
Ga0137363_1122413723300012202Vadose Zone SoilLEAKTIDKLSISVVLTDYLRPVFFRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSRAQRALAAAASFARVEGEK*
Ga0137363_1162793713300012202Vadose Zone SoilAPATYLEAKTIRKLRILVVLIDYLRPAAFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRPAGVSAPFFLVEAGFVPPFNLAQRALAAAVTLARVEGEK
Ga0137374_1015417833300012204Vadose Zone SoilYGQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSFAQRALAADESFARVEGEK*
Ga0137362_1142096613300012205Vadose Zone SoilLIDYLRAVFRRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0137381_1030569433300012207Vadose Zone SoilLTDYLRPVFLRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSRAQRALAAAASFARVEGEK*
Ga0137376_1003941333300012208Vadose Zone SoilLTDYLRPVFLRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFINSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0137376_1067401413300012208Vadose Zone SoilLKAKAIGQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAAEESFARVDGEK*
Ga0137377_1146751213300012211Vadose Zone SoilRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLAGAGFVLPFNLAQRALAAAASFARVEAEK*
Ga0137370_1020956123300012285Vadose Zone SoilKAKTIGKLPIPIVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEAAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLLGAAFVPPFSLAQRALAAAESLARVEGEK*
Ga0137372_1083476413300012350Vadose Zone SoilLEAKTIDKLSISVALTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSRAQRALAAAASFARVEGEK*
Ga0137367_1048413113300012353Vadose Zone SoilLEAKTIDKLSISVALTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAADESFARVEGEK*
Ga0137366_1010418143300012354Vadose Zone SoilVLDSATYLEAKTIGKLRIPVVLIYYLRPAVFRRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPPVFLLAAQRAFMSSESFLRPAGVSAPFFLVGVGAGFVLPFNLAQRALAAAASLARVEGEK*
Ga0137371_1112058313300012356Vadose Zone SoilLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGATFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK*
Ga0137368_1009821633300012358Vadose Zone SoilLKAKTIGKLPIPIVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLDQRALAAAESLARVEGEK*
Ga0137385_1111668523300012359Vadose Zone SoilLEAKTIDKVSISVVLTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0137375_1004632153300012360Vadose Zone SoilVSQGPISDVLIDYLRAVFLRAAQRAFINCESLLRPAAVSPPFFAEGAAFVPADFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAADESFARVEGEK*
Ga0137360_1017973033300012361Vadose Zone SoilLEANTIGKLRIPVVLIDYLRPAAFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRPAGVSAPFFLVGVGFVPPFSLAQRALAAAASLARVEGEK*
Ga0134041_119068623300012405Grasslands SoilVRVLSRAVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0150984_12016389223300012469Avena Fatua RhizosphereLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVRPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK*
Ga0137373_1030352013300012532Vadose Zone SoilVYPRPYVEAKTIGKLPILVASIDYLRAAFRRAAQRAFINCESLLRPAGVSPPFFAEGAALVPPAFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLALWALAADESFALVEGEK
Ga0137358_1008988723300012582Vadose Zone SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVGAGFVPPFSLAQRALAAAASLARVEGEK*
Ga0157285_1022326613300012897SoilLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAADESFARVDGEK*
Ga0157298_1002236923300012913SoilVEVRVLSRAVLGRSPNRPYLKAKTIGKLPIQVVLIDYLRAAFRRAAQRAFINCESLLRPAGVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLGGAGFVRPFSLAQRALAAAESLARVEGEK*
Ga0137394_1061865123300012922Vadose Zone SoilLIDYLRAVFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRPAGVSAPFFLVGADFVPPFSLAQRALAAAASLARVEGEK*
Ga0137419_1022082413300012925Vadose Zone SoilLEAKTIGKLRIPVVLIDYLRPEAFRRAAQRAFINCDSLLRPAGVSPPFFAEGAALVPAVFLLAAQRALISSESFLRTAGVSAPFFLVGADFVPPFSLAQRALAAAASLARVEGEK*
Ga0137416_1114202913300012927Vadose Zone SoilGGSSPLASSAWVSQRPYLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLEGDGFVPPFSLAQRALAAAASFARVEGEK*
Ga0137404_1002229933300012929Vadose Zone SoilLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVEGEK*
Ga0137404_1012737933300012929Vadose Zone SoilVSQRPYLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLEGDRFVPPFSLAQRALAAAASFARVEGEK
Ga0137407_1005297333300012930Vadose Zone SoilLIDYLRPAAFRRAAQRAFINCDSLLRPAGVSPPFFATLAALVPAPFLLAAQRALISSESFLRPAGVSAPFFLVGAGFVPPFNLAQRALAAAASLARVEGEK*
Ga0137407_1063044933300012930Vadose Zone SoilLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLEGDGFVPPFSRAQRALAAAASFARVEGEK*
Ga0134110_1006393723300012975Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLAPVEGEK*
Ga0157307_100897713300013096SoilLKAKSYRQAPLPLLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK*
Ga0134081_1010573413300014150Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK*
Ga0137409_1024035413300015245Vadose Zone SoilLETKAIDRLSISVVLTDYLRPVFLRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSRAQRALAAAASFARVEGEK*
Ga0137403_1002693813300015264Vadose Zone SoilGSGPYGETRGGSSPLASSPLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVEGEK*
Ga0132255_10081080513300015374Arabidopsis RhizosphereLVSPPTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK*
Ga0163161_1203720613300017792Switchgrass RhizosphereMVDYLRAVFRRAAQRAFINCESLLRPAAVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0184604_1000704923300018000Groundwater SedimentLVCQGPYLKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGE
Ga0184638_105063823300018052Groundwater SedimentLVSPPTLFESESYRQPLLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0184621_1027886713300018054Groundwater SedimentVSQGPISGVLIDYLRAVFLRAAQRAFINCESLFRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0184635_1001533713300018072Groundwater SedimentDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0066655_1013084123300018431Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK
Ga0066655_1029345213300018431Grasslands SoilLLDYLRAAFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARFEGEK
Ga0066667_1139523213300018433Grasslands SoilVKTSGKPPTSIILLDYLRAPFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGEK
Ga0066662_1018466243300018468Grasslands SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLA
Ga0066662_1273291713300018468Grasslands SoilYPQAADFGCLIDYLCPAAFRRAAHRAFINCDSLLRPAGVSPPFFAEGAAFFVPAVFLLAAQRALMSSESFLRPAGVSAPFFLLPVGLVPPFSLAQRALAAAASLARVEGEK
Ga0066669_1008426843300018482Grasslands SoilVKTSGKPPTSIILLDYLRAPFLRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLAGAAFVPPFNLAQRALAAAESFARVEGET
Ga0066669_1223421813300018482Grasslands SoilVSQRSYLEAKTIDKLSISVVLTDYLRPAFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFINSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASLARVEGEK
Ga0137408_135592233300019789Vadose Zone SoilLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVEGEK
Ga0193704_103423013300019867SoilLKAKAIGNPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGE
Ga0193720_100040943300019868SoilLKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0193715_102575913300019878SoilLVSPPTLFESESYRQLSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGE
Ga0193707_110348613300019881SoilVSQRPYLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0193728_126693613300019890SoilWRFESSREQSFGLPRQPYLKAKAIGNPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDAFVPPDLLLAAQRAFISSDSFLRPAAVSAPFFLTGAGFVPPFSFAQRALAADESFARVEGEK
Ga0193693_102053713300019996SoilPYGETRGGSSPLASSAWVSQGPISGVLIDYLRAVFLRAAQRAFINCESLFRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0193732_107921113300020012SoilPPTLFESESYRQLSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0193734_103850913300020015SoilSSPLVSPPTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEDAAFVPPDAFVPLDLLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0193721_100452633300020018SoilLVSPPTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAEAGFVPPFSLAQRALAAAESLARVEGE
Ga0193745_102835623300020059SoilDYLRAVFLRAAQRAFINCESLFRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0210381_1022852813300021078Groundwater SedimentLVSPPTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEDAAFVPPDAFVPLDLLLAAQRAFISSDSFLRPAAVSAPFFLAGAGF
Ga0210382_1043992613300021080Groundwater SedimentARDLYLKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0210382_1047559813300021080Groundwater SedimentVSQGPISGVLIGYLRAVFLRAAQRAFINCESLFRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0210402_1198939313300021478SoilLIDYLRPAALRRAAQRAFINCDSLFRPAGVSPPFFAEGEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLVGAGFGPPFNLAQRALAAAASLARVEGEK
Ga0126371_1228450713300021560Tropical Forest SoilMLIDYLRPPALRRAAQRAFINCESLRRPAGVSPPFFAEGAAFVPEVLLLAAHRAFINSESFLRPAAVSTPFFLLGAGLAPPFSLAHRALAAAESLARVEGEK
Ga0222622_1014319533300022756Groundwater SedimentPYGETRGGSSPLASSPLVSPPTLFESESYRQLSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0222622_1018504513300022756Groundwater SedimentVSQGPISGVLIDYLRAVFLRAAQRAFINCESLLRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0193714_100551233300023058SoilSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAEAGFVPPFSLAQRALAAAESLARVEGEK
Ga0207693_1001023173300025915Corn, Switchgrass And Miscanthus RhizosphereLRPAAFRRAAQRAFINCDSLLRPAGVRPPFFAEGAAFFVPAVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGFVPPFSLAQRALAAAASLARVEGEK
Ga0207657_1059150723300025919Corn RhizosphereLKAKSYRQAPLPLLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0207646_1012927733300025922Corn, Switchgrass And Miscanthus RhizosphereRQAADCVCIDRDYLRPLAFLRAAQRAFINCESLLRPAGVSPPFFAEAVVFAPPAFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0207651_1113158113300025960Switchgrass RhizosphereYGSGPYGETRGGSSPLASSAWVSPATYLKAKTIGKLPIPVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0207677_1071884833300026023Miscanthus RhizospherePVEVRVLSRAVLWSPRQPYLKAKSYRQAPLPLLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0207678_1076747533300026067Corn RhizosphereMVDYLRAVFRRAAQRAFINCESLLRPAAVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAHRALAAAESLARVEGEKYRPAA
Ga0209688_100097123300026305SoilLEAKTIGKLRIPVVLIYYLRPAALRRAAQRAFINCESLFRPAGVSPPFFAEGAAFVPPVFLLAAQRAFISSESFLRPAGVNAPFFLVGVGAGFVPPFNLAQRALAAAASFARVEGEK
Ga0209687_105258513300026322SoilPDYLRAAFFRAAQRAFINCESLLRPAGVSPPFFAEGAFFVPAVFLLAAQRAFISSESFRRPAAVSAPFFLEGAGFVPPFNLAQRALAAAESFARVEGEK
Ga0209470_118176213300026324SoilVLGCSPNRPYLKAKTIGKLPIPIASIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESLARVEGEK
Ga0209470_133465913300026324SoilDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0209473_100349053300026330SoilVLDSATYLEAKTIGKLRIPVVLIDYLRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK
Ga0209158_106216533300026333SoilGETRGGSSPLASTAWAPATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK
Ga0209057_104238413300026342SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEG
Ga0209057_106694733300026342SoilKPVEVRVLSRAVLDSATYLEAKTIGKLRIPVVLIDYLRPAVFRRAAQRAFINCDSLFRPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPLSLAQRALAAAASLARVEGEK
Ga0257163_107991613300026359SoilAQRAFINCDSLLRPAGVRPPFFAEGAAFFVPAVFLLAAQRAFMSSESFLRPAGVSAPFFPVAVGFVPPFSLAQRALAAAASLARVEGEK
Ga0257157_108752913300026496SoilFDYLRPAAFRRAAQRAFINCDSLLRPAGVRPPFFAEGAAFFVPAVFLLAAQRAFMSSESFLRPAGVSAPFFPVAVGFVPPFSLAQRALAAAASLARVEGEK
Ga0209808_103497213300026523SoilVLDSATYLEAKTIGKLRIPVVLIDYFRPAVFRRAAQRAFINCDSLFLPAGVSPPFFAEGAAFVPPVFLLAAQRALMSSESFLRPAGVSAPFFLVAVGLVPPPLSLAQRALAAAASLARVEGEK
Ga0209577_1044016013300026552SoilAPPFLRAAQRAFISWESRFRPAAVSPPFFGAGAVFVPPAFLRAAQRALISSESLLRPAGVSAPFFLAGAGFAVVFCFAQRALAASESLARVVGEK
Ga0207497_10106113300026786SoilVLGYPQRSCFEAKTIGKPPIPIVLIDYLRAAFRRAAQRAFINCESLLRPAGVSPPFLAEGAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLGGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0209858_101880513300027948Groundwater SandPIPLVLIDYLRPAALRRAAQRAFINCDSLLRPAGVSPPFFAEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLVGAGFVPPFNLAQRALAAAASLARVDGEK
Ga0137415_1001225413300028536Vadose Zone SoilGGSSPLASSAWVSQRPYLEAKTIDKLSISVVLTDYLRPVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0307299_1002193723300028793SoilLKAKTIGKLPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0307284_1026786413300028799SoilSGVLIDYLRAVFLRAAQRAFINCESLFRPAAVSPPFFAEGAAFVPPDFLLAAQRAFISSESFLRPAAVSAPFFLAGAGFVPPFSFAQRALAAAESFARVEGEK
Ga0307302_1003808113300028814SoilLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0307312_1066581913300028828SoilVWLDYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0307304_1060010513300028885SoilPLVCQGPYLKAKTIGKLPISVVLIDYLSAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0138303_143206313300030939SoilLINYNAQRSRRRINPGAFRRAAQRAFINCDSLFRPAGVSPPFFAEGEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLVGAGFVPPFNLAQRALAAAASLARVEGEK
Ga0075375_1041485013300030971SoilRAAQRAFINCDSLFRPAGVSPPFFAEGEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLAGAGFVPPFNLAQRALAAAASLARVEGEK
Ga0068589_1164372513300030979SoilGFSVDFFFMMLDGDAETAQRAFINCDSLFRPAGVSPPFFAEGEGAAFVPAPFLLAAQRAFISSESFLRPAGVSAPFFLVAAGFVPPFNLAQRALAAAASLARVEGEK
Ga0308189_1028523413300031058SoilLVSPLTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEDAAFVPPDAFVPLDLLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0308193_105355723300031096SoilLVSPPTLFESESYRQPSLPIILVDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDAFVPLDLLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAA
Ga0308187_1030319223300031114SoilLVSPPTLFESKSYRQPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVRPPFFAEAAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGLAPPFSLAQRALAADESFARVDGEK
Ga0308194_1031065413300031421SoilLKAKAIGNPPLPLVLLDYLRPEAFRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADESFARVDGEK
Ga0308194_1036980413300031421SoilLASSVLVCQGPLFESQNYRQAPISVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPADFLRAAQRAFISSESFLRPAAVSAPFFLAGDGFVPPFSLAQRALAAAASFARVEGEK
Ga0170818_10551539813300031474Forest SoilVLGCSPKPALLKAKTIGKLPIYVVLIDYLRAAFRRAAQRAFINCESLFRPAGVSPPFLAEGAGFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFGPPFSFAQRALAAAESFARVEGEK
Ga0310813_1042781913300031716SoilVLGYPQRPYLEWKTIGKPLSIPVLIDYLRPEALRRAAQRAFINCESLLRPAGVRPPFFADGAAFVPAAFLLAAQRAFINSDSFLRPAAVSAPFFLVGAGLVPPFSLAQRALAAAESLARVEGEK
Ga0307468_10008927823300031740Hardwood Forest SoilLKAKTIGKLPIPVVLIDYLRAVFLRAAQRAFINCESLLRPAGVSPPFFAEGAAFVPPDLLLAAQRAFISSDSFLRPAAVSAPFFLAGAGFVPPFSLAQRALAAAESLARVEGEK
Ga0310810_1003489113300033412SoilVLGYPQRPYLEWKTIGKPLSIPVLIDYLRPEALRRAAQRAFINCESLLRPAGVRPPFFADGAAFVPADFLLAAQRAFINSDSFLRPAAVSAPFFLVGAGLVPPFSLAQRALAAAESLARVEGEK
Ga0310811_1039321013300033475SoilVLGSPRQPYLKAKSYRQAPLPLLLLNYLRPEALRRAAQRAFINCESLLRPAAVSPPFFAEPAAFVPPDFLLAAQRAFISSDSFLRPAAVSAPFFLVGAGFAPPFSLAQRALAADE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.