NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069381

Metagenome / Metatranscriptome Family F069381

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069381
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 207 residues
Representative Sequence MTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTAVGFEAKPAALRPLVAEATEKFALLPRPL
Number of Associated Samples 101
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.69

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(34.677 % of family members)
Environment Ontology (ENVO) Unclassified
(33.871 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 48.12%    β-sheet: 15.48%    Coil/Unstructured: 36.40%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.69
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF06496DUF1097 17.74
PF02585PIG-L 6.45
PF00903Glyoxalase 1.61
PF07884VKOR 0.81
PF00171Aldedh 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 6.45
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 0.81
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 0.81
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 0.81
COG4243Vitamin K epoxide reductase (VKOR) family protein, predicted involvement in disulfide bond formationGeneral function prediction only [R] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil34.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.97%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.29%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.23%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.42%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.42%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.42%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.61%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.61%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.61%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.61%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.81%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.81%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.81%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018064Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033812Sediment microbial communities from East River floodplain, Colorado, United States - 65_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1000801943300002560Grasslands SoilMTRFRVRRGAVVAVLLVLAFGVAVALAVQAYQTAGDHRAQADRVLRDYARLAAARVAIRTATDMYYAVKSPLKALQLAHETAPHHATPKPRDLHFDVMEHEFSIAPYIRFTFRMDLTTKDLQTSGEQVPAPVRTWLIDTLPVHTRTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDRDG
Ga0062589_10152994113300004156SoilKADRVLRDYAALAAARVALAAAREIYFAVTPPLKALEHAHDGDKPLPDPKHLHFDPMEKEFSIAPYLRFTFSLDLKTGELKTWGQPVPPHVRQWMADTLPGHTRVVYDTSWHMGSVLGQPDGERRYVVYTVLRDADGVLRTAIGFEAKPAALRPLVEQAVDKFPLLPRPLIGSVQYDSMGSVIVTDRFGIEIYRSAAQYTSPFTARDTIGTDMGDLFA
Ga0066677_1070855413300005171SoilYGVARYHRAQADRVLRDYAALAAARVAQRSAIEIYYAVLPPLKALQHAHEMAPHARLPALKELRIDAMEHEFSLSPYIRFTFRMDFKTRRLETAGEPVPAAVRTWLIDTLPVHTRTVYDTSWNMHMGTVLGRPAGERRYVAYTVLRDSTGVLRTALGFDTKPEVLQPLVVHATEKFPLLPRPLTGGV
Ga0066673_1034838813300005175SoilMSRFRVRRGAVVALLLVLAFGVAVVLAVQAYRTAGDHRAQADRVLRDYARLGASRVARQTAVYVYYAVNAPLKTLQHAHEMAPFDAPPKPRNLHFDTMEHEFSMAPYIRFTFWMDLTTKRLKTSGEPLSAPVRKWLLDTLPIHLRTVYDTSWHMGTILGQPGGDRRYVGYTAVRDRDGVLRTALGFEVNPTALTPLIVQATDTQRFALLP
Ga0066676_1000333013300005186SoilMTRFRVRRGAVVAVLLVLAFGVAAALAVQAYRTARYHRAQADRVLRDYARLAASRVAVRSATDIYYAVVPPLKALKHAHEMAPHKALPVPKDLHFDTMEHEFSLAPYIRFTFRMDLKSGELTTSGQKLPTAVRRWLIDTLPLHTRTVYDTSWHMGTVLGQPKGDRRYVAYTVLRDSDGVLRTALGFEANPTALTPLVIQATDTQKFALLPRPLTSGDSSDPSRRRVQYDSMGSIIITDSYGVDIYRSAVQYTSPFTARDTIGTDMGALFAH
Ga0066676_1010538823300005186SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHTPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLEATGATSRTRCCVTPMGCCAPRSGSKRSRPR
Ga0066676_1067333413300005186SoilRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYRS
Ga0066675_1034538013300005187SoilMSRLRVRRGAVVALLLVLAFGVAVVLAVQAYRTAGDHRAQADRVLRDYARLGASRVARQTAVYVYYAVNAPLKTLQHAHEMAPFDAPPKPRNLHFDTMEHEFSMAPYIRFTFWMDLTTKRLETSGEPLPAPVRKWLLDTLPIHLRTVYDTSWHMGTILGQPGGDRRYVGYTAVRDR
Ga0066682_1034654613300005450SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHTPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLG
Ga0066681_1037899513300005451SoilMEGWGGNLQSMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEIAPDKPLPRPKDLHFDMMEHEFSLAPYIRFTFRMNLETGALTTSGQTVPERVRRWLVDTLPQHTRVVYDTSWHMGTVLGQLGGDRRYIAYTVLRDADGMLRTALGFEAKPAALRP
Ga0066687_1023122413300005454SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTAGDHRAQADRVLRDYARLAAARVAQWTATYIYYAVKPPLKALQHAHEMAPHKPLPVPKDLHFDIMEHEFSIAPFIRFTFRMDLKTGQLQTSGQTLPPVIRRWLSDTLPVHTRTVYDTSWHMGTVLGQPEGDRRYIAYTVLRDSDGTLRTALGFEAKPAALTPLVVQATDTQKFALLPRPLTGGVQYDSMGSIIIVDSYGVEIYRSAAQYKSPFTARDTIGTDMGALFAWATLRESMADKLIIGG
Ga0073909_1070234413300005526Surface SoilVTATLAVQAYTNARHHRAQADRVLHDYAALAAARVANAAAREIYWAVTPPLKVLQHAHEAARGKPLPDPKHLHFDPMEKEFSIAPYIRFTFSMDLKTGELKTWGEQVPANVRQWMTDTLPVHTRVVYDSSWHMGSVLGQPDGERRYVVYTVLRDGDGSLRTAIGFETK
Ga0070696_10097211313300005546Corn, Switchgrass And Miscanthus RhizosphereAAALGIQAYGVSRYHRRQADRVLRDYAALAAARVAERSRVAIYYALTPPLKAIQHAHEMSPGKLPAPNELHVDAMERVFSLTPYIRFTFRMDLVNNRLETAGELLPATVRAWIVDTLPLHARTVYDTAWHMGDMGTILGQPAGDRRYLAYKLLRDKNGAAQTALGFETKPAALRPLVIQATERNFELLPRPLTGGVQYDSMGSIIITDRFGVEIYRTAVQYTSPFTARDTIGTDM
Ga0066701_1078807813300005552SoilLAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDSDGVLRTALGFEVKPIVLKPLVVQATEKFELLPRPLTGG
Ga0066707_1091779713300005556SoilRAQADRVLRDYASLAAARVAQRSAILIYYAVDAPLKALQHAYEKAPHKPLPRPGDLHFDVMEHEFSLAPYIRFTFRLDLKTGALQTSGHELPNAVRKWLIDTLPVHTRTVYDTSWHMGSVLGQPAGGWDRRYVVYTVLRNSEGTLRTALGFEAKPAALRPLVAQVTEKFPLLPRPLTG
Ga0066698_1052923113300005558SoilMTRFRVRRGAVVAVLLVLAFGVAVALAVQAYQTAGDHRAQADRVLRDYARLAAARVAIRTATDMYYAVKSPLKALQLAHETAPHHATPKPRDLHFDVMEHEFSIAPYIRFTFRMDLTTKDLKTSGEQVPAPVRTWLIDTLPVHTRTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDRDGVLRTALGFEANPTALTPLIVQATDTQKFALLPRPLTGGVGYDSMGSIIITDSYGVDVYRSAVQYTSPFTARDTIGTDMGALFAH
Ga0066698_1070452513300005558SoilVLLVLAFAVAAALGVQAYETARYHRAQADRVLRDYAALAAARVAQRSAQEIYYAVVPPLKALQHAHEMAPHQPLPRPADLHFDVMEHEFSLAPYIRFTFRMDLKTRQLQTSGQALPPFVRQWLLDTLPVHTRAVYDSSWHMGTVLGQPGGDWDRRYVAYTVLRDAEGTLRTALGFEAKPAALRPLVVQATEKFPLLPRPLTGGVQYDSMGSIIITDRYGVE
Ga0066698_1086397313300005558SoilQAYETARYHRAQADRVLRDYAALAAARVAQICAREIYYAVIPPLKALQHAQGMAPHKSLPRPADLHFDVMEHEFSLAPFLRFTFRMDLKARVLQTSGQPVPPPVRKWLIDTLPVHTRTVYDTSWHMGSILGQPDWDRRYVVYTVLRDSDGSLRTAIGFEAKPAALRPLVVQATEKFPLLPRPLTGGVQYDSMG
Ga0066694_1021557013300005574SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGV
Ga0066694_1024766313300005574SoilMEGWGGNLQSMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEIAPDKPLPRPKDLHFDMMEHEFSLAPYIRFTFRMNLETGALTTSGQTVPERVRRWLVDTLPQHTRVVYDTSWHMGTVLGQLGGDRRYIAYTVLRDADGMLRTALGF
Ga0066708_1098188213300005576SoilRAQADRVLRDYARLAAARVAQWTATYIYYAVKPPLKALQHAHEIAPHKPLPVPKDLHFDIMEHEFSIAPFIRFTFRMDLKTGQLQTSGQALPPVIQRWLSDTLPVHTRTVYDTSWHMGTVLGQPEGDRRYVAYTVLRDSDGTLRTALGFEAKPAALTPLVVQATDTQKFALLPRP
Ga0066651_1007468133300006031SoilMIRFRVRRGAVVALLLVLAFGVAVALAIQAYQTAGDHRAQADRVLRDYAHLAASRVAMRSATDIYFAVTPPLKALQRAHKALPNAAPPKPKDLHFDMMEHEFSLAPYIRFTFRMDLKTRELTTSGQAPPRAVRKWLIDTLPIHARTVYDTLWSANMGTVLGQPQGDRRYVAYTLLRDSDGALRTALGFEVKPMALRPLLIQATDTQKLALLPPP
Ga0066652_10000902613300006046SoilMTRFRVRRGAVVALLLVLAFGVAVALAIQVYQTAGDHRAQADRVLRDYARLAAARVARQSAIGMYYAVTPPLKALQHAHERAPHHGPPKPRDLHFDVMENDFSIAPYIRFTFRMDLMTKQLQTAGEVPVPVRNWLIDTLPIHTRAVYDTSWHMGTVLGQPGGDRRYVAYTVLRDRHGVLRTALGFESNPSAFRPIVVQATDTQKFALLPRPLTGGIGYDSMGSFVITDRYGVELYSSGVQYTSPFTARDTIGTDLGDLYAQATLR
Ga0070715_1051044213300006163Corn, Switchgrass And Miscanthus RhizosphereSMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHKPLPSPANLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQKVPDRVRRWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGVLRTAIGLEAKPAALRPLVAEATDKFALLPRPLTGGVQYDSMGSIIITDRY
Ga0066653_1000781943300006791SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSII
Ga0075421_10018392613300006845Populus RhizosphereMTSFPTSARFRVRRGAVVAVLLLLAFGVAAALGIQAYGVARYHRAQADRVLRDYAALAASRVAVRSASEIYYAVTPPLKSIKHAHEMVPGKLPRPSDLGGDAMEREFSLTPYIRFTFRMDLRDKKLETAGARLPPTVRSWLIDTLPTHTRVVYDTSWHMGTVLGQPGGERRYLVYTVLRDEEGTLRTALGFEAKPEALRPLVIQAAKKYPLLPRPLTGGVQYDSMGSFIITDRYGVEIYRTAVQYTSP
Ga0075425_10309063113300006854Populus RhizosphereVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHQPLPSPANLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQKVPDRVRRWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLR
Ga0075426_1002856313300006903Populus RhizosphereMSLLDRIPRFRVRRAAVVAVLLVLTFGVAAVLGVQAYGVARYHRAQADRVLRDYAKLAAARVAQRSAIEIYEAVMPRLKAIQHAHEMSPGKLPAPSELRVDAMEGSFSLTPYVRFTFQMDLKEGRLETAGELLPDAVRSWIIDTLPIHTRAVYDTSWHMGTVLGQPGGERRYVAYTVLRDSKGVLRTALGFEAKPEALRPLVIQATEKFPLLPRPLTGGVQYDSMGSIIISDRYGVEIYR
Ga0075426_1027945423300006903Populus RhizosphereMARFRVRRGAVVAVLLVMAFAVAAALGVQAYETAHDHRAQADRVLRDYASLAAARVAQRSAQEIYYAVVPPLKALQHANEIGPHAALPKPADLHFDVMEHEFSIAPYIRLTFRLDLKTGELQSSGKPVSPVVRKWLLDTLPAHTRAVYDTSWHMGTILGQPDWQRRYIVYTVLRDRDGALRTVLGYEANPAALQPLVVQATEKFPLLPRPLTGGVLYDSMGSIIITDRYGVEIYR
Ga0075424_10001783083300006904Populus RhizosphereMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEVAPHKPLPAPKELHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPDRVRRWLVDTLPAHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLRTAIGVEAKPVALRPLVAEATDKFPLLPRPLTGGIQYDSMGSIIITDRYGVEIYRSAVQYTSPYTGSDTVGTDMGDLAAQTTLRASMADKLII
Ga0075424_10010051613300006904Populus RhizosphereMSRFPVRRGPVVAVLLVLAFGVAVALAVQAYQMAGNYRAQADRVLRDYARLAASRVARQSATFVYWAVNPPITALQHSYETAPHAPPPQPQHLHVDMMDHELSVVPSIRFTFRMDLKTKELQTTGQQVPVSVRKWLTDTLPIHARSVFDTSWHMGTVLGQPDGDRRYVGYTILPDRDGVLRTALGFEMKP
Ga0075424_10113958413300006904Populus RhizosphereRFRVRRGAVVAVLLVLAFGVAVALAIQAYQTAGDHRAQADRVLRDYARLAAARVAVRTATDMYYAVKSPLKSLEHAHETAPRSAPPKPRDLHFDTMEHEFSIAPYIRFTFRMDLTTKDLKTSGEKVPAQVRKWLIDTLPAHTRTVYDTSWHMGTVLGQPGGDRRYVGYTVLRDPDGVLRTALGFEANPTALTPLIVQATDTQKFALLPRPLTGGVGYDSMGSIIITDSYGVDVYRSAVQYTSPFTARDTIGTDMGAFFAHATLRAEVADQLIIGGL
Ga0075436_10011831013300006914Populus RhizosphereMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEVAPHKPLPAPKELHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPDRVRRWLVDTLPAHTRTVYDTSWHMGTVLGQPGGD
Ga0075436_10022741323300006914Populus RhizosphereMERRGGNLQSMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEMAPHKPLPAPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGPNVPDRVRRWLVDTLPPHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLRTAIGVEAKPVALRPLVAEATDKFPLLPRPLTGGIQYD
Ga0075436_10094562513300006914Populus RhizosphereMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEMAPHKPLPAPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPDRVRRWLVDTLPAHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLRTAIG
Ga0075435_10028866723300007076Populus RhizosphereMERRGGNLQSMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEMAPHKPLPAPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGPNVPDRVRRWLVDTLPPHTRTVYDTSWHMGTVLGQPGGDR
Ga0075435_10031087013300007076Populus RhizosphereMSRFPVRRGPVVAVLLVLAFGVAVALAVQAYQMAGNYRAQADRVLRDYARLAASRVARQSATFVYWAVNPPITALQHSYETAPHAPPPQPQHLHVDMMDHELSVVPSIRFTFRMDLKTKELQTTGQKVPVSVRKWLTDTLPVHARSVFDTSWHMGTVLGQPDGDRRYVGYTILPDRDGVLRTALGFEMKPAALTPLVVQATDTQKFALLPRPLTGG
Ga0099791_1011619223300007255Vadose Zone SoilMSRLDRIPRFRVRRGAVIAVLLVLVFAVTATPAVQAYTNARHHRAQADRVLRDYAALAAARVAQTAAREIYWAVTPPLKALEHAHEAAPGKPLPEPKHLHFDANMEREFSLAPYIRFTFSMDLKSGQLQTWGQQVPPQVRHWMSDTLPAHTRVVYDTSWHMGTVLGQPGGERRYIVY
Ga0099794_1007737213300007265Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHIQEMAPHKPLPRPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPERVRRWLVDTLPPHTRAVYDTSWHMGTVLGQPGGDRRYLAYTV
Ga0099829_10000229383300009038Vadose Zone SoilMILRVRRGAVVGTLLVLAFAVAVVLAVQAYGVARYHRAQADRVLRDYAALAAARVAQRSAIQIYDAVSPPLKAVQHAYEMAPHAPPPAPKDLRVDEMEHEFSLKPYIRFTFRMDFKTHRLQTAGAPLPAAVRTWLIDTLPVHTRTVYDTSWHMGTVLGRPAGERRYVAYTVLRDSTGVLRTALVFDVKPEVLRPLVVHATEKLPLLPRPLTGGVQYDSMGSIII
Ga0066709_10127656313300009137Grasslands SoilMTRFRVGRGAVVAVLLVLAFGVAVALAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDSDGVLRTALGFEVKPIVLKPLVVQATEKFELLPRPL
Ga0099792_1087394013300009143Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPALKALQHAHELAPHQPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLATGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYIAYTVLRDADGMLRTAIGLEAKPA
Ga0134070_1000246313300010301Grasslands SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFE
Ga0134109_1036732513300010320Grasslands SoilMSRFRVRRAAVVAVLLVLVFGLAVALAIQAYQMAGNYRAQADRVLRDYARLAASRVARQSATYVYWAVNPPLTALQHAYEMAPHAPPPRPRDVHVDMMEHELSVVPAIRFTFRMDLMTKDLQTTGEKLPVSVRQWLVDTLPVHARTVYDTSWHMGTVLGQPE
Ga0134065_1025483113300010326Grasslands SoilGAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLSKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYRSAVQ
Ga0134111_1004584613300010329Grasslands SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHNPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYRSAVQYTSPFTGSDTVGTDMGDLYAQTTLRASMADKLIIG
Ga0134111_1036370613300010329Grasslands SoilVVAVLLVLAFGVAAALGIQAYRTARYHRAQADRVLSDYAALAAARVAQRSAYEIYWAVMPPLKAIQHAHEVAPHKPLPTPGALHFDAMEREFSLAPYIRFTFRMDLKSGRLEPSGPTIPSSVRAWLVDTLPVHARTIYDTSWHMGTVLGQPGGDRRYVAYTLLRDSQGTLRTALGFEAKPAALRPLVIQATEKFPLLPRPLTGG
Ga0134071_1000248173300010336Grasslands SoilMTRFRVGRGAVVAVLLVLAFGVAAALAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDKDGTLPTALGFEAKPTALTPLVVQATDTQKFALLPRPLTGGVQYDSMGSMIITDRYGVEIYRSPV
Ga0126377_1138388013300010362Tropical Forest SoilMRLRDRLPSSRFRVRRTAVVAVLLVLSLGVAAVLGIQAYGVARYHRAQADRVLRDYAALAAARVAERTRVYIYWAVAPPLKAIKHATEKALGKLPAPSEVRVDVMEEEFSLTPYIRFTFRMDLKDKRLETAGEPLPAAVRSWLIDTLPTHARAVYDTLWKPNMGTVLGQPGGERRYVAYTLLRDSKGVL
Ga0137389_1150995113300012096Vadose Zone SoilAALGVQAYRTAGDHRAQADRVLRDYARLAAARVALRSATDLYYAVMPPLKAVQHAYEMAPHKAPPLPKDLHFETMEHQLSLAPYIRFTFRMDLKSGQLATSGQELPPAVRRWLIDTLPVHTRAVYDTSWHMGTVLGQPSGDRRYVAYTVLRDSDGVLRTALGFEANPTLLKPLVVQATDAQKFALLPRP
Ga0137364_1017141123300012198Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVMPPLKALQHANELAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMG
Ga0137382_1077883713300012200Vadose Zone SoilMTRFRVRRGAIVAVLLVLSFGVAVALAIQAYETAGDHRAQADRVLRDYARLAAARVALRTAQGIYYAVTPPLKSLQHAREVAPHDAPPKPQDLHFDVMEDEFSIAPYIRFTFWMDLKTKELKTSGQALPPNVRQWLIDTLPLHARFIYDTAWGGHMGTVLGQPGGDRRYIAYALLRDKDGTRRTALGFEAKPAALRPLVAEATEKFALL
Ga0137365_1013874523300012201Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLATGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLRTAIGLEAKPAALRPLVAEATDKFPLLPRPLTGGVQYDSMGSIIITDRYGVEIYRSAVQYTSPFTG
Ga0137365_1024275213300012201Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHRPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLATGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYIASTVVRDAD
Ga0137363_1058090913300012202Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHQPLPRPKDLHFDMMEHEFSLAPYIRFTFRMDLQTGALTTYGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYIAYTVLRDAAGV
Ga0137399_1101545613300012203Vadose Zone SoilRGAVVAVLLLLAFGVAAALGVQASRTAGDHRDQADRVLRDYAKLAAARVGQRSATDIYYAVMPPLKALQHTYERNPHKPPPGPRDLHFETMEHAFSLAPYIRFTFRMDLKTGQLMLAGQAPKIVRRWLTDTLPVHTHTVYDTSWHMGTVLGRPAGDRRYVVYTVLRDADGVLRTALGFEAKPTALEPLLVEATSTAKFALLPRPLTGGVQYDSMGSIIITDSYGVEIYRSPV
Ga0137362_1027729013300012205Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHIQEMAPHKPLPRPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPERVRRWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALLSLVAEATEKFALLPRPLTGGVQ
Ga0137381_1067037913300012207Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDSSWHMGTVLGQLGGDRRYIAYTVLRGADGMLRTAIGLEAKP
Ga0137376_1018317413300012208Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGVLTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVL
Ga0137376_1073705523300012208Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGD
Ga0137376_1090686913300012208Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLSKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDS
Ga0137376_1090703213300012208Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSGGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDS
Ga0137378_1147145013300012210Vadose Zone SoilHRAQADRVLRDYAALAAARVAQRSATEIYWAVTPPLKAIQHAHEMAPEKPPQPPAMLHFDAMEHEFSLAPFIRFTFRMDLKSGKLETSGQTVPPSISAWLIDTLPVHTRTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDANGMLRTALGFEAKPAALRPLVVQAMEKTPLLPRPLTGPVIFTSDASTGEPALAVVLA
Ga0137377_1073717313300012211Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGVLTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDADGMLRTALGFEAKPAALRPLVAEATEKFALLPRPLT
Ga0137387_1109818413300012349Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTARYHRDQADRVLRDYARLAASRVAIRSATDIYYAVMPPLKAVKQAHEMAPHQAPPAPKDLHFDTMEHEFSLRPYIRFTFRMDLKSGQLVTAGQEVPPAVRRWLVDTLPVHTRTVYDTSWHMGTV
Ga0137386_1009357323300012351Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTARYHRDQADRVLRDYARLAASRVAIRSATDIYYAVMPPLKAVKQAHEMAPHQAPPAPKDLHFDTMEHEFSLRPYIRFTFRMDLKSGRLVTAGQQVPPAVRRWLIDTLPVHTRIVYDTSWHMGTVLGQPHGDRRYIAYTVLRDSDGLLRTALGFEAKPSALT
Ga0137367_1063599713300012353Vadose Zone SoilMTRFRLRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEMAPHRPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYI
Ga0137369_1034787013300012355Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAAALAVQAYGTARYHRAQADRVLRDYARLAAARVALRTATDIYYAVSPPLKALQHAHETAPHKAPPKPRDLHFDTMERDFSIAPYIRFTFWMDLKNKHLQTSGQELPPRVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDKDGTLRSALGFEAKPTALTPLVVQATDTQKFALLPRPLTGGVQYDS
Ga0137369_1045882413300012355Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGT
Ga0137384_1044766413300012357Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLATGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYIAYTVLRDADGMLRTAIGLEAKPAALRPLVAEATDKFPLLPRPLTGGVQYDSMGSIIITDRFGVEIYRSAVQYTSPFTGSDTVGTD
Ga0137368_1003922113300012358Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAAALAVQAYGTARYHRAQADRVLRDYARLAAARVALRTATDIYYAVSPPLKALQHAHETAPHKAPPKPRDLHFDTMERDFSIAPYIRFTFWMDLKNKHLQTSGQELPPRVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDKDGTLRSALGFEAKPTALTPLVVQATDTQK
Ga0137375_1068887513300012360Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRFGVEIYSSAVQYTSPFTGSD
Ga0137390_1110086713300012363Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAVALAVQAYQTAGDHRAQADRVLRDYARLAAARVAIRTATDMYYAVKSPLKALQLAHATAPHHAPPKPRDLHFDVMEHEFSIAPYIRFTFRMDLTTKDLQTSGEQIPAPVRKWLIDTLPVHTHTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDRDG
Ga0137398_1068614223300012683Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLADTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRD
Ga0137395_1129078913300012917Vadose Zone SoilGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHEIAPHKPLPRPKDLHFDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPERVRRWLVDTLPSHTRAVYDTSWHMGTVLGQPGGDRRYIAYTVLRDADGMLRTAVGLEAKPAALR
Ga0137396_1009938813300012918Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYATSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALWSLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGV
Ga0137394_1004365913300012922Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALG
Ga0137394_1122568313300012922Vadose Zone SoilTARYHRAQADRVLRDYAALAAARVAQRSAIEIYYAVSPPLKAIQHAHEMAPRKPLPPPGTLHFDAMEHEFSLAPYIRFTFRMDLKSGALETSGPTVPPSIRVWIVDILPVHTRTVYDTSWHMGTVLGQPGGDRRYVAYTVLRDSQGTLRTALGFEAKPAALRPLVIQATEKFPLLPRPLTGGVQYDSMGSVIITDRWGIEIYRS
Ga0137359_1007907033300012923Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHIQEMAPHKPLPRPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPPHTRAVYDTSWHMGTVLGQPGGDRRYLAYTVLRDADGMLRTAVGLEAKPAALRPLVAEA
Ga0137413_1041997723300012924Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYDTSWHMG
Ga0137419_1145969913300012925Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTAL
Ga0137416_1090844113300012927Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTAGDHRAQADRVLRDYARLAASRVAVWSATRIYYAVNPPLKALQHAHEKAPHKPLPVPKDLHFDTMEHEFSIAPYIRFTFRMDLESGKLQTSGQEVPPVVRHWLVDTLPVHTRTVYDTSWHMGTVLGQPNGDRRYVAYTVLRDSDGVLRTALGFETKPTVLTPLVVEATD
Ga0134079_1002671723300014166Grasslands SoilMALRVRRGAVVATLLVLALVVAGMLAVQAYGVARYHRAQADRVLRDYAALAAARVAQRSASELYYAIIPNLKAIQHAHEMAPRGPLPLPRAIHVDIMEHEIPLTPSIRFTFRLDLPSGRLETAGEPVPNTVRTWLIDTLPIHTRTVYDTSWHMG
Ga0134079_1003940523300014166Grasslands SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTAGDHLAQADRVLRDYARLAAARVAQWTATYIYYAVKPPLKALQHAHEMAPHKLLPVPKDLHFDIMEHEFSIAPFIRFTFRMDLKTGQLQTSGQALPPVIQRWLSDTLPVHTRTVYDTSWHMGTVLGQPEGDRRYVAYTVLRDSDGTLRTALGFEAKPAALTPLVVQATDTQKFALLPRPLTGGVQYDSMGSIIIVDSYGVEIYRSAAQYKSPFTARDTIGTDMGALFAWATLRESMADKLII
Ga0137405_139807113300015053Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRGTSRTRCCATQTGCCAPRSASKRSPPRS
Ga0134089_1011411123300015358Grasslands SoilMTRFRVGRGAVVAVLLVLAFGVAVALAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDSDGVLRTALGFEVKPIVLKPLVVQATEKFELLPRPLTGGVQYDSMGSIIITDRYGVEIYRSAVQYTSPFTARDTIGTDMGG
Ga0134089_1026198913300015358Grasslands SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTARYHRNQADRVLRDYARLAASRVAIRSATDIYYAVIPPLKAVKHAHEMAPHKAPPAPKELHFDTMEHEFSLRPYIRFTFRMDLKSGQLITAGQEVPPAVRRWLVDTLPVHTRIVYDTSWHMGTVLGQPHGDRRYIAYTVLRDSDGVLRTALGFEAKPTALTPL
Ga0134089_1040879213300015358Grasslands SoilSMTRFRVRRGAVMAALLVLTFGVAATLGVQAYRTARYHRDQADRVLRDYAKLAAARLGQRSATDIYYAVMPPLKALQHTYEKNPHKPPPGPRDLHFETMEHAFSLAPYIRFTFRMDLKTGQLTLAGQAPPTVRHWLIDTLPVHTRTVYDTSWHMGTVLGQPNSDRRYIAYTVLRDSNGLLRTALGFEAKPAAL
Ga0134085_1010362113300015359Grasslands SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAHELAPHTPLPRPKDLHFDMMEHEFSLAPYIRFTFRLDLETGALTTSGQTVPERVRRWLVDTLPQHTRAVYDTSWHMGTVLGQLGGDRRYIAYTVLRDADGMLRTAIGLEAKPAALRPLVAEATDKFPLLPRPLTGGVQYDSMGSIIITDRYGVEI
Ga0132256_10224128913300015372Arabidopsis RhizosphereLAIQAYTNARHHRAQADRVLRDYAALAAARVALAAAREIYFAVTPPLKALEHAHEGEKPLPDPKHLHFDPMEKEFSIAPYIRFTFSLDLKTGELKTWGQQVPPHVRQWMADTLPGHTRVVYDTSWHMGSVLGQPDGERRYVVYTVLRDADGVLRTAIGFEAKPVALRPLVEQAVDKFPLLPRPLIGWASFLGTLVTLGFVVALIADFDTGTAGLQH
Ga0184610_108792913300017997Groundwater SedimentMKLFPTSTRFRVRRGAVIAVLLVLVFVVAAALAVQAYGNARYHRAQADRVLRDYAALAAARVAQRSATEIYYAVIPPLKAIQHAHEAAPHKALPGPKDLHFDPMEREFSLAPYIRYTFRMDLKSGQLKTSGQAVPWAVHKWMVDTLPAHTRTVYDTSWHMGSVLGQPGGERRYVVYTVLRDSDGALRTALGFEAKPAALRPLVEQGTQKFPLLPRPLTGGVQYDSMGSVIITDRYGVEIYRSAVQYSSPFTARDTIGTDMGDLYAQ
Ga0184623_1000903743300018056Groundwater SedimentMRFSPSTWRFRVRRGAVVAALLVLAFGVAVMLAIQAYGVARYHRAQADRVLRDYAALAAARVAQRSASEIYYAIIPPLKAIQRAHEKAPRTPLPRPAALRVEVMDRDVSLTPYIRFTFRLDLPTGRLETSGKSVPEPVRAWLTDTLPVHTRTVYDTSWHMGAVLGQPGGDRRYVAYTVLRDSTGALRTALGFEAQPEGLRPLVVHATEKFPLLPRPLTGGVQYDSMGSIIITDRYGVELYRSAVQYSSPFSARDTVGTDMGDLYAH
Ga0187773_1097642913300018064Tropical PeatlandRQSATYVYWAVNPPLTALQHAYETAPHAPPPRPRDVHVDMMPQEHDLSVVPAIRFTFRMDLITKDLRTTGEELPVWVRKWLVDTLPVHARTVYDTSWHMGTVLGQPEGDRRYIGYTIVRDRDSVLRTALGFEMKPVALSPLVVQATDTQRFALLPRPLTGGGLSYIGGTFYLAGGSTSAYGSVN
Ga0184609_1026556023300018076Groundwater SedimentMTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTAVGFEAKPAALRPLVAEATEKFALLPRPL
Ga0066655_1008179113300018431Grasslands SoilMTRFRVGRGAVVAVLLVLAFGVAAALAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVA
Ga0066669_1104673713300018482Grasslands SoilVARYHRAQADRVLRDYAALAAARVAQRSAQEIYYAVLAPLKAVQHAHEMAPQARLPAPKELRIDVMEHEFSLSPYIRFTFRMDFKTRRLETAGEPVPAAVRTWLIDTLPVHTRTVYDTSWHMGTVLGRPAGERRYVAYTVLRDSTGVLRTALGFDTKPEVLQPLVVHATEKFPLLPRPLTGGVQYDSIGSIIITDRYGYEIYRSAVQYTSPFTAHATVGRDLGELYAQTTLRASVADQLITGRSP
Ga0137408_122371933300019789Vadose Zone SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVRHWLVDTLPLHTRTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTALGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYSSAVQYTSPFTGSDTVG
Ga0210382_1032549213300021080Groundwater SedimentMTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLSKRVQRWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLR
Ga0179596_1047272113300021086Vadose Zone SoilMTRFRVRRGAVVAVLLVLAFAVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAHEMAPHKPLPRPKDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPERVRRWLVDTLPPHTRAVYDTSWHMGTVLGQPGGDRRYL
Ga0222625_157872113300022195Groundwater SedimentMTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTAVGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYSSAVQYTSPFTGSDTVGTDMGDLYAQTTLRASMADKLII
Ga0224452_113000113300022534Groundwater SedimentMTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDGVLRTAVGFEAKPAALRPLVAEATEKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYSSA
Ga0222623_1001109713300022694Groundwater SedimentMTRFRVRRGQVVAVLLLLAFGVAAALGIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPKPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPLHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSD
Ga0247680_102049213300024246SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVTPPLKALQHAYEMAPHKPLPAPNDLHYDMMEHEFSLAPYIRFTFRMDLKTGALTTSGQTVPDRVRRWLVDTLPAHTRTVYDTSWHMGTVLGQPGGDRRYIAY
Ga0207707_1028982913300025912Corn RhizosphereMRFFPTSTRFRVRRSAVIAVLLLLVFAVTATLAVQAYTNSRHHRAQADRVLNDYAALAAARVASQSAREIFWAVTPPLKALEQALEDAPGKPLPDPEHLRFDPMEKDFSIAPYIRFTFSMDLNTHELKTWGQQVPASVRQWMMDTLPGHTRFVYDTAWEGNMGTVLGQPGGERRYVVYRLLRDEHGSVRTAIGYEAKPAALRPLVEKAQTRPYFSLLPRPLVGNVPDDSMGSVIVTDRFGVEIYRSAAQYASPFTARDTIGTDMGD
Ga0207646_1139744113300025922Corn, Switchgrass And Miscanthus RhizosphereRHHRAQADRVLRDYAALAAARVALAAAREIYFAVTPPLKALEHAHDGDKPLPDPKHLHFDPMEKEFSIAPYIRFTFSLDLKTGELKTWGQQVPSHVRQWMADTLPGHTRVVYDTSWHMGSVLGQPDGERRYVVYTVLRDADGVLRTAIGFEAKPAALRPLVEQAVDKFPLLPRPLIGSVQYDSMGSVIVTDRFGIEIYRSAA
Ga0209468_107717923300026306SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGGDRRYIAYTVLRDSDG
Ga0209472_109559213300026323SoilMTRFRVRRGQVVAVLLLLAFGVAAALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALTTSGQTLPKRVQRWLVDTLPRHTLTVYDTSWHMGTVLGQPGG
Ga0209470_126184113300026324SoilVVAVLLVLVFAVAAALAIQAYETARYHRAQADRVLRDYAALAAARVAQICAREIYYAVIPPLKALQHAQGMAPHKPLPRPADLHFDVMEHEFSLAPFLRFTFRMDLKARVLQTSGQPVPPPVRKWLIDTLPVHTRTVYDTSWHMGSILGQPDWDRRYVVYTVLRDS
Ga0209152_1002259633300026325SoilMTRFRVRRGAVVAVLLVLAFGVAAALGVQAYRTAGDHRAQADRVLRDYARLAASRVAVWSATRIYYAVNPPLKALQHAHEKAPHKPLPVPKDLHFDTMEHEFSIAPYIRFTFRMDLESGELQTSGEEVPPAVRHWLVDTLPVHTRTVYDTSWHMGTVLGQPNGDRRYVAYTVLRDSDGVLRTALGFETKPTVLTPLVVEATDSKKFALLPRPLT
Ga0209378_116138113300026528SoilAALSGNRPARFENLPVPQRRGILQAMTRFRVGRGAVVAVLLVLAFGVAAALAVQAYRTAGDHRAQADRVLRDYARLAAARVAIRTATDIYYAVRPPLKALQHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLKTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPDGDRRYVAYTVLRDSDGVLRTALGFEVKPIVLKPLVVQATEKFELLPRPLTGGVQYDSMGSIIITDRYGVEIYRSAVQYTSPFTARDTIGTDMGGLYAQAT
Ga0209156_1046641213300026547SoilMSRLRVRRGAVVALLLVLAFGVAVVLAVQAYRTAGDHRAQADRVLRDYARLGASRVARQTAVYVYYAVNAPLKTLQHAHEMAPFDAPPKPRNLHFDTMEHEFSMAPYIRFTFWMDLTTKRLETSGEPLPAPVRKWLLDTLPIHLRTVYDTSWHMGTILG
Ga0209577_1071829913300026552SoilLLVLAFAVAVVLAVQAYGVARYHRAQADRVLRDYAALAAARVAQRSAIEIYYAVLPPLKALQHAHEMAPHARLPALKELRIDAMEHEFSLSPYIRFTFRMDFKTRRLETAGEPVPAAVRTWLIDTLPVHTRTVYDTSWNMHMGTVLGRPAGERRYVAYTVLRDSTGVLRTALGFDTKPEVLRPLVVH
Ga0179587_1082487913300026557Vadose Zone SoilTLCAAFVRERSSGVSKPLPCGHPRENLQSMTRFRVRRGQVVAVLLLLAFGVAVALAIQAYGTARYHRAQADRVLRDYAKLAASRVALQSARGIYFAVVPPLKALQHAQERAPHKPLPEPKDLHFDTMEHEFSLAPYIRFTFRMDLKTGALMTSGQTVPERVRRWLVDTLPQHARRL
Ga0208988_112247213300027633Forest SoilVRRGAVVAVLLVLAFVVAAALGVQAYRTAGDHRDQADRVLRDYARLAASRVAIYSASGIYYAVKEPLKALQHAHEKAPHKAPPAPKDLHFDIMEHQFSIAPYIRFTFRMDLKSGQLTTSGQALPPAVRQWLIDTLPAHTRAVYDTSWHMGTVLGQPNGDRRYVAYTVLRDSDGVLRTALGFEAQPTALKPLVVQATDAQKFALLPRPLTGGV
Ga0209076_110199013300027643Vadose Zone SoilPCLHGEMLRTMKVRVVTALLVLAFGVAAVLAIDGYRVGRSHHDQADRVLRDYARLAAARVAIRTATDIYYAVSPPLKALLHAHETAPHKAPPKPKDLHFDTMEHEFSIAPYIRFTFWMDLKNKHLQTSGQELPPPVRKWLVDTLPLHTRTVYDTSWHMGTVLGQPNGDRRYIAYTVLRDKDGTLRTALGFEAKPTALTPLVVQATDTQKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYRTAVQYTSPFSARDTIGTDMGDLYAQVALRE
Ga0209588_111037313300027671Vadose Zone SoilMSRFRVRRGAVVAVLLVLAFAVAVALAVQAYGTARYHRAQADRVLRDYARLAAARVALRTATDIYYAVNPPLKALKHAHETAPHRAPPKPKDLHFDTMEHEFSIAPYIRFTFWMGLKNKHLRASGQELPPPVRKWLVDTLPIHTRAVYDTSWHMGTVLGQPNGDRRYIAYTVLRDTDGVLRTALGFEANPTALTPLVVQATDTQKFALLPRPLTGGVQYDSMGSIIITDRYGVEIYRTAVQYTSPFSARDTIGTDMGDLYAQVALREAMADKLIIGG
Ga0209689_126603113300027748SoilRAQADRVLRDYAALAAARVAQRSAMEIYDAVSPPLRAIQHSHEMAPRARLPAPKELRIDAMEHEFSLMPYIRFTFRMDFKTRRLETSGEPLPAAVRSWLIDTLPVHTRTVYDTSWHMGTVLGHPGGERRYVAYTVLRDSTGVLRTALGFDTKPEVLRPLVVHATEKFPLLPRPLTGGVQYDSMGSIIITDRYGYEIYRSAVQYTSPFTARDTVGKDMGDLHAEATLRA
Ga0209382_1093689413300027909Populus RhizosphereMRLRDRLPSSRFRVRRAAVVAVLLVLSFGVAAVLGIQAYGVARYHRAQADRVLSDYAALAASRVAWRSAIAIYYAVTRPLKAIKHANEKAPGQLPAPSELRVDVMEGEFSLTPYIRFTFRMDLNDKRLETAGEQLPATVRSWLIDTLPAHTRAVYDTSWSGNMGTVLGQPGGERRYVAYTVLRDSKGILSTALGFEVKPTALRPLVDE
Ga0209382_1110451813300027909Populus RhizosphereMTSFPTSARFRVRRGAVVAVLLLLAFGVAAALGIQAYGVARYHRAQADRVLRDYAALAASRVAVRSASEIYYAVTPPLKSIKHAHEMVPGKLPRPSDLGGDAMEREFSLTPYIRFTFRMDLRDKKLETAGARLPPTVRSWLIDTLPTHTRVVYDTSWHMGTVLGQPGGERRYLVYTVLRDEEGTLRTALGFEAKPEALRPLVIQAAKKYPLLPRPLTGGVQYDSMGSFIITDRYGVEIYRTAVQYT
Ga0307469_1204953013300031720Hardwood Forest SoilLQSMARFRVRRGGVVALLLVLAFGVAAALAVQAYRTARYHRDQADRVLRDYARLAASRVAIRSATDIYYAVMPPLKAVQHAHEMAPHKAPPAPKDLHFDMMEREFSLRPYIRFTFRMDLKSGQLVTAGQDVPPAVRRWLVDTLPAHTRAVYDTSWHMGTVLGPPHGDRRYIAYTVLRDSNGMLRT
Ga0307473_1132742313300031820Hardwood Forest SoilFAVAAALGVQAYETARYHRAQADRVLRDYASLAAARVAQRSAILIYYAVDAPLKALQHAYEKAPHKPLPRPGDLHFDVMEHEFSLAPYIRFTFRLDLKTGALQTSGHELPNAVRKWLIDTLPVHTRTVYDTSWHMGSVLGQPAGGWDRRYVVYTVLRNSEGTLRTALGFEAKPAALRPL
Ga0307471_10225021513300032180Hardwood Forest SoilRGGVVALLLVLAFGVAAALAVQAYRTARYHRDQADRVLRDYARLAASRVAIRSATDIYYAVMPPLKAVQHAHEMAPHKAPPAPKDLHFDMMEREFSLRPYIRFTFRMDLKSGQLVTAGQGVPPAVRRWLVDTLPAHTRAVYDTSWHMGTVLGPPHGDRRYIAYTVLRDSTGMLRTALGFEAKPSALEPLILQATTDTQKFALLPRPLTTGDPSDPSRRRVRYDSMGSII
Ga0364926_072298_1_6903300033812SedimentLAVQAYGTARYHRAQADRVLRDYAALAAARVAQRSATEIYYAVIPPLKAIQHAHEAAPHKALPGPKDLHFDPMEREFSLAPYIRYTFRMDLKSGQLKTSGQAVPRAVHKWMVDTLPVHTRTVYDTSWHMGSVLGQPGGERRYVVYTVLRDSDGALRTALGFEAKPAALRPLVEQGTEKFPLLPQPLTGGVQYDSMGSVIITDRYGVEIYRSAVQYTSPFTARDTIGTDMG
Ga0364926_124217_3_5303300033812SedimentVQAYYTARYHRAQADRVLRDYAALAASRVAWRSAIQILDAVMPPLKALQHAYEAAPDKPLPGPKELHFDPRERKFTLAPYIRYTFLMDLKTRELNTWGMPVSPQVRQWMTDTLPVHARTVYDTLWYLGSVLGQPGGERHYIVYSQQRDRDGVMRTAIGFEAKPAALRPLVAEATED


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.