NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F070198

Metagenome / Metatranscriptome Family F070198

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F070198
Family Type Metagenome / Metatranscriptome
Number of Sequences 123
Average Sequence Length 231 residues
Representative Sequence MSGLMKTLHRLTLSWPRAVSLAFALLCFVGSASSAHAASAGPQSWPVSSTQVTSQFAIADFDGDRRPDLATVQAGQVGSLDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDGQGNFRTSSPSAFPGAFTTSEKSWASNTDKITDATAVLLSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSHFSSVVSFLGRAPPSSVLHI
Number of Associated Samples 85
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 87.50 %
% of genes near scaffold ends (potentially truncated) 0.81 %
% of genes from short scaffolds (< 2000 bps) 0.81 %
Associated GOLD sequencing projects 70
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.496 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.650 % of family members)
Environment Ontology (ENVO) Unclassified
(39.024 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(45.528 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 7.36%    β-sheet: 26.74%    Coil/Unstructured: 65.89%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF02321OEP 22.76
PF01081Aldolase 15.45
PF16576HlyD_D23 4.88
PF00582Usp 2.44
PF00873ACR_tran 2.44
PF07238PilZ 2.44
PF03551PadR 1.63
PF07676PD40 1.63
PF00106adh_short 0.81
PF13519VWA_2 0.81
PF13561adh_short_C2 0.81
PF02371Transposase_20 0.81
PF05163DinB 0.81
PF13377Peripla_BP_3 0.81
PF13414TPR_11 0.81
PF00464SHMT 0.81
PF01904DUF72 0.81
PF01654Cyt_bd_oxida_I 0.81
PF00480ROK 0.81
PF13424TPR_12 0.81
PF13520AA_permease_2 0.81
PF01839FG-GAP 0.81
PF12704MacB_PCD 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 45.53
COG08002-keto-3-deoxy-6-phosphogluconate aldolaseCarbohydrate transport and metabolism [G] 15.45
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 1.63
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.63
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 1.63
COG1940Sugar kinase of the NBD/HSP70 family, may contain an N-terminal HTH domainTranscription [K] 1.63
COG0112Glycine/serine hydroxymethyltransferaseAmino acid transport and metabolism [E] 0.81
COG01567-keto-8-aminopelargonate synthetase or related enzymeCoenzyme transport and metabolism [H] 0.81
COG1271Cytochrome bd-type quinol oxidase, subunit 1Energy production and conversion [C] 0.81
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 0.81
COG2318Bacillithiol/mycothiol S-transferase BstA/DinB, DinB/YfiT family (unrelated to E. coli DinB)Secondary metabolites biosynthesis, transport and catabolism [Q] 0.81
COG3547TransposaseMobilome: prophages, transposons [X] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.50 %
All OrganismsrootAll Organisms6.50 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009038|Ga0099829_10011662All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5729Open in IMG/M
3300009089|Ga0099828_10091245All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2619Open in IMG/M
3300012096|Ga0137389_10570382All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium972Open in IMG/M
3300012203|Ga0137399_10053605All Organisms → cellular organisms → Bacteria2944Open in IMG/M
3300020579|Ga0210407_10001619All Organisms → cellular organisms → Bacteria20680Open in IMG/M
3300021478|Ga0210402_10121110All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2365Open in IMG/M
3300026551|Ga0209648_10000007All Organisms → cellular organisms → Bacteria → Acidobacteria108347Open in IMG/M
3300027846|Ga0209180_10000902All Organisms → cellular organisms → Bacteria14383Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.20%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil12.20%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.57%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.57%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.50%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.44%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.44%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001162Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12683J13190_100016253300001089Forest SoilMSRRIEALHQLNLSWPRAVSLPFALLCLVGSSAPAQAASVGPQTLSSPPQVRSQFAIADFDGDRRPDLATVQVGQSSSWDTHYWIAFQLSSGPRQTLGITAPTGGVQITSRDVNGDDFLDVIVTTAWTNQPVAVLLNDGKGNFRASSPSAFRGAFSTSEKSWALTTDELKDASAILISRYPGNCSEGSRFSSPRNVTGLLVLWASRNSHFSPVVSFLGRAPPSFVLHI*
JGI12683J13190_100109243300001089Forest SoilVVCDKANKCRPVDCLSSRCWLRNGSDHAGLTLAQRPGTAMSGPMKTSHHLSLSCSRAVSLAFALLCFTGSAGSAHAASVGPQSRPSSSTQVTSQFAIADFDGDTRPDLATVQAGVNSSWDTHYWIAFQLSSGPRQTLSITVPTGGLQITSRDVNGDYFLDVIVTAAWTNRPVAVLLNDGQGNFRAFSPSAFPGAFSTSEKSGVSTRDEVRDATAVLLSRYPTGNCSERTRFSSPRNVTRQLVLRPSRNLPYCPVVSFLGRAPPPSFFS*
JGI12714J13572_100004823300001162Forest SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALLLSRYPTGNCQAVSRFSSPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI*
JGI12635J15846_1000158983300001593Forest SoilMEALHQLNLSWPRAVSLPFALLCLVGSSAPAQAASVGPQTLSSPPQVRSQFAIADFDGDRRPDLATVQVGQSSSWDTHYWIAFQLSSGPRQTLGITAPTGGVQITSRDVNGDDFLDVIVTTAWTNQPVAVLLNDGKGNFRASSPSAFRGAFSTSEKSWALTTDELKDASAILISRYPGNCSEGSRFSSPRNVTGLLVLWASRNSHFSPVVSFLGRAPPSFVLHI*
JGI12635J15846_1002269963300001593Forest SoilVVCDKANKCRPVDCLSSRCWLRNGSDHAGLTLAQRPGTAMSGPMKTSHHLSLSCSRAVSLAFALLCFTGSAGSAHAASVGPQSRPSTSTQVTSQFAIADFDGDTRPDLATVQAGVNSSWDTHYWIAFQLSSGPRQTLSITVPTGGLQITSRDVNGDYFLDVIVTAAWTNRPVAVLLNDGQGNFRAFSPSAFPGAFSTSEKSGVSTRDEVRDATAVLLSRYPTGNCSERTRFSSPRNVTRQLVLRPSRNLPYCPVVSFLGRAPPPSFFS*
JGI12635J15846_1002973023300001593Forest SoilMSGLMKTLHQLHRSWPRAVRLAFAFLCLILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAILLNDGQGNFKASNPSAFPGAFRTSEESWICITDEIKDAVALLLSRYPTRNGPEVSRFSSPRNVTRRLVLRASRNSLSSAVVSFLGRAPPSFVLHI*
JGIcombinedJ26739_10001162673300002245Forest SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEXSWICITDEIXDAVALLLSRYPTGXCQEVXRFSXXRKVTGRFVLRASRNSLXSAVVSFLGRAPPSFILHI*
JGI25385J37094_1005139623300002558Grasslands SoilMKTPRQRHLSWSRAISAAFAFLCMFLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVFFLGRAPPSFVLHI*
JGI25382J43887_1007174023300002908Grasslands SoilMKTLRQRHLSWSRAISAAFAFLCMVLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTDRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASRNSHSSAVVSFLGRAPPSFVFHI*
JGI25617J43924_1000604133300002914Grasslands SoilMKTPRQMHFSWSKAVSTAFAFLCLALGFASCGNAAPGVPQSGAASSPQVTSQFAIADFDGDRRPDLATVQVGQDNSRDTHYWIAFQLSGGTRQTLGITAPTGGLQIASRDVNGDSFLDVVITTAWTNLPVAVLLNDGQGNFRATSPSAFPGAFATSERSWASTVDEIKDATAVLLSRYPTGNCSEGNRFSSPRNVTGLLILRASRNLPYRLVVAFLGRAPPSFILHI*
JGI25617J43924_1001209433300002914Grasslands SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGPAHAASTDPQSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDAHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTTSEKSWACTTDEVKDATAVLLSRYPTGNCSAGSRFSPPQNAAGLLLQWAPRSSALSVVVSFLGRAPPSFVLHI*
JGI25617J43924_1012332213300002914Grasslands SoilLCTPWRGWYPPDVGCANVQVMQDGRLTHRLEIAMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAASVHAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTGNCPEGGRFSSPRNVTGLLVLRASRNSL
Ga0066680_1007625133300005174SoilMKTLRQRHLSWSRAISAAFAFLCMVLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASRNSHSSAVVSFLGRAPPS
Ga0066685_1024750823300005180SoilSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLRWAPRSSALSVVVSFLGRAPPSFVLHI*
Ga0066676_1005769223300005186SoilMKTLHRLNPSWSRAVRLALALLCFVGSAGSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLRWAPRSSALSVVVSFLGRAPPSFVLQI*
Ga0070730_1001606023300005537Surface SoilMPGLMKILHQLNRSWLRAMSLAFAFSCVILGFVACGNAASTGTQSWPVSSTQATSQTAIADFDGDDRPDLATIQVGRDSSPNTQYWIAFQLSRGSRQILGITAPTGGLQVTSLDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASNPSSFPGAFTTSEKSWDCIPDEVKEATAALLSRYPAGSYPDSSRFPSPRNVVGLLVLRTSRNSLSSAVVSFLGRAPPSFVLYI*
Ga0066692_1004252723300005555SoilMKTLRQRHLSWSRAISAAFAFLCMVLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVSFLGRAPPSFVLHI*
Ga0066692_1080997013300005555SoilSWSRAVRLALALLCFVGSAGSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLR
Ga0066707_1041959613300005556SoilMKTLHRLNPSWSRAVRLALALLCFVGSAGSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLRWAPRSSALSVVVSFLGRAPPSFVLHI*
Ga0066691_1047901513300005586SoilMKTLHRLNLSWSRAVRLALALLCFVGSSGSAHAAPTDPQSWPPPSTQITSQFAIADFDGDNRPDLATVQAGVSSSWDTQYWIAFQLSRGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRTSSPSSFPGAFTASEKSWACTTDEVKDATVVLLSRYPTRNCAEARGFSPLQTAAGLLLPWAPHSWVLSVSVSFLGR
Ga0070762_1004477723300005602SoilMKRLQQSKLSWSRAVGVAFVFLCLVLAFAACGNAASSEPQNWPVSSAQVTTQFALADFDGDNRPDLATVQAGRGNSSDTYYWIAFQLSSGPRQTLGVRAPNGGLHIATRDVNGDDFLDVIVTTAWTNRPVAVFLNDGRGNFRVSSPSGFPGAFTTSEKSWASSADEVRDVTAVLLSRYPTGNCSEAGTFFSPRNVNGLLTLWNFRSWHVAAVVSFLDRAPPSFVPHI*
Ga0075028_10007547323300006050WatershedsMSRLMKTLGPLNRSWRGIIALTMAFLCVILGFAAYGSAAPTASQTRPFSSTQATSRFAIADFDGDNRPDLATVQVGDSNALDTHYWIAFQLSSGARRTLGITAPAGGLRITSQDVNGDDFLDIIVTTAWANRPVAILLNDGQGNFRVSDPSAFSAAFTTSDKSWASSADEATDATAILLSRYPTGNCPEGNEFSSPRNVTGLLVLRASRNSLSSPVVSFLGRAPPSFVLHN*
Ga0075021_1005712923300006354WatershedsVNAFETAMSRLMKTLGPLNRSWRGIIALTMAFLCVILGFAAYGSAAPTASQTRPFSSTQATSRFAIADFDGDNRPDLATVQVGDSNALDTHYWIAFQLSSGARRTLGITAPAGGLRITSQDVNGDDFLDIIVTTAWANRPVAILLNDGQGNFRVSDPSAFSAAFTTSDKSWASSADEATDATAILLSRYPTGNCPEGNEFSSPRNVTGLLVLRASRNSLSSPVVSFLGRAPPSFVLHN*
Ga0099793_1050166213300007258Vadose Zone SoilGSARAAPAGPQSWPSSPQATSQFAIADFDGDRGPDLATVQAGQSSSLDTQYWIALQLSSGPRRTLGITAPNGGLHITSRDVNGDHFLDIIVTAASTNRRVAVLLNYGQGNFRAFGPSAFPGAFTSSEKSCTSTTDEIKDATAFLLSRHPTGNCSEGSRFSSPRSMKGLLVLWTSHNSPLSSVVSFLGRAPPSFVLHI*
Ga0066710_10001918523300009012Grasslands SoilMKTLRQRHLSWSRAISAAFAFLCMVLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTDRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASRNSHSSAVVSFLGRAPPSFVFHI
Ga0066710_10007399523300009012Grasslands SoilMKTLHRLNPSWSRAVRLALALLCFVGSAGSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLRWAPRSSALSVVVSFLGRAPPSFVLHI
Ga0099829_1000579443300009038Vadose Zone SoilMSLLMKTLHQLTLSWPRAASLAFALLCFAGSAAPAHAASAGPQSLPSPPPQVRSQFAIADFDGDRRPDIATVHVGQSSSWETHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWANQPVAVLLNDGQGNFRASSPSAFPGAFATSEKSWALTTDEVKDATAALFSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSPFRPVISFLGRAPPSFVPHI*
Ga0099829_1001166263300009038Vadose Zone SoilMSGLMKALHQLSLSCSRAVSLSFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGPSAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0099829_1035654523300009038Vadose Zone SoilMKTLHRLNLSWSRAVRLAFVLLCFVGSAGSAHAGSTDPKSWPSTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPRAFTTSEKSWICITDEIKDAVALLLSRYPTVNCPEVSRFSSPRNVTGLLVLQASRNSHSSAVVSFLGRAPPSVVLHI*
Ga0099829_1043712823300009038Vadose Zone SoilMSGLMKTLHRLTLSWPRAVSLAFALLCFVGSASSAHAASAGPQSWPVSSTQVTSQFAIADFDGDRRPDLATVQAGQVGSLDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDGQGNFRTSSPSAFPGAFTTSEKSWASNTDKITDATAVLLSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSHFSSVVSFLGRAPPSSVLHI*
Ga0099829_1109113813300009038Vadose Zone SoilALALLCFVGSAGSAHAAATDPQSWLVSSTQATSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSRGPRQTLSITAPNGGLHISSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGHGNFRASSPSAFPGAFTTSEKSWAYATDEVKDATAVLFSRYPPRDCSEASRFSPPQDAAGLLLPWAPRSSVLSVVVSFLGRAPPSLVPNI*
Ga0099830_1001582933300009088Vadose Zone SoilMSLLMKTLHQLTLSWPRAVSLAFALLCIAGSAAPAHAASAGPQSLPSPPPQVRSQFAIADFDGDRRPDIATVHVGQSSSWETHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWANQPVAVLLNDGQGNFRASSPSAFPGAFATSEKSWALTTDEVKDATAALFSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSPFRPVISFLGRAPPSFVPHI*
Ga0099830_1010025623300009088Vadose Zone SoilMSGLMKTLHRLTHSWPRAVSLAFALLCFVGSASSAHAASAGPQSWPVSSTQVTSQFAIADFDGDRRPDLATVQAGQVGSLDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDGQGNFRTSSPSAFPGAFTTSEKSWASNTDKITDATAVLLSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSHFSSVVSFLGRAPPSSVLHI*
Ga0099830_1048666713300009088Vadose Zone SoilVGCANVQVMQDGRLTHRLEIAMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVLAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTGNCPEGGRFSSPRNVTGLLVLRASRNSLSSAVVSFLGRAPPSFVLHI*
Ga0099828_1009124553300009089Vadose Zone SoilVHSLTWMVSSRCWLRNRSDHAGLTLGHRLETAMSGLMKTLHQLSLSCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPEVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGASAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0099827_1002188443300009090Vadose Zone SoilMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVLAASTGPQRLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTGNCPEGGRFSSPRNVTGLLVLRASRNSLSSAVVSFLGRAPPSFVLHI*
Ga0099827_1090906223300009090Vadose Zone SoilMSGLMKTLHQLSLYCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGASAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTRNCAEGSRFSSPR
Ga0099792_1007948523300009143Vadose Zone SoilMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVHAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTRNCPEGSRFSSPRSVTGLLVLRASRNSLSSAVVSFLGRAPPSFVLHI*
Ga0137392_1009184823300011269Vadose Zone SoilVHSLTWMVSSRCWLRNRSDHAGLTLAHRLETAMSGLMKTLHQLSLYCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPEVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGASAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0137392_1070631213300011269Vadose Zone SoilMKALRQMHFSWSKAVTTAFAFLCLVLVFASRGSAAPGVAQSGPASSLQVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLAITAPTGGLQIASRDVNGDSFLDVVITTAWTNQPVAVLLNDGQGNFRATSPSAFPGAFTTSEKSWASTADEIKDATAVLLSRYPTGSCSEGGRFSSPRNVTGLRILRASRNLPYRPAVSFLGRAPPSFVLHI*
Ga0137391_1005766513300011270Vadose Zone SoilVHSLTWMVSSRCWLRNRSDHAGLTLAHRLETAMSGLMKTLHQLSLYCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSRDTHYWVAFQLSGGSRQTVGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGASAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0137391_1015656523300011270Vadose Zone SoilMKTLHRLNLSWSRALIRLAFVLLCFVGSAGSAHAGSTDPKSWPSTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPRAFTTSEKSWICITDEIKDAVALLLSRYPTVNCPEVSRFSSPRNVTGLLVLQASRNSHSSAVVSFLGRAPPSVVLHI*
Ga0137393_1006290673300011271Vadose Zone SoilVHSLTWMVSSRCWLRNRSDHAGLTLAHRLETAMSGLMKTLHQLSLYCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGPSAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0137389_1057038223300012096Vadose Zone SoilCAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSPDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGASAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0137388_1022436813300012189Vadose Zone SoilMSGLMKTLHQLSLYCSRAVSLAFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGPSAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI*
Ga0137388_1032050323300012189Vadose Zone SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGPAHAASTDPQSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDAHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEIKDAVALLLSRYPTGNCQEVSRFSSPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI*
Ga0137388_1060924513300012189Vadose Zone SoilMSGLMKTLHPLNRSWPSTVSLSFAFLCLILGFAASGNAASAGPQSWPVSSTQTTSQFSIADFDGDNRPDLATVQVGHGSSSDTQYWIAFQLSRGSRQILGIIAPTGGLQVRSRDVNGGSFLDVVVTTAWTDRPVAVLLNDGQGNFRASSPSAFPGAFTTSEKSWACIADEVKEATAVLLARYPTGNCPESSRPSSPRNVT
Ga0137383_1024342823300012199Vadose Zone SoilMKTLHRLNLSWSRAVRLALAVLCSVGSAGSAHAASTDPQSWPPTSTQITSQLAIADFDGDNRPDFATFQVGHSSSWDTQYWIAFQLSRGSRQILGITAPTGGLQVTSRDVNGDSFFDVVVTTEWTNQPVAVLLNDGQGRFAVSSPSAFPGAFPISEKSRICITDEIKEAVALLLSRYPTVNCSETSRFSPPQNAAGLLLPWAPRSSVLSVVVSFLGRAPPSSVPHI*
Ga0137363_1007349023300012202Vadose Zone SoilMFRLMRTFNQLSVSWSRAVRLAFALLCFVGSARCAGAASTGSQSWPPTSIQVTSQFAIADFDGDRRPDLATVQVGQGSSWETHYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSASEKSRVSTKDEVKDPIAVLLSRFPTGNCPENSRFSSPRNVTGRVVPRASRNLLSSAVVSFLGRAPPSFVLHI*
Ga0137399_1005360533300012203Vadose Zone SoilMSGLLKTLHQASLSWSRAVSLAFALLCFVGSIGSARAAPAGPQSWPSSPQATSQFAIADFDGDRGPDLATVQAGQSSSLDTQYWIALQLSSGPRRTLGITAPNGGLHITSRDVNGDHFLDIIVTAASTNRPVAVLLNYGQGNFRAFGPSAFPGAFTSSEKSCTSTTDEIKDATAFLLSRHPTGNCSEGSRFSSPRSMKGLLVLWTSHNSPLSSVVSFLGRAPPSFVLHI*
Ga0137362_1009618123300012205Vadose Zone SoilMFRLMRTFNQLSVSWSRAVRLAFALLCFVGSARCAGAASTGSQSWPPTSIQVTSQFAIADFDGDRRPDLATVQVGQGSSWETHYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSASEKSRVSTKDEVKDATAVLLSRFPTGNCPENSRFSSPRNVTGRVVPRASRNLLSSAVVSFLGRAPPSFVLHI*
Ga0137380_1016995923300012206Vadose Zone SoilMKTLRQRHLSWSRAISAAFAFLCMFLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVSFLGRAPPSFVLHI*
Ga0137378_1095614423300012210Vadose Zone SoilMKTLHRLIVSWSKAVWLAFALLCFVGAAGSAHAASAGPQRWLVSSTQATSQFAIADFDGDNRPDLATVHVGQSTSWDTHYWIAFQLSRGSRQILGITAPTGGLQVTSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGHGNFRASSPSAFPGAFSTSEKSWACTTDEVKDAAAVLFSRYAPGNCSEASR
Ga0137387_1011907923300012349Vadose Zone SoilMKTLRQRHLSWSRAISAAFAFLCMFLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGRLKITSRDVTGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVSFLGRAPPSFVLHI*
Ga0137384_1007080353300012357Vadose Zone SoilMKTLRQSNLSWFRAVRAPFALLCLGFAACGNAASPGPQSWPVSSTQARSQFAIADFDGDNRPDLATVQVGQGSSWDTRYWIAFQLSRGSRQVLGVTAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAVPVAFTTSETLGPPLLMSLKDATAVLLSRHPTGNCSEAVGFRHLEK*
Ga0137384_1012958623300012357Vadose Zone SoilMKTLHRLNLSWSRAVRLALAVLCSVGSAGSAHAASTDPQSWPPTSTQITSQFAIADFDGDNRPDFATVQVGHSSSWDTQYWIAFQLSRGSRQILGITAPTGGLQVTSRDVNGDSFLDVVVTTEWTNQPVAVLLNDGQGRFTVSSPSAFPGAFPISEKSRICITDEIKDAVALLLSRYPTVNCSEASRFSPPQNAAGLLLPWAPRSSVLSVVVSFLGRAPPSFVPHI*
Ga0137361_1079632813300012362Vadose Zone SoilMKTLRQMHFSWSKAVSTPFAFLCFVLGFASCANAAPGVPQSGAASSPQVTSQFAIADFDGDRRPDLASVQVGQGSSWDTHYWIAFQLSGGSRQTLGITGPTGGLQITSRDVNGDSFLDVVITTAWTNQPVAVLLNDGQGNFRATSPSAFPGAFTTSEESWASTADEIKDATAVLLSRYPTGNCLEGSRFSSPRNVTGLLVLRASRNLPYRPVVSFLGRAPPSFILHI*
Ga0137390_1003995743300012363Vadose Zone SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGPAHAASTDPQSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDAHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFSTSEKSWACTTDEVKDATAVLLSRYPTGDCSAGSRFSPPQNAAGLLLQWAPRSSALSVVVSFLGRAPPSFVLHI*
Ga0137390_1007943733300012363Vadose Zone SoilMVLGFATCGNAAPGVPQSGPASSLLVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLAITAPTGGLQIASRDVNGDSFLDVVITTAWTNQPVAVLLNDGQGNFRATSPSAFPGAFTTSEESWASTFDEIKDATAALLSRYPTGNCSEGSKFSSPRNVTGLLVLRASRNLPYRTVVSFLGRAPPSFILHI*
Ga0137358_1005138023300012582Vadose Zone SoilMFRLMRTFNQLSVSWSRAVRLAFALLCFVGSARCAGAASTGSQSWPPTSIQVTSQFAIADFDGDRRPDLATVQVGQGSSWETHYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNWPVAVLLNDGQGNFSASSPSAFPGAFSASEKSRVSTKDEVKDATAVLLSRFPTGNCPENSRFSSPRNVTGRVVPRASRNLLSSAVVSFLGRAPPSFVLHI*
Ga0137396_1000984143300012918Vadose Zone SoilMSGLMKTLRQLNRSWPRAVSLAFAFLCVILDFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGHRGSSDTQYWIAFQLSRGSRQILGITGPTGGLQVTSRDVNGDSFLDVVVTTTWTNRPVAVLLNDGQGNFRASSPSAFPGVFTTSEKSWACIADEVKEATAVLLSRYPTGNCPESSRPSSPRNVTGLLVLRTSRNSLSSAVASFLGRAPPSFVLHI*
Ga0137396_1002459823300012918Vadose Zone SoilMLAAKPFRSCRGLALRLEIAMSGLLKTLHQASLSWSRAVSLAFALLCFVGSIGSARAAPAGPQSWPSSPQATSQFAIADFDGDRGPDLATVQAGQSSSLDTQYWIALQLSSGPRRTLGITAPNGGLHITSRDVNGDHFLDIIVTAASTNRPVAVLLNYGQGNFRAFGPSAFPGAFTSSEKSCTSTTDEIKDATAFLLSRHPTGNCSEGSRFSSPRSMKGLLVLWTSHNSPLSSVVSFLGRAPPSFVLHI*
Ga0137396_1016576823300012918Vadose Zone SoilMPRRMKTLHQLTLSWPRAVGLAFALLCFAGSADSAHAASADPQSLSSPPQVRSQFAIADFDGDRRPDLATVQVGQSSSWDTHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDGQGNFRASDPSAFPGAFTTSEKSLASTADEIKDATAVLPSRDPAGNCSNGSRFWSPQKVRRLFVPWASRNLPFSVIVSFLGRAPPSFVLHI*
Ga0164301_1007919923300012960SoilVIHSTAIVLAQYTETEDTGGPRQSEQISARWTAYPPDADYETVRIIHAGLTLALMSGLMKTSHQLSLSCSRAVSLAFALLCFVGSASSSQAASTDPRTWPPTAAQVTSQFAFVDFDGDRRPDLATVQAGQGSSWATQYWIAFQLSRGSPQTLGVIAPTGGLQITSRDVNSDSFLDVVVTTEWTNQPVAVLLNDGQGNFTASDPSAFPGAFRTSEKSWICITDEIKDAVALLLSRYRTGNCPASSSFSSPRNLMRLLVMWASRNSLSSIVVSFLGRAPPSLVLHI*
Ga0066662_1077393413300018468Grasslands SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGSAHGGSTDPKSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDTHYWIAFQLSSGPRQTLGITAPNGGLHITSRDVNGDDFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTPSEKSWACTTDEVKDVTAVLLSRYPTGNCPEGSRFSQLQTAAGLLLRWAPRSSALSVVVSFLGRAPPSFVLHI
Ga0179592_1002385423300020199Vadose Zone SoilMFRLMRTFNQLSVSWSRAVRLAFALLCFVGSARCAGAASTGSQSWPPTSIQVTSQFAIADFDGDRRPDLATVQVGQGSSWETHYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNDDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSASEKSRVSTKDEVKDATAVLLSRFPTGNCPENSRFSSPRNVTGRVVPRASRNLLSSAVVSFLGRAPPSFVLHI
Ga0210407_10001619163300020579SoilVVRDKANKCRPVDCLSPGSWQRKRSDPAGLMLADYLETVMSGLVKTSAQFSLCCSRTVSLAFTLCLVGAAGSTHAASAGPQGWPSSSPQATSQFAIADFDGDRQPDLATIQAVQSSSPDTEYWIAFQLSSGPQQTLGIAAPPGGLQVTSRDVNGDNFLDVLVTTAWTNRPVAVLINDGQGKFRASSPCAFPGAFATSEKSFVYATDETRDATAILFSRYPTSNCSEGSGGSPRNVMGLLVLKASRNSLSSAVVCFLGRAPPTCVCHD
Ga0210407_1010095623300020579SoilMQYRRLTHRLEIAMSGLMKTSHQLNRSWPGAVGLAFAFLCVVLGFATYGNAASAGPQSWPVSSARTTSQFAIADFDGDNRPDLATIQVGHGSSSDTQYWIAFQLSRGSRQILGITAPTGGLQVTSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPRAFTTSEESWACINDEVKEANAALLLRYPTGNCPESRRFSSPQYVTGLLVLWASRNSRFSPVVSFLGRAPPSFVLHI
Ga0210403_1012251113300020580SoilRLEIAMSRLMKTLHQLNRSWPRAVSSAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALLLSRYPTGNCQEVSRFSPPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0210403_1017481423300020580SoilVVRDKAKKCRPVDCLCPGSWLRKRSDPAGLMLADYLETVMSGLVKTSAQFSLCCSRTVSLAFTLCLVGAAGSTHAASAGPQGWPSSSPQATSQFAIADFDGDRQPDLATIQAVQSSSPDTEYWIAFQLSSGPQQTLGIAAPPGGLQVTSRDVNGDNFLDVVVTTAWTNRPVAVLINDGQGKFWASSPCAFPGAFATSEKSFVYATDETRDATAILFSRYPTSNCSEGSRGSPRNVMGLLVLKASRNSLSSAIVCFLGRAPPTCVCHD
Ga0210399_1001796843300020581SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALLLSRYPTGNCQEVSRFSSPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0210404_1000183573300021088SoilVVRDKAKKCRPVDCLCRGSWLRKRSDPAGLMLADYLETVMSGLVKTSAQFSLCCSRTVSLAFTLCFVGAAGSTHAASAGPQGWPSSSPQATSQFAIADFDGDRQPDLATIQAVQSSSPDTEYWIAFQLSSGPQQTLGIAAPPGGLQVTSRDVNGDNFLDVLVTTAWTNRPVAVLINDGQGKFRASSPCAFPGAFATSEKSFVYATDETRDATAILFSRYPTSNCSEGSRGSPRNVMGLLVLKASRNSLSSAVVCFLGRAPPTCVRHD
Ga0210404_1001341433300021088SoilMSGLMKTLHRLNRSWPRAASLAFAFLCVILGFAACGNAASTGQQSWPVSSTQATSQFAIADFDGDSRPDLATIQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALVLSRYPTGNCQEVSRFSSPRRVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0210400_1003799153300021170SoilVVRDKAKKCRPVDCLCRGSWLRKRSDPAGLMLADYLETVMSGLVKTSAQFSLCCSRTVSLAFTLCFVGAAGSTHAASAGPQGWPSSSPQATSQFAIADFDGDRQPDLATIQAVQSSSPDTEYWIAFQLSSGPQQTLGIAAPPGGLQVTSRDVNGDNFLDVVVTTAWTNRPVAVLINDGQGKFRASSPCAFPGAFATSEKSFVYATDETRDATAILFSRYPTSNCSEGSRGSPRNVMGLLVLKASRNSLSSAVVCFLGRAPPTCVRYD
Ga0210394_1009513723300021420SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALVLSRYPTGNCQEVSRFSSPRRVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0210384_1012757033300021432SoilMSGAMRTSNQLNVSCSRALRLAFVLLCFFAFAISARAASTGSQSWSPTSTQVTSQFAIADFDGDNRPDLATVHVAQSSSRYSHYWIAFQLSGGSRQTLGIIAPAGGLQIVSRDVNGDSFVDVIVTTAWTNRPVAVLLNDGRGNFSASSPSAFPGAFAASEKSWIGITDEIKDGVALLFSRYPAGDCSEVSRFSSPRNVAGLLFLWAPRRSALSAVVSFLGRAPPSFVLHI
Ga0210402_1012111023300021478SoilVVRDKANKCRPVDCLSPGSWQRKRSDPAGLMLADYLETVMSGLVKTSAQFSLCCSRTVSLAFTLCLVGAAGSTHAASAGPQGWPSSSPQATSQFAIADFDGDRQPDLATIQAVQSSSPDTEYWIAFQLSSGPQQTLGIAAPPGGLQVTSRDVNGDNFLDVVVTTAWTNRPVAVLINDGQGKFWASSPCAFPGAFATSEKSFVYATDETRDATAILFSRYPTSNCSEGSGGSPRNVMGLLVLKASRNSLSSAVVCFLGRAPPTCVCHD
Ga0210410_1026541823300021479SoilMSGLMKTLHRLNRSWPRAASLAFAFLCVILGFAACGNAASTGQQSWPVSSTQATSQFAIADFDGDSRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALLLSRYPTGNCQEVSRFSSPRRVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0210409_1001915943300021559SoilMSGLMRTSNQLSVSCSRAPRLAFALLCFLGFAISARATSTGSQSWTPTSTLVTSQFAIADFDGDSRPDLATVHVAESSFRYSHYSIAFQLSGGSRQTLGIIAPTGGLQIISRDVNGDSFLDVIVTTAWTNRPVAVLLNDGRGNFSAASPSAFPGAFTASEKSWICITDQIKDAVALLSSRYPAGECSEVSRFSSPRKVTRLLFLWAPRSSALSAVVSFLGRAPPSFVLHT
Ga0242660_105946423300022531SoilDGRLTHRLEIAMCGLMKTLHQLNRSWPRAASLAFAFLCVIILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGGSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWIFITDEITDAVALLLSRYPTGNCQEVSRFSSPRRVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0209237_100821233300026297Grasslands SoilMKTPRQRHLSWSRAISAAFAFLCMFLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASCNLLSSAVVFFLGRAPPSFVLHI
Ga0209236_103182423300026298Grasslands SoilMKTPRQRHLSWSRAISAAFAFLCMFLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVFFLGRAPPSFVLHI
Ga0209377_113748313300026334SoilLSWSRAISAAFAFLCMVLGFAPCGNAAPGVPQSGPASFPQVTSQFAIADFDGDRRPDLATVQVGQGNSWDTYYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSTSEKSRVSTKDEVEDATAVLLSRYPTGNCPEDSRFSSPRNVRGRVVLRASCNLLSSAVVSFLGRAPPSFVLHI
Ga0209648_10000007683300026551Grasslands SoilMKTPRQMHFSWSKAVSTAFAFLCLALGFASCGNAAPGVPQSGAASSPQVTSQFAIADFDGDRRPDLATVQVGQDNSRDTHYWIAFQLSGGTRQTLGITAPTGGLQIASRDVNGDSFLDVVITTAWTNLPVAVLLNDGQGNFRATSPSAFPGAFATSERSWASTVDEIKDATAVLLSRYPTGNCSEGNRFSSPRNVTGLLILRASRNLPYRLVVAFLGRAPPSFILHI
Ga0209648_1001496633300026551Grasslands SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGPAHAASTDPQSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDAHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTTSEKSWACTTDEVKDATAVLLSRYPTGNCSAGSRFSPPQNAAGLLLQWAPRSSALSVVVSFLGRAPPSFVLHI
Ga0209648_1011011323300026551Grasslands SoilMKDGRLAHRLKTAMSGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPDSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDNFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEIKDAVALLLSRYPTGNCQEVSRFSSPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0179587_1010823913300026557Vadose Zone SoilLMRTFNQLSVSWSRAVRLAFALLCFVGSARCAGAASTGSQSWPPTSIQVTSQFAIADFDGDRRPDLATVQVGQGSSWETHYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNDDSFLDVVVTTAWTNRPVAVLLNDGQGNFSASSPSAFPGAFSASEKSRVSTKDEVKDATAVLLSRFPTGNCPENSRFSSPRNVTGRVVPRASRNLLSSAVVSFLGRAPPSFVLHI
Ga0209527_100006453300027583Forest SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEITDAVALLLSRYPTGNCQAVSRFSSPRKVTGRFVLRASRNSLSSAVVSFLGRAPPSFILHI
Ga0209220_100913543300027587Forest SoilVVCDKANKCRPVDCLSSRCWLRNGSDHAGLTLAQRPGTAMSGPMKTSHHLSLSCSRAVSLAFALLCFTGSAGSAHAASVGPQSRPSTSTQVTSQFAIADFDGDTRPDLATVQAGVNSSWDTHYWIAFQLSSGPRQTLSITVPTGGLQITSRDVNGDYFLDVIVTAAWTNRPVAVLLNDGQGNFRAFSPSAFPGAFSTSEKSGVSTRDEVRDATAVLLSRYPTGNCSERTRFSSPRNVTRQLVLRPSRNLPYCPVVSFLGRAPPPSFFS
Ga0209625_100041363300027635Forest SoilMCGLMKTLHQLNRSWPRAASLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWETHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASSPSAFPGAFRTSEESWICITDEIKDAVALLLSRYPTGDCQEVSRFSPARKVTGRFVLRASRNSLLSAVVSFLGRAPPSFILHI
Ga0209117_1000387113300027645Forest SoilMSRRMEALHQLNLSWPRAVSLPFALLCLVGSSAPAQAASVGPQTLSSPPQVRSQFAIADFDGDRRPDLATVQVGQSSSWDTHYWIAFQLSSGPRQTLGITAPTGGVQITSRDVNGDDFLDVIVTTAWTNQPVAVLLNDGKGNFRASSPSAFRGAFSTSEKSWALTTDELKDASAVLISRYPGNCSEGSRFSSPRNVTGLFVLWASRNSPFSPVVSFLGRAPPSFVLHI
Ga0209588_103880423300027671Vadose Zone SoilMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVLAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTGNCPEGGRFSSPRNVTGLLVLRASRNSLSSAVVSFLGRAPPSFVLHI
Ga0209118_100639733300027674Forest SoilMSGLMKTLHQLNRSWPRAVRLAFAFLCLILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAILLNDGQGNFKASSPSAFPGAFRTSEKSWICITDEIKDAVALLLSRYPTRNGPEVSRFSSPRNVTRRLVLRASRNSLSSAVVSFLGRAPPSFVLHI
Ga0209118_106067713300027674Forest SoilMSRRIEALHQLNLSWPRAVSLPFALLCLVGSSAPAQAASVGPQTLSSPPQVRSQFAIADFDGDRRPDLATVQVGQSSSWDTHYWIAFQLSSGPRQTLGITAPTGGVQITSRDVNGDDFLDVIVTTAWTNQPVAVLLNDGKGNFRASSPSAFRGAFSTSEKSWALTTDELKDASAILISRYPGNCSEGSRFSSPRNVTGLLVLWASRNSHFSPVVSFLGRAPPSFVLHI
Ga0209011_100003523300027678Forest SoilVVCDKANKCRPVDCLSSRCWLRNGSDHAGLTLAQRPGTAMSGPMKTSHHLSLSCSRAVSLAFALLCFTGSAGSAHAASVGPQSRPSSSTQVTSQFAIADFDGDTRPDLATVQAGVNSSWDTHYWIAFQLSSGPRQTLSITVPTGGLQITSRDVNGDYFLDVIVTAAWTNRPVAVLLNDGQGNFRAFSPSAFPGAFSTSEKSGVSTRDEVRDATAVLLSRYPTGNCSERTRFSSPRNVTRQLVLRPSRNLPYCPVVSFLGRAPPPSFFS
Ga0209180_1000090273300027846Vadose Zone SoilMSGLMKALHQLSLSCSRAVSLSFALLCFVGSAGSAHAACAGPSSSPQVTPQFAIADFDGDRRPDLATVQVGQGSSWDTQYWVAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDSQGNFRASGPSAFPGAFTTSEKSCASTTDETKDATAVLLSRYPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSSVVSFLGRGPPSFVLHI
Ga0209180_1000296733300027846Vadose Zone SoilMSLLMKTLHQLTLSWPRAASLAFALLCFAGSAAPAHAASAGPQSLPSPPPQVRSQFAIADFDGDRRPDIATVHVGQSSSWETHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWANQPVAVLLNDGQGNFRASSPSAFPGAFATSEKSWALTTDEVKDATAALFSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSPFRPVISFLGRAPPSFVPHI
Ga0209166_1000326153300027857Surface SoilMPGLMKILHQLNRSWLRAMSLAFAFSCVILGFVACGNAASTGTQSWPVSSTQATSQTAIADFDGDDRPDLATIQVGRDSSPNTQYWIAFQLSRGSRQILGITAPTGGLQVTSLDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASNPSSFPGAFTTSEKSWDCIPDEVKEATAALLSRYPAGSYPDSSRFPSPRNVVGLLVLRTSRNSLSSAVVSFLGRAPPSFVLYI
Ga0209701_1001707033300027862Vadose Zone SoilMSLLMKTLHQLTLSWPRAVSLAFALLCIAGSAAPAHAASAGPQSLPSPPPQVRSQFAIADFDGDRRPDIATVHVGQSSSWETHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWANQPVAVLLNDGQGNFRASSPSAFPGAFATSEKSWALTTDEVKDATAALFSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSPFRPVISFLGRAPPSFVPHI
Ga0209701_1007478823300027862Vadose Zone SoilMSGLMKTLHRLTHSWPRAVSLAFALLCFVGSASSAHAASAGPQSWPVSSTQVTSQFAIADFDGDRRPDLATVQAGQVGSLDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDDFLDVIVTTAWTNRPVAVLLNDGQGNFRTSSPSAFPGAFTTSEKSWASNTDKITDATAVLLSRYPTGNCSEGSRFSSPRNVTGLLVLWASRNSHFSSVVSFLGRAPPSSVLHI
Ga0209283_1004542433300027875Vadose Zone SoilMKTLHRLNLSWSRAVRLALALLCFVGSAGPAHAASTDPQSWPPTSTQVTSQFAIADFDGDSRPDLATVQAGVSSSWDAHYWIAFQLSSGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTTSEKSWACTTDEVKDATAVLLSRYPTGNCSEVSRFSPPQNAAGLLLPWAPRSSALSVVVSFLGRAPPSFVLHI
Ga0209590_1006365923300027882Vadose Zone SoilMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVLAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTRNCAEGSRFSSPRSVTGLLGLRASRNSLSSAVVSFLGRAPPSFVLHI
Ga0209275_1002431433300027884SoilMKRLQQSKLSWSRAVGVAFVFLCLVLAFAACGNAASSEPQNWPVSSAQVTTQFALADFDGDNRPDLATVQAGRGNSSDTYYWIAFQLSSGPRQTLGVRAPNGGLHIATRDVNGDDFLDVIVTTAWTNRPVAVFLNDGRGNFRVSSPSGFPGAFTTSEKSWASSADEVRDVTAVLLSRYPTGNCSEAGTFFSPRNVNGLLTLWNFRSWHVAAVVSFLDRAPPSFVPHI
Ga0209068_1009496623300027894WatershedsVNAFETAMSRLMKTLGPLNRSWRGIIALTMAFLCVILGFAAYGSAAPTASQTRPFSSTQATSRFAIADFDGDNRPDLATVQVGDSNALDTHYWIAFQLSSGARRTLGITAPAGGLRITSQDVNGDDFLDIIVTTAWANRPVAILLNDGQGNFRVSDPSAFSAAFTTSDKSWASSADEATDATAILLSRYPTGNCPEGNEFSSPRNVTGLLVLRASRNSLSSPVVSFLGRAPPSFVLHN
Ga0209488_1013283523300027903Vadose Zone SoilMSGLMKTLRHLNRSWPRAVGSAFALLCFAASAGSVHAASTGPQSLPSPPQVRSQFAIADFDGDRRPDLATVHVAQSSSGDTHYWIAFQLSGGSRQTLGITAPIGGLQLTSRDVNGDDFLDVIITTAWTNQPVAVLLNDGRGNFRASSPSAFPGAFTTSEKSWACTTDELKDATAVLLSRYPTGNCPEGSRFSSPRSVTGLLGLRASRNSLSSAVVSFLGRAPPSFVLHI
Ga0209488_1060406713300027903Vadose Zone SoilMKTLRQRHLSWSKAVSAAFASLCMVLGFATCGNAAPGVPQSGPASSLQVTSQFAIADFDGDRRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVITTAWTNQPVAVLLNDGQGNFRATSPSAFPGAFTTSEESWASTFDEIKDATAVLLSRYPTGNCSEGSKFSSPRNVTGLLVLRASRNLPYRPVVSFLGRAPPSFILHI
Ga0209526_1009262923300028047Forest SoilFAFLCLILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGQGSSWDTHYWIAFQLSGGSRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFKASNPSAFPGAFRTSEESWICITDEIKDAVALLLSRYPTGDCQEVSRFSPARKVTGRFVLRASRNSLLSAVVSFLGRAPPSFILHI
Ga0137415_1112652213300028536Vadose Zone SoilQSGPASSLQVTSQFAIADFDGDRRPDLATVQVGQGSSWDAHYWIAFQLSGGSRQTLGITAPTGGLHITSRDVNGDSFLDVVITTAWTNQPVAVLLNDGQGSFRATSPSAFPGAFTTSEESWASTFDEIKDATSVLLSRYPTGNCSEGSKFSSPRNVTGLLVLRASRNLPYRPVVSFLGRAPPSFILHI
Ga0308309_1001903633300028906SoilVAFVFLCLVLAFAACGNAASSEPQNWPVSSAQVTTQFALADFDGDNRPDLATVQAGRGNSSDTYYWIAFQLSSGPRQTLGVRAPNGGLHIATRDVNGDDFLDVIVTTAWTNRPVAVFLNDGRGNFRVSSPSGFPGAFTTSEKSWASSADEVRDVTAVLLSRYPTGNCSEAGTFFSPRNVNGLLTLWNFRSWHVAAVVSFLDRAPPSFVPHI
Ga0073994_1005717923300030991SoilMSGLMKTLRQLNRSWPRAVSLAFAFLCVILGFAACGNAASTGPQSWPVSSTQATSQFAIADFDGDNRPDLATVQVGHRGSSDTQYWIAFQLSRGSRQILGITGPTGGLQVTSRDVNGDSFLDVVVTTTWTNRPVAVLLNDGQGNFRASSPSAFPGVFTTSEKSWACIADEVKEATAALLSRYPTGNCPESSRPSSPRNVTGLLVLRTSRNSLSSAVASFLGRAPPSFVLHI
Ga0307469_1101518913300031720Hardwood Forest SoilVIHSTAIVLAQHTETENTGGPRQSEQISARLTAYPPDADYETVRIIHAGLTLALMSGLMKTSHQLSLSCSRAASLAFALLCFVGSASSSQAASTDPRTWPPTAAQATSQFAFADFDGDRRPDLATVQAGQGSSWATQYWIAFQLSRGSPQTLGVIAPTGGLQITSRDVNSDSFLDVVVTTEWTNQPVAILLNDGQGNFTASGPSAFPGAFRTSEKSWICITHEIKDAVALLLSRYP
Ga0307475_1039391913300031754Hardwood Forest SoilMKTLRQRHLSWSKAVRGAFACLCVVLGFAPCGNAASGVPQSGPASVPQVTSQFAIADFDGDRRPDLATVQSGEGSSWDTHYWIAFQLSRGPRQILGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNQPVAVLLNDGKGNFRASSPSAFPRAFTASEKSRVSTTDEVKDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASRNLPYCPVVSFLGRAPPSFVLHI
Ga0307475_1043423713300031754Hardwood Forest SoilNTGESTEKWTFSTADSPLRIRLCTCQRYDCVALDADGILDQKGWLVVSGFESKFMRTLRHRHLSWSHAISAAFALLCAIPGFAPCTNAAPSIPQSDPALSMQVTSQFAIADFDGDKRPDVATIQAGAGSSRDTHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNQPVAVLLNDGQGNFRTSKPSAFPGAFGTSEESWVSTADEIKDATAFFLSRYPTGNCVEGGRFSSPRNVTGLLAVWTSRGLRCRPVVSFLGRAPPCFFLHA
Ga0307473_1022280413300031820Hardwood Forest SoilMVGDKANTCRPVDCLSSGRWLRNRSDHAGLTLAHRPEPAMSGLMKTSHQLSLCCSKAVSLAFALLCFVGAAGSAHAASAGPQNWPSSSPQVTSQFAIADFDGDRRPDLATVQAGQSSSVDTQYWIAFQLSSGPRRTLGITAPSGGLQVTSVDVNGDDFLDVIVTAAWTNRPVAVLLNDGQGNFRASSPSAFPGAFTNSKKSCVYTTDETKDATAILLSRYPSGNCSDGSRFSLRRNVTGL
Ga0307479_1000939843300031962Hardwood Forest SoilMKTLRQRHLSRSKAVSGAFAFLCVVLGFAPCGNAASGVPQSGPASVPQVTSQFAIADFDGDRRPDLATVQSGEGSSWDTHYWIAFQLSRGPRQILGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNQPVAVLLNDGKGNFRASSPSAFPRAFTASEKSRVSTTDEVKDATAVLLSRYPTGNCPEDSRFSSPRNVTGLLVLRASRNLPYCPVVSFLGRAPPSFVLHI
Ga0307479_1073680913300031962Hardwood Forest SoilDIENGLMKTLCQKHLSWSKAVGAAFALLCVVLGFAPGGKAASGVPKSGSGSSLQVPSQFAIADFDGDKRPDLATVHAGVSSPWDTQYWIAFQLSGGPRQTLGITAPTGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAFPEAFTTSEKSWACTPDEINDPSAVLVSRYPTGDCPESSSFSPPRNLAWLLVPWASRNSLSSATVSFLGRAPPAFFIV
Ga0307471_10000322063300032180Hardwood Forest SoilMRLQNTGKSAEKWTFSTAASPLRIRLCTCQRYDCVALDADGILDQKGWLVVSGFESKFMRTLRHRHLSWSHAISAAFALLCAIPGFAPCTNAAPSIPQSDPALSMQVTSQFAIADFDGDKRPDVATIQAGAGSSRDTHYWIAFQLSGGSRQTLGITAPTGGLRITSRDVNGDSFLDVVVTTAWTNQPVAVLLNDGQGNFRTSKPSAFPGAFGTSEESWVSTADEIKDATAFFLSRYPTGNCVEGGRFSSPRNVTGLLAVWTSRGLRCRPVVSFLGRAPPCFFLHA
Ga0307471_10001744343300032180Hardwood Forest SoilMKTLRQSNLSWFRAVRAPLALLCLGFAACGNAASPGPQSWPVSSTQARSQFAIADFDGDNRPDLATVQVVQGSSWDTHYWIAFQLSRGSRQVLGVTAPTGGLQITSRDVNGDHFLDVVVTTAWTNRPVAVLLNDGQGNFRASSPSAVPVAFTTSENSWASTTNQFKDATAVLLSRHPTGNCSEGSRFSPSRKVTGLLVLRASRNSLFSPSVSFLGRAPPSFVPHI
Ga0307471_10042432923300032180Hardwood Forest SoilMVSSRMPVATRFRSCRIEGWSIALRLLSKFMKTFRQLNLSWPRAVSLAFALLCFVGSTAPAHAASSGPQSWPATAPLVTSQFAIADFDGDNRPDLATVQVGHGSSSDTQYWIAFQLSRGSRQILGITAPTGGLQVTSRDVNGDNFLDVVVTTAWTNLPVAVLLNDGQGSFRASSPSAFPGAFTTCQKSWACINDQVKEATAALLSRYPTGNCPEGSRFSSPRNVTGLFVLRTSRNSLSSAIVTFLGRAPPSFVLHI
Ga0307471_10142044313300032180Hardwood Forest SoilMHARLRLALRLEITVSWHMKTLHQLRLSWSRAVSLAFALLCLVGSSGSAHAASAGPPSWPATAPRVTSQFAIADYDEDSRPDLATVQAGQIGSSDTRYLIGFQLSTGQRQTIGITAPAGGLQITSRDVNGDSFLDVVVTTAWTNRPVAVLLNDGQGNFRASNPSAFPEAFTTSEKSWASTTDEVKGATAVLLSRYPTGNCPEGSRLSSPRN
Ga0307471_10152158113300032180Hardwood Forest SoilAHAASAGPPSWPASAPRVTSQFAIADYDGDGRPDLATVQAGQIGSSDTRYWIGFQLSRGPRQSLAITALAGGLQITSRDVNGDNFLDVIVTRAWTNQPVAVLLNDGQGNFRASSPSAFPGAFTLSEKSCTSTTDETKDATAVLLSRNPMGDCSRGRFPSPRNVMALLVPPPSRNLFSFVVVSFFGRAPPSFIFHI
Ga0307471_10287935513300032180Hardwood Forest SoilSQSWPRTSTHVTSQFAIADFDGDNRPDIATVQVGQGSSWDTHYWVAFQLSGGSRQTLGITAPTGGLQVTSRDANGDNFLDVVLTTAWTNRPVAVLLNDGLGNFRPSSPSAFPGAFTTSEKSWACSNDEVQDATAALLSHYQTDNFLEGRRFSPPRNVTGLLALRTSGNLLSSTVLSFLDRAPPSFAL
Ga0307472_10138028113300032205Hardwood Forest SoilTLHQLSLSCSRAVCLAFALLCFVGSAGSAHAASAGRSSSPQVTSQFAIADFDGDRRPDLATVQAGQSSSTDTQYWIAFQLSRGPRQTLGITAPTGGLQITSCDVNGDDFLDVIVTTTWTKRPVAVLLNDGQGNFRASSLSAFPGAFTTSEKSCAPRTDETKDATAVLLLRDPTGNCLEGSRFSSPRNITGLLVLWTSHNSPFSTVVSFLGRAPPSFIFYI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.