NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F057232

Metagenome Family F057232

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F057232
Family Type Metagenome
Number of Sequences 136
Average Sequence Length 374 residues
Representative Sequence MCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Number of Associated Samples 101
Number of Associated Scaffolds 136

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 22.06 %
% of genes near scaffold ends (potentially truncated) 36.03 %
% of genes from short scaffolds (< 2000 bps) 45.59 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.85

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.265 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(43.382 % of family members)
Environment Ontology (ENVO) Unclassified
(43.382 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.529 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.66%    β-sheet: 5.87%    Coil/Unstructured: 34.47%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.85
Powered by PDBe Molstar

Potential Novel Structural Fold:

This family has a high confidence model (pTM >=0.7) with no significant hits to either SCOPe or PDB biological assemblies. It is, therefore, classified as a potential novel structural fold.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 136 Family Scaffolds
PF03572Peptidase_S41 6.62
PF04389Peptidase_M28 4.41
PF01156IU_nuc_hydro 3.68
PF02597ThiS 3.68
PF00486Trans_reg_C 2.94
PF04255DUF433 2.21
PF01431Peptidase_M13 2.21
PF02887PK_C 2.21
PF05231MASE1 2.21
PF02852Pyr_redox_dim 1.47
PF04956TrbC 1.47
PF00589Phage_integrase 1.47
PF02824TGS 1.47
PF03602Cons_hypoth95 1.47
PF00069Pkinase 1.47
PF05649Peptidase_M13_N 1.47
PF13432TPR_16 0.74
PF01713Smr 0.74
PF00080Sod_Cu 0.74
PF12796Ank_2 0.74
PF07992Pyr_redox_2 0.74
PF00535Glycos_transf_2 0.74
PF12770CHAT 0.74
PF00271Helicase_C 0.74
PF00005ABC_tran 0.74
PF00144Beta-lactamase 0.74
PF07973tRNA_SAD 0.74
PF13673Acetyltransf_10 0.74
PF01261AP_endonuc_2 0.74
PF12969DUF3857 0.74
PF10282Lactonase 0.74
PF00999Na_H_Exchanger 0.74
PF00006ATP-synt_ab 0.74
PF01266DAO 0.74
PF13188PAS_8 0.74
PF05960DUF885 0.74
PF00884Sulfatase 0.74
PF12867DinB_2 0.74
PF10518TAT_signal 0.74
PF01425Amidase 0.74
PF13365Trypsin_2 0.74
PF01676Metalloenzyme 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 136 Family Scaffolds
COG0793C-terminal processing protease CtpA/Prc, contains a PDZ domainPosttranslational modification, protein turnover, chaperones [O] 6.62
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 5.88
COG3590Predicted metalloendopeptidasePosttranslational modification, protein turnover, chaperones [O] 3.68
COG1957Inosine-uridine nucleoside N-ribohydrolaseNucleotide transport and metabolism [F] 3.68
COG1977Molybdopterin synthase sulfur carrier subunit MoaDCoenzyme transport and metabolism [H] 3.68
COG2104Sulfur carrier protein ThiS (thiamine biosynthesis)Coenzyme transport and metabolism [H] 3.68
COG3447Integral membrane sensor domain MASE1Signal transduction mechanisms [T] 2.21
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 2.21
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 2.21
COG0642Signal transduction histidine kinaseSignal transduction mechanisms [T] 2.21
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 2.21
COG2890Methylase of polypeptide chain release factorsTranslation, ribosomal structure and biogenesis [J] 1.47
COG3838Type IV secretory pathway, VirB2 component (pilin)Intracellular trafficking, secretion, and vesicular transport [U] 1.47
COG074216S rRNA G966 N2-methylase RsmDTranslation, ribosomal structure and biogenesis [J] 1.47
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 1.47
COG2242Precorrin-6B methylase 2Coenzyme transport and metabolism [H] 1.47
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 1.47
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 0.74
COG3263NhaP-type Na+/H+ and K+/H+ antiporter with C-terminal TrkAC and CorC domainsEnergy production and conversion [C] 0.74
COG4651Predicted Kef-type K+ transport protein, K+/H+ antiporter domainInorganic ion transport and metabolism [P] 0.74
COG4805Uncharacterized conserved protein, DUF885 familyFunction unknown [S] 0.74
COG0025NhaP-type Na+/H+ or K+/H+ antiporterInorganic ion transport and metabolism [P] 0.74
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.74
COG0475Kef-type K+ transport system, membrane component KefBInorganic ion transport and metabolism [P] 0.74
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.74
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.74
COG2032Cu/Zn superoxide dismutaseInorganic ion transport and metabolism [P] 0.74
COG2367Beta-lactamase class ADefense mechanisms [V] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.26 %
UnclassifiedrootN/A0.74 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_17336618All Organisms → cellular organisms → Bacteria → Acidobacteria4663Open in IMG/M
3300001661|JGI12053J15887_10044514All Organisms → cellular organisms → Bacteria2487Open in IMG/M
3300002908|JGI25382J43887_10085217All Organisms → cellular organisms → Bacteria → Acidobacteria1692Open in IMG/M
3300002914|JGI25617J43924_10117747All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300004092|Ga0062389_100952044All Organisms → cellular organisms → Bacteria1042Open in IMG/M
3300005166|Ga0066674_10000145All Organisms → cellular organisms → Bacteria17797Open in IMG/M
3300005174|Ga0066680_10137850All Organisms → cellular organisms → Bacteria1520Open in IMG/M
3300005179|Ga0066684_10195458All Organisms → cellular organisms → Bacteria1306Open in IMG/M
3300005434|Ga0070709_10145177All Organisms → cellular organisms → Bacteria1634Open in IMG/M
3300005436|Ga0070713_100011447All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium6461Open in IMG/M
3300005437|Ga0070710_10132679All Organisms → cellular organisms → Bacteria1520Open in IMG/M
3300005439|Ga0070711_100012980All Organisms → cellular organisms → Bacteria5222Open in IMG/M
3300005451|Ga0066681_10009486All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Terriglobus → unclassified Terriglobus → Terriglobus sp. TAA 434702Open in IMG/M
3300005537|Ga0070730_10147477All Organisms → cellular organisms → Bacteria1598Open in IMG/M
3300005540|Ga0066697_10031974All Organisms → cellular organisms → Bacteria2920Open in IMG/M
3300005555|Ga0066692_10062957All Organisms → cellular organisms → Bacteria2102Open in IMG/M
3300005556|Ga0066707_10034515All Organisms → cellular organisms → Bacteria2833Open in IMG/M
3300005560|Ga0066670_10114307All Organisms → cellular organisms → Bacteria → Acidobacteria1531Open in IMG/M
3300005568|Ga0066703_10249682All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Kitasatospora → unclassified Kitasatospora → Kitasatospora sp. SolWspMP-SS2h1077Open in IMG/M
3300005574|Ga0066694_10007275All Organisms → cellular organisms → Bacteria4601Open in IMG/M
3300005586|Ga0066691_10055123All Organisms → cellular organisms → Bacteria2141Open in IMG/M
3300005587|Ga0066654_10111237All Organisms → cellular organisms → Bacteria1340Open in IMG/M
3300006031|Ga0066651_10034282All Organisms → cellular organisms → Bacteria2292Open in IMG/M
3300006796|Ga0066665_10023210All Organisms → cellular organisms → Bacteria3937Open in IMG/M
3300006797|Ga0066659_10005706All Organisms → cellular organisms → Bacteria6258Open in IMG/M
3300007258|Ga0099793_10053324All Organisms → cellular organisms → Bacteria1790Open in IMG/M
3300009012|Ga0066710_100477289All Organisms → cellular organisms → Bacteria → Acidobacteria1875Open in IMG/M
3300009038|Ga0099829_10064644All Organisms → cellular organisms → Bacteria2757Open in IMG/M
3300009038|Ga0099829_10165332All Organisms → cellular organisms → Bacteria1774Open in IMG/M
3300009088|Ga0099830_10038296All Organisms → cellular organisms → Bacteria → Acidobacteria3301Open in IMG/M
3300009088|Ga0099830_10218924All Organisms → cellular organisms → Bacteria1496Open in IMG/M
3300009089|Ga0099828_10080481All Organisms → cellular organisms → Bacteria2779Open in IMG/M
3300009089|Ga0099828_10146996All Organisms → cellular organisms → Bacteria2080Open in IMG/M
3300009137|Ga0066709_100004898All Organisms → cellular organisms → Bacteria10845Open in IMG/M
3300009137|Ga0066709_101417361All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300009137|Ga0066709_101417362All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300010326|Ga0134065_10004901All Organisms → cellular organisms → Bacteria3370Open in IMG/M
3300011269|Ga0137392_10065587All Organisms → cellular organisms → Bacteria2779Open in IMG/M
3300011269|Ga0137392_10142108All Organisms → cellular organisms → Bacteria1931Open in IMG/M
3300011270|Ga0137391_10008771All Organisms → cellular organisms → Bacteria7986Open in IMG/M
3300011270|Ga0137391_10020404All Organisms → cellular organisms → Bacteria5466Open in IMG/M
3300012189|Ga0137388_10226813All Organisms → cellular organisms → Bacteria1693Open in IMG/M
3300012202|Ga0137363_10006821All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium7128Open in IMG/M
3300012202|Ga0137363_10231241All Organisms → cellular organisms → Bacteria1496Open in IMG/M
3300012203|Ga0137399_10004213All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7626Open in IMG/M
3300012203|Ga0137399_10005077All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium7159Open in IMG/M
3300012203|Ga0137399_10112845All Organisms → cellular organisms → Bacteria → Acidobacteria2123Open in IMG/M
3300012205|Ga0137362_10010437All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium6766Open in IMG/M
3300012359|Ga0137385_10276722All Organisms → cellular organisms → Bacteria → Acidobacteria1448Open in IMG/M
3300012361|Ga0137360_10182733All Organisms → cellular organisms → Bacteria1686Open in IMG/M
3300012361|Ga0137360_10215598All Organisms → cellular organisms → Bacteria1561Open in IMG/M
3300012361|Ga0137360_10246633All Organisms → cellular organisms → Bacteria1465Open in IMG/M
3300012362|Ga0137361_10000569All Organisms → cellular organisms → Bacteria21094Open in IMG/M
3300012363|Ga0137390_10133725All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2460Open in IMG/M
3300012363|Ga0137390_10291107All Organisms → cellular organisms → Bacteria1616Open in IMG/M
3300012363|Ga0137390_10291266All Organisms → cellular organisms → Bacteria1616Open in IMG/M
3300012582|Ga0137358_10005466All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium7644Open in IMG/M
3300012582|Ga0137358_10014966All Organisms → cellular organisms → Bacteria4910Open in IMG/M
3300012683|Ga0137398_10008117All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia5148Open in IMG/M
3300012685|Ga0137397_10003027All Organisms → cellular organisms → Bacteria11720Open in IMG/M
3300012918|Ga0137396_10012498All Organisms → cellular organisms → Bacteria5249Open in IMG/M
3300012918|Ga0137396_10112229All Organisms → cellular organisms → Bacteria1957Open in IMG/M
3300012923|Ga0137359_10469063All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300012925|Ga0137419_10150416All Organisms → cellular organisms → Bacteria1679Open in IMG/M
3300012925|Ga0137419_10169912All Organisms → cellular organisms → Bacteria1591Open in IMG/M
3300012925|Ga0137419_10317503All Organisms → cellular organisms → Bacteria1196Open in IMG/M
3300012927|Ga0137416_10046993All Organisms → cellular organisms → Bacteria → Acidobacteria3000Open in IMG/M
3300012927|Ga0137416_10246622All Organisms → cellular organisms → Bacteria1457Open in IMG/M
3300012927|Ga0137416_10327075All Organisms → cellular organisms → Bacteria1278Open in IMG/M
3300012927|Ga0137416_10339394All Organisms → cellular organisms → Bacteria1257Open in IMG/M
3300012929|Ga0137404_10065534All Organisms → cellular organisms → Bacteria2833Open in IMG/M
3300012929|Ga0137404_10120007All Organisms → cellular organisms → Bacteria2148Open in IMG/M
3300012929|Ga0137404_10246706All Organisms → cellular organisms → Bacteria1531Open in IMG/M
3300012975|Ga0134110_10003626All Organisms → cellular organisms → Bacteria5645Open in IMG/M
3300012977|Ga0134087_10237344All Organisms → cellular organisms → Bacteria831Open in IMG/M
3300014150|Ga0134081_10047163All Organisms → cellular organisms → Bacteria1280Open in IMG/M
3300014501|Ga0182024_10220362All Organisms → cellular organisms → Bacteria2572Open in IMG/M
3300015241|Ga0137418_10141322All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2129Open in IMG/M
3300015241|Ga0137418_10159673All Organisms → cellular organisms → Bacteria1979Open in IMG/M
3300015241|Ga0137418_10170773All Organisms → cellular organisms → Bacteria1901Open in IMG/M
3300015264|Ga0137403_10012695All Organisms → cellular organisms → Bacteria → Acidobacteria9227Open in IMG/M
3300017970|Ga0187783_10016616All Organisms → cellular organisms → Bacteria5434Open in IMG/M
3300018431|Ga0066655_10005745All Organisms → cellular organisms → Bacteria5069Open in IMG/M
3300018433|Ga0066667_10012791All Organisms → cellular organisms → Bacteria4137Open in IMG/M
3300020170|Ga0179594_10017163All Organisms → cellular organisms → Bacteria2141Open in IMG/M
3300020199|Ga0179592_10006240All Organisms → cellular organisms → Bacteria → Acidobacteria5093Open in IMG/M
3300020199|Ga0179592_10011198All Organisms → cellular organisms → Bacteria3884Open in IMG/M
3300020199|Ga0179592_10038931All Organisms → cellular organisms → Bacteria2153Open in IMG/M
3300020580|Ga0210403_10091530All Organisms → cellular organisms → Bacteria2461Open in IMG/M
3300020583|Ga0210401_10004054All Organisms → cellular organisms → Bacteria15399Open in IMG/M
3300020583|Ga0210401_10171228All Organisms → cellular organisms → Bacteria2026Open in IMG/M
3300021046|Ga0215015_10857375All Organisms → cellular organisms → Bacteria2497Open in IMG/M
3300021168|Ga0210406_10149439All Organisms → cellular organisms → Bacteria1961Open in IMG/M
3300021168|Ga0210406_10195051All Organisms → cellular organisms → Bacteria1681Open in IMG/M
3300021170|Ga0210400_10032944All Organisms → cellular organisms → Bacteria4021Open in IMG/M
3300021180|Ga0210396_10013759All Organisms → cellular organisms → Bacteria → Acidobacteria7545Open in IMG/M
3300021374|Ga0213881_10000169All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae53317Open in IMG/M
3300021406|Ga0210386_10014277All Organisms → cellular organisms → Bacteria6205Open in IMG/M
3300021406|Ga0210386_10053100All Organisms → cellular organisms → Bacteria3223Open in IMG/M
3300021406|Ga0210386_10484326All Organisms → cellular organisms → Bacteria1069Open in IMG/M
3300021407|Ga0210383_10199842All Organisms → cellular organisms → Bacteria1709Open in IMG/M
3300021420|Ga0210394_10056515All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae → Dyella → Dyella thiooxydans3420Open in IMG/M
3300021433|Ga0210391_10523232All Organisms → cellular organisms → Bacteria931Open in IMG/M
3300021474|Ga0210390_10336789All Organisms → cellular organisms → Bacteria1277Open in IMG/M
3300021478|Ga0210402_10467187All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300024347|Ga0179591_1217242All Organisms → cellular organisms → Bacteria1783Open in IMG/M
3300025898|Ga0207692_10091702All Organisms → cellular organisms → Bacteria1649Open in IMG/M
3300025916|Ga0207663_10224976All Organisms → cellular organisms → Bacteria1367Open in IMG/M
3300025928|Ga0207700_10494170All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300026277|Ga0209350_1011140All Organisms → cellular organisms → Bacteria2899Open in IMG/M
3300026309|Ga0209055_1095193All Organisms → cellular organisms → Bacteria1203Open in IMG/M
3300026320|Ga0209131_1018938All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4161Open in IMG/M
3300026323|Ga0209472_1003690All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8343Open in IMG/M
3300026325|Ga0209152_10066414All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300026333|Ga0209158_1082832All Organisms → cellular organisms → Bacteria1248Open in IMG/M
3300026482|Ga0257172_1002299All Organisms → cellular organisms → Bacteria → Acidobacteria2522Open in IMG/M
3300026532|Ga0209160_1042078All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2740Open in IMG/M
3300026540|Ga0209376_1000016All Organisms → cellular organisms → Bacteria178432Open in IMG/M
3300026548|Ga0209161_10058982All Organisms → cellular organisms → Bacteria2468Open in IMG/M
3300026550|Ga0209474_10056084All Organisms → cellular organisms → Bacteria → Acidobacteria2797Open in IMG/M
3300026551|Ga0209648_10009011All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8638Open in IMG/M
3300026551|Ga0209648_10145595All Organisms → cellular organisms → Bacteria → Acidobacteria1878Open in IMG/M
3300026557|Ga0179587_10016844All Organisms → cellular organisms → Bacteria3884Open in IMG/M
3300027643|Ga0209076_1033468All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300027663|Ga0208990_1020362Not Available2166Open in IMG/M
3300027835|Ga0209515_10059247All Organisms → cellular organisms → Bacteria2848Open in IMG/M
3300027857|Ga0209166_10018417All Organisms → cellular organisms → Bacteria4388Open in IMG/M
3300027862|Ga0209701_10036011All Organisms → cellular organisms → Bacteria → Acidobacteria3204Open in IMG/M
3300027862|Ga0209701_10128568All Organisms → cellular organisms → Bacteria1563Open in IMG/M
3300028536|Ga0137415_10176944All Organisms → cellular organisms → Bacteria1960Open in IMG/M
3300028536|Ga0137415_10185235All Organisms → cellular organisms → Bacteria1906Open in IMG/M
3300028536|Ga0137415_10341721All Organisms → cellular organisms → Bacteria1300Open in IMG/M
3300031708|Ga0310686_108030848All Organisms → cellular organisms → Bacteria1602Open in IMG/M
3300031754|Ga0307475_10445195All Organisms → cellular organisms → Bacteria1041Open in IMG/M
3300031962|Ga0307479_10045733All Organisms → cellular organisms → Bacteria4205Open in IMG/M
3300032174|Ga0307470_10060437All Organisms → cellular organisms → Bacteria → Acidobacteria1993Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil43.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil17.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.76%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.15%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.21%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.47%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.47%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.47%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.74%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.74%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.74%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.74%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.74%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021374Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R08EnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_026227902088090014SoilLFTIFLSIQPGAQERPAENQQNAENAAVKYLRADASLRQSYALPPDASEKLQRAVELPLDIEDEKLVAAADEALIEFHHGATIQRCDWLMSAGDGLFATTAHRGAIKELVAVAEIRARLRFRDGNSPGAIADALAAMAAARHLSVDGSIASVLFAYRLENSVTGILVQNLLRLSRLQLQGLASGLNSLPDGSNLITAFESEKLSRNDLLVVVQDATSRDELIEHLLHNIPVLQSNRELAAQVVDGCGGSVKGYENCVDQQHSFYVSWAPRFKLPPEQFEKAYKIEFEEISKTNPVVQQFTPALPRIRWSEAYEQTRRAMLRAAIAVQLEGPRAVNQRLDPYDKKPFIXXXXXXXAGGGGFRLESRLTADGIPISLAILPNSEERKATPK
JGI12053J15887_1004451413300001661Forest SoilMFRLLPMRTSRAVAMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEDALVEFHHGATIKRCDWAMSVEDGPLASTAXRGAIRELVAVSGLRARLRFRDGDTXGAXGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIEDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
JGI25382J43887_1008521723300002908Grasslands SoilMRTATALAALVAMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
JGI25617J43924_1011774713300002914Grasslands SoilLDGEDEKLVAAAEDALVEFHHGTTIKRCDWAVSAEDGPLANTAHRGAIKELVAVSGLRARLRFRDGDIPGAMGDALAAMAAARHLSVDGSLASVLFAYKLENAITGVLVQNLLGFSPAELNELASGLDALPSGSSLGTAFESEKVSRNDLLAIVQVAKSRDELIERLLNKVPALQSNRGLAGEIVDGCGGSVKGFLNCVDQRQSFYTSWAPRFALPPEQFESAYKAEIEEVSRSNPVIRVFTPALPRFRWAEAYNQTRRVLLHAAIAVRLDGPRALNQHLVNSG*
Ga0062389_10095204413300004092Bog Forest SoilMKPIMVLLLLFLVFPSFWLAAQTPSGGNHQSVENAAVKYLRADASLRQSYELPPDAAAKLQKALESPLDIEDEKLVTAAEDALVEFHHGATIERCDWVMSSEDGPFANTAHRGAIKELVAVAEIRSRLRFRDGDMTGGIDDAVAAMAAARHLSVDGSLASALFAYRIENSVTTILARNLLRLSTAQLRELENAINALPRGSDLGTAFESEKLNRNDILASVQGATTRDDLIEGLLRNIPVLKSNRELAEQIVDGCGGSVKGFTACVNQQQSFYVSWAPRFTLPPEEFEKVYKVEFDSLSKANPVVRQFTPALPRFRWAEAYQQTRRALLHSAIAVQLDGPKAVSLAP
Ga0066674_1000014533300005166SoilMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066680_1013785013300005174SoilAVKYLRADAALRQSYALAPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLKKVPALQSNKGLAGEIVDGCGGSIKGFVNCVDQQQSFYAAWARRFALPPEQFEKTYKGEFEELARANPFVRQFTPDLSRFRWAEAYNQTRRALLQAAIAVRLDGPKALNQHPDPYNKTTFSYIPVDGGFRLESLLREGGIPISLSIVPNSER*
Ga0066684_1019545813300005179SoilPDAAAKLEKALASPLDEEDEKLVAAAEDALVEFHHGAASKGCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGNTPGAIDDALAAIAAVRHLSVDGSLASVLFAYKLERAITAVLAQNLLRFSPAELNELASGLGALPSGFDLGTAFESEKVRRNDFLAIAQATKSRDELIEQLLNKVPVLRSNKELAGEIVDGCGGSVKGFVNCVEQQQSFYTSWAPRFALSPEQFEKAYKAEIEELARVNPVIRQFTPPLPRFRWAEAYNETRRALLHAAIVVRLDGPKGLNQHLDPFDQNPFSYIPIDGGFRLESRLTEGGIPISISIVPNSEERKASPR*
Ga0070709_1014517713300005434Corn, Switchgrass And Miscanthus RhizosphereLVAWAVLLPSRLAAQTASAGNHQSAENAAVKYLRADASLRQSYALPADAVPMLQKSLESPLDLEDEKLITAADEALVEFHHGTASSRCDWVMSSQDGPLTNTAHRGAMKELVAVVEIRSRLRFRDGDMPGAMDDAVAAIAAARHLSVDGSLASVLFAYKMENSVMGILSRNLLRLSSEQLRELATAINALPNGSDLKIALESEKLNRNDILDSVQGAKTRDDLIARLLRNIPFLKSNRELATQIVDGCGGSVKGFRDCVDQQQAFYISWASRFALAPEEFEKAYKLEFEKLSTANPLVWQFTPFLPRFRWTEAYEQTHRALLHTAIAVRLDGPRAVSVSPDPYDRKPFTYVALGEGFRLESRLVEGGIPISLSIMPSAEDRKPGPK*
Ga0070713_10001144713300005436Corn, Switchgrass And Miscanthus RhizosphereMSGNVYRAVDSLTRRRRDLTSLLIPQGKLSDVKPIIAPLILCVLLPPSWLKAQTPSAGNHQSAENAAVKYLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPF
Ga0070710_1013267923300005437Corn, Switchgrass And Miscanthus RhizosphereLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPFTFIALGEGFRLESQLVDGGIPISLSVVPGAEDRKTVSK*
Ga0070711_10001298023300005439Corn, Switchgrass And Miscanthus RhizosphereMSGNVYRAVDSLTRRRRDLTSLLIPQGKLSGVKPIIAPLILCVLLPPSWLKAQTPSAGNHQSAENAAVKYLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPFTFIALGEGFRLESQLVDGGIPISLSVVPGAEDRKTVSK*
Ga0066681_1000948633300005451SoilMRTATALAALVAMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGTLASVLFASKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0070730_1014747723300005537Surface SoilMKTLIVSVVLFTMPPAIWLKAQNPPIEDEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADRALIEFRRGASIKRCDWAMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLIAYKLEDSVTGVLVQNLLRFSPAQLRELSNGLAGLPGGSNLGAALESEKLGRNDFVAIIQSAKTRDDLIEQLLQDIPALQSDRGLAAQIVDGCGGSVKGFVNCVDQQHSFYESWAPRFALPPEQFEMDYKVEFDEISKMNPVARQFIPALPRFRWAEADEQTRRALLQTAIAVRLLGPEALNQHIDPYDKKPFAYTAVDGGFHLESRLTDGVIPISLSIMPSSKEQKAIPK*
Ga0066697_1003197423300005540SoilMKTGLGLVFVVIVSAATVLAEGISRSGGQEETENAAVKYLRADAALRQSYALPPDAATELLKALDSPLNGEDEKLIAAAADALVEFHHGADLKRCDWTMSVEDGPLANTAHRGAVKDLVAVAGLRARLRFRDGDTEGAVSDALAAMAATRHLSVDGSLASVLFAYRLEDAVTRVLTQNLYRLSPVQLNQLASGLEVLPSGWSLAIAFKSEKVNRNDLFLTLADEARSRDDLVARLLKKVPVLESDSVRAGEIVNACGGSVNGFRTCAQQQQSFYTSWSSRFTLPPNQFESVCKSEMEGVAGANPLIRLFTPNLPRFRWADLYRQTRRALLNAAIAVRLNRVSAVNQHPDPYDGKPFSYFPVDGGFRLESRLTEGSIPIALLIPKSSSDPNGNHD*
Ga0066692_1006295723300005555SoilMKAAIALAVLAEMFCSIRLGAQTLPDGNQQDIENAAVKYLRADASLRQSYALPSDAAAKLQKALDSPLNEEDERLVAAADEALTEFGHGAASRRCNWEMSTEDGPLASTAHRGAIMELVSVSGLRARLRFRDGDTPGAMDDLLAAMAAARHLSVDGSLASVLFAYKLENALTRVLALNLYHFSSGQLKELKSRLDDLPTGSSLGAAFAAEKVGRNNVLDIAQRAKSRDELIEMLLKNVPILESNRGLAIEVVDGCGGTVKDFLNCVDQQQSLYNAWASRFNLAPEQFEREYKAEIEKVSKENPVIRQFTPALPRFRWTEAYCQTRRALLQAAIAVEL
Ga0066707_1003451533300005556SoilMKAAIALAVLAEMFCSIRLGAQTLPDGNQQDIENAAVKYLRADASLRQSYALPSDAAAKLQKALDSPLNEEDERLVAAADEALTEFGHGAASRRCNWEMSTEDGPLASTAHRGAIMELVSVSGLRARLRFRDGDTPGAMDDLLAAMAAARHLSVDGSLASVLFAYKLENALTRVLALNLYHFSSGQLKELKSRLDDLPTGSSLGAAFAAEKVGRNNVLDIAQRAKSRDELIEMLLKNVPILESNRGLAIEVVDGCGGTVKDFLNCVDQQQSLYNAWASRFNLAPEQFEREYKAEIEKVSKENPVIRQFTPALPRFRWTEAYCQTRRALLQAAIAVELDGASTLSRHLDPYDRNRFSYGPVDRGFRLQSQLSDNGIPISLLVVTKPTNDALSPD*
Ga0066670_1011430723300005560SoilELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066703_1024968213300005568SoilLQKALDSPLNEEDERLVAAADEALTEFGHGAASRRCNWEMSTEDGPLASTAHRGAIMELVSVSGLRARLRFRDGDTPGAMDDLLAAMAAARHLSVDGSLASVLFAYKLENALTRVLALNLYHFSSGQLKELKSRLDDLPTGSSLGAAFAAEKVGRNNVLDIAQRAKSRDELIEMLLKNVPILESNRGLAIEVVDGCGGTVKDFLNCVDQQQSLYNAWASRFNLAPEQFEREYKAEIEKVSKENPVIRQFTPALPRFRWTEAYCQTRRALLQAASAVELDGASTLSRHLDPYDRNRFSYGPVDRGFRLQSQLSDNGIPISLLVVTKPTNDALSPD*
Ga0066694_1000727533300005574SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066691_1005512333300005586SoilDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066654_1011123723300005587SoilPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066651_1003428223300006031SoilMRTATALAALVAMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWSPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066665_1002321023300006796SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTALESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0066659_1000570643300006797SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLTRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0099793_1005332423300007258Vadose Zone SoilMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEALVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
Ga0066710_10047728913300009012Grasslands SoilMENAAVKYLRADAALRQSYALAPDAAARLEKALESPLDVDDEKLVAAAEDALVEFHHGAASKRCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGNTPGAIDDALAAIAAARHLSVDGSLASVLFGYKLERTITGVLAQSLLRLSPAQLNELASGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLKKVPALQSNKGLAGEIVDGCGGSIKGFVNCVDQQQSFYAAWARRFALPPEQFEKTYKGEFEELARANPFVRQFTPDLSRFRWAEAYNQTRRALLQAAIAVRLDGPKALNQHPDPYNKTTFSYIPVDGGFRLESLLREGGIPISLSIVPNSER
Ga0099829_1006464423300009038Vadose Zone SoilMKATIVLVVLFTMFPAIWLGAHNPPIENQQNTENAAVKYLRADASLRQSYALPPDASAKLQKALESPLDVEDEKLVAAAEEALVEFRHGATINRCDWVMSAEDGPLANTAHRGAIRELVAVAEIRARLRFRDGNMTGAMEDALAAMAAARHLSVDGSLASVLVAYKLEKSVTGVLTQNLFRFSPAQLHELERGLNALPSGSNLSTAFGSEKLSRNDLLSVVQDAKNREELIEQLLHRIPALESNRGLAVEIVDGCGGSIKGYVNCVDQQHSFYVSWAPRFTLPPEQFEKAYKVEFDELSKTNPVVRQFTPALPRFRWAEAYEQTRRALLHSAIAVRLEGPKVLNQHLDPYDQKPFTYTALDGGFRLESRLTDGEIPISLSILPNSEERKTIPK*
Ga0099829_1016533213300009038Vadose Zone SoilMKTSLSLGLLVAMCSPTWLGAQFPPGGHQLDRENAAVKYLRADASLRQSYVLAPDAAAKLLQAVESPLDGEDEKLVAAAEDALVEFHHGAALKRCDWAMSEEDGPLANTAHRGAIMELVAVSGLRARLRFRDGNSPGATDDALAAMAAARHLSVDGSLASVLIAYKLENALTGILARNLHRFSPAQLNELASGLDSLPNGSSLATAFESEKVRRNDLLDIVEGAKSRDGLIVLLLNKLPILQSNRALAAEIVDGCGGSVTGFVTCANRQHSFYKAWASRFSLPPEQFERAYKAEIEEVSRANPVIQQFTPALPRFRWAEAYSQTRRALLQTAIAVRLDGPSALNRQLDPYDRNPFSYIPVDGGFRLESRLREGGTPISLSIVPSL*
Ga0099830_1003829633300009088Vadose Zone SoilMKTSLSLGLLVAMCSPTWLGAQFPPGGHQLDRENAAVKYLRADASLRQSYVLAPDAAAKLLQAVESPLDGEDEKLVAAAEDALVEFHHGAALKRCDWAMSEEDGPLANTAHRGAIMELVAVSGLRARLRFRDGNSPGATGDALAAMAAARHLSVDGSLASVLIAYKLENALTGILARNLHRFSPAQLNELASGLDSLPNGSSLATAFESEKVRRNDLLDIVEGAKSRDGLIVLLLNKLPILQSNRALAAEIVDGCGGSVTGFVTCADRQHSFYKAWASRFSLPPEQFERAYKAEIEEVSRANPVIQQFTPALPRFRWAEAYSQTRRALLQTAIAVRLDGPSALNRQLDPYDRNPFSYIPVDGGFRLESRLREGGTPISLSIVPSL*
Ga0099830_1021892413300009088Vadose Zone SoilMFPAIWLGAHNPPIENQQNTENAAVKYLRADASLRQSYALPPDASAKLQKALESPLDVEDEKLVAAAEEALVEFRHGATINRCDWVMSAEDGPLANTAHRGAIRELVAVAEIRARLRFRDGNMTGAMEDALAAMAAARHLSVDGSLASVLVAYKLEKSVTGVLTQNLFRFSPAQLHELERGLNALPSGSNLSTAFGSEKLSRNDLLSVVQDAKNREELIEQLLHRIPALESNRGLAVEIVDGCGGSIKGYVNCVDQQHSFYVSWAPRFTLPPEQFEKAYKVEFDELSKTNPVVRQFTPALPRFRWAEAYEQTRRALLHSAIAVRLEGPKVLNQHLDPYDQKPFTYTALDGGFRLESRLTDGEIPISLSILPNSEERKTIPK*
Ga0099828_1008048123300009089Vadose Zone SoilMKATIVLVVLFTMFPAIWLGAHNPPIENQQNTENAAVKYLRADASLRQSYALPPDASAKLQKALESPLDVEDEKLVAAAEEALVEFRHGATINRCDWVMSAEDGPLANTAHRGAIRELVAVAEIRARLRFRDGNMTGAMEDALAAMAAARHLSVDGSLASVLVAYKLEKSVTGVLTQNLFRFSPAQLHELERGLNALPSGSNLSTAFGSEKLSRNDLLSVVQDAKNRGELIEQLLHRIPALESNRGLAVEIVDGCGGSIKGYVNCVDQQHSFYVSWAPRFTLPPEQFEKAYKVEFDELSKTNPVVRQFTPALPRFRWAEAYEQTRRALLHSAIAVRLEGPKVLNQHLDPYDQKPFTYTALDGGFRLESRLTDGEIPISLSILPNSEERKTIPK*
Ga0099828_1014699613300009089Vadose Zone SoilMMFPSNRLGAQVPPVKNEQNTENAAVKYLRADASLRQSYPLPLDAAAKLQKALESSLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSAEDGPLANTAHRGAIKELVAVAGIRSRLRFRDDNLPGAIADAVAAMAAARHLSVDGSLASVLFAYKMENSVTGILVQNLLRLSPAQLQELTTGINALPSGSDIRIAFESEKLSRNDILAGLQGAKARDELIEGLLRNIPFLKSNRVLAAQIVDGCGGSVKGFTDCVDQQQSFYLSWALRFTLPPEQFEKAYKAEFDELSKANPVVRQFTPALPRFRWAEAYEETRRALFHTAIAVRLDGTKALSVCLDPYDQKPFTYTALDGGFRLESRLTDGGIPISLSIVPSAEDRKAVSK*
Ga0066709_10000489873300009137Grasslands SoilVKYLRADAALRQATPLAPDAAGKLEKALELPLDGEDEKLVAAAKDALVEFRHGAASKRCDWAMSVEDGPLANTTHRGAIMELVAVSGLRARLRFRDGETPGAMGDALAAMAAARHLSVDGSLASVLFADKLEKAIIGVLAHNLLRFSSARLNELASGLDALPSGSSLSTAFEAEKVSRNDLLAIAQVARTRDELIGRLLNKIPVLQSNRRLAAEIVDGCGGSVKGFVDCIDQQQSFYPSWAPRFTLPPEQFDRVYRAEIEELSRTNPVIRQFTPALPRFRWAEAYNQTRRALLQAAIAVRLDGSRALNQHLDPFDRNPFSYILVDGGFRLESRLSEGEIPISLSIVPSSEERKANPR*
Ga0066709_10141736113300009137Grasslands SoilASPLDEEDEKLVAAAEDALVEFHHGAASKGCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGNTPGAIDDALAAIAAARHLSVDGSLASVLFGYKLERTITGVLAQSLLRLSPAQLNELASGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLKKVPALQSNKGLAGEIVDGCGGSIKGFVNCVDQQQSFYAAWARRFALPPEQFEKTYKGEFEELARANPFVRQFTPDLSRFRWAEAYNQTRRALLQAAIAVRLDGPKALNQHPDPYNKTTFSYIPVDGGFRLESLLREGGIPISLSIVPNSER*
Ga0066709_10141736213300009137Grasslands SoilASPLDEEDEKLVAAAEDALVEFHHGAASKGCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGDTPRAMDDALAAMAAARHLSVDGSLASVLFGYKLERTITGVLAQNLLRFSPAELNELASGLGALPSGFDLGTAFESEKVRRNDFLAIAQAAKSRDELIEQLLNKVPVLRSNKELAGEIVDGCGGSLKGFVNCVDQQQSFYTSWAPRFALPPEQFEKAYKAEIEELARVNPVIRQFTPALPRFRWAEAYNQTRRALLHAAIAVRLDGPKALNQHLDPFDQNAFSYIPVDGGFRLESLLREGGIPISLSIVPNSER*
Ga0134065_1000490133300010326Grasslands SoilMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGTLASVLFASKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0137392_1006558723300011269Vadose Zone SoilMFPAIWLGAHNPPIENQQNTENAAVKYLRADASLRQSYALPPDASAKLQKALESPLDVEDEKLVAAAEEALVEFRHGATINRCDWVMSAEDGPLANTAHRGAIRELVAVAEIRARLRFRDGNMTGAMEDALAAMAAARHLSVDGSLTSVLVAYKLEKSVTGVLTQNLFRFSPAQLHELERGLNALPSGSNLSTAFGSEKLSRNDLLSVVQDAKNREELIEQLLHRIPALESNRGLAVEIVDGCGGSIKGYVNCVDQQHSFYVSWAPRFTLPPEQFEKAYKVEFDELSKTNPVVRQFTPALPRFRWAEAYEQTRRALLHSAIAVRLEGPKVLNQHLDPYDQKPFTYTALDGGFRLESRLTDGEIPISLSILPNSEERKTIPK*
Ga0137392_1014210813300011269Vadose Zone SoilMKVAIVLAVAFVMFPSIWLGAQTPPVGNQQNTENAAVKYLRADVSLRQSYALPPDAAAKLQKALESPLDAEDEKLVAAADEALVEFRHGASIKRCDWVMSAEDGPLANTAHRGAIKELVAVAGIRSRLRFRDVNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTVVLVQNLLRLSPTLLQELASGLNGLPSGSNLGTALESEKLSRNELLAIARNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQHSLYVSWAPRFTLPPEQFEEAYKIEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTAVDGGFRLESRLTDGGIPIMLSIVPTSEEARVIPK*
Ga0137391_1000877163300011270Vadose Zone SoilMKTSLSLGLLVAMCSPTWLGAQFPPGGHQLDRENAAVKYLRADASLRQSYVLAPDAAAKLLQAVESPLDGEDEKLVAAAEDALVEFHHGAALKRCDWAMSEEDGPLANTAHRGAIMELVAVSGLRARLRFRDGNSPGATGDALAAMAAARHLSVDGSLASVLIAYKLENALTGILARNLHRFSPAQLNELASGLDSLPNGSSLATAFESEKVRRNDLLDIVEGAKSRDGLIVLLLNKLPILQSNRALAAEIVDGCGGSVTGFVTCANRQHSFYKAWASRFSLPPEQFERAYKAEIEEVSRANPVIQQFTPALPRFRWAEAYSQTRRALLQTAIAVRLDGPSALNRQLDPYDRNPFSYIPVDGGFRLESRLREGGTPISLSIVPSL*
Ga0137391_1002040443300011270Vadose Zone SoilMGKLQKALESPLDVEDEKLVAAADEALVEFHHGATIERCDWVMSAEDGPRANTAHRGAMKELVAVAEIRARLRFRDGNVPGAIEDALAAVAAARHLSVDGSLASVLFAYKLENSVTGVLAQNLLRLSPAQLQELAKGLNTLPSGSNIRTAFESEKLGRNDILASVQGAKTRGELIEQLLLNLPFLQSNRGLAEQIVDGCGGSVKGFVTCVDQQQTFYLSWAPRFALPPEQFERAYNVEFDELSKANPVIGQFAPALPRFRWAEAYEQTRRALFRAAIAIRLDGPKALNLNLDPYDQKPFTYAAVNGGFRLESRLTDTAIPISLSIEPSAGERKAIPK*
Ga0137388_1022681313300012189Vadose Zone SoilMMFPSNRLGAQVPPVKNEQNTENAAVKYLRADASLRQSYPLPLDAAAKLQTALESSLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSAEDGPLANTAHRGAIKELVAVAGIRSRLRFRDVNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTVVLVQNLLRLSPALLQELASGLNGLPSGSNLGTALDSEKLSRNELLAIARNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQHSLYVSWAPRFTLPPEQFEEAYKIEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNKHFDPFDKKPFTYTAVDGGFRLESRLTDGGIPIMLSIVPTSAEARVIPK*
Ga0137363_1000682153300012202Vadose Zone SoilMKMGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSVEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137363_1023124123300012202Vadose Zone SoilVRRRLRMKVIIVLVVLFTMSPSSWLKAQNSPVENQENTKNAAVKYLRADASLRQSYALPPDAAAKLQKALESPLDADDEKLVVAADEALVEFRHGASIKRCDWVMSAEDGALANTSHRGAIKELVAVAGIRSRLRFRDGNTQGAMEDALAAMAAPRHLSVDGSLASVLIAYKLENSAARILAQNLHRLASPQLHELANGLNNLPGGSNLGTALESEKLSRNEFLTLAQNAKTRDELIEQLLQNIPVLQSNRELAAQIIDGCGGSVKGFVNCVDQQRSFYESWAPRFTLPPEQFEKAFKVEFTELSKKNPVVRQFTPALPRFRWVVADQQTHRALLQAAIAVRSRSFESA*
Ga0137399_1000421323300012203Vadose Zone SoilMKAAIVLVVQFMMFPSIWLGAQTPPVGNQQNTENAAVKYLRADASLRQSYALPPDAPAKLQKALESPLDAEDEKLVAAADEALVEFRHGAFIKRCDWVMSAEDGPLASTAHRGAIKELVAVAGIRSRLRFRDGNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTLVLVQNLLRLSPAQLHELASGLTGLPSGSNLGTALESEKLSRNELLAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQRSLYVSWAPRFTLPPDQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTTVDGGFRLESRLADGGIPISLSIVPSSEEARVIPK*
Ga0137399_1000507753300012203Vadose Zone SoilMKIGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137399_1011284523300012203Vadose Zone SoilMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEALVEFHHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSREELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
Ga0137362_1001043743300012205Vadose Zone SoilMKIGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIAVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137385_1027672213300012359Vadose Zone SoilHGAAIKRCAWIISDEDGALANTAHRGAITELVAVSGLRARLRFRDGDTPGAMGDALAAMAAARHLSVDGSLASVLFGYKLEREITGVLAQNLLRFSPAQLNGLANGLGVLPSGFSLSTAFESEKVRRNDFLAIVQIAKTRDELIAQLLKKVPALQSNKGLAGEIVDGCGGSVTGFVNCVGQQQSFYASWAPRFALPPEQFEEAYHAEIEELARVNPVIRQFTPALPRFRWAEAYNQTRRALLRAAIAVRLDGPKALSRHLDPFDQNAFSYIPVDGGFRLESRLREVGIPISLSIVANSEDRKPSPR*
Ga0137360_1018273323300012361Vadose Zone SoilMKVIIVLVVLFTMSPSSWLKAQNSPVENQENTKNAAVKYLRADASLRQSYALPPDAAAKLQKALESPLDADDEKLVVAADEALVEFRHGASIKRCDWVMSAEDGALANTSHRGAIKELVAVAGIRSRLRFRDGNTQGAMEDALAAMAAPRHLSVDGSLASVLIAYKLENSAARILAQNLHRLASPQLHELANGLNNLPGGSNLGTALESEKLSRNEFLTLAQNAKTRDELIEQLLQNIPVLQSNRELAAQIIDGCGGSVKGFVNCVDQQRSFYESWAPRFTLPPEQFEKAFKVEFTELSKKNPVVRQFTPALPRFRWVVADQQTHRALLQAAIAVRSRSFESA*
Ga0137360_1021559823300012361Vadose Zone SoilMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKLQKTLESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWVPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTAADGGFRLESRLADGGIPISLSILLNSEERKAIPK*
Ga0137360_1024663323300012361Vadose Zone SoilLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSVEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQLESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137361_10000569123300012362Vadose Zone SoilMENAAVKYLRADAALRQSYALAPDAAAKLEKALALPLDGEDEKLVAAAEDALVEFHHGAAIKRCDWIMSDEDGALANTAHRGAITELVGVSGLRARLRFRDGDTPGAMGDALAAIAAARHLSVDGSLASVLFGYKLEREITGVLARNLLRFSPTQLNELASGLGVLPSGFSLSTAFESEKVRRNDFLAVVQGAKSRDELIERLLKRAPALRSNKELAGEIVDGCGGSVKGFVNCVDQQQSFYASWAPRFALPPEQFEKAYKSEIEEFARVNPVIRQFTPALPRFRWAEAYNQTRRALLHAAIAVRRDGPKALHQHPDPFDQNAFSYMPVDGGFRLESRLSEGGIRISLLIAANSEERKPSPR*
Ga0137390_1013372523300012363Vadose Zone SoilMKTSLSLGLLVAMCSPTWLGAQFPPGGHQLDRENAAVKYLRADASLRQSYVLAPDAAAKLLQAVESPLDGEDEKLVAAAEDALVEFHHGAALKRCDWAMSEEDGPLANTAHRGAIMELVAVSGLRARLRFRDGNSPGATGDALAAMAAARHLSVDGSLASVLIAYKLENALTGILARNLHRFSPAQLNELASGLDSLPNGSSLATAFESEKVLRNDLLDIVEGAKSRDGLIVLLLNKLPILQSNRALAAEIVDGCGGSVTGFVTCANRQHSFYKAWASRFSLPPEQFERAYKAEIEEVSRANPVIQQFTPALPRFRWAEAYSQTRRALLQTAIAVRLDGPSALNRQLDPYDRNPFSYIPVDGGFRLESRLREGGTPISLSIVPSL*
Ga0137390_1029110723300012363Vadose Zone SoilMENAAVKYLRADAALRQSTALAPDAAAKLEKALQSPLDGEDEKLVAAAEDALVEFHHGTTIKWCDWAVSAEDGPLANTAHRGAIKELVAVSGLRARLRFRDGDTPGAMGDALAAMAAARHLSVDGSLASVLFAYKLENTITGVLAQNLLRFSPAQLNELASGLDALPSGSSLSTAFESEKVNRNDLLAIAQPAKSRDELIERLLNKVPTLQSNRGVAAEIVDGCGGSVKGFLNCVDQQQSFYTSWAPRFTLPPEQFEKAYKAEIEELSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAVRLDGPKALNQHRDPFDKNPFSYIPVDGGFRLESRLSEGGIPISLSIMPSSEERRANPR*
Ga0137390_1029126613300012363Vadose Zone SoilMGKLQKALESPLDVEDEKLVAAADEALVEFHHGATIERCDWVMSAEDGPRANTAHRGAMKELVAVAEIRARLRFRDGNVPGAIEDALAAVAAARHLSVDGSLASVLFAYKLENSVTGVLAQNLLRLSPAQLQELAKGLNTLPSGSNIRTAFESEKLGRNDILASVQGAKTRGELIEQLLLNLPFLQSNRGLAEQIVDGCGGSVKGFVTCVDQQQTFYLSWAPRFALPPEQFERAYNVEFDELSKANPVIGQFAPALPRFRWAEAYEETRRALFHTAIAVRLDGTKALSVCLDPYDQKPFTYTALDGGFRLESRLTDGGIPISLSIVPSAEDRKAVSK*
Ga0137358_1000546653300012582Vadose Zone SoilMKIGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSVEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137358_1001496623300012582Vadose Zone SoilMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKLQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSITGVLVQNLLRPSPAQLQELASSLNALPSGCNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWVPRFRLPPEQFEKTFKIEFDELSKTSPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTATDGGFRLESRLADGGIPISLSILLNSEERKAIPK*
Ga0137398_1000811733300012683Vadose Zone SoilMENAAVKYLRADAALRQSTALAPDAAAKLEKALELPLDGEDEKLVAAAEEALVEFHHGTTIKRCDWAVSAEDGPLANTAHRGAIRELVAVSVLRARLRFRDGDTPGAMGDALAAMAAARHLSVDGSLASVLFAYKLENTITGVLAQNLLRFSPAQLNELASGLDALPSGSSLSTAFESEKVSRNDLLAIAQPAKSRDELIERLLNKVPTLQSNRGVAAEIVDGCGGSVKGFLNCVDQQQSFYTSWAPRFTLPPEQFEKAYKAEIEEFSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAVRLDGPKALNLHRDPFDKNPFSYIPVDGGFRLESRLSEGGIPISLSIIPSSEGRRANPR*
Ga0137397_1000302773300012685Vadose Zone SoilMKIGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGLSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYKETRRALLKAAIAVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137396_1001249823300012918Vadose Zone SoilMKMGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIAVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137396_1011222923300012918Vadose Zone SoilMCSPTWLGAQLPPAVNQQDTENAAVKYLRADTALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
Ga0137359_1046906313300012923Vadose Zone SoilALESPFDGEVEKLVAAAEDALVEFRHGAAIKRCDWSMSLEDGPLANTAHRGAIMELVAVSGLRARLRFRDGDTPGALGDVLAAIAAARHLSVDGSLASVLFAYKLENAVAAILAQNLHGFSPAQLNELSTKLDALPKGFSLGTALESEKLGRNDLLTASQGAKDRDDLIGRLVNKIPVLQSKPELAREIVDGCGSSVVGFVNCVNQQQSFYASWASRFTLPPEQFEMSYKAEIQELSRTNPVVREFTPNLPRLRWAEAYSQTRRALLRAAIAVRMEGPDALNRHLDPYDGNPFPYAPVGSGFKLQSQLSEGGIPISLSILPGAENRKASPNQLVPPPEEPQSPFRWQALPVFKQRHSKKIGDFAGAAIA*
Ga0137419_1015041613300012925Vadose Zone SoilMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADTALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESWLSDGGIPISLSIVPNSEERK*
Ga0137419_1016991213300012925Vadose Zone SoilMKATIVSVVLFMMVPSIWLRSQTPPAGNQQQTENAAVKYLRADAALRQSYALPPDAAEKLQKAVESPLDAEDERLVAAADEALVELRHGAAIERCDWLMSAEDGALASTAHRGAIKELVAIAEIRARLRFRDGNTPGAIGDVLAAMSAARHLSVDGSLASVLFANKLENSVTGVLVQNLPRLSSAQLHELSSGLKALPRGSNLSTAFESEKLSRNNVLLALVEGAKTRDELTEQLLHNIPALGSNRGLAAEIVDGCGGSVKGYVSCVDRQHSFYASWAPRFVLPPEEFEKAYKVEFDGLSKTNPVVRQFTPALWRFRWAEAYEQTRRALLHTAIAVQLEGPRVLNQHLDPYDQRPFTYTAVDGGFRLESRLADGGVPISLVILPNSEERKTIPK*
Ga0137419_1031750313300012925Vadose Zone SoilMKAAIVLVVQFMMFPSIWLGAQTPPVGNQQNTENAAVKYLRADASLRQSYALPPDAPAKLQKALESPLDAEDEKLVAAADEALVEFRHGAFIKRCDWVMSAEDGPLASTAHRGAIKELVAVAGIRSRLRFRDGNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTLVLVQNLLRLSPAQLHELASGLTGLPSGSNLGTALESEKLSRNELLAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQRSLYVSWAPRFTLPPDQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQNRRALLHAAVAVRL
Ga0137416_1004699313300012927Vadose Zone SoilALDSPLNGGDEKLVAAATEALVEFHHGAGLKRCDWTMSVEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD*
Ga0137416_1024662213300012927Vadose Zone SoilIIALVALFTMSTSIWLGAHDPPVENQQNTENAAVKYLRADASLRQSYALAPDAPANLQKALESPLDAEDEKLVAAADEALVEFRHGASIQRCDWVMSAEDGPLASTAHRGAIKELVAVAGIRSRLRFRDGNTPGAVDDALAAMAAARHLSVDGSLASVLFAYKLENSVTLVLVQNLLRLSPAQLHELASGLTGLPSGSNLGTALESEKLSRNELLAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQRSLYVSWAPRFTLPPDQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTTVDGGFRLESRLADGGIPISLSIVPSSEEARVIPK*
Ga0137416_1032707513300012927Vadose Zone SoilALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVTKSRDELVEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
Ga0137416_1033939413300012927Vadose Zone SoilVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKLQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWAPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTATDGGFRLESRLADGGIPISLSILLNSEERKAIPK*
Ga0137404_1006553413300012929Vadose Zone SoilMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKFQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWAPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTATDGGFRLESRLADGGIPISLSILLNSEERKAIPK*
Ga0137404_1012000733300012929Vadose Zone SoilVKYLRADAALRQSYALAPDAPTKLQKALESPFDGEVEKLVAAAEDALVEFRHGAAIKRCDWSMSLEDGPLANTAHRGAIMELVAVSGLRARLRFRDGDTPGALGDVLAAIAAARHLSVDGSLASVLFAYKLENAVAAILAQNLHGFSPAQLNELSTKLDALPKGFSLGTALESEKLGRNDLLTASQGAKDRDDLIGRLVNKIPVLQSKPELAREIVDGCGSSVVGFVNCVNQQQSFYASWASRFTLPPEQFEMSYKAEIQELSRTNPVVREFTPNLPRLRWAEAYSQTRRALLRAAIAVRMEGPDALNRHLDPYDGNPFPYAPVGSGFKLQSQLSEGGIPISLSILPGAENRKASPNQLVPPPEEPQSPFRWQALPVFKQRHSKKIGDFAGAAIA*
Ga0137404_1024670613300012929Vadose Zone SoilMKATIVSVVLFMMVPSIWLRSQTPPAGNQQQTENAAVKYLRADAALRQSYALPPDAAEKLQKAVESPLDAEDERLVAAADEALVELRHGAAIERCDWLMSAEDGALASTAHRGAIKELVAIAEIRARLRFRDGNTPGAIGDVLAAMSAARHLSVDGSLASVLFANKLENSVTGVLVQNLPRLSSTQLHELSSGLKALPRGSNLSTAFESEKLSRNNVLLALVEGAKTRDELTEQLLHNIPALGSNRGLAAEIVDGCGGSVKGYVSCVDRQHSFYASWAPRFVLPPEEFEKAYKVEFDGLPKTNPVDRQFTPALWRFRWAEAYEQTRRALLHTAIAVQLEGPRVLNQHLDPYDQRPFTYTAVDGGFRLESRLADGGVPISLVILPNSEERKTIPK*
Ga0134110_1000362633300012975Grasslands SoilMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDVPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK*
Ga0134087_1023734413300012977Grasslands SoilLEKALESPLDVDDEKLVAAAEDALVEFHHGAASKRCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGNTPGAIDDALAAIAAARHLSVDGSLASVLFGYKLERTITGVLAQSLLRLSPAQLNELASGLGALPSGFSLGTAFESEKVRRNDFLTIVQAAKTRDELIEQLLKKVPALQSNKGLAGEIVDGCGGSIKGFVNCVDQQQSFYAAWARRFALPPEQFEKTYKGEFEELARANPFVRQFTPDLSRFRWAEAYNQTRRALLQAAIAVR
Ga0134081_1004716313300014150Grasslands SoilMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALSPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGP
Ga0182024_1022036233300014501PermafrostMRVLKVIVLVALSLPTGSAAQQKSENAAVKYLRADLSLRQTYPLAPDAAVKLEKALESPLDGEDEKLVAAADDALVEFNHGTALTRCDWAMSSEDGPFANTSHRGAIEELVAVSGLRARLRFHAGNVHGAISDALASLTAARHLSVDGSIASVLFANKLENEIAGVLAQNLEQLSRTQLKELTISLDGLPMGSSLSNAFEAEKVRRNDLLPIAEGATTRDELIEHLLNGIPFLQSNKAVAGEIVDGCGGSVRGFLNCVNQQQSFYTSWVARFGFSPEQFETEYKAEIEELSRANPVIRLLTPNLPRLRWTEAYTQTRRALLYAAIDVRLDGPRAVNGHLDPYDRSPFSYSSVDDGFRLVSRLKDQQGIPFSLTIAPGARDGSAGEK*
Ga0137418_1014132223300015241Vadose Zone SoilMMVPSIWLRSQTPPAGNQQQTENAAVKYLRADAALRQSYALPPDAAEKLQKAVESPLDAEDERLVAAADEALVELRHGAAIERCDWLMSAEDGALASTAHRGAIKELVAIAEIRARLRFRDGNTPGAIGDVLAAMSAARHLSVDGSLASVLFANKLENSVTGVLVQNLPRLSSAQLHELSSGLKALPRGSNLSTAFESEKLSRNNVLLALVEGAKTRDELTEQLLHNIPALGSNRGLAAEIVDGCGGSVKGYVSCVDRQHSFYASWAPRFVLPPEEFEKAYKVEFDGLSKTNPVVRQFTPALWRFRWAEAYEQTRRALLHTAIAVQLEGPRVLNQHLDPYDQRPFTYTAVDGGFRLESRLADGGVPISLVILPNSEERKTIPK*
Ga0137418_1015967323300015241Vadose Zone SoilMKAAIVLVVQFMMFPSIWLGAQTPPVGNQQNTENAAVKYLRADASLRQSYALPPDAPAKLQKALESPLDAEDEKLVAAADEALVEFRHGAFIKRCDWVMSAEDGPLASTAHRGAIKELVAVAGIRSRLRFRDGNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTLVLVQNLLRLSPAQLHELASGLTGLPSGSNLGTALESEKLSRNELLAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQRSLYVSWAPRFTLPPDQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQNRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTTVDGGFRLESRLADGGIPISLSIVPSSEEARVIPK*
Ga0137418_1017077323300015241Vadose Zone SoilMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK*
Ga0137403_1001269543300015264Vadose Zone SoilMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKFQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWAPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTAADGGFRLESRLADGGIPISLSILLNSEERKAIPK*
Ga0187783_1001661633300017970Tropical PeatlandVVVRTVIAVAALFALFFPPWFGSRLVNAPNQENTENAAVKYLRADAALRQSYALPPDAAPKLEKALESPLDGEDRKLVAAAEDALVELDHGASDKRCDWAVSVEDGPLANTAHRGAIRELVAVSGLRARLRFVAGDTPGAMSDALAAIAAARHLSMDRSIASVLIAYKLENMTATVLAQNLGQFSPAQLRELVSGLDALPHGSSLSSALESEKLNRNDLSAIVQGAKTRDELVGRLLARVPTLQSNRVLAEQIVDGCGGSVEGFLKCADQQISLYALWEARFSWSPEHFESAYNADLAEVSKTNPVIRQFTPSLTHLRWAEAYCQTRRALLQAAVFILLDGQSALNRYLDPYDARPFSSTSVDGGFRLGSRLTENGIPLSLSAVPNSDHVVTNPK
Ga0066655_1000574543300018431Grasslands SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTALESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0066667_1001279143300018433Grasslands SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0179594_1001716313300020170Vadose Zone SoilTGLLMEGLDSLARSECKLLSMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKLQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWAPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTAADGGFRLESRLADGGIPISLSILLNSEERKAIPK
Ga0179592_1000624033300020199Vadose Zone SoilMENAAVKYLRADAALRQSTALAPDAAAKLEKALELPLDGEDEKLVAAAEEALVEFHHGTTIKRCDWAVSAEDGPLANTAHRGAIRELVAVSVLRARLRFRDGDTPGAMGDALAAMAAARHLSVDGSLASVLFAYKLENTITGVLAQNLLRFSPAQLNELASGLDALPSGSSLSTAFESEKVSRNDLLAIAQPAKSRDELIERLLNKVPTLQSNRGVAAEIVDGCGGSVKGFLNCVDQQQSFYTSWAPRFTLPPEQFEKAYKAEIEEFSRANPVIRQFTPALPRFRLAEAYNQTRRALLHTAIAVRLDGPKALNLHRDPFDKNPFSYIPVDGGFRLESRLSEGGIPISLSIIPSSEGRRANPR
Ga0179592_1001119833300020199Vadose Zone SoilVLACDIPQKPSTRSEKAIRHLCYRLVCMKIGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYKETRRALLKAAIAVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD
Ga0179592_1003893113300020199Vadose Zone SoilSGRWPAIPDRRARRHFPRPPQKGRNRDAIREVHRQLPDARKQTVEPQRLAAECTLPFGCRVLPMRTAAAVAGLISICLPAWLGAQSPPLGNQQDTENAAVKYLRADAALRQSYALAPDAPTKLQKALESPFDGEVEKLVAAAEDALVEFRHGAAIKRCDWSMSLEDGPLANTAHRGAIMELVAVSGLRARLRFRDGDTPGALGDVLAAIAAARHLSVDGSLASVLFAYKLENAVAAILAQNLHGFSPAQLNELSTKLDALPKGFSLGTALESEKLGRNDLLTASQGAKDRDDLIGRLVNKIPVLQSKPELAREIVDGCGSSVVGFVNCVNQQQSFYASWASRFTLPPEQFEMSYKAEIQELSRTNPVVREFTPNLPRLRWAEAYSQTRRALLRAAIAVRMEGPDALNRHLDPYDGNPFPYAPVGSGFKLQSQLSEGGIPISLSILPGAENRKASPN
Ga0210403_1009153043300020580SoilMKTKEFTQRPRRKRAEFNEKRNQRKTRTLKGEGSSTRLRMKAITGLVVLFTMSAAIWLKAQNPPIENEKSTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLFDYKLENSVTGVLARNLLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWGPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRALLQAAVATRLEGSQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLLIVPSSEERKTIAK
Ga0210401_1000405463300020583SoilMKTKEFTQRPRRKRAEFNEKRNQRKTRTLKGEGSSTRLRMKAITGLVVLFTMSPAIWLKAQNPPIENEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLATMAAARHLSVDGSLASVLFDYKLENSVTGVLARNLLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWGPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRALLQAAVATRLEGSQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLLIVPSSEERKTIAK
Ga0210401_1017122833300020583SoilMTAVVFVALCTGTGFGAERGAENAAVKYLRADVALRQSYPLAPDAASNLEKALESPLDAEDQKLVAAADEALVEFHNGSTLKTCDWTLSFQDGPFADTSHRGAIKELVAVAGLRARLRFRDGDIQGAMNDALAAMAAARHLSVDGTLASVLFAYKLENAISGVLARNLDQFSLAQLNELAIGLDALPSGSTLGSAFEAEKVRRNDLLPIAQGARTHDELIEHLLNGIPFLQSNKALAAEMVDGCGGTVNGFVNCVNQQQSFYTSWAPRFGFSPEQFETEYQTEIAELSKANPVIRVLTPALPRFRWAEAYCQTRRALLQAAIAVRRDGPSALNRHLDPYDGNPFSYISVDEGFRLQSRLKDNGIDLFLTIVPSGEDRSR
Ga0215015_1085737513300021046SoilMFPSIWLEAQTPSVGNQQNTENAAVKYLRADASLRQSYALPPDAVAKLQKALESPLDVEDEKLVVAADEALVEFHHGASIARCDWVMSAGDGPLANTAHRGAVKELVAVAGIRSRLRFRDGNTPGAIGDALAAMAAARHLSVDGSLASVLFAYKLENSITGVLAQNLFRLSPAQLHEFASGLNALPSGSDLSTAFESEKLSRNDLLAIVQDAKTRDELIEQLLHNIPVLESNRALAVEIVDGCGSSVKGFVNCVDQQHSFYVSWAPRFTLAPEQFEKAYKVEFDELSKANPVVREFTPALPLLRWAEAYERTRRALLQTAIAVRLEGPKSLNQHFDPYDQKPFTYTARDGGFRLESRLTDGGIPISLSIVSSAEERKAIPK
Ga0210406_1014943913300021168SoilGCLFTMSPAIWLKAQNPPIENEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLFDYKLENSVTGVLARNLLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWGPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRALLQAAVATRLEGSQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLLIVPSSEERKTIAK
Ga0210406_1019505123300021168SoilLARSGCRLLRMKAILGLVVLFTMSPSSWLAAQNPPIENEESTKNAAVKYLRAEASLRQSYALPPNAAANLQTALESPLDGEDEKLVAAADEALVEFHHGASIKRCDWAMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLFAYKLENSVTGVLARNLLRLSPAQLQELASGLNGLPSGSSLGTALESEKLRRNEFFAIAQNAKTRDELIEQLVHNIPALQSNRELAVQIVDGCGGTLKGFVNCVDQQHSLYVSWATRFALPPEQFEKTYKVEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRGLLQAVVAVRLEGPQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLSIMPRSEEQKAIPK
Ga0210400_1003294423300021170SoilMKATIVLVALFTIFPSIWLEAQVPPAENQLYTENAAVKYLRADASLRQSYPLPPDAAAKLQKALESPLDVEDEKLVAATDEALIEFRHGAAIKRCDWVMSVVDGPFANTAHRGAIKELVAVSGIRARLRFRDGNMPGAIEDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASGLNALPSGSNLSTAFESEKLSRNDLLAVVQDATSRDELIEHLLRNIPALQSNRGLSAQIVDGCGGSVKGYVNCVDQQHSFYVSWATRFKFPPEQFEKTYKAEFDELSKTNAVVRQFTPALPRIRWAEAYEQTRRALLHAAIAVQLEGPKVLNQQLDPYDKKPFTYTAVDGGFRLESLLTDGGIPISLSILPNSEERKAIPK
Ga0210396_1001375983300021180SoilRGEGSSTRLRMKAITGLVVLFTMSAAIWLKAQNPPIENEKSTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLFDYKLENSVTGVLARNLLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWGPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRALLQAAVATRLEGSQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLLIVPSSEERKTIAK
Ga0213881_10000169193300021374Exposed RockMLSAFSLGASPARTKQERTENAAVKYLRADAALRQAYPLPPDAGAKLERSLESPLNADDQKLIAAAQDALAEFEHGASLANCDWAMSFEDGPLANTAHRGAVRELVAVAGLRARLRFQSGDTPGALKDALAAIAGARHLSVDGSLASVLIAHKLEKELTELLAQNLDRFSPDELNQLTTGLDALPRGSNVSRAFETEKLQRNDLLPLAEVANSREGLIEELLKRIPALQSNNALAVEIVDGCGGSVNGFATCVRQQESFYASWAPLFTLPPDQFESRYKSEIEVVSKTNAVIRVFTPNLPRFRWAEAYCETRRALLKVAINIRLNGPNALKQVLDPYDRTLFSYVPVEGGFQLKSHLNDGGTPISLVIGSR
Ga0210386_1001427723300021406SoilMKTKEFTQRPRRKRAEFNEKRNQRKTRTLKGEGSSTRLRMKAITGLVVLFTMSPAIWLKAQNPPIENEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLATMAAARHLSVDGSLASVLFDYKLENSVTGVLARNLLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWAPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWAEADEQTRRALLQAAVATRLEGSQALNQHFDPYDKKPFTYTAVDGGFRLESRLTDGGIPISLLIVPSSEERKTIAK
Ga0210386_1005310023300021406SoilMKPIIAMLVLCVLFPSFWLGAQTPSGGNHQSAENAAVKYLRADASLRQSYPLPTDAVPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTSTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAMAAARHLSVDGSLASVLFAYRIENSVTGIVARNLLRLSTAQLRELENAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRELAVQIVDGCGGSVRGFTHCVDQQQSFYVSWAPRFTLPPEEFEKAYKVEFDSLSKANPVVWQFTPSLPRFRWAEAYEQTRRALLHTAIAVRLDGPKAVSLSADPYNRKPFTYIALGEGFKLESQLLDGGVPISLSIVPGAEDRKPVLK
Ga0210386_1048432613300021406SoilGAERGAENAAVKYLRADVALRQSYPLAPDAASNLEKALESPLDAEDQKLVAAADEALVEFHNGSTLKTCDWTLSFQDGPFADTSHRGAIKELVAVAGLRARLRFRDGDIQGAMNDALAAMAAARHLSVDGTLASVLFAYKLENAISGVLARNLDQFSLAQLNELAIGLDALPSGSTLGSAFEAEKVRRNDLLPIAQGARTRDELIEHLLNGIPFLQSNKALAAEMVDGCGGTVNGFVNCVNQQQSFYTSWAPRFGFSPEQFETEYQTEIAELSKANPVIRVLTPALPRFRWAEAYCQTRRALLQAAIAVRRDGPSALNRHLDPYDGNPFSYISVDEGFRLQSRLKDNGIDLFLTIV
Ga0210383_1019984223300021407SoilALESPLDGEDEKLVAAADEALVEFHHGAALKECDWELSFEDGPFANTSHRGAIKELVAVSGIRARLRFREGNLQGAMNDALAAMAAARHLSVDGTLASVIFAYKLENAIAGVLARNLGQFSAAQLNELADGLNALPPGSNLPTALESEKVGRNDLLPIAQGAKTREELIERLASGIPFLQSNRAQATELVDGCGGSVAGFVSCLNQEQSFYNSWVPRFGTSPEQFEEEYKAEIKDLSRTNPVVRLLTPSLPRFRWEEAYCQTRRALLRAAIAVRMDGPTSLNRHLDPFDGTPFSYVSASEGFRLESRLKDHGIPLFLTIVPGTEGRNVNEK
Ga0210394_1005651513300021420SoilMKTKEFTQRPRRKRAEFNEKRNQRKTRTLKGEGSSTRLRMKAITGLVVLFTMSPAIWLKAQNPPIENEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADEALVEFHHGASLKRCDWVMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLFDYKLENSVTGVLARNMLQLSPAQLRELASGLNDLPSGSNLGTALVSEKLGRNELLAIAQNAKTPDELVEQLLHNVPALQSNRELAVQIVDGCGGSLKGFVNCVDQQHFFYESWGPRFALPPEQFEKTYKIEFDELSKTNPVVRQFTPALPRFRWA
Ga0210391_1052323213300021433SoilSPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTSTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAMAAARHLSVDGSLASVLFAYRIENSVTGIVARNLLRLSTAQLWELENAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRALAEQIVDGCGGSVRGFTHCVDQQQSFYVSWAPRFTLPPEEFEKAYKVEFDSLSKANSVVWQFTPSLPRFRWAEAYEQTRRALLHTAIAVRLDGPKAVSLSADPYNRKPFTYIALGEGFKLESQLLDGG
Ga0210390_1033678913300021474SoilMLVPCVLLSSFWLGAQTPSAGNHQSAENAAVKYLRADASLRQSYPLPTDAVPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTSTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAMAAARHLSVDGSLASVLFAYRIENSVTGIVARNLLRLSTAQLRELENAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRELAVQIVDGCGGSVRGFTHCVDQQQSFYVSWAPRFTLPPEEFEKAYKVEFDSLSKANPVVWQFTPSLPRFRWAEAYEQTRRALLHTAIAVRLD
Ga0210402_1046718713300021478SoilRQSYPLPPDAAEKLQKAVESPLDAEDERLVAAADEALVEFRHGAAIERCDWVISAEDGPVANTAHRGAIKELIAVAEIRARLRFRDGDTPGAMSDVLAAMAAARHLSVDGSLASVLFAYKLENSVTGVLVQNLLRLSPAQVHELASGLNALPRGSNLSTAFESERLGRNDFFLAIVQGAKTRGELIEQLLHNIPALDSNKVLAAEIVDGCGGSVKGYVDCVGQQHSFYVSWASRFILPPEQFEKAYRTEFDEVSKTNPVVRQFTPALWRFRWTEAYEQTRRALLNAAIAVRLEGPNALNQHFDPYDKKPFAYAVVDGGFRLESLLTEGGIPLSLSIVLSSEERKGIPK
Ga0179591_121724213300024347Vadose Zone SoilTLPPDAAMQLQKALESPLDTEDEKLVTAGSEALIEFHHGARGSRCDWAMSAGDGPLANTAHRGAIRELVALAGIRARQRFRDGDISGALDDALAAIAAARHLSTDGSLASVLIAYRLENSVTGILSQNLLRLSPAQLHDLASGLNGLPSGSNLGAALESEKLNRNDLLAVVQGAKTRDELIEQLLAKLPVLKSNRALTVEIVDGCGGSVKGYVDCVDQQHSFYEVWATRFALPPSQFEKDYKTEFDEISRTNPVVRQFTPALPRLRWAEAVEQTRRAMLQTAIAVRLESPQTLNQHLDPYDKTPFTYTVVDGGFCLNSQLSDSGVPISLSVLPSSADQKTIPK
Ga0207692_1009170213300025898Corn, Switchgrass And Miscanthus RhizosphereAVKYLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPFTFIALGEGFRLESQLVDGGIPISLSVVPGAEDRKTVSK
Ga0207663_1022497613300025916Corn, Switchgrass And Miscanthus RhizosphereAWSTPGRSLPEMSGNVYRAVDSLTRRRRDLTSLLIPQGKLSGVKPIIAPLILCVLLPPSWLKAQTPSAGNHQSAENAAVKYLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPFTFIALGEGFRLESQLVDGGIPISLSVVPGAEDRKTVSK
Ga0207700_1049417013300025928Corn, Switchgrass And Miscanthus RhizosphereLSDVKPIIAPLILCVLLPPSWLKAQTPSAGNHQSAENAAVKYLRADASLRQSYPLPADAMPNLQKSVESPLDVEDEKLVAAADEALVEFHHGAASNRCDWVMSSEDGPLTNTAHRGAIKELVAVAEIRSRLRFRDGDIPGAIDDAVAAIAAARHLSLDGSLASVLFAYRIENSVTGIVARNLLRLSTVQLRELESAINGLPNGSDLKIAFEAEKLDRNDILASVQGATTRDELIEGLLRNIPVLKSNRVLAAQIVDGCGGSVRGFTHCVDQQQSFYVSWVPRFTLSPEEFEKAYKVEFDSLSKANPVVWQFTPALPRFRWTEAYEETRRALLHTAIAVRLDGPKAVNLSLDPYNGKPFTF
Ga0209350_101114033300026277Grasslands SoilMRTATALAALVAMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0209055_109519313300026309SoilALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0209131_101893823300026320Grasslands SoilVLACDIPQKPSTRSEKAIRHLCYRLVCMKMGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSVEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD
Ga0209472_100369073300026323SoilMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGTLASVLFASKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0209152_1006641413300026325SoilALRQSYALAPDAAAKLEKALASPLDEEDEKLVAAAEDALVEFHHGAASKGCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGNTPGAIDDALAAIAAVRHLSVDGSLASVLFAYKLERAITAVLAQNLLRFSPAELNELASGLGALPSGFDLGTAFESEKVRRNDFLAIAQATKSRDELIEQLLNKVPVLRSNKELAGEIVDGCGGSVKGFVNCVEQQQSFYTSWAPRFALSPEQFEKAYKAEIEELARVNSVIRQFTPPLPRFRWAEAYNETRRSLLHAAIVVRLDGPKGLNQHLDPFDQNPFSYIPIDGGFRLESRLTEGGIPISISIVPNSEERKASPR
Ga0209158_108283213300026333SoilLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0257172_100229923300026482SoilMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK
Ga0209160_104207823300026532SoilMKAAIALAVLAEMFCSIRLGAQTLPDGNQQDIENAAVKYLRADASLRQSYALPSDAAAKLQKALDSPLNEEDERLVAAADEALTEFGHGAASRRCNWEMSTEDGPLASTAHRGAIMELVSVSGLRARLRFRDGDTPGAMDDLLAAMAAARHLSVDGSLASVLFAYKLENALTRVLALNLYHFSSGQLKELKSRLDDLPTGSSLGAAFAAEKVGRNNVLDIAQRAKSRDELIEMLLKNVPILESNRGLAIEVVDGCGGTVKDFLNCVDQQQSLYNAWASRFNLAPEQFEREYKAEIEKVSKENPVIRQFTPALPRFRWTEAYCQTRRALLQAAIAVELDGASTLSRHLDPYDRNRFSYGPVDRGFRLQSQLSDNGIPISLLVVTKPTNDALSPD
Ga0209376_1000016723300026540SoilMRTATALAALVAMCFPTRLGAQVRPAVNNQDMENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTALESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0209161_1005898223300026548SoilMENAAVKYLRADAALRQSYALAPDAAARLEKALESPLDVDDEKLVAAAEDALVEFHHGAASKRCDWTMSDEDGALANTAHRGAITELVAVSGLRARLRFRDGDTPRAMDDALAAIAAARHLSVDGSLASVLFGYKLERTITGVLAQSLLRLSPAQLNELASGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLKKVPALQSNKGLAGEIVDGCGGSIKGFVNCVDQQQSFYAAWARRFALPPEQFEKTYKGEFEELARANPFVRQFTPDLSRFRWAEAYNQTRRALLQAAIAVRLDGPKALNQHPDPYNKTTFSYIPVDGGFRLESLLREGGIPISLSIVPNSER
Ga0209474_1005608433300026550SoilVATCSSTWLGAQSSLPAVNLQRTENAAVKYLRADAALRQSYALSPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFRHGAATKRCDWEVSVEDGPLANTAHRGAIKELIAVSGLRARLRFRDGDMPGATGDALAAMAAARHLSVDGSLASVLFAYKLETTVTGVLAQNLLRLSPAQLNELAGGLGALPSGFSLGTAFESEKVRRNDFLAIVQAAKTRDELIEQLLNKVPVLQSNRGLASEIVDGCGGSVKGFVNCVDQQQSFYRSWAPRFALPPEQFEKTYKGEIEEFARSNPVIRLFTPSLPRFRWAEAYNQTRRTLLQTAIAVRLDGPRALNQHLDPYDKKPFSYIPIDGGFRLESRLSEGAIPISLSIVANSEARKTSPK
Ga0209648_1000901113300026551Grasslands SoilMMCSQTSLGAQSPPAANQQYMENAAVKYLRADAALRQSYTLAPDAAAKLEKALELPLDGEDEKLVAAAEDALVEFHHGTTIKRCDWAVSAEDGPLANTAHRGAIKELVAVSGLRARLRFRDGDIPGAMGDALAAMAAARHLSVDGSLASVLFAYKLENAITGVLVQNLLGFSPAELNELASGLDALPSGSSLGTAFESEKVSRNDLLAIVQVAKSRDELIERLLNKVPALQSNRGLAGEIVDGCGGSVKGFLNCVDQRQSFYTSWAPRFALPPEQFESAYKAEIEEVSRSNPVIRVFTPALPRFRWAEAYNQTRRVLLHAAIAVRLDGPRALNQHLVNSG
Ga0209648_1014559523300026551Grasslands SoilMKVAIVLVVPFTMFPSIWLGAQTPPVGNQQNTENAAVKYLRADASLRQSYALPPDAAAKLQKALESPLDAEDEKLVAAADEALVEFRHGASIKRCDWVMSAEDGPLANTAHRGAIKELAAVAGIRSRLRFRDGNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTGVLVQNLLRLSPAQLHELASGLNGLPSGSNLGTALELEKLSRNELWAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQHSLYVSWAPRFALPPEQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTAVDGGFRLESR
Ga0179587_1001684433300026557Vadose Zone SoilVLACDIPQKPSTRSEKAIRHLCYRLVCMKMGLALVLVAATGVTEGFSPTGGQEAENATVKYLRADAALRQSYALPPDAATQLLKALDSPLNGEDEKLVAAATEALVEFHHGAGLKRCDWTMSIEDGPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYKETRRALLKAAIAVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD
Ga0209076_103346823300027643Vadose Zone SoilMFRLLPMRTSRAIAMFVAMCSPTWLGAQLPPAVNQQDTENAAVKYLRADAALRQSYALAPDSAAKLEKALELPLDEEDKKLVGAAEEAFVEFDHGATIKRCDWAMSVEDGPLASTAHRGAIRELVAVSGLRARLRFRDGDTRGAIGDALAAMAAARHLSVDGSLASVLIAYKLEKAITGVLAQNLLRLSPAQLNELASGLDALPSGFSLGTAFKSEKLSRNDLLSVAQVAKSRDELIEQLSNKIPALQSNKGLAGEIVDGCGGSVKGFVNCVDQQRSFYTSWAPRFTLPPEQFEKAYKAEIGDLSRANPVIRQFTPALPRFRWAEAYNQTRRALLHTAIAIRLEGPRALNQHLDPYDKNPFSYIPVNGGFRLESRLSDGGIPISLSIVPNSEERK
Ga0208990_102036223300027663Forest SoilMKAIIVLVVLFTMSPSSWLKAQNSPVENQENAKNAAVKYLRADASLRQSYALPPDAAAKLQKALESPLDADDEKLVAAGDEALVEFRHGASIKRCDWVMSAEDGALANTSHRGAIKELVAVAGIRSRLRFRDGNTQGAMEDALAAMAAARHLSVDGSLASVLIAYKLENSAAGILAQNLHRLPSPQLHELANGLSNLPGGSNLGTALESEKLSRNEFLTLAQNAKTRDELIEQLLQNIPVLQSNRELAAQIIDGCGGSVKGFVNCVDQQRSFYESWAPRFTLPPEQFEKAFKVEFTELSKKNPVVRQFTPALPRFRWVEADQQTRRALLQAAIAVRLEGPEALNQHSDPYDKRPFTYTVVDGGFRLESRLTDDGIPISLLIVPSSEEQKAVPK
Ga0209515_1005924723300027835GroundwaterMRTATALAVFVAMCCPTRLGAHFPPAGNQQDTENAAVKYLRADAALRQSYALAPDAAAKLQKALESPLDGEDEKLVAAAEDALVEFHRGATIKRCDWVMSKEDGPLANTAHRGAIRELAAVSGLRARLRFRDGDTPGAMGDALAAMAAARHLSVDGSLASVLFAYRLENALTGILARNLHRFSAPQLNELASGLDALPSGSSLSTAFESEKVRRNELLDIAQGAKSRDDLIERLLNKAPALKSNRGLAGEIVDGCGGSVKGFVTCANQQQSFYTSWAPRFTLPPEQFEKAYKAEIEELSRANPVIRQFTPALQRFRWAEAYNQTRRALLQAAIAVRLDGARALNQHLDPYDRNPFSYIPVDGGFRLESRLSEGGTPISLSSVPSSEDRKASSK
Ga0209166_1001841733300027857Surface SoilMKTLIVSVVLFTMPPAIWLKAQNPPIEDEESTKNAAVKYLRAEASLRQSYALPPNAAANLQKALESPLDAEDEKLVAAADRALIEFRRGASIKRCDWAMSAEDGPLANTAHRGAVKELVAVAGIRARLRFRDGNIPGAMDDVLAAMAAARHLSVDGSLASVLIAYKLEDSVTGVLVQNLLRFSPAQLRELSNGLAGLPGGSNLGAALESEKLGRNDFVAIIQSAKTRDDLIEQLLQDIPALQSDRGLAAQIVDGCGGSVKGFVNCVDQQHSFYESWAPRFALPPEQFEMDYKVEFDEISKMNPVARQFIPALPRFRWAEADEQTRRALLQTAIAVRLLGPEALNQHIDPYDKKPFAYTAVDGGFHLESRLTDGVIPISLSIMPSSKEQKAIPK
Ga0209701_1003601133300027862Vadose Zone SoilMKTSLSLGLLVAMCSPTWLGAQFPPGGHQLDRENAAVKYLRADASLRQSYVLAPDAAAKLLQAVESPLDGEDEKLVAAAEDALVEFHHGAALKRCDWAMSEEDGPLANTAHRGAIMELVAVSGLRARLRFRDGNSPGATGDALAAMAAARHLSVDGSLASVLIAYKLENALTGILARNLHRFSPAQLNELASGLDSLPNGSSLATAFESEKVRRNDLLDIVEGAKSRDGLIVLLLNKLPILQSNRALAAEIVDGCGGSVTGFVTCADRQHSFYKAWASRFSLPPEQFERAYKAEIEEVSRANPVIQQFTPALPRFRWAEAYSQTRRALLQTAIAVRLDGPSALNRQLDPYDRNPFSYIPVDGGFRLESRLREGGTPISLSIVPSL
Ga0209701_1012856823300027862Vadose Zone SoilLPRSECKLPSMKATIVLVVLFTMFPAIWLGAHNPPIENQQNTENAAVKYLRADASLRQSYALPPDASAKLQKALESPLDVEDEKLVAAAEEALVEFRHGATINRCDWVMSAEDGPLANTAHRGAIRELVAVAEIRARLRFRDGNMTGAMEDALAAMAAARHLSVDGSLASVLVAYKLEKSVTGVLTQNLFRFSPAQLHELERGLNALPSGSNLSTAFGSEKLSRNDLLSVVQDAKNREELIEQLLHRIPALESNRGLAVEIVDGCGGSIKGYVNCVDQQHSFYVSWAPRFTLPPEQFEKAYKVEFDELSKTNPVVRQFTPALPRFRWAEAYEQTRRALLHSAIAVRLEGPKVLNQHLDPYDQKPFTYTALDGGFRLESRLTDGEIPISLSILPNSEERKTIPK
Ga0137415_1017694423300028536Vadose Zone SoilPLANTAHRGAVKELVAVAGLRARLRFRDGDTDGAVSDALAAMAAARHLSGDGSLASVLFAYGLEDAVTRVLAQNLYRLSAGELNRLASGLDALPSGSSLAIAFKSEKVDRNDLFLRLADGATSRDDLVARLLKKVPILESDRARASKIVDGCGSCVSGFRTCAQQQQSFYASWSSRFTMPPKQFESTYKSEMEGIAGANPLIRLFTPNLPRFRWADSYRETRRALLKAAIVVRLDGPSAVNQHPDPSDGNPFSYIPVGGGFQLESRQNEAGSPLALSIPTSPAGDSGSPEQSVPTPNMD
Ga0137415_1018523533300028536Vadose Zone SoilSECKLLSMKATIVLVVLFTMFPSIWLGAQVPPAENQPYAENAAVKYLRADVSLRQSYALPPDAAAKLQKALESPLDMEDEKLVAAADEALIEFRHGAATKRCDWVMSVEDGPFANTAHRGAIKELVAVAGIRARLRFRDGNMPGAIDDALAAMAAARHLSVDGSLASVLFAYKLEDSVTGVLVQNLLRPSPAQLQELASSLNALPSGSNLSTAFESEKLSRNDLLAVVQDAKSRDEVIEHLLHDIPALQSNRGLAAQIVDGCGGSVKGYVNCVDQQHSFYVSWAPRFRLPPEQFEKTFKIEFDELSKTNPVLRQFTPALPRIRWAEAYEQTRRALLHAAIAIQLEGPKVLNQQLDPYDKKPFTYTATDGGFRLESRLADGGIPISLSILLNSEERKAIPK
Ga0137415_1034172113300028536Vadose Zone SoilAAAKLQKALESPLDAEDEKLVAAADEALVEFRHGASIQRCDWVMSAEDGPLASTAHRGAIKELVAVAGIRSRLRFRDGNTPGAMDDALAAMAAARHLSVDGSLASVLFAYKLENSVTLVLVQNLLRLSPAQLHELASGLTGLPSGSNLGTALESEKLSRNELLAIAQNAKTRDELIEQLLHNIPALQSNRGLAVEIVDGCGGSVKGFVNCVDQQRSLYVSWAPRFTLPPDQFEKAYKVEFDELSKANPVVRQFTPALPRFRWAEAYEQTRRALLHAAVAVRLDGPKALNQHFDPFDKKPFTYTTVDGGFRLESRLADGGIPISLSIVPSSEEARVIPK
Ga0310686_10803084823300031708SoilVRYLRADVALRQSYPLPPDAGSKLEKALESPLDGEDEKLVAAADEAPVEFHHGAALKTCDWELSIEDGPLADTSHHGAIKELVAVSGLRARLRFRDGNLPGAMNDALAAMAAARHLSVDGTLASVLFAYKLERMISGVLERNLDQFSPSQLNELAIGLDALPSGSSLGGAFEAEKVLGKDLLPMAQGAKTRSELIERLRNGIPFLQSNEVLAAEIVDGCSGFVNGFVNCVNQQQSFYISWAPRFGLSPEQFETEYKVEIAELSKKNPLIRMLTPALQRFRWEEAYCQTRRALLHAAIAVRLHGTGALSRHLDPYDGIPFLYISVDGGFRLESLLKDNGVSLSLTIVPGTEDRSAIEK
Ga0307475_1044519513300031754Hardwood Forest SoilFWLGAQTPCAGNHQSAENAAVKYLRADASLRQSYALPADAVPKLQKSLESPLDGDDEKLVAAADEALVEFHHGAASNRCDWVMSAEDGPLANTAHRGAIKELVAVAEIRSRLRFRDGDMPGAIDDAVAAMAAARHLSVDGSLASVLFAYKLEDSVTGILARNLLRLSSTQLRELESAINGLPSGSDLRAAFESEKLSRDDILASVQGAKTRDDLIAGLLRNIPILGSNRELAAQIVDGCGGSVKGFTACVDQQQSFYVSWAPRFALPPEEFDKAYKVEFDKLSKANPVVWQFTPNLARFRWTEAHEQTRRALLQTAIAVRLDGPQALNQHFDPYDQKLFVYTVLDEG
Ga0307479_1004573333300031962Hardwood Forest SoilMQAKIFLVMLFTIFSSIWLGAQSLPVEKQLIAENAAVKYLRADASLRQSYPLPPDAAEKLQKAVESPLDAEDERLVAAADEALVEFRHGAAIERCDWVISAEDGPVANTAHRGAIKELVAVAEIRARLRFRDGDTPGAMSDVLAAMAAARHLSVDGSLASVLFAYKLENSVTGVLVQNLLRLSPAQVHEFASGLNALPRGSNLSTAFESERLGRNDFFLAIVQGAKTRGELIEQLLHNIPALDSNKVLAAEIVDGCGGSVKGYVDCVGQQHSFYVSWASRFILPPEQFEKAYRTEFDEVSKTNPVVRQFTPALWRFRWTEAYEQTRRALLNAAIAVRLEGPNALNQHFDPYDKKPFAYAVVDGGFRLESLLTEGGIPLSLSIVLSSEKRKGIPK
Ga0307470_1006043713300032174Hardwood Forest SoilALVCSPVYIAGQAPAGHNESVENAAVKYLRADASLRQSYPLPPDSIAKLEAALNEPLDLEDEKLVVAADAALAEFSHGAAIKHCDWAISVEDGPFANTAHRGVIRELAAVAGLRARLRFRQGDTPGAMADLLAALAGARHLSMDGSLASVLFAYKLETLLTGVLARNLHRFSPAQLTELARGLDALPGGFSMGAAFEAEKVRRDDLFFLAILQGTKTRDELIERLVNKVPALQSNKTLAGKLVDGCGGSLAGFVNCLNQKHSFYESWASRFTLPPEQFETDYKVEIEKASTENPLVREFTPNLPRFRWAETYSQTRRALLRAAVAVRLNGPAVLNQNLDPYDRNPFSYVPLDGGFRLESRLRDGGVPLTLSIVASSDEKN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.