NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072996

Metagenome Family F072996

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072996
Family Type Metagenome
Number of Sequences 120
Average Sequence Length 452 residues
Representative Sequence MPASALHSFSKAIEIPARLHNQPLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFEDPEVAMVSLRPSNPGSLFRDVPGQNGKSSTAAKVHSNLPAIQFFGTKNAFCERHAVGRR
Number of Associated Samples 97
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 39.17 %
% of genes near scaffold ends (potentially truncated) 40.83 %
% of genes from short scaffolds (< 2000 bps) 38.33 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.167 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(24.167 % of family members)
Environment Ontology (ENVO) Unclassified
(49.167 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(38.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.93%    β-sheet: 3.98%    Coil/Unstructured: 51.09%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF02518HATPase_c 31.67
PF05922Inhibitor_I9 3.33
PF13462Thioredoxin_4 2.50
PF00486Trans_reg_C 2.50
PF11396PepSY_like 2.50
PF04545Sigma70_r4 2.50
PF00082Peptidase_S8 1.67
PF03413PepSY 1.67
PF00127Copper-bind 0.83
PF12390Se-cys_synth_N 0.83
PF10262Rdx 0.83
PF12705PDDEXK_1 0.83
PF02397Bac_transf 0.83
PF00586AIRS 0.83
PF13361UvrD_C 0.83
PF09699Paired_CXXCH_1 0.83
PF12801Fer4_5 0.83
PF03841SelA 0.83
PF07994NAD_binding_5 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG1404Serine protease, subtilisin familyPosttranslational modification, protein turnover, chaperones [O] 3.33
COG1260Myo-inositol-1-phosphate synthaseLipid transport and metabolism [I] 0.83
COG1921Seryl-tRNA(Sec) selenium transferaseTranslation, ribosomal structure and biogenesis [J] 0.83
COG2148Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid)Cell wall/membrane/envelope biogenesis [M] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.17 %
UnclassifiedrootN/A0.83 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16557131All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2031Open in IMG/M
2088090014|GPIPI_17070215All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus7521Open in IMG/M
2124908045|KansclcFeb2_ConsensusfromContig1201830All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1528Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100841345All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1299Open in IMG/M
3300000596|KanNP_Total_noBrdU_T14TCDRAFT_1003532All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1825Open in IMG/M
3300000709|KanNP_Total_F14TBDRAFT_1000198All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3109Open in IMG/M
3300000955|JGI1027J12803_100168822All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2675Open in IMG/M
3300002899|JGIcombinedJ43975_10000852All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4409Open in IMG/M
3300002899|JGIcombinedJ43975_10004009All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium2370Open in IMG/M
3300003911|JGI25405J52794_10000542All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia5499Open in IMG/M
3300003911|JGI25405J52794_10005474All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2291Open in IMG/M
3300004114|Ga0062593_100216533Not Available1544Open in IMG/M
3300004643|Ga0062591_100131328All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1701Open in IMG/M
3300005186|Ga0066676_10115445All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1649Open in IMG/M
3300005293|Ga0065715_10146984All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1777Open in IMG/M
3300005332|Ga0066388_100000292All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia22180Open in IMG/M
3300005332|Ga0066388_100008315All Organisms → cellular organisms → Bacteria7754Open in IMG/M
3300005332|Ga0066388_100335852All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2157Open in IMG/M
3300005332|Ga0066388_100836941All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1508Open in IMG/M
3300005471|Ga0070698_100001580All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia25326Open in IMG/M
3300005764|Ga0066903_100130413All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3533Open in IMG/M
3300005764|Ga0066903_100355400All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2372Open in IMG/M
3300005764|Ga0066903_100772889All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1711Open in IMG/M
3300005937|Ga0081455_10000566All Organisms → cellular organisms → Bacteria48189Open in IMG/M
3300005937|Ga0081455_10001168All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia32847Open in IMG/M
3300005937|Ga0081455_10009495All Organisms → cellular organisms → Bacteria9998Open in IMG/M
3300005983|Ga0081540_1001303All Organisms → cellular organisms → Bacteria21760Open in IMG/M
3300009137|Ga0066709_100216171All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2534Open in IMG/M
3300009137|Ga0066709_100903772All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1287Open in IMG/M
3300009553|Ga0105249_10261187All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1721Open in IMG/M
3300010043|Ga0126380_10024169All Organisms → cellular organisms → Bacteria → Proteobacteria2974Open in IMG/M
3300010046|Ga0126384_10203882All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1568Open in IMG/M
3300010047|Ga0126382_10075265All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2087Open in IMG/M
3300010362|Ga0126377_10127760All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2358Open in IMG/M
3300010371|Ga0134125_10051483All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4581Open in IMG/M
3300010396|Ga0134126_10048032All Organisms → cellular organisms → Bacteria5350Open in IMG/M
3300010398|Ga0126383_10300699All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1603Open in IMG/M
3300012199|Ga0137383_10041488All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3267Open in IMG/M
3300012200|Ga0137382_10207773All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1347Open in IMG/M
3300012201|Ga0137365_10007686All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus8611Open in IMG/M
3300012201|Ga0137365_10015955All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus5886Open in IMG/M
3300012201|Ga0137365_10147119All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1770Open in IMG/M
3300012204|Ga0137374_10212930All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1652Open in IMG/M
3300012208|Ga0137376_10261621All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1501Open in IMG/M
3300012210|Ga0137378_10011167All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus7786Open in IMG/M
3300012285|Ga0137370_10113325All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1539Open in IMG/M
3300012353|Ga0137367_10085411All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2332Open in IMG/M
3300012355|Ga0137369_10073306All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2891Open in IMG/M
3300012357|Ga0137384_10004361All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus11329Open in IMG/M
3300012358|Ga0137368_10109091All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2126Open in IMG/M
3300012360|Ga0137375_10020591All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus7668Open in IMG/M
3300012360|Ga0137375_10192050All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1942Open in IMG/M
3300012582|Ga0137358_10080040All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2198Open in IMG/M
3300012685|Ga0137397_10087969All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2265Open in IMG/M
3300012922|Ga0137394_10130295All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2134Open in IMG/M
3300012924|Ga0137413_10002139All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus8141Open in IMG/M
3300012925|Ga0137419_10024975All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3568Open in IMG/M
3300012929|Ga0137404_10034933All Organisms → cellular organisms → Bacteria3751Open in IMG/M
3300012930|Ga0137407_10028532All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus4315Open in IMG/M
3300012971|Ga0126369_10524569All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1246Open in IMG/M
3300013296|Ga0157374_10195714All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1978Open in IMG/M
3300013306|Ga0163162_10112502All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2821Open in IMG/M
3300015241|Ga0137418_10058001All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3548Open in IMG/M
3300015241|Ga0137418_10089850All Organisms → cellular organisms → Bacteria2776Open in IMG/M
3300015242|Ga0137412_10002912All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus13619Open in IMG/M
3300015264|Ga0137403_10083698All Organisms → cellular organisms → Bacteria3220Open in IMG/M
3300015371|Ga0132258_10042076All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus10382Open in IMG/M
3300015371|Ga0132258_10935733All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2189Open in IMG/M
3300015373|Ga0132257_100160046All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2644Open in IMG/M
3300018027|Ga0184605_10010094All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3584Open in IMG/M
3300018028|Ga0184608_10021784All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2334Open in IMG/M
3300018028|Ga0184608_10028009All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2102Open in IMG/M
3300018071|Ga0184618_10011298All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus2708Open in IMG/M
3300018071|Ga0184618_10030961All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1839Open in IMG/M
3300018072|Ga0184635_10000478All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus10180Open in IMG/M
3300018075|Ga0184632_10027252All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2434Open in IMG/M
3300018081|Ga0184625_10014050All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3732Open in IMG/M
3300019356|Ga0173481_10100701All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1110Open in IMG/M
3300019868|Ga0193720_1001383All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3173Open in IMG/M
3300019875|Ga0193701_1002337All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3380Open in IMG/M
3300019875|Ga0193701_1015662All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1530Open in IMG/M
3300019877|Ga0193722_1009827All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2430Open in IMG/M
3300019878|Ga0193715_1000103All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus22363Open in IMG/M
3300019879|Ga0193723_1000023All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus41828Open in IMG/M
3300019881|Ga0193707_1004083All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus5205Open in IMG/M
3300019881|Ga0193707_1054178All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1272Open in IMG/M
3300019883|Ga0193725_1005678All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3606Open in IMG/M
3300019885|Ga0193747_1002755All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus4411Open in IMG/M
3300019887|Ga0193729_1017370All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3153Open in IMG/M
3300019887|Ga0193729_1048819All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1724Open in IMG/M
3300019888|Ga0193751_1023594All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2990Open in IMG/M
3300019890|Ga0193728_1059841All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1834Open in IMG/M
3300020001|Ga0193731_1037975All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1272Open in IMG/M
3300020004|Ga0193755_1009175All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus3230Open in IMG/M
3300020022|Ga0193733_1001053All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus8551Open in IMG/M
3300021080|Ga0210382_10046765All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1690Open in IMG/M
3300021178|Ga0210408_10080814All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2548Open in IMG/M
3300021344|Ga0193719_10051082All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1790Open in IMG/M
3300021411|Ga0193709_1022828All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1523Open in IMG/M
3300022756|Ga0222622_10219523All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1272Open in IMG/M
3300025315|Ga0207697_10002974All Organisms → cellular organisms → Bacteria8555Open in IMG/M
3300025915|Ga0207693_10225609All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1471Open in IMG/M
3300025922|Ga0207646_10064883All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3260Open in IMG/M
3300025930|Ga0207701_10000687All Organisms → cellular organisms → Bacteria35358Open in IMG/M
3300026324|Ga0209470_1065093All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1709Open in IMG/M
3300026508|Ga0257161_1017517All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1352Open in IMG/M
3300027717|Ga0209998_10024331All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1313Open in IMG/M
3300028536|Ga0137415_10154526All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2130Open in IMG/M
3300028784|Ga0307282_10097931All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1358Open in IMG/M
3300028819|Ga0307296_10123477All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1393Open in IMG/M
3300028828|Ga0307312_10007697All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales5870Open in IMG/M
3300028828|Ga0307312_10150779All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1475Open in IMG/M
3300028875|Ga0307289_10087748All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1264Open in IMG/M
3300028885|Ga0307304_10107091All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1124Open in IMG/M
3300031231|Ga0170824_104762134All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1801Open in IMG/M
3300031446|Ga0170820_11154048All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1247Open in IMG/M
3300031474|Ga0170818_108707903All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1824Open in IMG/M
3300032421|Ga0310812_10062785All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1459Open in IMG/M
3300033412|Ga0310810_10054756All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4852Open in IMG/M
3300033412|Ga0310810_10506487All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1198Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil24.17%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.50%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil5.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.00%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere5.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.50%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.50%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.67%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.67%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.83%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.83%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2124908045Soil microbial communities from Great Prairies - Kansas assembly 1 01_01_2011EnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000596Amended soil microbial communities from Kansas Great Prairies, USA - Total DNA no BrdU F1.4TCEnvironmentalOpen in IMG/M
3300000709Amended soil microbial communities from Kansas Great Prairies, USA - Total DNA F1.4 TB amended with BrdU and acetate no abondanceEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002899Soil microbial communities from Manhattan, Kansas, USA - Combined assembly of Kansas soil 100-500um Nextera (ASSEMBLY_DATE=20140607)EnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005983Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S2T1R1Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300019356Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S073-202C-2 (version 2)EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025315Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_011455802088090014SoilMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYXXCGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSPAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPGGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNGGVLFRDVPRQHGKSSTAENVHSNSPAIQFFKNSFCESQAVGRH
GPIPI_030480102088090014SoilMKAARQPIDPGCVRDWPYTGTAHLKTAVPQLGLHSFSKAIEIPASLREQPLEIWQTSARLTAVLNRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLSRRARFQNGKASRSHRRGGAHASHGSGAAGTEAATAAMQENGANFAIPESVCHLSFHELPITTRLANIVRSIDARRLGDLHGRNAFELLQHKACGWRTISEIQQLIERAVSGEFDLGQIKETTAAAELLSLLEQSLAKLPLRDRQFLLARIGAEMGRAGGPRADLLYLSYAEIGRRYGLTRARVQKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSMICPLTPELLEKWINVPAASPARRATRGRFSSFRLSTKAYLRLIAALDETIPCWLERHHKPQRIDDFVRQFDLALAVVVREGGRHITVAEAYRRLSHPAGRDYRRLPIENFLQMLRSVECTVIEFKDPQSPIIRLRPANAGAFLCEVPSQNGKSSTARKIHSNSPTIQLFGITSMFGEHAGIGPR
KansclcFeb2_133987302124908045SoilMPALALHSFSKAIEIPAGLHNQPLDGWQTSARLSTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCKNHLRECVYATPDFTETGSEGATAKMQEDAASFAIPESICHLSFDELPITRRLANVVRSIGARSLGDLNGRNAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPKVPRLLEVIKWRCLSTICPLTPQLLEKWVDSRAASPPRPTTRDYFSTFQLSSKAHVRLIMALDKSIPCWLGTNHKPRRIDGSVGQFDLALARVVRGAGGHISVVEAYRRLSHPTERNCRRLTVENFLRMLRSVERTVIEFKDPQSPIIRLRPSNAEVFLRDVPTQNGKSSTARKVHSNLSAIQ
INPhiseqgaiiFebDRAFT_10084134513300000364SoilYELDLLARRAQFQNGKASCSHGGWANAARSADAAGSEAAVAEMQEDAGTFVIPESVRHLSFDELPLTTRLANVARSISARSLGDLNGRNTFELLQYKSCGWCTISEIQQLIERAVSGEFDVGQIEEATAATELLSLLEQGLTKLPLRDRQFVLARIGGEMGSARNPRGDRLCLSYAEIGRPYGLTRARVHMVFANTLVTLRKTWGPRVPRLLEVIKWRCLSMICPLTSQLLEKWIDSRAACSGSRATRDHFTSFRLSMEAHVRLIAAMDKSIPCWLEINHEPRRIDDSAGQFDLALAHVVREAGGHITVAQAYRQLSHPGGREYRRPRIENFLRMLLSVDFTVIEFKDPQFPIVRLRASNGENLRAQIRSDQPSAPGKTSPEVIPFFSAKRIFREHAAVGPC*
KanNP_Total_noBrdU_T14TCDRAFT_100353223300000596SoilVPALALHSFSRAIEIPASLRDQPLDVWETSARLTAVLDRFGIRALGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNRKASCNXHRRGFVHTSHTPSHSFSATGSQRAMAKMQEDASSFAIPESICHLSFSELPITTRLANVFRAIGARSLGDLNGRSAFEMLQYKACGWGTISEIQHLIERAFSGEFDLEQIEEAAAAGELLSLLEQGLAKLPLRDREFVLARIGAQTGSGCSRDADLLCLSYAEIGRRYGLTRARVHKVFANSLNTLRKNWGPRVPRLLEVIKWRCLSMICPLTPQLLGKRIDSPAASPPRLTTRDYFSTFQLSMEAHVRLIAALDKNIPCWPETNHKPRCIGDSVAQFDLALAHVVREARGHISVVEAYRQLSHPTERNCRKLTVENFLRMLRSVERTVIEFKNPHSPIIRLRPSNAGVFLRDVPTQNGKSSTARKVHSNLSAIQFFGTKNGSERHAVARR*
KanNP_Total_F14TBDRAFT_100019823300000709SoilVPALALHSFSRAIEIPASLRDQPLDVWETSARLTAVLDRFGIRALGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNRKASCNGHRRGFVHTSHTPSHSFSATGSQRAMAKMQEDASSFAIPESICHLSFSELPITTRLANVFRAIGARSLGDLNGRSAFEVLQYKACGWGTISEIQHLIERAFSGEFDLEQIEEAAAAGELLSLLEQGLAKLPLRDREFVLARIGAQTGSGCSRDADLLCLSYAEIGRRYGLTRARVHKVFANSLNTLRKNWGPRVPRLLEVIKWRCLSMICPLTPQLLGKWIDSPAASPPRPTTRDYFSTFQLSMEAHVRLIAALDKNIPCWPETNHKPRCIGDSVAQFDLALAHVVREARGHISVVEAYRQLSHPTERNCRKLTVENFLRMLRSVERTVIEFKNPHSPIIRLRPSNAGVFLRDVPTQNGKSSTARKVHSNLSAIQFFGTKNGSERHAVARR*
JGI1027J12803_10016882223300000955SoilMKALRQTVAENPSGTGCMRETAHLKIAVPQLAVHSFSKAIEIPVSLRDQPLDAWQTSARLTALLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQFQNGKASCSHGGWANAARSADAAGSEAAVAEMQEDAGTFVIPESVRHLSFDELPLTTRLANVARSISARTLGDLNGRNTFELLQYKSCGWCTISEIQQLIERAVSGEFDVGQIEEATAATELLSLLEQGLTKLPLRDRQFVLARIGGEMGSARNPRGDRLCLSYAEIGRPYGLTRARVHMVFANTLVTLRKTWGPRVPRLLEVIKWRCLSMICPLTSQLLEKWIDSRAACSGSRATRDHFTSFRLSMEAHVRLIAAMDKSIPCWLEINHEPRRIDDSAGQFDLALAHVVREAGGHITVAQAYRQLSHPGGREYRRPRIENFLRMLLSVDFTVIEFKDPQFPIVRLRASNGENLRAQIRSDQPSAPGKTSPEVIPFFSAKRIFREHAAVGPC*
JGIcombinedJ43975_1000085233300002899SoilVPALALHSFSRAIEIPASLRDQSLDVWETSARLTAVLDRFGIRALGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNRKASCNSHRRGCVHTSHTPSHSFSATGTQRAMAKMQEDASSFAIPESICHLSFSELPITTRLANVFRAIGARXLGDLNGRSAFEMLQYKACGWGTISEIQXLIERAFSGEFXLEQIEEAAAAGELLSLLEQGXAKLPLRDREFVLARIGAQTGSGCSRDADLLCLSYAEIGRRYGLTRARVHKVFANSLNTLRKNWGPRVPRLLEVIKWRCLSMICPLTPQLLGKWIDSPAASPPRLTTRDYFSTFQLSMEAHVRLIAALDKNIPCWPETNHKPRCIGDSVAQFDLALAHVVREARGHISVVEAYRQLSHPTERNCRKLTVENFLRMLRSVERTVIEFKNPHSPIIRLRPSNAGVFLRDVPTQNGKSSTARKIHSNLSAIQFFGTKNGSERHAVARR*
JGIcombinedJ43975_1000400913300002899SoilSNENGMPDRRHEYVRPRPFTAKSNLSTAMPTLALHSFSKAIDIPSKLHNQPLNEWQTSARLTTLLSRFGIRVLGXLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASGNGHRGHRVHPSHDFSVTDSGEANAKVKEDAASFAIPESICHLALNELPLTTRLANVVRSMGARSLGDLNRRSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEANAATELLSLLEQALAKLPLRDRQFVLARIGAQPGSSCSPGVDLFCLSYAEIGRRYGLTRARVHKVLENTLDSLRKIWGPRVPRLLETMKWRCLSTICPLTPQLLGKWVGSCPAFASQPTTRDCLNTLRLSMEAHVRLIAALDKNIPCWPETNHKLRRIYDPVGQFDLTLARVVREAGGQITVAEAYHRFSHPGRRQYRRVTVEKFLQMLRSVEYTVVDFKDPEVPIVSLRPSNGGNLFRDVPGKNGKSSTAGRVPSNLAAIQFFGTKNGFCERQAVFPLKQGRDLKHDPRPKTN*
JGI25405J52794_1000054223300003911Tabebuia Heterophylla RhizosphereMPALALHSFSKAIEIPPRLHNQPLDVWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARIRNGTSLRRNGVHASRNSPATDHEGAISMMREDAASFAIPESICHLAFRELPISTRLANVVRLMGARNLGDLNGRSAFELLQYKACGWSTISEIQQLVERAISGEFDVVQIKEAMAAAELLTLLVQGMAKLPLRERQFLLARIGAHTGRGRGSGADPVWLSYEEIGRRYGLTRARIHKVLEHTLDSLRKTWGPRVPRLLETLKWRCLSAICPLTPQLLEKWIDSPAAFSSRRTRVDHFNSFQLSMEAYVRLIAALDKSIPCWPETNHKPRRTYDPSGQFDLTLARVVAEAGGKMTVAEAYRRFSNPAKRNYRRLTVEMFLQMLRSVELTVIDFKDPEVPVMSLNPSNSERLFRNVPEQDGKSSIGRKSHSNSPTIQLFAAKNGFCERRAAAVR*
JGI25405J52794_1000547423300003911Tabebuia Heterophylla RhizosphereVKRRPERFWNSSQTRTNDLEIAVPALALHSFSQAIEIPTSLRDQPLDEWQTSARLTALLSRSGIRVLGXLHGRKVVDFVWEKNCGPKTLYELDLLARRARFRNGKTSCNGHRRGHAHPSHDVAATGGERAAAGMQEDGASFAIPESICHLSFNELPITRRLANVVRTMGARRLGDLNGRGCFELLQYKQCGWSTISEIQQLIERAVSGEFDITQIEEVNAPAELSSLLEQALAKLPLRERQFVLARIGAQPGNSRSPGADLLCPSYAEIGRRYGLTRARVHKVFVNTLDSLRKTWGPRIPRLLEVIKWRCLSTISPLTPQLLERWINSAEVSARRVMTRDNFSGCCRLSAQARVRLIAALDKNIPCWPETHHNRRWIDGSVDQFDLTLARIVRVAGGHMTVAEAYRQLLRWGGRDYRRLTVEXFLRMLRNVKCTTVEFKDSEIPIVGLSPLNGGVFLRMVPAKTANHQALAKFNLLRQQFSSLG*
Ga0062593_10021653313300004114SoilSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPAGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNGGVLFRDVPRQHGKSSTAENVHSNSPAIQFFKNSFCESQAVGRH*
Ga0062591_10013132813300004643SoilMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPAGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNGGVLFRDVPRQHGKSSTAENVHSNSPAIQFFKNSFCESQAVGRH*
Ga0066676_1011544513300005186SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHLRECVYASPDFTATDSEGATAKMQENAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNRRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAASELLSLVEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKSWGPRVPRLLEVIKWRCVSAIFPLTPQLLEKWVGSRAAFSSRPTTGNYLNSVRLSMEAHVRLIVALDKNIPCWPETNHKLRRIDEPVGRFDLTLAHVVREAGGQLTVAEAYRRLSHPRKRDYCRLTVEKFLQMLPSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPGQNGKSSIARKVRSNLPAIQFFKNGFCERQAIGRR*
Ga0065715_1014698413300005293Miscanthus RhizosphereMPALALHSFSRAIEIPERLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHQSECVHASDDFIATDSERATAKMQEDAASFAIPESICHLAFNELPLTTRLANVVRSMGARSLDDLNGRSAFELLQYRACGWGTISEIQELIERAVSGEFDVAPIEEAAAAAELLSLLEQGLAMLPLRDRQFVLARIGAQTGSGRRPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCVSAICPLTHQLLEKWVGSRAAFSSRPTSGEYLNSVRLSMEAQVRLIAALDKNIPCWPETNHKLQRIGEPAGQFDLTLAHVVREAGGQITVAEAYGRLSHPGRRDYRRLTVERFLQMLRTAECTVVEVKNPEVPVIRLRSSDGGVLSRHVPGENGKSSTAGKVHSNLAAIQFFKNGFCERQAV*
Ga0066388_100000292123300005332Tropical Forest SoilMPALALHSFSKAIEIPTRLHNQPLDTWQTSARLTTLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNAKASGNGHSRDRVHASHHLIATGSKRSAAKIQEDAASFAIPESIYHLSFSELPITMRLANVVRSIGARRLGDLNGRSAFELLQRRACGWRTISEIQKLIERAVSGEFDVGQIEETKAAAELSSLLEQGLLKLSLRDRQFVLARIGAEIGSGRSPGADLLYLSFAEIGRRYGLTRACVHKVFANSLDTLRKIWGPRIPRLLEVIKWRCLSTICPLTPQLLRMWVDSAAASSSQAMRRDNLSSNFRLSIEAHVRLIAALDKSIPYWLSANHRPRRIDDRASQFDLALAHTVREAGGQIAVAEAYRELSHPAKPNCPQLTIENFLQMLRSVETTVVDFEDPEVPMISLRSSNGEVFHRDFPSQNGRSTRARKVDSNLPAIQLFGSKNGFCEGRAIGRR*
Ga0066388_10000831563300005332Tropical Forest SoilVPALALHSFSKSIEIPANLRRQPLDACQTSARLGALLERSGIHVLGDLHGRKVVDFAWERNCGPKTLYELDLLTRRARFQNGKASGNPRPRACVDASQRSSRSFSATGTEVATANVQEDDGSFAIPESIRWLSFNELPLTTRLTNVVRSIGAQSLGDLNGRNAFELLQYKACGWGTISEIQHLIERAVSGEFDVGQIEETTAAAELLNLLEQGLEKLPLRDTEFILARIGAPIGSARSPRTYPICPTYAEIGRRYGLTRARVHKVFGKTLDSLRKIWGPRVPRLLEVVKWRCLLSICPLTPQLLEKWVDIAGAFSSRQATRDCFNGFRLSMEAHVRLIAALDKNIPCWPEPKYRPPHFGDSVRQFDLILAHVVREASGRITVAEAYRKLSHRGGRNHRGLPIESFLQMLRSVERTVVEFKDPEIPILRLSPLNAAVLLNEVPSQNGKPSTAGKIHSSSRAIQFFEPKTALCGRHAVSRR*
Ga0066388_10033585223300005332Tropical Forest SoilVPTLALHSFSQAIEIPVSLRDQPVDMLHTSARLTSVLGRYGIRLLGDLHGRKVVDFAWEKNCGAKTLYELDLLARRARFQNGCASAKPCCSAESEAAAQGAQWDAGSFAIPDSVGHLSFGELPLTTRLANVVRSIGAQSLGDLNGCNAFELLQYRACGWGMISEIQQLIERAVSGEFNVGPIEQKTVAAELVSLLEQGLAKLPLRDTQFVLARIGGATTSARSPSPNSVCLSHAEIGRRYGLTRARIHKVFVNTLDSLRKIWGPRVPRLLEVIKRRCLSTICPLTPELLGKWVGLSAISSSRPATRDSLNSARLSMAAHVRLIAGLDKSIPCWPEMNRRPRGTHAPGRFDLALAYVVREAGGHITVAEAYRILSDRGGRHFRRLPVESFLQMLRSVECTAVEFNNPEIPIVRVRSLKAEASTGDSSNQNCKASPTGMFHSNSPVIELFRPKRSFCERHAVARR*
Ga0066388_10083694113300005332Tropical Forest SoilMPALALHSFSKAIEIPASLRDQPLDAWQTSARLTAVLERSGIHVLGDLHGRKVVDFACEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRHGRAHASHSFNATSSERGTAKMQEDAASFVIPESICHLVLNELPLTTRLANVVRSMGARRLGDLDGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDVGQIEQATAAAKLLRLLEQGLATLPLRDRQFVLARIGAQIGGCRSPGADLFCVTYAEIGQRYGLTRARVHKVFANRIDSLRKVWGPRVPRLLEAIKWRCLSEICPLTPQLLEKWVGSTPSFSSRPTTRDYLNGFQLSMEAHVRLIAALDKSIPCWPDTNHKLRRVDDPVGRFDLGLAHVVREADGQITVAEAYRRLSHPGRHDYRRLTVEKFLQMLPGVQCTVVEFKDPQSPIVRLHPSIPAVFPRRVRSENGKPSTARKVHSNLSAIQFFKNGSCEGHAVERR*
Ga0070698_100001580203300005471Corn, Switchgrass And Miscanthus RhizosphereMPALALHSFSRAIEIPERLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHQSECVHASDDFIATDSERATAKMQEDAASFAIPESICHLAFNELPLTTRLANVVRSMGARSLDDLNGRSAFELLQYRACGWGTISEIQELIERAVSGEFDVAPIKEAAAAAELLSLLEQGLAMLPLRDRQFVLARIGAQTGSGRRPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCVSAICPLTHQLLEKWVGSRAAFSSRPTSGEYLNSVRLSMEAQVRLIAALDKNIPCWPETNHKLQRIGEPAGQFDLTLAHVVREAGGQITVAEAYGRLSHPGRRDYRRLTVERFLQMLRTAECTVVEVKNPEVPVIRLRSSDGGVLSRHVPGENGKSSTAGKVHSNLAAIQFFKNGFCERQAV*
Ga0066903_10013041313300005764Tropical Forest SoilMPALALHSFAKAIQIPAGLRDQPLGAWQTSARLSALLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQFRNGQASGNGRPRGSAQPSHTPLRSLGRTGSGVAAEKFQKDGASFAIPDSVRHLSLQELPITTRLANVVRSIGARTLGDLDGRNAFELLQYKACGWRTISEIQQLIDRAVCGEFDLGQIKETTAVAELLSLIEQGLAKLPHRDRQFVLAKIGGETGGTRSPGANLLCLSYAEIGRRYGLTRARVHKVFGNTLDTLKKAWGPRVPRLLEVIRWRCLSKICPLTPQLLEKWADRPGNSSSRSMTRAFLANVRLSMEAHVRLIAALDKSIPCWPTTNHTLRRVDESIWQFDLALTNVVRLAGGRIQVAEAYRKVSHPGRDGQQPTIQNFLSMLRRAECTVIEFKDPETPIVRLRPSKPVALPNTAPSRNGKSSVARKIHSDSSAIRLFEPKTAFFERHAVSRR*
Ga0066903_10035540023300005764Tropical Forest SoilMPALALHSFSQAIEIPTSLRDQPLDAWPTSARLTTLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARLRNGEACGVQREGRAHVSHNFIAPGDERTTVKMQEDAANFAIPESVCHLSFKELPITKRLANVVRSIGAGRLGDLNGRSAFELLQCKACGWRTISQIQQLIERAASGEFDVRQIEETDAAAELLRLLEQGLAKLPLRDRELVRARIGGQIRNGRSPGADLLYLSYAEIGRRHGLTRARVHKVFANTVDTLRKIWGPRIPRLLEIMKWRCLSTICPLTPQLLGRWIDLAATCSQQATTRNNLRSNFRLSNEAHVRLIAALDKGIPCWLQTNQKPRCMQDSFSQFDLALAYVVREGGGQLTIAEAYHQLTCWVGRDYRQLTIENFLRMLRSVECSIIEFKNPQSPTIRVNELNVGVLLGEGPTKDGKPLNGLKVHSNLSATQFFSTKNGSCERRAVAAR*
Ga0066903_10077288913300005764Tropical Forest SoilMPALALHSFSKAIEIPASLRDQPLDAWQTSARLTAVLERSGIHVLGDLHGRKVVDFACEKNCGPKTLYEFDLLARRAQSRTGKASCNGHRHDSVHASHSFSATGSERGTAKMQEDAASFVIPESIYHLVLNELPLTTRLANVVRSMGARRLGDLDGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDVGQIEQATAAAKLLRLLEQGLATLPLRDRQFVLARIGAQIGGCRSPGADLFCVTYAEIGQRYGLTRARVHKVFANRIDSLRKVWGPRVPRLLEAIKWRCLSEICPLTPQLLEKWVGSTPSFSSRPTARDYLNGFQLSMEAHVRLIAALDKSIPCWPDTNHKLRRVDDPVGRFDLGLAHVVREANGQITVAEAYRRLSHPGRHDYRRLTVEKFLQMLPGVQCTVVEFKDPQSPIVRLHPSIPAVFPRRVRSENGKPSTARVHSNLSAIQFFKNGSCEGHAVERR*
Ga0081455_1000056613300005937Tabebuia Heterophylla RhizosphereVPALALHSFSQAIEIPTSLRDQPLDEWQTSARLTALLSRSGIRVLGDLHGRKVVDFVWEKNCGPKTLYELDLLARRARFRNGKTSCNGHRRGHAHPSHDVAATGGERAAAGMQEDGASFAIPESICHLSFNELPITRRLANVVRTMGARRLGDLNGRGCFELLQYKQCGWSTISEIQQLIERAVSGEFDITQIEEVNAPAELSSLLEQALAKLPLRERQFVLARIGAQPGNSRSPGADLLCPSYAEIGRRYGLTRARVHKVFVNTLDSLRKTWGPRIPRLLEVIKWRCLSTISPLTPQLLERWINSAEVSARRVMTRDNFSGCCRLSAQARVRLIAALDKNIPCWPETHHNRRWIDGSVDQFDLTLARIVRVAGGHMTVAEAYRQLLRWGGR
Ga0081455_1000116823300005937Tabebuia Heterophylla RhizosphereVKSRPERFRNSSQTRTIELETNVPALALHSFSQAIEIPTSLRDRPLDEWQTSARLTALLSRSGIRVLGDLHGRKVVDFVWEKNCGPKTLYELDLLARRARVRNGKTSCDGHRRGRAHPSHNVTATDGERATTRMQEDAASFAVPESICHLSFNELPITRRLANVVRTMGARRLGDLNGRGCFELLQYNQCGWSTISEIQQLIERAVSGDFDIRQIEEVNAAAELLSLLEQALAKLPLRERQFVLARIGAQPGSSRSPGADLLCLSYAEIGRRYGLTRARVHKVLVNTLDSLKKTWGPRIPRLLELIKWRCLSTISPLTPQLLERWVNLAEVSARRVMTRDNFSGCCRLSSQAHVRLIAALDKNIPCWPETHHKPRWIDGSLDQFDLTLARIVRVAGGHMTVAEAYRQLLRWGGRDYRRLTVERFLRMLRNVKCTTVEFKDSEIPIVGLSPLNGGVFLRMVPAKTANHQALAKFNLLRQQFSSLG*
Ga0081455_10009495103300005937Tabebuia Heterophylla RhizosphereMPALALHSFSKAIEIPPRFHNQPLDEWQTSARLTTVLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARIRNGTMSRNGLRHNGVHASRNSTATDREGAISMMREDAASFAIPESICHLAFSELPVTTRLGNVVRSMGARNLGDLNGRSAFELLQHKACGWSTIGEIQQLIERAISGEFDVVQIEEAMAAAELLSLLEKGMAKLPLRERQFLLARIGAQTGKGLGSGADLLWLSYAEIGRRYGLTRARIHKVLVNTVDSLRKTWGPRVPRLLEILRWRCLSSICPLTPQLLEKWSDSPATFSSRRTKVDHFNSFQLSMKAYVRLIAALDKNIPCWPEPNYKLRRSYDPVGQFDLTLARVVCEAGGKITVAEAYRRFTNPAKRDYRRLTVEMFLQMLRSVELTVIDFKDPEVPVVSLNPSNSERFFRDVSEQNGKSSIGRKSHSSSSTIQLLPGKNGFCERRAVAGR*
Ga0081540_1001303193300005983Tabebuia Heterophylla RhizosphereMKAALQTVAPHASGRGRILEQPSKNAVPSLALHSFSKAIEIPAKLRDQPLDVWQTSARLTALLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASSNGHRNGHASHSVSVTGADLATARIQEDAATFAIPESIYHLALNELPLTTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDVGQIEQATAAAELLRLLEQGLPKLPLRDRQFVLARIGAQIRGRRSPGADLFCLTYAEIGQRYGLTRARVHKIFANTVESLRKVWGPRVPRLLEVIKWRCLSAVCPLTPQLLEKWVVSSAGFSSRPTTRDYLNGFRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDSVDRFDLALAHVVREAGGQISVAEAYRKLSHPGRREHRHLTVEKFLRMLRSVEYTVVDFKDPQSPIVRLPPSIAGVFPRTLPSENGKPSTARTIHSNSPAVRLFEPKTAFCERRAVAKR*
Ga0066709_10021617113300009137Grasslands SoilVPALALHSFSRAIEIPASLRDQPLDAWQTSARLSAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASCGHYRRGRAHASHIPSRSFGGTGSEVATAKMQEDAGSFAIPESIRRLSFNELPITMRLANVICSIGARSLGDLNERNAFELLQYKACGWGTISEIQQLIERAISGEFNVGRIEQTTAPAELLSLVEQGLAMLPLRDRQFVLARIGAETASARSPRVDLLRPSYAEIGRQYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLWTICPLTPQLLSKWVDSPAAFSQRQTARDYFSTFRLPREAHVRLIAALDKTIPCWLETNHKPQRIDGSIEQFDLAVAQIAQRAGGH
Ga0066709_10090377213300009137Grasslands SoilSFSKAIEIPARLHNQPLDAWQPSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEACCNGHRRDCVHASHDLTATDGEGTIAKTQEGAASFAIPESICRLAFNELPITTRLANVVRSIGARSLGDLNGRGAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLEQGLAKISLRDRQFVLARIGGEMGSLRSRRVDLLCLSYAEIGRRYGLTRARVQKVFANKLDSLRKIWGPRVPRLLEIIKWRCLSLVCPLTPELLQQWIGNSRGSFQLSTKAQVRLIAILDETIPCWLDKHDAAERINPSERDKLDLALAQLVCEAGGRIAVAEAYRKLNRQLPDHRRLTIGEFFRILRGVRCTIVDFKDPDIPVARLRAWNGENLRVHVRTDQSSAPRRA
Ga0105249_1026118713300009553Switchgrass RhizosphereIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGSGHRHDYIHASNGFTATDSEGPTAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLDLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAELIAGSHIPVDGIVAG*
Ga0126380_1002416923300010043Tropical Forest SoilLERSGIHVLGDLHGRKVVDFAWERNCGPKTLYELDLLTRRARFQNGKASGNPRPRACVDASQRSSRSFSATGTEVATANVQEDDGSFAIPESIRWLSFNELPLTTRLTNVVRSIGAQSLGDLNGRNAFELLQYKACGWGTISEIQHLIERAVSGEFDVGQIEETTAAAELLNLLEQGLEKLPLRDTEFILARIGAPIGSARSPRTYPICPTYAEIGRRYGLTRARVHKVFGKTLDSLRKIWGPRVPRLLEVVKWRCLLSICPLTPQLLEKWVDIAGAFSSRQATRDCFNGFRLSMEAHVRLIAALDKNIPCWPEPKYRPPHFGDSVRQFDLILAHVVREASGRITVAEAYRKLSHRGGRNHRRLPVESFLQMLRSVERTVVEFKDPEIPIVRLSPLNAAVLLNEVPNQNGKPSTAGKIHSSSRAIQFFEPKTALCGRHAVSRR*
Ga0126384_1020388213300010046Tropical Forest SoilLERSGIHVLGDLHGRKVVDFAWERNCGPKTLYELDLLTRRARFQNGKASGNPRPRACVDASQRSSRSFSATGTEVATANVQEDDGSFAIPESIRWLSFNELPLTTRLTNVVRSIGAQSLGDLNGRNAFELLQYKACGWGTISEIQHLIERAVSGEFDVGQIEETTAAAELLNLLEQGLEKLPLRDTEFILARIGAPIGSARSPRTYPICPTYAEIGRRYGLTRARVHKVFGKTLDSLRKIWGPRVPRLLEVVKWRCLLSICPLTPQLLEKWVDIAGAFSSRQATRDCFNGFRLSMEAHVRLIAALDKNIPCWPEPKYRPPHFGDSVRQFDLILAHVVREASGRITVAEAYRKLS
Ga0126382_1007526513300010047Tropical Forest SoilMTALTLHSFSKAIEIPAGLHNQPLDAWQTSARLTTLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKESCNAHRRDRVHISHNFIATGCEKAPAKMQKDAASFAIPESICHLSFSELPITTRLTNVVRSISAWRLGDLNGSSAFELLQCKACGWRTISEIQQLIERAVSGEFDIGQVEETNAAAELLSLLERGLAKLPLRDREFVFARVGAEIGSGRSPGAGLLFLSYADIGRRYGLTRARVHKVFANSLDILTKIWGPRIPRLLEVIKWRCLSTICPLTPALLQRWVDSPAASSPRAMTGDNFSNSFRLSIEAHVRLIAALDKSIPCWLETSQKPRRIHDSFGQFDLALAHVVRGAGEPITIAEAYRRLSRPTKPNCPQLTIENFLRMLRSVERTVIEFKDPHSPMIQLNALNAGVFLGEASNQNGKSSTARKVHS*
Ga0126377_1012776013300010362Tropical Forest SoilMTALALHSFSKAIEIPAGLHNQPLDAWQTSARLTTLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARLRNGKESCNAHRGDRVHASHNLIGTGRERAAAKMQQNAASFAIPESICNLSFNELPITTRLTNVVRSISARRLGDLNGRSAFELLQYKACGWRTISEIQQLIERAVSGEFDIGQVEETNAAAELLSLLQRGLAKLPLRDREFVLARIGAEIGSGRSPGADLLFLSYADIGRRYGLTRARVHKVFANSLDILTKIWGPRIPRLLEVIKWRCLSTICPLTPQLLQRWVDSAATSSPRAMTGDNFSSSFRLSIEAHVRLIAALDKSIPCWLETNQKPRRIRDSFGQFDLALAHVVRGAGEPITIAEAYRRLS
Ga0134125_1005148313300010371Terrestrial SoilINLGRAMPALAIHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASRNGHRHGRLHASHSFTATGSELATAKIQEDAATFAIPESICHLALNELPLTTRLANLVRFMGARSLGDFNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDIGQIEQGSAAAELLRLLEQGLPKLPLRDRQFVFARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRIPRLLEVIKWRCLSEICPLTPPLLEKWIGSPAGFSSLPTTRDDLNGFRLSMEAYVRLIAALDENIPCWPETNRKLRRIDDSVGRFDLALAHVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLRMLRSVEYTVVEFKDPQSPIVRLPASIAGIFPGTVPSESGKPSPARKIHSNSPAIRLFEPNTAFCERRAVAKH*
Ga0134126_1004803253300010396Terrestrial SoilMKAALQTVVPHTSGRGRILEQPSKNAVPSLALYSFSKAIEIPAKLRDQPLDVWQTSARLTSLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASRNGHRHGRLHASHSFTATGSELATAKIQEDAATFAIPESICHLALNELPLTTRLANLVRFMGARSLGDFNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDIGQIEQGSAAAELLRLLEQGLPKLPLRDRQFVFARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRIPRLLEVIKWRCLSEICPLTPPLLEKWIGSPAGFSSLPTTRDDLNGFRLSMEAYVRLIAAQDENIPCWPETNRKLRRIDDSVGRFDLALARVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLRMLRSVEYTVVEFKDPQSPIVRLPASIAGIFPRTVPCENGKPSPAPKIHSNSPAIRLFEPSTAFGERRAVAKR*
Ga0126383_1030069913300010398Tropical Forest SoilLERSGIHVLGDLHGRKVVDFAWERNCGPKTLYELDLLTRRARFQNGKASGNPRPRACVDASQRSSRSFSATGTEVATANVQEDDGSFAIPESIRRLSFNELPLTTRLTNVVRSIGAQSLGDLNGRNAFELLQYKACGWGTISEIQHLIERAVSGEFDLGQIEETTAAAELLNLLEQGLEKLPLRDTEFILARIGAPIGSARSPRTYPICPTYAEIGRRYGLTRARVHKVFGKTLDSLRKIWGPRVPRLLEVVKWRCLLSICPLTPQLLEKWVDIAGAFSSRQATRDCFNGFRLSMEAHVRLIAALDKNIPCWPEPKYRPPHFGDSVRQFDLILAHVVREASGRITVAEAYRKLSHRGGRNHRGLPIESFLQMLRSVERTVVEFKDPEIPIVRLSPLNAAVLLNEVPNQNGKPSTAGKIHSSSRAIQFFEPKTALCGRHAVSRR*
Ga0137383_1004148833300012199Vadose Zone SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHLRECVYASPDFTETDSEGATAKMQENAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNGRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAAPELLSLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFGNTLDNLRKIWGPRVPRLLKAIKWRCVSAICPLTPQLLEKWVGSRAAFSSRQTTGDYLNSVRLSMEAHVRLIAALDKNIPCWPEMNHKLRRIDEPVGQFDLTLGHVVREAGGQITVAEAYRRLSHFGKRDYCRLTVEKFLQMLRSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPDQNGKSSIAWKVRSNLPAIQFFKNGFCERQAVGRR*
Ga0137382_1020777313300012200Vadose Zone SoilLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIHKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLSPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKFPRTDDPVGQFDLTLARIVREAGGKITVAEAYRRFSHPSGRHCPRLTVETFLQMLRSVECTVVEVKNPEVPIVSLRSSIGGVLFRDVPGQNGKSSTPRKDSSKLTAIQLFGPKNSFCERQAVRRR*
Ga0137365_1000768693300012201Vadose Zone SoilLDVWKTSARLTAVLDRFGIRLLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASCNGHRRGCVHASHTPSHSFSATASERATAKMQEDVASFAIPESICHLSFDELPITRRLANVVRCIGAQSLGDLNGRNAFELLQYKACGWGMISELQQVIERAVSGEFDVGQTEETTAAAELLSLLEQGLTKLPLRNRQFVLARIGAEIGSARIPRADLLCLSYAEIGRRYGLTRARVHKVFANSLNTLRKNWGPRVPRLLEVIKWRCLSTICPMTPQLLEKWVDSCPASPPRPTTRDYFSTFQLSREAHVRLIMALDKSIPCWLGTNHKPRRIDASVGEFDLALARVVRGAGGHISVVEAYRRL*
Ga0137365_1001595513300012201Vadose Zone SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLARRARFRNGKASCNGHRRACVHASHDFTATEATAKMQENAASFAIPESIGHLAFNELPITTRLANVVRSMGARSLGDLNGRSAFELLQYRACGWSTISEIQHLIERAVSGEFDVAQIKEAAAAPELLTLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKSWGPRVPRLLEVIKWRCLSAIFPLTPQLLEKWVGSRAAFSSRPTTGDYLNSVRLSMEAHVRLIVALDKNIPCWPETNHKLRRIDEPVGRFGLTLAHVAREAGGQITVAEAYRRLSHPGRQDYRWLTVEKFLQMLRSVECTVVE
Ga0137365_1014711913300012201Vadose Zone SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHLRECVYASPDFTETDSEGATAKMQENAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNGRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAAPELLSLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFGNTLDNLRKIWGPRVPRLLKAIKWRCVSAICPLTPQLLEKWVGSRAAFSSRQTTGDYLDSVRLSMEAHVRLIAALDKNIPCWPEMNHKLRRIDEPVGQFDLTLGHVVREAGGQITVAEAYRRLSHFGKRDYCRLTVEKFLQMLRSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPDQNGKSSIAWKVRSNLPAIQFFKNGF
Ga0137374_1021293013300012204Vadose Zone SoilMLNRFGVRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGHSGGRVHASHTPSHSFSAIGSERVTANMQQDAASFVIPESICHLSFSELPLTTRLANVVRSMGARSVGDLNGRSAFELLQYKACGWSTISEIQQLIERAVSGEFDVAQIKEATVAAELLNLLEQGLAKLPLRDRQFALARIGAQIGSGRSPGANLLCLSYAEIGRRYGLTRARVHKVFANSLDTLRKIWGPRVPRLLEVMKWRCLSMICPLTPQLLEKWIDSAAASSLRTTRRDYFSSFRLSSEAYVRLIAALDKTIPCLLGTNHKPRRSDGSVGQFDLALAHVVREAGGRISVVEAYRQLSHPEGRDHHWLTIENFLRMLPGVECTVIEFKDPEVPIVSLRAWNGENLRPRTRRDQSSATGETSLEAIPFFSA
Ga0137376_1026162113300012208Vadose Zone SoilTGKINLGTAMPASALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGKASGNGHRGDCIPASHVFSATDSGRASARMQKDAAGFAVPESICHLALNELPLTTRLANVVRSIGARILGDLNGRSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLEQGLAKISLRDRQFVLARIGGEMGSARSRRVDLLCLSYAEIGRRYGLTRARVQKVFANKLDSLRKIWGPRVPRLLEIIKWRCLSLVCPLTPELLQQWIGNSRGSFQLSTKAQVRLIATLDETIPCWLDKHDAAERINPTERDNLDLALAHLVCEAGGRIAVAEAYRKLTRQLPDHRRLTIGEFFRILRGVRCTVVDFKDPDIPVARLRAWNGENLRVHVRTDQPSAPRRAANREAIPFFNTKNGFCKRKAGAVR*
Ga0137378_1001116753300012210Vadose Zone SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHLRECVYASPDFTETDSEGATAKMQENAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNGRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAAPELLSLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFGNTLDNLRKIWGPRVPRLLKAIKWRCVSAICPLTPQLLEKWVGSRAAFSSRQTTGDYLNSVRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDEPVGRFDLTLGHVVREAGGQITVAEAYRRLSHPGKRDYCRLTVEKFLQMLRSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPDQNGKSSIAWKVRSNLPAIQFFKNGFCERQAVGRR*
Ga0137370_1011332513300012285Vadose Zone SoilLDVSQTSARLTALLSRFGIRVFGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKTFCVSYRRYGVDASHNFITTGWEMATAKVQEDAASFVIPESICHLSFNELPITTRLANVVRSIGARRLGDLNGRSPFELLQYRACGWGTISEIQRLIERAVSGEFAVAQIEATDAAAELVSLLEQGLATLPLRDRQFVLARIGAQIGSGRSPSAHRLSLSYAEIGQRYGLTRARVHKVFANSLDTLRKIWGPRIPRLLEAIKWRCLSTICPLTPRLLERWIDSGAGSSPRAMTRNNFSSNFRLSTEAHVRLMAALDKSIPCWPETHLKPRHIDDSFGQFDLALARIVREAGGYITVANA
Ga0137367_1008541113300012353Vadose Zone SoilMLNRFGVRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGHSGGRVHASHTPSHSFSAIGSERVTANMQQDAASFVIPESICHLSFSELPLTTRLANVVRSMGARSVGDLNGRSAFELLQYKACGWSTISEIQQLIERAVSGEFDVAQIKEATVAAELLNLLEQGLAKLPLRDRQFALARIGAQIGSGRSPGANLLCLSYAEIGRRYGLTRARVHKVFANSLDTLRKIWGPRVPRLLEVMKWRCLSMICPLTPQLLEKWIDSAAASSLRTTRRDYFSSFRLSSEAYVRLIAALDKTIPCLLGTNHKPRRSDGSVGQFDLALAHVVREAGGRISVVEAYRQLSHPEGRDHHWLTIENFLRMLPGVECTVIEFKDPQSPIVRLRAWNGENLRPRIRRDQSSATGETSLEAIPFFSVNTISANMPPSDRPEFAQSSRTSFPRK*
Ga0137369_1007330623300012355Vadose Zone SoilMLNRFGVRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGHSGGRVHASHTPSHSFSAIGSERVTANMQQDAASFVIPESICHLSFSELPLTTRLANVVRSMGARSVGDLNGRSAFELLQYKACGWSTISEIQQLIERAVSGEFDVAQIKEATVAAELLNLLEQGLAKLPLRDRQFALARIGAQIGSGRSPGANLLCLSYAEIGRRYGLTRARVHKVFANSLDTLRKIWGPRVPRLLEVMKWRCLSMICPLTPQLLEKWIDSAAASSLRTTRRDYFSSFRLSSEAYVRLIAALDKTIPCLLGTNHKPRRSDGSVGQFDLALAHVVREAGGRISVVEAYRQLSHPEGRDHHWLTIENFLRMLPGVECTVIEFKDPEVPIVSLRAWNGENLRPRIRRDQSSATGETSLEAIPFFSVNTISANMPPSDRPEFAQSSRTSFPRK*
Ga0137384_1000436113300012357Vadose Zone SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHRRERVYASHDFTATDSEEATAKMQEDAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNGRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAAPELLSLLEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFGNTLDNLRKIWGPRVPRLLKAIKWRCVSAICPLTPQLLEKWVGSRAAFSSRQTTGDYLNSVRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDEPVGQFDLTLGHVVREAGGQITVAEAYRRLSHFGKRDYCRLTVEKFLQMLRSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPDQNGKSSIAWKVRSNLPAIQFFKNGF
Ga0137368_1010909113300012358Vadose Zone SoilMPALALHSFSKVIEIPTSLRDQPLDVWETSARLTALLGRFGIHVLGDLHGRKVIDFAWEKNCGPKTLYELDLLARRAQSRNGTASCNGHRRGCVHASHTSSHSFSATNSERATAKIQEDAASFAVPESVCHLSFNELPLTTRLANVVRAIGARSLGDLNERGAFELLQWKACGWGTISEIQQLIERAVSGEFDLGQIEETTAAAELLNLLEQGLAKLPLRDREFVLARIGAQIGSGCSPGADLLCPSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVMKWRCLSMICPVTSQLLEKWIDSAAASSLRTTRRDYFSSFRLSSEAHVRLIAALDKSIPCWLGTNHKPRRIDGSVGQFDLALAHVVREAGGRISVVEAYHQLSYPAGRDHHWLTIETFLRMLPGVECTVIEFKDPQSPIV
Ga0137375_1002059153300012360Vadose Zone SoilMPALALHSFSKAIEIPARLHSQPLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGSKTLYELDLLARRARFRNVKVAGNGHRRDRVHASHGFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVEQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGTDLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTACDYLNSFRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDRVGQFDLTLAHVVREAGGQITVAEAYHRLSHPGRRDYRRLTVEEFLQMLRSVECTVVEVKNPEVPVVTLRSSNGGILFRDVPGQNGKSPTAREVHSNLPAIQFFGTKNGFYERQAVGRR*
Ga0137375_1019205013300012360Vadose Zone SoilMPALALHSFSRAIEIPASLRNQPLDVWETSARLTAMLNRFGVRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGHSGGRVHASHTPSHSFSAIGSERVTANMQQDAASFVIPESICHLSFSELPLTTRLANVVRSMGARSVGDLNGRSAFELLQYKACGWSTISEIQQLIERAVSGEFDVAQIKEATVAAELLNLLEQGLAKLPLRDRQFALARIGAQIGSGRSPGANLLCLSYAEIGRRYGLTRARVHKVFANSLDTLRKIWGPRVPRLLEVMKWRCLSMICPLTPQLLEKWIDSAAASSLRTTRRDYFSSFRLSSEAYVRLIAALDKTIPCLLGTNHKPRRSDGSVGQFDLALAHVVREAGGRISVVEAYRQLSHPEGRDHHWLTIENFLRMLPGVECTVIEFKDPEVPIVSLRAWNGENLRPRIRRDQSSATGETSLEAIPFFSVNTISANMPPSDRPEFAQSSRTSFPRK*
Ga0137358_1008004023300012582Vadose Zone SoilMPALALHSFSKAIKIPASLRDQPLDTWQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRRECVHASHSFSATGSEWATAQMREDAASFAIPESICHLDFNELPITTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVGQIEQATAAAELLRLLEQGLAKLPLRDRQFVLARIGAQIGGGRSPGADLFCLSYAEIGQRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVLKWRCVSAICPLTPQLLEKWVGSPAGFSSRRTTRDYLNGFQLSMEAHVRLIAALDKSIPCWPDRNHKLRRVDDPVCRFDLTLAHVVREAGGQITMAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECAVVEFEDPEVPIVSLRLSTGGALFGDVPSQNGKSSTHRKVHSNLSAIQFFKNGSCEREAVGRG*
Ga0137397_1008796923300012685Vadose Zone SoilVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASGNGHCHGRVHASHSFSATDSERATAKMQEDAASFAIPESICHLAFNELPITTRLANVVHSMGARSLGDLNGRSAFELLQYKACGWGTVSEVQQLIERAVSGEFDVGQIEQATAAPELLRLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLFCLSYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRVSRLLEVIKWRCVSAICPLTPQLLEKWVGSPAGFSSRPTTRDYLNGFRLSMEAHVRLIAALDKSIPCWPDTNHKLRRVDDPVGRFDLTLAHVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECTVVEFEDPEVPIVSLRLSTGGALFGDVPSQNGKSSTHRKVHSNLSAIQFFKNGSCERQAVGRG*
Ga0137394_1013029513300012922Vadose Zone SoilTLYEFDLLARRAQSRNGKASCNGHRLGRVHASHSFSATGSERATAPMQEDAASFAIPESICHLDFNELPITTRLANVVRSMGARTLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVAQIEEAAAAAELLRLLEQGLAKLPLRDRQFVLARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARIHKVFANTLDSLRKIWGPRVPRLLEVLKWRCVSAICPLTPQLLEKWVGSPAGFSSRRTTRDYLNGFRLSMEAHVRLIAALDKSIPCWPDTNHKLRRVDDPVGRFDLTLAHVVREACGQITMAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECTVVEFEDPEVPIVSLRLSTGGALFGDVPSQNGKSSTHRKVHSNLSAIQFFKNGSCERQAVGRG*
Ga0137413_1000213973300012924Vadose Zone SoilMPASALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEASCNGHRRDCVHASHDFSATDSEGTIAKTQEGAASFAIPESICHLAFNELPITTRLANVVRSMGARSLGDLNGRGAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLTLLEQGLAKISLRDRQFVLARIGGEMGSASSGADILCLSYAEIGRQYGLTRARVHKVFADTLDNLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLENWVGPPAAFSVRPTTRDYLSSFRLSMEAHVRLIAALDKNIPCWPETNPKRPRIDDPVGQFDLTLAHVVREAGGQITVAEAYRRLSHPGRRDYRRVTVENFLQLLRSTDYTAIEFTDPQSPIIRLRAWNGENFRARIRSDQPSAPSRTDPATIPFFGSNSMCGKHAAIGPR*
Ga0137419_1002497553300012925Vadose Zone SoilVPALALHSFSKAIDIPASLRDQPLDASHTSARLAAVLDRFGIRMLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGRRRGCVHASHTPSHGFIATGNERATAKMQEDAASFAIPESVCHLSFHELPITTRLANVVRAIGARSLGDLNGRSAFEMLQYKACGWSTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLERGLAKLALRDRQFVLARIGGEMANARSWGADLLCPSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLGKWVDSRAAFSSRPKTREFLNNVRLSMEAHVRLIAALDKNIPCWPETNHEPRHIDDSVHQFDLALAFLVREKGGQITIAEAYRKLSHPAGMDSRRLTIESFLRMLRPVEYTVVDFKDPEIPIA
Ga0137404_1003493323300012929Vadose Zone SoilMLDRTSNERDRSNSQPECFWVCPQTRKIDPRSAMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGATAKIHKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARSLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPEANHKLPRTYDPAGQFDLTLARIVREAGGKITVAEAYRRFSHPGGRHCRRLTVETFLQMLRGVECTVVEVKNPEVPIVTLRSSNGGGLFRDVPRQNGKSSTPPKGPSSNLAALQFFGPKNSFCERQAVRRR*
Ga0137407_1002853223300012930Vadose Zone SoilMLDRTSNERDRSNSQPECFWVWPQARKINLRSAMPALALHSFSKAIEIPASLRNQPLDTWQTSARLSTVLERFGIHLLGDLHGRKVVEFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGATAKIHKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARSLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGPGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPEANHKLPRTYDPAGQFDLTLARIVREAGGKITVAEAYRRFSHPGGRHCRRLTVETFLQMLRGVECTVVEVKNPEVPIVTLRSSNGGGLFRDVPRQNGKSSTPPKGPSSNLAALQFFSPKNSFCERQAVRRR*
Ga0126369_1052456913300012971Tropical Forest SoilDFAWEKNCGPKTLYELDLLTRRARFQNGKASGNPRPRACVDASQRSSRSFSATGTEVATANVQEDDGSFAIPESIRWLSFNELPLTTRLTNVVRSIGAQSLGDLNGRNAFELLQYKACGWGTISEIQHLIERAVSGEFDVGQIEETTAAAELLNLLEQGLEKLPLRDTEFILARIGAPIGSARSPRTYPICPTYAEIGRRYGLTRARVHKVFGKTLDSLRKIWGPRVPRLLEVVKWRCLLSICPLTPQLLEKWVDIAGAFSSRQATRDCFNGFRLSMEAHVRLIAALDKNIHCWPEPKYRAPRFGDSVRQFDLILAHVVREASGRITVAEAYRKLSHRGGRNHRRLPIESFLQMLRSVERTVVEFKDPEIPILRLSPLNAAVLLNEVPSQNGKPSTAGKIHSSSRAIQFFEPKTA
Ga0157374_1019571423300013296Miscanthus RhizosphereMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKLPRTYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPAGRHCCRLTVETFLQMLRSVECTVVEIKNPEVPIVSLRSSNGGVLFRDVPRKNGKSSTPPKGPSNLDAIQFFGTKNSFCERQAVRRR*
Ga0163162_1011250223300013306Switchgrass RhizosphereMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEALKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPAGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNGGVLFRDVPRQHGKSSTAENVHSNSPAIQFFKNSFCESQAVGRH*
Ga0137418_1005800143300015241Vadose Zone SoilVPALALHSFSKAIDIPASLRDQPLDASHTSARLAAVLDRFGIRMLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASGNGRRRGCVHASHTPSHGFIATGNERATAKMQEDAASFAIPESVCHLSFHELPITTRLANVVRAIGARSLGDLNGRSAFEMLQYKACGWSTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLERGLAKLPLRDRQFVLARIGGEIASARSWGADLLCPSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEGIKWRCLSTICPLTPQLLGKWVDSRAAFSSRPKTREFLNNVRLSMEAHVRLIVALDKNIPCWPETNHEPRHIDDSVHQFDLALAFLVREKGGQITIAEAYRKLSHPAGMDSRRLTIESFLRMLRPVEYTVVDFKDPEIPIARLRPLNGGVFFDGVPSQNGKPSTARKIHSNSPAIRLFGPKAVFCERHAIARR*
Ga0137418_1008985023300015241Vadose Zone SoilMPASALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGKASCNGHRRECVHASHRFSATSSERATAQMQEDAAGFAIPESICHLDFNELPITTRLANVVRSMGARSLGDLHGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVAQIEEAAAAAELLRLLEQGLAKLPLRDRQFVLARIGAQIGGDRSPGADLFCLSYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRVPRLLEVIKWRCVSAICPLTPQLLEKWVGSRAAFSPRATTRDYLNSFRLSMEPHVRLIAALDKTIPCWPDRNHKLRRVDDPVCRFDLTLAHVVREACGQITMAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECAEVEFEDPEVPIVSLRLSTGGALFGDVPSQNGKSSTHRKVHSNLSAIQFFKNGSCERQAVGRG*
Ga0137412_1000291263300015242Vadose Zone SoilMPASALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEASCNGHRRDCVHASHDFSATDSEGTIAKTQEGAASFAIPESICHLAFNELPITTRLANVVRSMGARSLGDLNGRGAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLEQGLAKISLRDRQFVLARIGAQTGSASFGADILCLSYAEIGRRYGLTRARVHKVFANTLDNLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLENWVGPPAAFSVRPTTRDYLSSFRLSMEAHVRLIAALDKNIPCWPETNPKRPRIDDSVGQFDLTLAHVVREAGGQITVAEAYRRLSHPGRRDYRRVTVENFLQLLRSTDYTAIEFTDPQSPIIRLRAWNGENFRARIRSDQPSAPSRTDPATIPFFGSNSMCGKHAAIGPR*
Ga0137403_1008369823300015264Vadose Zone SoilMLDRTSNERDRSNSQPECFWVCPQTRKIDPRSAMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGATAKIHKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARSLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPEANHKLPRTYDPVGQFDLTLARVVREAGGPITVAEAYRRFSHPGGRHCRRLTVETFLQMLRGVECTVVEVKNPEVPIVTLRSSNGGGLFRDVPRQNGKSSTPPKGPSSNLAALQFFGPKNSFCERQAVRRR*
Ga0132258_1004207643300015371Arabidopsis RhizosphereMKAALQTVVPHTSGRGRILEQPSKNAVPSLALYSFSKAIEIPAKLRDQPLDVWQTSARLTSLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASRNGHRHGRLHASHSFTATGSELATAKIQEDAATFAIPESICHLALNELPLTTRLANLVRFMGARSLGDFNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDIGQIEQGSAAAELLRLLEQGLPKLPLRDRQFVFARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRIPRLLEVIKWRCLSEICPLTPPLLEKWIGSPAGFSSLPTTRDDLNGFRLSMEAYVRLIAALDENIPCWPETNRKLRRIDDSVGRFDLALAQVVREAGGQITVAEAYRRLSHPGRRDYRRLTAEKFLRMLRSVEYTVVEFKDPQSPIVRLPASIAGIFPRTVPCENGKPSPAPKIHSNSPAIRLFEPSTAFGERRAVAKR*
Ga0132258_1093573323300015371Arabidopsis RhizosphereMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTATGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPGGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNGGVLFRDVPRQHGKSSTAENVHSNSPAIQFFKNSFCESQAVGRH*
Ga0132257_10016004623300015373Arabidopsis RhizosphereMKAALQTVVPHTSGRGRILEQPSKNAVPSLALYSFSKAIEIPAKLRDQPLDVWQTSARLTSLLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASRNGHRHGRLHASHSFTATGSELATAKIQEDAATFAIPESICHLALNELPLTTRLANLVRFMGARSLGDFNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDIGQIEQGSAAAELLRLLEQGLPKLPLRDRQFVFARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRIPRLLEVIKWRCLSEICPLTPPLLEKWIGSPAGFSSLPTTRDDLNGFRLSMEAYVRLIAALDENIPCWPETNRKLRRIDDSVGRFDLALAQVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLRMLRSVEYTVVEFKDPQSPIVRLPASIAGIFPRTVPCENGKPSPAPKIHSNSPAIRLFEPSTAFGERRAVAKR*
Ga0184605_1001009423300018027Groundwater SedimentMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLDGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0184608_1002178413300018028Groundwater SedimentMPALALHSFSKAIEIPARLHNQPLDGWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKIQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWLETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTVVEFKDPEVPIVSLRPSNGGVFFRDVAGQNGKPSTARKVH
Ga0184608_1002800923300018028Groundwater SedimentMPALALHSFSKAIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIHKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKSPRSYDPVGQFDLILARIVREAGGQITVAEAYRRFSHPGGR
Ga0184618_1001129823300018071Groundwater SedimentMLDRTSNENDRSNSQPECFRVCPQTRKINLRNAMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0184618_1003096123300018071Groundwater SedimentMPASALHSFSKAIEIPARLHNQPLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFEDPEVAMVSLRPSNPGSLFRDVPGQNGKSSTAAKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0184635_1000047873300018072Groundwater SedimentMPALALHSFSKAIEIPAGLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFIVTDSEGATAKTPKDPTSFVIPESICYLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKSPRSYDPVGQFDLILARIVREAGGQITVAEAYRRFSHSGGRHCRRLTVETFLQMLRGVECTVVEVKNPEVPIVTLRSSNGRVLFRDVPGQNGKSSTVGKVHSNLRAIQFFKNGCCEHQAAGRR
Ga0184632_1002725213300018075Groundwater SedimentMPALALHSFSKAIEIPAGLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFIVTDSEGATAKTPKDPTSFVIPESICYLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIGALDKSIPCWPEANHKLPRTYDPAGQFDLTLARIVREAGGPITVAEAYRRFSHSGGRHCRRLTVVTFLQMLRGVECTVVEVKNPEVPIVTLRSSNGRVLFRDVPGQNGKSSTVGKVHSNLRAIQFFKNGCCEHQAAGRR
Ga0184625_1001405023300018081Groundwater SedimentMPALALHSFSKAIEIPAGLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFIVTDSEGATAKTPKDPTSFVIPESICYLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIGALDKSIPCWPEANHKLPRTYDPAGQFDLTLARIVREAGGPITVAEAYRRFSHSGGRHCRRLTVETFLQMLRGVECTVVEVKNPEVPIISLRSSNGGVFFRDVPGQNGKSSTVGKVHSNLRAIQFFKNGCCEHQAAGRR
Ga0173481_1010070113300019356SoilFAWEKNCGPKTLYELDLLARRARFQNGKASCNCHRCDCVHASHDFSATDSEGATAKMQEDAASFAIPKSICHFAFSELPLTTRLANVVRSMGARSLGDLNGRSAFELLQYRACGWGTISEIQQLIERAVSGEFDVSQIQEAKAAAELLSLLEQGLTKLPLRNRQFVLARIGAEIGSARIPRADLLCPSYAEIGRRYGLTRARVHKVFANTLESLRKIWGPKVPRLLEVIKSRCLSMICPLTPQLLEKWVDSRAASPPRPTTRDYFSTFQLSSKAHVRLIMALDKSIPCWLGTNHKPRRIDASVGEFDLALARVVRGAGGYISVVEAYRRLSHPTERNCRRLTVENFLRMLRSVERTVIEFKDPQSPIIRL
Ga0193720_100138323300019868SoilMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0193701_100233723300019875SoilMPALALHSFSKAIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0193701_101566223300019875SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWLETNHKLRRIDDRVGQFDLTLANVVREAGVQITVAEAYRRLSHRGRRDYRRLTVEKCL
Ga0193722_100982723300019877SoilRPECFGVWPQTGKINLRSAMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAYVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGVQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0193715_100010383300019878SoilMPALALHSFSKAIEIPASLRNQPLDAWQTSARLSTVLERFGIHFLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0193723_100002363300019879SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWLETNHKLRRIDDRVGQFDLTLANVVREAGVQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0193707_100408323300019881SoilMPALALHSFSKAIKIPASLRDQPLDTCQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRRECVHASHSFSATGSERATAQMQEDAASFAIPESICHLDFNELPITTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVGQIEQATAAAELLRLLEQGLAKLPLRDRQFVLARIGAQIGGGRSSGADHFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKVWGPRVPRLLEVIKWRCVSAICPLTPQLLEKWVGSPAGFSSRRTTRDYLNSFRLSMGAHVRLIAALDRNIPCWPDTNHKLRRVDDPVGRFDLTLAHVVREAGGQITMAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECVVEFEDPEVPIVSLRLSNGGALFGDVPSQNGKSPTHRKVHSNLSAIQFFKNGSCERQAVGRG
Ga0193707_105417813300019881SoilSRFGIRVLGDLHGRRVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFF
Ga0193725_100567823300019883SoilMPALALHSFSKAIKIPASLRDQPLDTCQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRHGRVHASHRFSATGSERATAKMQEDAARFAIPESICHLDFNELPITTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDVGQIEQATVAAELLRLLEQGLAKLPLRDREFVLARIGAQNRGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFAKTLDSLRKVWGPRVPRLLEVIKWRCVSAICPVTPQLLEKWVGSPAAFSSRPTTRDYLNGFRLSMEAHVRLIAALDKSIPCWPDTNHKLRRVDDPAGRFDLTLAHVVREAGGQITIAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECTVLEFKDPEVPVVSLRPSNRGVLFRHVPGQNGKSSTCRKVHSNLPAIQFFKNGSCERHAVGRR
Ga0193747_100275553300019885SoilMPALALHSFSKAIKIPASLRNQPLDTWQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRHGRVHAAHSFSATSSERATAKMQEDAARFAIPESICHLDFNELPITTRLANVVRSMGARSLGDLNGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVGQIEHATAAAELLRLLEQGLAKLPVRDREFVLARIGAQIGGGRSPGADLFCLTYAEIGQRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKLRCVSAISPLTPQLLEKWVGSPPSFSSRRPTRDYLNGFRLSMEAHVRLIAALDKSIPCWPDTNHKLRRVEDPVGRFDLTLAHVVREAGGQITMAEAYRRLSHPGRRDYRRLTVEKFLQMLPSVECTVVEFEDPEVPVVSLRLSNGGALFGDVPSQNGKSSTHRKVHSNLSAIQFF
Ga0193729_101737023300019887SoilMPASALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEASCNGHRRDCVHASHDFTATDSEGTIAKTQEGAASFAIPESICHLAFNELPITARLANVVRSMGARSLGDLNGRGAFELLRYRACGWGTISEIQQLIERAVSGEFDVAQIEEATAAAELLSLLEQGLAKLPLRDRQFVLARIGGEMGSARSRRVDLLCLSYAEIGRRYGLTRARVHKVFANTLDNLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLENWVGPPAAFSVRPTTRDYLSSFRLSMEAHVRLIAALDKNIPCWPETNPKRPRIDDPVGQFDLTLAHVVREAGGQITVAEAYRRLSHPGRRDYRRVTVENFLQLLRSTDYTAIEFTDPQSPIIRLRAWNGENFRARIRSDQPSAPSRTDPATIPFFGSNSMCGKHAAIGPR
Ga0193729_104881923300019887SoilMPALALHSFSKAIEIPPRLHTQPLDAWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGAKTLYELDLLARRARSRNGKASGSGHRRNCVLASHEFTATDNEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSMGARSLGDINGRSAFELLQYKACGWGTISEVQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPTAFSSRPTTRDYLNSFRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGVQITVAEAYRRLSH
Ga0193751_102359433300019888SoilVPALALHSFFKAIEIPTSLRGQPLDGWQTSARLSAVLDRFGIRVLGDLHGRKVVDFAWERNCGPKTLYELDLLARRARFRNGKASCTGHQRGCAHAARMPAHSFSATRGEEASAKTQEDAASFAIPESVCHLSFNELPITTRLANVVRLIGTRTLGDLNGRSAFELLQYKACGWRTLSEIQQLIERAVSREFDLGQIEQATAVGELLSLLEQGLAKLPLRDRQFVLARIGAEIRGARSPGADLLCLTYAEIGRRYGLTRARVHKVFANTLKSLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLRKWIDSPAASSSRPMTRDFLNSFRLSMEAHVRLIAALDKSIPCWPETNHKLRRIDDPIGQFDLTLAQVVSEAGGHISVAEAYRKLSQPGGRDYRRLTIQNFLWMLCGVESTVVEFKDPEIPIIRLRPSNNENLRAQVRTGQLSAAQETGLEAIPFFGAKSIVRECAGIGPR
Ga0193728_105984113300019890SoilHSCACDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSQNGKASCNGHRRGYVRASYTPSHGFSATGTERATARMQQDAASFAIPESICHFSFSELPITTRLANVVRAIGARSLGDLNGRSAFEMLQYKACGWGTISEIQQLIERAGSGEFDIDEVEETTAAAELLSLLEQGLAKLPLRDRQFVLARIGGEIVSARSPRADLLCLSYAEIGRRYGLTRARVHKVFANTLDRLRKIWGPRVPRLLEVIKWRCLSEICPLTPQFLEKWVGSPAAFSSRRTTRDYLTGSRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDHPVGQFDLTLAHVVREAGGQITVAEAYRSLSHPGGRDYRRLTVEKFLQMLRSVEGTAVEFKDPKVPIVSLRPSNRGVLFRDVPGHGKSSTARKVHSNLSAIQFFKNGSCERHAVGRR
Ga0193731_103797513300020001SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRV
Ga0193755_100917523300020004SoilMPALALHSFSKAIEIPERLHSQPLDAWQTSARLTTLLSRFGIRLLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASCNGHRRECVHASHDFIAADSEGATAKMQEDAASFAIPESICHLAFNELPITTRLANVVRSMGGRSLGDLNGRSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEAAAAAELLSLLEQALAKLPLRDRQFVLARIGAQTGNGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSRAAFSSRPTTRDYLNGFRLSMEAHVRLIAALDKNIPCRPETNHKLPRSDDPVGRFDLTLAHVVREAGGQITVAEAYRRLSHPGGRHYRRLTVEKFLQMLRCVECTVVEFEDPEVPIVSLRPSNGGALFGDVPGQNGKSSTHRKVHSNLSAIQFLKNGFCERQAVGRG
Ga0193733_100105363300020022SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGVQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0210382_1004676513300021080Groundwater SedimentSKAIEIPARLHNQPLNAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEASCNGHRRDCVHASHDFTATDSEGMIAKTQEGAASFAIPESICHLAFNELPITTRLANVVRSMGARSLGDLNGRGAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNGFCERKAVGRR
Ga0210408_1008081423300021178SoilMPALALHSFSKAIKIPASLRDQPLDTWQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARRAQSRNGKASCNGHRRECVHASHDFIAADSEGATAKMQEDAASFAIPESICHLAFNELPITTRLANVVRSMGARSLGDLDGRSAFELLQYKACGWGTISEIQQLIERAVSGEFDVAQIEEAAAAAELLSLLEQGLAKLPLRDRQFVLARFGAQTGNGRSPGADLLCLTYAEIGRQYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLEKWVGSRAVCSSRPTTRDCLNSFRLSMEAYVRLIAALDKNIPCWPETNHELRRTDDPVGQFDLTLAHVVREAGGRTTVSDAYRRLSHPRSRDYRRLTVERFLQMLCTVECTVLEFKDPEVPIVSLRPSNRGVLFRHVPGQNGKSSTGRKVHSNLAAIQFFGTKPGFSERQAVERR
Ga0193719_1005108213300021344SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPFRDRQFVLARIGAQIRSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0193709_102282813300021411SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKIQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0222622_1021952313300022756Groundwater SedimentKGRPECFGVRPQTGKINLRNAMPALALHSFSKAIEIPARLHNQPLDGWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHPRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFL
Ga0207697_1000297463300025315Corn, Switchgrass And Miscanthus RhizosphereMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWEGSSAVFSSRPTTGDCFNSFRLSMEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPG
Ga0207693_1022560923300025915Corn, Switchgrass And Miscanthus RhizosphereTLYEFDLLARRAQSRNGKASCNGHCNGRVHASHSFSATGSELATAKMQEDAAAFAIPESICHLSFGELPITTRLSNVVRAIGARSLGDLNGRGAFEMLQYKACGWGTISEIQQLIERAVSGEFDVRQVDETTAPAELLNLLEQGLAKLPLRDRQFVLARIGAQAGSGRSPAADLLCPSYEEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLGKWLDSPAASLPRPKTRDFLNRVRLSMEAHVRLITALDKNIPCWPEPNHKLRRIDDSLGQFDLALASAVREAGGHISVVEAYRQLSHPTKRNCRPLTIERFLQMLRSVESTVIDFKDPQSPIVRLHPWNAGNIRVHVGTNQPSAPGRTNRQATPLFRNGFCERQAV
Ga0207646_1006488333300025922Corn, Switchgrass And Miscanthus RhizosphereMPALALHSFSRAIEIPERLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHQSECVHASDDFIATDSERATAKMQEDAASFAIPESICHLAFNELPLTTRLANVVRSMGARSLDDLNGRSAFELLQYRACGWGTISEIQELIERAVSGEFDVAPIEEAAAAAELLSLLEQGLAMLPLRDRQFVLARIGAQTGSGRRPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLEVIKWRCVSAICPLTHQLLEKWVGSRAAFSSRPTSGEYLNSVRLSMEAQVRLIAALDKNIPCWPETNHKLQRIGEPAGQFDLTLAHVVREAGGQITVAEAYGRLSHPGRRDYRRLTVERFLQMLRTAECTVVEVKNPEVPVIRLRSSDGGVLSRHVPGENGKSSTAGKVHSNLAAIQFFKNGFCERQAV
Ga0207701_10000687173300025930Corn, Switchgrass And Miscanthus RhizosphereMPALALHSFSKPIEIPVGLRNQPLDAWQTSARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWGTISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPG
Ga0209470_106509313300026324SoilMPALALHSFSKAIEIPARLHNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLTRRARFRNGKASCNGHLRECVYASPDFTATDSEGATAKMQENAASFAIPESICHLAFNELPITTRLANVVRSIGARSLGDLNRRSAFELLQYRACGWGTISEIQQLIERGVSGEFDVAQIEEAAAASELLSLVEQGLAKLPLRDRQFVLARIGAQTGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDSLRKSWGPRVPRLLEVIKWRCVSAIFPLTPQLLEKWVGSRAAFSSRPTTGNYLNSVRLSMEAHVRLIVALDKNIPCWPETNHKLRRIDEPVGRFDLTLAHVVREAGGQLTVAEAYRRLSHPRKRDYCRLTVEKFLQMLPSVECTVVEFKDPEVPIVSLHPSDRRVLFRHVPGQNGKSSIARKVRSNLPAIQFFKNGFCERQAIGRR
Ga0257161_101751723300026508SoilVPALALHSFSKAIKIPASLRDQPLDTWQTSARLTAVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYEFDLLARHAQSRNGKASCNGHRHGRVHASHSFSATNSEGATAKMQEDTASFAIPKSICHLAFNELPITTRLANVVRSIGARNLGDLNGRSAFELLQYRACGWGTISEIQQLIERAVTGEFDVAQIEETTAPAELLSLLEQGLSKLPLRDRQFVLARIGGEIVSARSPRADPLCPSYAEIGRRYGLTRARVHKAFANTLDSLRKIWGPRVPRLLEVIKWSCLSTICPLTPQLLGKWLDSPAASSPRPKTRDFLNRLRLPMEAHVRLIAALDKNIPCWPETNRKLRRIDDSLGQFDLALAHVVREAGRQITMAEAYRKLSHPGGRD
Ga0209998_1002433113300027717Arabidopsis Thaliana RhizosphereFRDLSQTRKVNLKTAVPALALHSFSRAIEIPVSLRDQPLDVWKTSARLTAMLDRFGIRLLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASCNGHRRGCVHASHTPSHSFSATASERATAKMHEDVASFAIPESICHLSFDELPITRRLANVVRCIGAQSLSDLNGRNAFELLQYKACGWGMISELQQVIERAVSGEFDVGQIEETTAAAELLSLLEQGLTKLPLRNRQFVLARIGAEIGSARIPRADLLCPSYAEIGRRYGLTRARVHKVFANTLESLRKIWGPKVPRLLEVIKSRCLSMICPLTPQLLEKWVDSRAASPPRPTTRDYFSTFQLSSKAHVRLIMALDKSIPCWLGTNHKPRRIDASVGEFDLALARVVRGAGGHISVVEAYRRLSHPTERNCRRLTVENFLRMLRSVERTVIEFKDP
Ga0137415_1015452633300028536Vadose Zone SoilVIEIPASLRNQPLDAWQTSARLTTLLSRFGIRVLGDLHGRKVVEFAWEKNCGAKTLYELDLLARRARFRNGEASCNGHRRDCVHASHDFTATDSEGTIAKTQEGAASFAIPESICHLAFNELPTTTRLANVVRSMGARSLGDLNGRSAFELLQHRACGWGTISEIQQLIERAVSGEFDVARIEEATAAAELVSLLEQGLAKLPLRDRQFVLARIGAQTGSASPGADILCLSYAEIGRRYGLTRARVHKVFANTLDNLRKIWGPRVPRLLEVIKWRCLSTICPLTPQLLENWVGPPAAFSVRPTTRDYLSSFRLSMEAHVRLIAALDKNIPCWPETNPKLPRIDDLVGQFDLTLAHVVHEAGGQITVAEAYRRLSHPGRRDYRRVTVENFLQILRSTDYTAIEFTDPQSPIIRLRAWNGENFRARIRSDQPSAPSRTDPATIPFFGSNSMFRKHAAIGPR
Ga0307282_1009793123300028784SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFQNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEF
Ga0307296_1012347713300028819SoilGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWLETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0307312_1000769713300028828SoilSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPAAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGGQITVAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVECTAVEFKDPEVPMVSLRPSNPGGLFRDVPGQNGKSSTAGKVHSNLPAIQFFGTKNAFCERHAVGRR
Ga0307312_1015077913300028828SoilRRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0307289_1008774813300028875SoilAWEKNCGLKTLYELDLLARRARFRNGTASGNGHRRDYIHASNGFTATDSEGETAKIRKDATSFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATAIAELLSLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSAICPLTPQLLEKWVGSSAAFSSRPATGDCFNSFRLSMDVHVRLIAALDKSIPCWPETNHKLPRTDDAVGQFDLTLARVVREAGGQITVAEAYRRLSHPDGRHCRRLTVETFLQMLRSVEGTVVEVKNPEVPIVSLRSSNGGVLFRDVPGQNGKSSTPRKGSSNLAALQFFGTKNSFCEFQAVRRR
Ga0307304_1010709113300028885SoilMPALALHSFSKAIEIPARLHSQTLDAWPTSARLTTLLSRFGIRVLGDLHGRKIVDFAWEKNCGPKTLYELDLLARRARFRNGKASRSGHRRNCVPASHDFTATDSEGATAKMQEDAASFAIPESICHLAFSELPLTTRLANVVRSIGARSLGDLNGLSAFELLQYRACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQIGSGRSPGADLLCLSYAEIGRRYGLTRARVHKVFANTLDGLRKIWGPRVPRLLEVIKWRCLSAICPLTPQLLEKWVGSPTAFSSRPTTRDYLNSFRLSTEAHVRLIASLDKNIPCWPETNHKLRRIDDRVGQFDLTLANVVREAGV
Ga0170824_10476213413300031231Forest SoilLALHSFSKAIEIPTTLRDQPLDVWPTSARLSAVLGRFGIRVLGDLQGRKVVDFAWERNCGPKTLYELALLARRAQSRNGKASCNGNRRGCVNTSHTPSHSFTATGSERATAKMQEDAASFAIPESICHLSFSELPITTRLANVVRAIGARSLGDLNGRGAFEMLQYKACGWGTISEIQQLIERADFGEFDVLPMEETTAPAELLSLLEQGLAKLPLRDRQFVLARIGGEIGSARSPAAYLLCPSYEEIGRRYGLTRARVHKVFANTLDSLRKIWGPRVPRLLELIKWRCVSTICPLTPQLLGKWLDIPAASSQRPKTRDFLNRVRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDSVGQFDLSLAHVVRQAGGQITVAEAYRRLSHPGRRDYRRLTIEKFLQMLRSVECTVIEFKDPEIPIVRLSTCSDENMRTRVQTSQPSAPGKIDPDAIQFFGTKNGFCERQAVAGR
Ga0170820_1115404813300031446Forest SoilFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLSRRAQSRNGKASCNGHRRGCVNTSHTPSHSFSVTASERATAKIQEDVASFAIPESICHLSFSELPITTRLANVVRTIGARSLGDLNGRSAFEMLQYKACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQTRNGRSPGADPLCLSHAEIGRRYGLTRARVHKVFANSLNTLRKIWGPRVPRLLEVIKWRCLSMICPLTPQLLEKWVDSRAAFSSRPKNRDFLNSVRLSMEAHVRLIAALDKNIPCWPGTNHKLRCIGDSVAQFDLAVARLVREAGGQISVAEAYRKLSHPGSRDYRRLTVERFLQMLRTVECTLLEFKDPEVPVVSVRPSNRGVLFRHVPGQNGKSSTARKV
Ga0170818_10870790323300031474Forest SoilVPALALHSFSRAIEVPASLRDQPLDVWETSARLTAVLDRFGIRVLGDLHGRKVVDFAWEKNCGPKTLYELDLLSRRAQSRNGKASCNGHRRGCVNTSHTPSHSFSVTASERATAKIQEDVASFAIPESICHLSFSELPITTRLANVVRTIGARSLGDLNGRSAFEMLQYKACGWGTISEIQQLIERAVSGEFDVAQIEEATVAAELLSLLEQGLAKLPLRDRQFVLARIGAQTRNGRSPSADPLCLSHAEIGRRYGLTRARVHKVFANSLNTLRKIWGPRVPRLLEVIKWRCLSMICPLTPQLLEKWVDSRAAFSSRPKNRDFLNSVRLSMEAHVRLIAALDKNIPCWPGTNHKLRCIGDSVAQFDLAVARLVREAGGQISVAEAYRKLSHPGSRDYRRLTVERFLQMLRTVECTLLEFKDPEVPVVCLRPPNRDVLFRHVPGQNGKSSTARKVHSNLPAIQIFENGFCERRAVERR
Ga0310812_1006278513300032421SoilLHSFSRAIEIPVSLRDQPLDVWKTSARLTAVLDRFGIRLLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRAQSRNGKASCNGHRRGCVHSHTPSHSFSATASERATAKMQEDVASFAIPESICHLSFDELPITKRLANVVRCIGAQSLGDLNGRNAFELLQYKACGWGMISELQQVIERAVSGEFDVGQIEETTAAAELLSLLEQGLTKLPLRNRQFVLARIGAEIGSARIPRADLLCPSYAEIGRRYGLTRARVHKVFANTLESLRKIWGPKVPRLLEVIKSRCLSMICPLTPQLLEKWVDSRAASPPRPTMRDYFSTFQLSSKAHVRLIMALDKSIPCWLGTNHKPRRIDASVGEFDLALARVVRGAGGHISVVEAYRRLSHPTERNCRRLTVENFLRMLRSVERTVIEFKDPQSPIIRLRPSNAEVFLRDVPTQNGKSSTARKVHSNLSAIQFFGTKYGSCERRAVARR
Ga0310810_1005475623300033412SoilVPALALHSFSKAIEIPASLRNQPLDAWLTSARLTAVLDRFGIRVLGDLHGRKVVDFAWEKNCGSKTLYELDLLARRARLRNGKVSCNGHRRGCVHASHNFIATGSERVTAKMPEDAASFAIPESICHLSFNELPITTRLANVVRSIGAKSLGDLNGRSAFELLQYRACGWGTISEIQQLIERAVSGEFDIGQIEETTAATELLSLLEQGLAKLPLRDRQFVLARIGAQKGSGRGLCANLLCLSYAEIGRRYGLTRARVHKVFANSLDTLRKIWGPRVPRLLEVIKWRCLSMICPLTPQLLEKWVGLSAAFSSRPTTRDYLNSVRLSMEAHVRLIAALDKNIPCWPETNHKLRRIDDPVGQFDLTLAHVVREADGQTTAAEAYRRLSHPGRRDYRRLTVEKFLQMLRSVQCTVAEFKDPELPTVSLRPSNRGVLFRDVPGQNGKSSTARKVHSDLPALNKAGVV
Ga0310810_1050648713300033412SoilARLSTVLERFGIHVLGDLHGRKVVDFAWEKNCGPKTLYELDLLARRARFRNGTASRNGHRRDYIDASHDFTATDSEGATAKIRKESFVIPESICHLAFNELPITTRLGNVVRSIGARNLGDLNGRSAFELLQYKACGWATISEIQQLIERAISGEFDVAQIEEATATAELLNLLEQGLAKLPLRERQFLLARIGAQTGTGRSPGADLLCLTYAEIGRRYGLTRARIHKVLVNTLDSLRRTWGPRVPRLLEVLKWRCLSATCPLTPQLLEKWVGSPAAFSSRPATGDCFNSFRLSTEAHVRLIAALDKSIPCWPETNHKLPRSYDPVGQFDLTLARIVREAGGQITVAEAYRRFSHPGGRHCRRLTVETFLQMLCSVECTVVEVKNPEVPIVSLRSSNG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.