NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F037139

Metagenome Family F037139

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F037139
Family Type Metagenome
Number of Sequences 168
Average Sequence Length 235 residues
Representative Sequence MNENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPQISSYVSLNVRVFRETPKEIRDRSLINRLRTTLKVGTRLGVR
Number of Associated Samples 95
Number of Associated Scaffolds 168

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 64.67 %
% of genes near scaffold ends (potentially truncated) 45.24 %
% of genes from short scaffolds (< 2000 bps) 57.14 %
Associated GOLD sequencing projects 74
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (92.857 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(35.714 % of family members)
Environment Ontology (ENVO) Unclassified
(62.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.095 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 25.29%    β-sheet: 17.12%    Coil/Unstructured: 57.59%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 168 Family Scaffolds
PF00534Glycos_transf_1 20.83
PF08484Methyltransf_14 12.50
PF13489Methyltransf_23 7.14
PF01329Pterin_4a 4.76
PF13692Glyco_trans_1_4 2.98
PF13401AAA_22 2.38
PF06745ATPase 2.38
PF00270DEAD 1.79
PF05050Methyltransf_21 1.79
PF08241Methyltransf_11 1.79
PF13191AAA_16 1.19
PF01022HTH_5 1.19
PF01835MG2 1.19
PF01916DS 1.19
PF12847Methyltransf_18 0.60
PF14520HHH_5 0.60
PF13439Glyco_transf_4 0.60
PF00271Helicase_C 0.60
PF13659Obsolete Pfam Family 0.60
PF08071RS4NT 0.60
PF00158Sigma54_activat 0.60
PF01370Epimerase 0.60

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 168 Family Scaffolds
COG2154Pterin-4a-carbinolamine dehydrataseCoenzyme transport and metabolism [H] 4.76
COG1899Deoxyhypusine synthaseTranslation, ribosomal structure and biogenesis [J] 1.19
COG2373Uncharacterized conserved protein YfaS, alpha-2-macroglobulin familyGeneral function prediction only [R] 1.19
COG1471Ribosomal protein S4ETranslation, ribosomal structure and biogenesis [J] 0.60


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.40 %
UnclassifiedrootN/A0.60 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1039750All Organisms → cellular organisms → Archaea → TACK group789Open in IMG/M
3300002558|JGI25385J37094_10001641All Organisms → cellular organisms → Archaea7214Open in IMG/M
3300002558|JGI25385J37094_10004843All Organisms → cellular organisms → Archaea4720Open in IMG/M
3300002558|JGI25385J37094_10009554All Organisms → cellular organisms → Archaea3452Open in IMG/M
3300002560|JGI25383J37093_10043060All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1481Open in IMG/M
3300002561|JGI25384J37096_10037250All Organisms → cellular organisms → Archaea1891Open in IMG/M
3300002561|JGI25384J37096_10049060All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1609Open in IMG/M
3300002562|JGI25382J37095_10012011All Organisms → cellular organisms → Archaea3264Open in IMG/M
3300002908|JGI25382J43887_10001116All Organisms → cellular organisms → Archaea10465Open in IMG/M
3300002908|JGI25382J43887_10004301All Organisms → cellular organisms → Archaea6637Open in IMG/M
3300002908|JGI25382J43887_10004784All Organisms → cellular organisms → Archaea6375Open in IMG/M
3300002908|JGI25382J43887_10005846All Organisms → cellular organisms → Archaea5914Open in IMG/M
3300002912|JGI25386J43895_10167012All Organisms → cellular organisms → Archaea → TACK group552Open in IMG/M
3300002916|JGI25389J43894_1015685All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1299Open in IMG/M
3300005167|Ga0066672_10144096All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1490Open in IMG/M
3300005174|Ga0066680_10096830All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1800Open in IMG/M
3300005175|Ga0066673_10005804All Organisms → cellular organisms → Bacteria4979Open in IMG/M
3300005176|Ga0066679_10008225All Organisms → cellular organisms → Bacteria5026Open in IMG/M
3300005177|Ga0066690_10208559All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1302Open in IMG/M
3300005179|Ga0066684_10001235All Organisms → cellular organisms → Archaea9954Open in IMG/M
3300005180|Ga0066685_10043476All Organisms → cellular organisms → Archaea2857Open in IMG/M
3300005180|Ga0066685_10049721All Organisms → cellular organisms → Archaea2689Open in IMG/M
3300005180|Ga0066685_10131884All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1689Open in IMG/M
3300005180|Ga0066685_10182249All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1436Open in IMG/M
3300005180|Ga0066685_10319897All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1075Open in IMG/M
3300005181|Ga0066678_10263070All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1119Open in IMG/M
3300005186|Ga0066676_10156456All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1437Open in IMG/M
3300005186|Ga0066676_10206571All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1265Open in IMG/M
3300005446|Ga0066686_10055026All Organisms → cellular organisms → Archaea → TACK group2442Open in IMG/M
3300005446|Ga0066686_10511723All Organisms → cellular organisms → Archaea → TACK group817Open in IMG/M
3300005450|Ga0066682_10741908All Organisms → cellular organisms → Archaea → TACK group599Open in IMG/M
3300005468|Ga0070707_100659074All Organisms → cellular organisms → Archaea → TACK group1009Open in IMG/M
3300005536|Ga0070697_100403065All Organisms → cellular organisms → Archaea → TACK group1187Open in IMG/M
3300005540|Ga0066697_10021071All Organisms → cellular organisms → Archaea3530Open in IMG/M
3300005540|Ga0066697_10364676All Organisms → cellular organisms → Archaea → TACK group840Open in IMG/M
3300005540|Ga0066697_10654557All Organisms → cellular organisms → Archaea → TACK group578Open in IMG/M
3300005552|Ga0066701_10026123All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2977Open in IMG/M
3300005553|Ga0066695_10392903All Organisms → cellular organisms → Archaea → TACK group862Open in IMG/M
3300005554|Ga0066661_10025053All Organisms → cellular organisms → Bacteria3226Open in IMG/M
3300005555|Ga0066692_10276038All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1064Open in IMG/M
3300005556|Ga0066707_10402151All Organisms → cellular organisms → Archaea → TACK group891Open in IMG/M
3300005556|Ga0066707_10622347All Organisms → cellular organisms → Archaea → TACK group686Open in IMG/M
3300005557|Ga0066704_10207773All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1325Open in IMG/M
3300005558|Ga0066698_10003641All Organisms → cellular organisms → Bacteria7394Open in IMG/M
3300005558|Ga0066698_10322147All Organisms → cellular organisms → Archaea → TACK group1070Open in IMG/M
3300005559|Ga0066700_10030028All Organisms → cellular organisms → Bacteria3166Open in IMG/M
3300005568|Ga0066703_10007453All Organisms → cellular organisms → Archaea5058Open in IMG/M
3300005568|Ga0066703_10146104All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1419Open in IMG/M
3300005569|Ga0066705_10368920All Organisms → cellular organisms → Archaea → TACK group903Open in IMG/M
3300005575|Ga0066702_10219804All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1155Open in IMG/M
3300005586|Ga0066691_10254157All Organisms → cellular organisms → Archaea → TACK group1032Open in IMG/M
3300005586|Ga0066691_10532204All Organisms → cellular organisms → Archaea → TACK group701Open in IMG/M
3300006034|Ga0066656_10030516All Organisms → cellular organisms → Archaea2991Open in IMG/M
3300006034|Ga0066656_10163402All Organisms → cellular organisms → Archaea1399Open in IMG/M
3300006034|Ga0066656_10296740All Organisms → cellular organisms → Archaea → TACK group1042Open in IMG/M
3300006796|Ga0066665_10260453All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1377Open in IMG/M
3300006797|Ga0066659_10215201All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1416Open in IMG/M
3300006797|Ga0066659_10289532All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1244Open in IMG/M
3300006797|Ga0066659_10563041All Organisms → cellular organisms → Archaea → TACK group922Open in IMG/M
3300007258|Ga0099793_10212230All Organisms → cellular organisms → Archaea → TACK group930Open in IMG/M
3300009012|Ga0066710_100003659All Organisms → cellular organisms → Archaea13221Open in IMG/M
3300009012|Ga0066710_101582654All Organisms → cellular organisms → Archaea1005Open in IMG/M
3300009012|Ga0066710_102789726All Organisms → cellular organisms → Archaea → TACK group692Open in IMG/M
3300009038|Ga0099829_10010407All Organisms → cellular organisms → Bacteria6012Open in IMG/M
3300009088|Ga0099830_11136450All Organisms → cellular organisms → Archaea → TACK group648Open in IMG/M
3300009090|Ga0099827_10008888All Organisms → cellular organisms → Archaea6352Open in IMG/M
3300009090|Ga0099827_10011553All Organisms → cellular organisms → Archaea5739Open in IMG/M
3300009090|Ga0099827_10163482All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1830Open in IMG/M
3300009137|Ga0066709_100013473All Organisms → cellular organisms → Bacteria7713Open in IMG/M
3300010304|Ga0134088_10085940All Organisms → cellular organisms → Archaea → TACK group1472Open in IMG/M
3300010304|Ga0134088_10421528All Organisms → cellular organisms → Archaea → TACK group652Open in IMG/M
3300010329|Ga0134111_10129395All Organisms → cellular organisms → Archaea → TACK group987Open in IMG/M
3300010335|Ga0134063_10330463All Organisms → cellular organisms → Archaea → TACK group737Open in IMG/M
3300011270|Ga0137391_10526969All Organisms → cellular organisms → Archaea → TACK group998Open in IMG/M
3300012189|Ga0137388_11285301All Organisms → cellular organisms → Archaea → TACK group670Open in IMG/M
3300012199|Ga0137383_10009218All Organisms → cellular organisms → Bacteria6540Open in IMG/M
3300012199|Ga0137383_10087203All Organisms → cellular organisms → Archaea2259Open in IMG/M
3300012201|Ga0137365_10000100All Organisms → cellular organisms → Archaea43053Open in IMG/M
3300012201|Ga0137365_10135946All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1848Open in IMG/M
3300012203|Ga0137399_10023571All Organisms → cellular organisms → Archaea4090Open in IMG/M
3300012203|Ga0137399_10628818All Organisms → cellular organisms → Archaea → TACK group903Open in IMG/M
3300012203|Ga0137399_10877834All Organisms → cellular organisms → Archaea → TACK group755Open in IMG/M
3300012206|Ga0137380_10005195All Organisms → cellular organisms → Archaea11563Open in IMG/M
3300012206|Ga0137380_10006118All Organisms → cellular organisms → Archaea10724Open in IMG/M
3300012206|Ga0137380_10029522All Organisms → cellular organisms → Archaea5056Open in IMG/M
3300012206|Ga0137380_10289071All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1470Open in IMG/M
3300012206|Ga0137380_10497271All Organisms → cellular organisms → Archaea → TACK group1074Open in IMG/M
3300012206|Ga0137380_10529931All Organisms → cellular organisms → Archaea → TACK group1035Open in IMG/M
3300012206|Ga0137380_10765427All Organisms → cellular organisms → Archaea → TACK group835Open in IMG/M
3300012207|Ga0137381_10020324All Organisms → cellular organisms → Archaea5247Open in IMG/M
3300012207|Ga0137381_10021295All Organisms → cellular organisms → Archaea5137Open in IMG/M
3300012207|Ga0137381_10039905All Organisms → cellular organisms → Bacteria3833Open in IMG/M
3300012207|Ga0137381_10658192All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon911Open in IMG/M
3300012207|Ga0137381_11044127All Organisms → cellular organisms → Archaea → TACK group704Open in IMG/M
3300012209|Ga0137379_10005631All Organisms → cellular organisms → Archaea11830Open in IMG/M
3300012209|Ga0137379_10008957All Organisms → cellular organisms → Archaea9512Open in IMG/M
3300012209|Ga0137379_10018626All Organisms → cellular organisms → Archaea6662Open in IMG/M
3300012209|Ga0137379_10026517All Organisms → cellular organisms → Archaea5572Open in IMG/M
3300012209|Ga0137379_10036827All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon4720Open in IMG/M
3300012209|Ga0137379_10057552All Organisms → cellular organisms → Archaea3742Open in IMG/M
3300012209|Ga0137379_10133756All Organisms → cellular organisms → Archaea2382Open in IMG/M
3300012210|Ga0137378_10048363All Organisms → cellular organisms → Archaea3822Open in IMG/M
3300012210|Ga0137378_10182347All Organisms → cellular organisms → Archaea1952Open in IMG/M
3300012210|Ga0137378_10506308All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1115Open in IMG/M
3300012210|Ga0137378_10617688All Organisms → cellular organisms → Archaea → TACK group994Open in IMG/M
3300012210|Ga0137378_10633612All Organisms → cellular organisms → Archaea → TACK group979Open in IMG/M
3300012349|Ga0137387_10003041All Organisms → cellular organisms → Archaea8855Open in IMG/M
3300012356|Ga0137371_10016230All Organisms → cellular organisms → Archaea5717Open in IMG/M
3300012356|Ga0137371_10179780All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1658Open in IMG/M
3300012357|Ga0137384_10909463All Organisms → cellular organisms → Archaea → TACK group709Open in IMG/M
3300012359|Ga0137385_10009423All Organisms → cellular organisms → Archaea8627Open in IMG/M
3300012359|Ga0137385_10037122All Organisms → cellular organisms → Archaea4368Open in IMG/M
3300012362|Ga0137361_11810557All Organisms → cellular organisms → Archaea → TACK group528Open in IMG/M
3300012918|Ga0137396_10423124All Organisms → cellular organisms → Archaea987Open in IMG/M
3300012918|Ga0137396_10853167All Organisms → cellular organisms → Archaea → TACK group669Open in IMG/M
3300012927|Ga0137416_10689788All Organisms → cellular organisms → Archaea → TACK group896Open in IMG/M
3300012927|Ga0137416_10951468All Organisms → cellular organisms → Archaea → TACK group765Open in IMG/M
3300012972|Ga0134077_10033691All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1832Open in IMG/M
3300012977|Ga0134087_10097471All Organisms → cellular organisms → Archaea → TACK group1227Open in IMG/M
3300015359|Ga0134085_10177223All Organisms → cellular organisms → Archaea → TACK group912Open in IMG/M
3300017656|Ga0134112_10129272All Organisms → cellular organisms → Archaea → TACK group963Open in IMG/M
3300017657|Ga0134074_1109432All Organisms → cellular organisms → Archaea → TACK group952Open in IMG/M
3300017659|Ga0134083_10050954All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1559Open in IMG/M
3300018431|Ga0066655_10008581All Organisms → cellular organisms → Archaea4330Open in IMG/M
3300018431|Ga0066655_10272007All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1092Open in IMG/M
3300018433|Ga0066667_10143810All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1671Open in IMG/M
3300018433|Ga0066667_10219079All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1415Open in IMG/M
3300018482|Ga0066669_10021461All Organisms → cellular organisms → Bacteria3622Open in IMG/M
3300025922|Ga0207646_11226631All Organisms → cellular organisms → Archaea → TACK group657Open in IMG/M
3300026295|Ga0209234_1008133All Organisms → cellular organisms → Archaea3988Open in IMG/M
3300026296|Ga0209235_1000417All Organisms → cellular organisms → Archaea21168Open in IMG/M
3300026296|Ga0209235_1004060All Organisms → cellular organisms → Archaea8150Open in IMG/M
3300026296|Ga0209235_1006470All Organisms → cellular organisms → Bacteria6557Open in IMG/M
3300026296|Ga0209235_1034624All Organisms → cellular organisms → Archaea2572Open in IMG/M
3300026297|Ga0209237_1007673All Organisms → cellular organisms → Archaea6480Open in IMG/M
3300026297|Ga0209237_1028606All Organisms → cellular organisms → Archaea3073Open in IMG/M
3300026298|Ga0209236_1007387All Organisms → cellular organisms → Archaea6606Open in IMG/M
3300026301|Ga0209238_1000321All Organisms → cellular organisms → Archaea16351Open in IMG/M
3300026309|Ga0209055_1138518All Organisms → cellular organisms → Archaea → TACK group874Open in IMG/M
3300026313|Ga0209761_1033696All Organisms → cellular organisms → Archaea3043Open in IMG/M
3300026313|Ga0209761_1148411All Organisms → cellular organisms → Archaea → TACK group1108Open in IMG/M
3300026314|Ga0209268_1099421All Organisms → cellular organisms → Archaea → TACK group800Open in IMG/M
3300026324|Ga0209470_1001047All Organisms → cellular organisms → Archaea21030Open in IMG/M
3300026330|Ga0209473_1084074All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1339Open in IMG/M
3300026331|Ga0209267_1003614All Organisms → cellular organisms → Archaea9564Open in IMG/M
3300026342|Ga0209057_1111123All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1063Open in IMG/M
3300026342|Ga0209057_1133338All Organisms → cellular organisms → Archaea → TACK group881Open in IMG/M
3300026524|Ga0209690_1079295All Organisms → cellular organisms → Archaea1386Open in IMG/M
3300026529|Ga0209806_1000242All Organisms → cellular organisms → Archaea36019Open in IMG/M
3300026532|Ga0209160_1136210All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1169Open in IMG/M
3300026536|Ga0209058_1001667All Organisms → cellular organisms → Archaea18780Open in IMG/M
3300026536|Ga0209058_1019973All Organisms → cellular organisms → Archaea4417Open in IMG/M
3300026540|Ga0209376_1038484All Organisms → cellular organisms → Archaea2885Open in IMG/M
3300026540|Ga0209376_1252429All Organisms → cellular organisms → Archaea → TACK group753Open in IMG/M
3300026548|Ga0209161_10001884All Organisms → cellular organisms → Archaea17220Open in IMG/M
3300026548|Ga0209161_10026646All Organisms → cellular organisms → Archaea4054Open in IMG/M
3300026548|Ga0209161_10126641All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1503Open in IMG/M
3300026552|Ga0209577_10137289All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1938Open in IMG/M
3300027846|Ga0209180_10005761All Organisms → cellular organisms → Archaea6256Open in IMG/M
3300027862|Ga0209701_10524821All Organisms → cellular organisms → Archaea → TACK group640Open in IMG/M
3300027882|Ga0209590_10016814All Organisms → cellular organisms → Archaea3599Open in IMG/M
3300027882|Ga0209590_10086454All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1843Open in IMG/M
3300027882|Ga0209590_10187797All Organisms → cellular organisms → Archaea1301Open in IMG/M
3300027882|Ga0209590_10208348All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1237Open in IMG/M
3300028536|Ga0137415_10145566All Organisms → cellular organisms → Archaea → TACK group2207Open in IMG/M
3300028536|Ga0137415_10292572All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1433Open in IMG/M
3300028536|Ga0137415_10784375All Organisms → cellular organisms → Archaea → TACK group763Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil35.71%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil35.12%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil20.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.95%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.60%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_103975013300002557Grasslands SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYXHEIRXLLKEYSNYDXECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLI
JGI25385J37094_1000164133300002558Grasslands SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDSR*
JGI25385J37094_1000484343300002558Grasslands SoilMKENITAVCISGKEEAWNVPEIPFYCHTAGFNKFPHYPPLKTRERVQYLSTRRNEANRRALEPNPTTEHFLSIDSYYLNQTTEIRKLIKEYSYYDDDCVLGATNWFLDYSKFPSKVRYWDIWATPEMKGKSYDYQPKNEGMPEGWERVRGCGGFTLYPRWLWERRGYGIPEPFPEAGNEVNYLCNYPGISTYVTFNVKAHRETPEELLKRSFARRLRTTVGLRSRLGLRQLEHKGHESN*
JGI25385J37094_1000955423300002558Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHERLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT*
JGI25383J37093_1004306023300002560Grasslands SoilVAVCISGKQEAWAIPEIPFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDSYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGPERRGSLSPACSRLDDLN*
JGI25384J37096_1003725013300002561Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIA
JGI25384J37096_1004906013300002561Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNTTHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKSFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNHLCQCPGISSYVSLNVRVLRETPKEI
JGI25382J37095_1001201123300002562Grasslands SoilMKRNIVAVCISGKQEAWAIPEIPFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDSYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGPERRGSLSPACSRLDDLN*
JGI25382J43887_1000111683300002908Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKXEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT*
JGI25382J43887_1000430133300002908Grasslands SoilMKQNIVAVCISGKQEPWSISEIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNTANKQALELSPKAEHFLSIDSYYLAQVDEIRHLLKEYSDFNAECILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPEGWESVRGCGGFTLYPRWVWERRGYGVPEPFPDAGNEVSYLCQCPGIPSYVTLNVKAVRETPKEIVNRSMINRVRTTIKLRTRLRMKRSSLLVGLIMMHTSETKPDPERRGSLSQWPFAS*
JGI25382J43887_1000478473300002908Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNTTHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKSFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNHLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHRLGPEPIRENVLAAPTI*
JGI25382J43887_1000584643300002908Grasslands SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDXDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDSR*
JGI25386J43895_1016701213300002912Grasslands SoilPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNXXHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKSFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNHLCQCPGISSYVSLNVRVLRETPK
JGI25389J43894_101568513300002916Grasslands SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRYLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRXRSLINRLRTTLRVGTRLGMRRPLISRASEMHGFGPEXIREXALDEPTIRKFNVT*
Ga0066672_1014409623300005167SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVL
Ga0066680_1009683023300005174SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDRTGSREGKNHSFYIESGHRIAMREVWTREQT*
Ga0066673_1000580443300005175SoilMNENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPQISSYVSLNVRVFRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLIGEMHRLGPEPMREDSLD*
Ga0066679_1000822553300005176SoilMKRNIVAVCISGKQEPWNLPEIPFYCHTEGFQEFPHYPPLKTRERIQYLAVRRNTAHKRALELNPNAEHFLSIDSYYLPSIDEIRRLLKEYSDYGEECILGATNWFEDYSRVPLKLRYWDTWATPEMKDKKYDYYPKHEGLPQGWEKVRGCGGFTVYPKWVWERRGYGVPDPFPEAGNEVNYLCQCPGISSYVTLNVKALRETPKEIVNRSMINRVRTTVKLGSRLRLKRTPFS*
Ga0066690_1020855913300005177SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIREDALDEPTIRKFNVT*
Ga0066684_1000123543300005179SoilMKQNIAAVCISGREENWSIGEVPFYCHTEGYKEFAHYPPVKTWERVKYLANRRNTANKKLLKLHPDTEHFLSIDSYYLDQVHEIRLLIKEYANYNMDCVLGATNWFLDSSKYPSRVRYWDTWATPEMKRKSYHYYPRHEQIPAGWERVAGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPSYVTFNVKARRETPKELLDRSFIKKLRTTVGLRSRLGLRRS*
Ga0066685_1004347623300005180SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLQEYSNYNAECALGATNWFSDFSRIPLKLRYWDTWATPEMKDRGFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPIRENVLAAPTI*
Ga0066685_1004972123300005180SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPRHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT*
Ga0066685_1013188423300005180SoilMKENIVAVCISGKQEAWNIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVREIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGISRPLISRASEMHDLGPEPIRQDILAEPTIRKFNATQLLD*
Ga0066685_1018224923300005180SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD*
Ga0066685_1031989713300005180SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVL
Ga0066678_1026307013300005181SoilMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIV
Ga0066676_1015645633300005186SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPIRENVLAAPTI*
Ga0066676_1020657113300005186SoilMKANITAVCISGKKEPWNVAEIPFYCHIEGFKEFPHYPPVKTRERVQYLSMRRNDANRRGLELNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAHNADCVLGATSWFLDYSKFPSKVRYWDTWATPEMLGKSYNYYPRNKGMPEGWEKVKGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNFLCQYPGIPSFVSSNVKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRD*
Ga0066686_1005502613300005446SoilVKAHITAVCISGKEESWNVAEIPFYCHTEGFKEFPHYPPVKTRERVQYLSMRRNDANRRALELNPDTEHFLSIDSYYLNQVGEIRGLVKEYNAYTADCVLGATNWFLDYSRFPSKLRYWDSWATPEMLGKSYNYYPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPSFVTSNVKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRG*
Ga0066686_1051172313300005446SoilMKANITAVCISGKKEPWNVAEIPFYCHIEGFKEFPHYPPVKTRERVQYLSMRRNDANRRGLELNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAHNADCVLGATSWFLDYSKFPSKVRYWDTWATPEMLGKSYNYYPRNKGMPEGWEKVKGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNFLCQYPGIPSFVSSNVKAHRDTPVELLSRSLVSR
Ga0066682_1074190813300005450SoilWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINR
Ga0070707_10065907423300005468Corn, Switchgrass And Miscanthus RhizosphereMKENIVAVCISGKQEAWSTLEIPFYCYTEGFKDFLHYPPVKTRERIQYLATRRNMAHERALELNPNAEHFLSIDSYYLTCTIEIRRLLKEYSDYNAECILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVGGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVTLNVKALRETPKEIVNRPLINRLRTTLGLRTKLGLKK
Ga0070697_10040306513300005536Corn, Switchgrass And Miscanthus RhizosphereMKGNIVAVCISGKQEAWSIPEISFYCHTGGFKEFPHYPPVKTRERIQYLAMRRNVAHKRALELNPNAEHFLSIDSYYLTSIDQIRHLLKEYSDYDAECILGATNWFLDYSRIPLRLRYWDTWATPEMKERKYEYYPRHEGLPPGWEMVRGCGGFTVYPRWVWERRGYGVPEPFPEAGNEVNYLCQYPGIPSYVTLNVKSLRETPKEIVNRSMINRVRTTIRLRSRLRMKRTPLLVGLMMSHTSETKPDPERRGSISPYHFAS*
Ga0066697_1002107113300005540SoilNDNTAFGETAQMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT*
Ga0066697_1036467623300005540SoilEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVREIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGISRPLISRASEMHDLGPEPIRQDILAEPTIRKFNATQLLD*
Ga0066697_1065455713300005540SoilGFKEFPHYPPVKTRERVQYLSMRRNDANRRALELNPDTEHFLSIDSYYLNQVGEIRGLVKEYNAYTADCVLGATNWFLDYSRFPSKLRYWDSWATPEMLGKSYNYYPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNFLCQYPGIPSFVSSNVKAHRDTPVELLSRSLVSRIRTTVG
Ga0066701_1002612323300005552SoilMKTNITAVCISGRKEPWNVAEIPFYCHTDGFKEFPHYPPVKTRERVQYLSKRRNDANRTALKLNPATEHFLSIDSYYLNQVGEIRKLVKEYRAYNSDCVLGATNWFLDYSKFPSKVRYWDAWATPEMLGKSYNYRPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTVGLRSRLGLRG*
Ga0066695_1039290313300005553SoilIIGDIKKMKANITAVCISGKKEPWNVAEIPFYCHIEGFKEFPHYPPVKTRERVQYLSMRRNDANRRGLELNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAHNADCVLGATSWFLDYSKFPSKVRYWDTWATPEMLGKSYNYYPRNKGMPEGWEKVKGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNFLCKYPGIPSFVSSNLTAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRD*
Ga0066661_1002505313300005554SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRA
Ga0066692_1027603823300005555SoilHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFSDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPWIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDRTGSREGKNHSFYIESGHRIAMREVWTREQT*
Ga0066707_1040215113300005556SoilMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTSLV*
Ga0066707_1062234713300005556SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRD
Ga0066704_1020777313300005557SoilMRLVHGKGVERMKESITAVCISGKEEAWSVPEIPFYCHTAGFKESPHYPPVMTRERVQYLSTRRNEANRRALELNPNTEHFLSIDSYYLNQTTEIRKLVKEYGYYDADCVLGATNWFLDHSKLPSKMRFWDTWATPEMRKKSYNYQPRNEGIPEGWERVRGCGGFTLYPRWLWEKRGYGIPEPFPKAGNEVNYLCNYPWIPTYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLWHRRSEPRRLE
Ga0066698_1000364123300005558SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT*
Ga0066698_1032214713300005558SoilMKANITAVCISGKKEPWNVAEIPFYCHIEGFKEFPHYPPVKTRERVQYLSMRRNDANRRGLELNPDTEHFLSIDSYYLNQVGEIRKLVKEYNAYNSDCVLGATNWFLDYSRFPSKIRYWDSWATPEMLGKSYNYRPRNKGMPYGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTVGLRSRLGLRG*
Ga0066700_1003002843300005559SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLR
Ga0066703_1000745323300005568SoilMRLVHGKGVERMKESITAVCISGKEEAWSVPEIPFYCHTAGFKESPHYPPVMTRERVQYLSTRRNEANRRALELNPNTEHFLSIDSYYLNQTTEIRKLVKEYGYYDADCVLGATNWFLDHSKLPSKMRFWDTWATPEMRKKSYNYQPRNEGIPEGWERVRGCGGFTLYPRWLWEKRGYGIPEPFPKAGNEVNYLCNYPWIPTYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLWHRRSEPRRLESR*
Ga0066703_1014610413300005568SoilMKRNIVAVCISGKQEPWNLPEIPFYCHTEGFQEFPHYPPLKTRERIQYLAVRRNTAHKRALELNPNAEHFLSIDSYYLPSIDEIRRILKEYSDYGEECILGATNWFEDYSRVPLKLRYWDTWATPEMKDKKYDYYPKHEGLPQGWEKVRGCGGFTVYPKWVWERRGYGVPDPFPEAGNEVNYLCQCPGISSYVTLNVKALRETPKEIVNRSMINRVRTTVKLGSRLRLKRTPFS*
Ga0066705_1036892013300005569SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPGWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIR
Ga0066702_1021980423300005575SoilYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFLDFSRIPLKLRYWDTWATPEMKDRRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGTEVNYLCQCREISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIREDALDEPTIRKFNVT*
Ga0066691_1025415723300005586SoilIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTSLV*
Ga0066691_1053220413300005586SoilEYPHYPPVKTRERVQYLSGRRNAANRRLLELHPDTEHFLSIDSYYLDQVDNIRQMIAEYSVYVADCILGATNWFLDCSKYPSRVRYWDTWATPEMRGKTFDYYPMHKQIPRGWEPVGGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPAYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLWHRRSEPRRLESR*
Ga0066656_1003051613300006034SoilGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD*
Ga0066656_1016340223300006034SoilSGKEESWNVAEIPFYCHTEGFKEFPHYPPVKTRERVQYLSMRRNDANRRALELNPDTEHFLSIDSYYLNQVGEIRGLVKEYNAYTADCVLGATNWFLDYSRFPSKLRYWDSWATPEMLGKSYNYYPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPSFVTSNVKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRG*
Ga0066656_1029674023300006034SoilVKAHITAVCISGKEESWNVAEIPFYCHTEGFKEFPHYPPVKTRERVQYLSMRRNDANRRGLELNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAHNADCFLGATSWFLDYSKFPSKVRYWDTWATPEMLGKSYNYYPRNKGMPEGWEKVKGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNFLCQYPGIPSFVSSNGKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRD*
Ga0066665_1026045323300006796SoilWMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALREPPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTSLV*
Ga0066659_1021520113300006797SoilIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTSLV*
Ga0066659_1028953223300006797SoilMKQNIAAVCISGKEESWSVNGVPFYCHTEGHREYPHYPPVKTRERVQYLSGRRNAANRRLLELHPDTEHFLSIDSYYLDQVDNIRQMIAEYSVYVADCILGATNWFLDCSRYPSRVRYWDTWATPEMRGKTFDYYPMHKQIPRGWEPVGGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPAYVTFNVKAHRETPEELLNRSFARRL
Ga0066659_1056304123300006797SoilMTSSVEKMKANITAVCISGKKEPWNVAEIPFYCHTDGFKEFPHYPPVKTRERVQYLSKRRNDANRTALKLNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAYNSDCVLGATNWFLDYSRFPSKIRYWDSWATPEMLGKSYNYRPRNKGMPYGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTV
Ga0099793_1021223023300007258Vadose Zone SoilMKENITAVCISGKEEAWNVPEIPFYCHTAGFNKFPHYPPLKTRERVQYLSTRRNEANRRALEPNPTTEHFLSIDSYYLTQTTEIRKLIKEYSYYDDDCVLGATNWFLDYSKFPSKVRYWDIWATPEMKGKSYDYQPKNEGMPEGWERVRGCGGFTLYPRWLWERRGYGIPEPFPEAGNEVNYLCNYPGISTYV
Ga0066710_10000365963300009012Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPNEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT
Ga0066710_10158265413300009012Grasslands SoilGREENWSIGEVPFYCHTEGYKEFAHYPPVKTWERVKYLANRRNTANKKLLKLHPDTEHFLSIDSYYLDQVHEIRLLIKEYANYNMDCVLGATNWFLDSSKYPSRVRYWDTWATPEMKRKSYHYYPRHEQIPAGWERVAGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPSYVTFNVKARRETPKELLDRSFIKKLRTTVGLRSRLGLRRS
Ga0066710_10207164513300009012Grasslands SoilAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLQEYSNYNAECALGATNWFSDFSRIPLKLRYWGTWATPEMKDRGFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPIRENVLAAPTI
Ga0066710_10278972613300009012Grasslands SoilPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECILGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGLPEGWESVRGCGGFALYPRWIWERRGYGVPEPFPEAGNEVNYLCQCPGIPSYVSLNVKVLRETPKEVRDRSLINRLRTTLKVGTRLGVRRP
Ga0099829_1001040733300009038Vadose Zone SoilMKGNIVAVCISGKQEAWAIPEIPFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDRYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGPERRGSLSPACSRLDDLN*
Ga0099830_1113645013300009088Vadose Zone SoilIPFYCHTEGFKQFPHYPPVKTRERVQYLSKRRNDANSRALELNPDTQHVLSIDSYYLNQVGEIRQLVKEYSAYNADCVLGATNWFLDYSKFPSKVRYWDSWATPEMLGKPYNYYPSNEGMPEGWEKVRGCGGFTLYPRWLWESRGYGIPEPFPDAGNEVNYLCHYPGIPSFVTFNVKAHRDTPAELLSKSLVSRIRTTVGLRSRLGLRG*
Ga0099827_1000888833300009090Vadose Zone SoilMKRNITAVCISGKKETWSFDGIPFYCHTDGYKEFPNYPPAKTRERVQYLSNRRNAANRKLLELHSETEHFLSIDSYYLDQITEIRQLLREYTDFNADCVLGATNWFLDYSKYPVRVRYWDTWATPEMRGKAYDYYPRREQIPEGWERVRGCGGFTLYPRWLWEKRGYGMPEPFPGAGNEVNYLCNFPGIPSYVTFNVKAHRETPKELLDRSLVNRIRTTIGLRSRLVFRP*
Ga0099827_1001155323300009090Vadose Zone SoilMKQNIIAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPAMKTRERIQYLAVRRNMAHKQALELNPNAEHFLSIDSYYLTRINEIWHLLKEYADYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMTTLDVRNETESTETKITLSPAVSPLTFLV*
Ga0099827_1016348213300009090Vadose Zone SoilMKQNVAAVCISGKRENWNLGGVPFYCHIEGYKAFPHYPPVKTRERVQYLSSRRNTANKRLLELHPDTEHFLTIDSYYLHQVNEIRQLIREYTNHIADCVLGATNWFLDRSKYPSRVRYWDTWATPEMKGKAYDYYPRHKQIPEGWEQVRGCGGFALYPRWLWEKRGYGVPEPFPDAGNEVNYLCNFPGILAYVTFNVKAHRETPSELLNRSFVKRLRTTVGLRSRLGLRR*
Ga0066709_10001347323300009137Grasslands SoilMTSSVEKMKANITAVCISGKKEPWNVAEIPFYCHTDGFKEFPHYPPVKTRERVQYLSNRRNDANRTALKLNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAYNSDCVLGATNWFLDYSRFPSKIRYWDSWATPEMLGKSYNYRPRNKGMPYGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTVGLRSRLGLRG*
Ga0134088_1008594023300010304Grasslands SoilMKDHITAVCISGVQEYWRLPEIPFYCHTGGYRPYPHFPPQVTRDRLLYLSSRRNAANSFALQIHPETEHILSIDSYYLGYVSEIRQLVKEYVGYPDACILGATNWYLDKAKIPARIRYWDGWGTPEMLRRSYDFNPRHKGLPPGWEPVGGCGGFTLYPRWIWERQGYGIPEPFPEAGNEVNYLCRCPGVSSFVTLNVKVLRQTPPEVMNRSLLRRIRTTIGLRTRLGSGPRTTA*
Ga0134088_1042152813300010304Grasslands SoilRERIRYLATRRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD*
Ga0134111_1012939513300010329Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLIGEMHRLGPEPMREDSLD*
Ga0134063_1033046313300010335Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPNEIANRSLIKRLRTTIK
Ga0137391_1052696913300011270Vadose Zone SoilMKRNITAVCISGKKETWSFDGIPFYCHTDGYKEFPNYPPAKTRERVQYLSNRRNAANRKLLELHSETEHFLSIDSYYLDQITEIRQLLREYTDFNADCVLGATNWFLDHSKYPVRVRYWDTWATPEMRGKAYDYYPRREQIPEGWERVRGCGGFTLYPRWLWEKRGYGMPEPFPGAGNEVNYLCNFPGIPSYVTFNVKAHRETPKELLDRSLVNRIRTTIGLRSRLGFRP*
Ga0137388_1128530113300012189Vadose Zone SoilCISGKAEHWKVSEIPFYCHTEGFKDFPHYPRVKTRERVQYLSERRNAANERLLQLYPETKHFLSIDSYYLNQVDEIRELITEYVRSNADCILGATNWFLDYSRFPARQTYWDTWATPEMRQRSYNYYPRHEGLPEGWERVGGCGGFTLYPRWLWEKRRYGIPEPFPDAGNEVNYLCQYSGIASFVTFNVKAHRETPQEILNKPLVSRIRTTIGLRSRLGIRTL
Ga0137383_1000921883300012199Vadose Zone SoilMKQNIAAVCISGKEESWSVNGVPFYCHTEGHREYPHYPPVKTRERVQYLSGRRNAANRRLLELHPDTEHFLSIDSYYLDQVDNIRQMIAEYSVYVADCILGATNWFLDCSKYPSRVRYWDTWATPEMRGKTFDYYPMHKQIPRGWEPVGGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPAYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLGLRKSEPRRLESR*
Ga0137383_1008720323300012199Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPPVKTRERIQYLAMRRNMAHKQALERNPHAEHFLSIDSYYLTGIKEIWHLLKEYAEYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKNYDYNPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSIINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTFLV*
Ga0137365_10000100503300012201Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIP
Ga0137365_1013594623300012201Vadose Zone SoilMKENIVAVCISGKEEAWSIPEIPFYCHAEGYKDFPHYPPVKTRERIRYLATRRNAAHKRTLELHPNTEHFLSIDSYYLSYAHEIRYLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWTTPEMRDKRFDYYPRHEGLPEGWESVRGCGGFAAYPRWIWERRGYGVPEPFPEAGNEVNYLCQCPGIPSYVSLNVKVLRETPKEIRDRTLINRLRTTLRVGTRLGVRRPLISRANETHGLGPEPFQEDILAEPTIRKFNVT*
Ga0137399_1002357153300012203Vadose Zone SoilMKRNIVAVCISGKQEAWSIPEIPFYCYTEGFRDFPHYPPVKTKERIQYLATRRNMAHKRALELNPNAEHFLSIDSYYLTNINEIRHLLNEYSDYDADCILGATNWFPDYSRVPLKLRYWDTWATPEMRDRKYDYYPRHEGLPQGWESVRGCGGFTLYPRWVWERRGYGVPEPFPETGNEVNYLCQCPGIPSYVTLNVKALR
Ga0137399_1062881813300012203Vadose Zone SoilMKQKIAAVCISGVEENWNVAEVPFYCHNEGYKDFPHYPPVKTRQRVQYLAARRNTANKRLLELHPDTEHILSIDSYYLIQVDEIRKLLNEYASYVADCVLGASNWFMDYSRLPRKMRYWDTWATPEMKGKPYDYYPRHEGMPAGWEQVMGCGGFSLYPRWLWEKRGYGIPEPFPKSGNEVNYLCQYSGIQSYVTLNVKAHRETPEELINRSLVNRLRTTIGLRSRLGLRRR*
Ga0137399_1087783413300012203Vadose Zone SoilNVPEIPFYCHTAGFNKFPHYPPLKTRERVQYLSTRRNEANRRALEPNPTTEHFLSIDSYYLNQTTEIRKLIKEYSYYDDDCVLGATNWFLDYSKFPSKVRYWDIWATPEMKGKSYDYQPKNEGMPEGWERVRGCGGFTLYPRWLWERRGYGIPEPFPEAGNEVNYLCNYPGISTYVTFNVKAHRETPEELLKRSFARRLRTTVGLRSRLGLRQLEHKGHESN*
Ga0137380_1000519593300012206Vadose Zone SoilMKGNIVAVCISGKQEAWSIPKIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNMGHKRALELNPNAEHFLSIDSYYLTSTNEIRHLLKEYSDFDAECILGATNWFPDYSRVPVKLRYWDTWATPEMKDKKYDYYPRHDGLPEGWERVRGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPEITSYVTLNVKAIRETPKEIVKRSMINRVRTTIKLRTRLRMKRSPLLVGLATVHTPETKPGLEGRGSLSPCQFAS*
Ga0137380_1000611883300012206Vadose Zone SoilMESLSAINDISSFWETSKMKGNIVAVCISAKQEAWSIPEIPFYCHTEGFKDFPHYPPVKTRERIQYLATRRNMAHKRALEMSPNAEHFLSIDSYYLTSINEIRHLLKEYSDFDAECILGATNWFPDYSRVPLKLRYWDTWATPEMKGRKYDYYPRHEGLPDGWEIVRGCGGFTVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMNRPLCS*
Ga0137380_1002952263300012206Vadose Zone SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLREYTNYNAECVLGATNWFSDFSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRET
Ga0137380_1028907123300012206Vadose Zone SoilMKQKITAVCISGVEETWNVSEVPFYCHTEGHRDFPHYPPVRTKQRVQYLSTRRNAANKRLLEVHPDTEHFLSIDSYYLNQVEEIRKLVGEYVNYKPDCVLGASNWFRDYSKIPSKVRYWDTWATPEMKGKHHDYYPRHEGIPEGWERVSGCGGFTLYPRWLWEKRRYGIPEPFPESGNEVNYLCQYPGIRSYVTLNVKAHRETPMELRNRPFANRLRTTVGLRSRLGLRR*
Ga0137380_1049727113300012206Vadose Zone SoilMKQNIAAVCISGKEESWSVNGVPFYCHTEGHGEYPHYPPVKTRERVQYLSGRRNAANRRLLELHPDTEHFLSIDSYYLDQVDNIRQMIAEYSVYVADCILGATNWFLDCSKYPSRVRYWDTWATPEMRGKTFDYYPMHKQIPRGWEPVGGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPAYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLGLRKSEPRRLESR*
Ga0137380_1052993123300012206Vadose Zone SoilMKENIVAVCISGKEEAWSIPEIPFYCHAEGYKDFPHYPPVKTRERIRYLATRRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRYLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWTTPEMRDKRFDYYPRHEGLPEGWESVRGCGGFAAYPRWIWERRGYGVPEPFPEAGNEVNYLCQCPGIPSYVSLNVKVLRETPKEIRDRTLINRLRTTLKVGTRLGVRRPLISRANETHGLGPEPFQEDILAEPTIRKFNVT*
Ga0137380_1076542723300012206Vadose Zone SoilMKVNIIAVCISGKEERWNVAEIPFYCHTAGFREFPHYPPVKTRERVEYLSTRRNDANRRALELNPDTEHFLSIDSYYLNQFSEIRKLVREYSTYDPDCVLGATNWFLDYSKFPSKIRYWDTWATPEMLGRSYNYYPKNKGMPEGWKSVRGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCQYPGIPSFITFNVRAHRDTPIELLRRSLVSR
Ga0137381_1002032423300012207Vadose Zone SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLREYTNYNAECVLGATNWFSDFSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRMPLISRASEMHRLGPEPIRENVLAAPTN*
Ga0137381_1002129513300012207Vadose Zone SoilGTALSEWERLEITMESLSAINDISSFWETSKMKGNIVAVCISAKQEAWSIPEIPFYCHTEGFKDFPHYPPVKTRERIQYLATRRNMAHKRALEMSPNAEHFLSIDSYYLTSINEIRHLLKEYSDFDAECILGATNWFPDYSRVPLKLRYWDTWATPEMKGRKYDYYPRHEGLPDGWEIVRGCGGFTVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMNRPLCS*
Ga0137381_1003990543300012207Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPPVKTRERIQYLAMRRNMAHKQALERNPHAEHFLSIDSYYLTGIKEIWHLLKEYAEYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKNYDYNPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSIINRVRTTVKLRTRLRMKWNPLLAGTMVTHNVRSETESTETKITLSPAVSLPTFLV*
Ga0137381_1065819233300012207Vadose Zone SoilVKADLKKNITAVCISGVQETWNVPEIPFYCYTEGYKYFPHYPPQATRERILYLSLRRNAAHRATLANYPETEHFLCIDSYYLGYVSEIRYLVKEYSGYGPDCILGATNWFRDYGRIPLRTKYWDGWATPEMVRRRYDYYPRHEGLPEGWEKVRGCGGFALYPRWVWEKRGYGIPEPFPEAGNEVNYLC
Ga0137381_1104412713300012207Vadose Zone SoilQGATQKMKVNIIAVCISGKEERWNVAEIPFYCHTAGFREFPHYPPVKTRERVEYLSTRRNDANRRALELNPDTEHFLSIDSYYLNQFSEIRKLVREYSTYDPDCVLGATNWFLDYSKFPSKIRYWDTWATPEMLGRSYNYYPKNKGMPEGWKSVRGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCQYPGIPSFITFNVRAHRDTPIELLRRSLVSRIRTTVGLRSRLGL
Ga0137379_10005631153300012209Vadose Zone SoilMKVNIIAVCISGKEERWNVAEIPFYCHTAGFREFPHYPPVKTRERVEYLSTRRNDANRRALELNPDTEHFLSIDSYYLNQFSEIRKLVREYSTYDPDCVLGATNWFLDYSKFPSKIRYWDTWATPEMLGRSYNYYPKNKGMPEGWKSVRGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCQYPGIPSFITFNVRAHRDTPIELLRRSLVSRIRTTVGLRSRLGLRA*
Ga0137379_10008957123300012209Vadose Zone SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLREYTNYNAECVLGATNWFSDFSRIPLKLRYWDTSATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRET
Ga0137379_1001862623300012209Vadose Zone SoilMESLSAINDISSFWETSKMKGNIVAVCISARQEAWSIPEIPFYCHTEGFKDFPHYPPVKTRERIQYLATRRNMAHKRALEMSPNAEHFLSIDSYYLTSINEIRHLLKEYSDFDAECILGATNWFPDYSRVPLKLRYWDTWATPEMKGRKYDYYPRHEGLPDGWEIVRGCGGFTVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMNRPLCS*
Ga0137379_1002651743300012209Vadose Zone SoilMKGNIVAVCISGKQEAWSIPRIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNMGHKRALELNPNAEHFLSIDSYYLTSTNEIRHLLKEYSDFDAECILGATNWFPDYSRLPVKLRYWDTWATPEMKDKKYDYYPRHDGLPEGWERVRGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPEITSYVTLNVKAIRETPKEIVKRSMINRVRTTIKLRTRLRMKRSPLLVGLATVHTPETKPGLEGRGSLSPCQFAS*
Ga0137379_1003682743300012209Vadose Zone SoilMKQNIAAVCISGKEESWNVDGVPFYCHTEGYLEFPHYPPVKTRERVQYLSSRRNAANKRLLELYPDTEHFLSIDTYYLGQVNLIKQLIIEYTVHIADCVLGATNWFLDCSRYPSRVRYWDTWATPEMKGTAHNYYPIHEKIPRGWEQVGGCGGFTLYPRWLWEKRGYGIPEPFPDSGNEVNYLCNYPGIPTYVTFNVKAHRETPEELLKRSFVERLQTTVGLRSRLGLRR*
Ga0137379_1005755223300012209Vadose Zone SoilMVKNNITAVCISGIQESWRLTEIPFLCYTAGYEYFPHYPPVKTKERIMYLSQRRNNANRAALELYPDTQHFLSIDTYYLNYINEIHQLLREYSDYDGDCILGASNWYLDYSKIPAKIRYWDQWATPEMMHRRYDYYPRTAGLPEGWERVRGCGGFTLYPRWVWEKRGYGIPEPFPEAGNEVNYLCECPGIFSYVTLNVKALRHTPPEVMNRSLLARIRTTVGLRTRLGLKTTTKA*
Ga0137379_1013375623300012209Vadose Zone SoilVKADLKKNITAVCISGVQETWNVPEIPFYCYTEGYKYFPHYPPQATRERILYLSLRRNAAHRATLANYPETEHFLCIDSYYLGYVSEIRYLVKEYSGYGPDCILGATNWFRDYGRIPLRTKYWDGWATPEMVRRRYDYYPRHEGLPEGWEKVRGCGGFALYPRWVWEKRGYGIPEPFPAAGNEVNYLCQCPGISSYVTFNVKVLRQTPPEVENRSLVNRIRTTIGLRTRLGLKRVSSK*
Ga0137378_1004836343300012210Vadose Zone SoilMTRGAEKMKENITAVCISGKEEAWKVPEIPFYCHTAGFKEFPHYPPVKTKERVQYLSTRRNEANMRAVELNPNTEHFLNIDSYYLNQTTEIRKIVEEYGRYDPDCVLGATNWFLDYSKVPSKVRYWDTWATPEMKGKSYDYQPRNEGMPEGWERVRGCGGFTLYPRWLWEKRGYGIPEPFPEAGNEVNYLCDYPGIPTFVTFNVKAHRETPEELLNRSLVKRLRTTVGLRSRLGLRRSEPRRLESR*
Ga0137378_1018234723300012210Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPPVKTRERIQYLAMRRNMAHKQALERNPHAEHFLSIDSYYLTGIKEIWHLLKEYAEYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKNYDYNPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSIINRVRT
Ga0137378_1050630823300012210Vadose Zone SoilMKQKITAVCISGVEETWNVSEVPFYCHTEGHRDFPHYPPVRTKQRVQYLSTRRNAANKRSLEVHPDTEHFLSIDSYYLNQVEEIRKLVGEYVNYKPNCVLGASNWFRDYSKIPSKVRYWDTWATPEMKGKRHDYYPRHEGIPEGWERVSGCGGFTLYPRWLWEKRRYGIPEPFHESGNEVNYLCQYPGIRSYVTLNVKAHRETPMELRNRPFANRLRTTVGLRSRLGLRR*
Ga0137378_1061768813300012210Vadose Zone SoilMKRNIVAVCISGKQEPWNIPEIPFYCHTEGFQEFPHYPPLKTRERIQYLAVRRNTAHKRALELSPNAEHFLSIDSYYLTSIDEIRRLLKEYSDYGEECILGATNWFEDYSRVPLKLRYWDTWATPEMRDKKYDYYPKHEGLPQGWEKVRGCGGFTVYPKWVWERRGYGVPDPFPEAGNEVNYLCQCPGISSYVTLNVKALRETPKEIVNRSMINRVRTTVKLGSRLRLKRTPSS*
Ga0137378_1063361223300012210Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYADYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIV
Ga0137387_1000304143300012349Vadose Zone SoilMKGNIVAVCISGKQEAWSIPKIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNMGHKRALELNPNAEHFLSIDSYYLTSTNEIRHLLKEYSDFDAECILGATNWFPDYSRVPVKLRYWDTWATPEMKDKKYDYYPRHDGLPEGWERVRGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPEITSYVTLNVKAIRETPKEIVKRSMINRVRTTIKLRTRLRMKRSPLLVGLATVHTPETKPGLAGRGSLSPCQFAS*
Ga0137371_1001623013300012356Vadose Zone SoilMKENIVAVCISGKEEAWSIPEIPFYCHAEGYKDFPHYPPVKTRERIRYLATRRNAAHKRTLELHPNTEHFLSIDSYYLSYAHEIRYLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWTTPEMRDKRFDYYPRHEGLPEGWESVRGCGGFAAYPRWIWERRGYGVPEPFPEDGNEINYLCQCPGITSYVSLNVKVLRETPKEIRDRTLINRLRTTLKVGTRLGVRRPLISRANETHGLGPEPFQEDILAEPTIRKFNVT*
Ga0137371_1017978013300012356Vadose Zone SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLGYVHEIRHLLQEYANYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFTVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHRLGPEPIRENVLAAPTI*
Ga0137384_1090946313300012357Vadose Zone SoilETWNVSEVPFYCHTEGHRDFPHYPPVRTKQRVQYLSTRRNAANKRLLEVHPDTEHFLSIDSYYLNQVEEIRKLVGEYVNYKPDCVLGASNWFRDYSKIPSKVRYWDTWATPEMKGKHHDYYPRHEGIPEGWERVSGCGGFTLYPRWLWEKRRYGIPEPFPESGNEVNYLCQYPGIRSYVTLNVKAHRETPMELRNRPFANRLRTTVGLRSRLGLRR*
Ga0137385_1000942343300012359Vadose Zone SoilMKGNIVAVCISGKQEAWSIPRIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNMGHKRALELNPNAEHFLSIDSYYLTSTNEIRHLLKEYSDFDAECILGATNWFPDYSRVPVKLRYWDTWATPEMKDKKYDYYPRHDGLPEGWERVRGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPEITSYVTLNVKAIRETPKEIVKRSMINRVRTTIKLRTRLRMKRSPLLVGLATVHTPETKPGLEGRGSLSPCQFAS*
Ga0137385_1003712213300012359Vadose Zone SoilMKENIVAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPPVKTRERIQYLAMRRNMAHKQALERNPHAEHFLSIDSYYLTGIKEIWHLLKEYAEYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKNYDYNPRHEGLPQGWEKVRGCGGFTVYPKCVWERQGYGVPEPFPEAGNEVNYLCLCPG
Ga0137361_1181055713300012362Vadose Zone SoilEIPFYCHTAGFKESPHYPPVMTRERVQYLSTRRNEANRRALELNPNTEHFLSIDSYYLNQTTEIRKLVKEYSRYGPDCVLGASNWFLDYSKFPSKVRYWDTWATPEMKGKSYHYRPKNEGMPEGWERVRGCGGFTLYPRWLWEKRGYGMPEPFPEAGNEVNYLCNYPGIMTYVTLN
Ga0137396_1042312423300012918Vadose Zone SoilVKTRQRVQYLAARRNTANKRLLELHPDTEHILSIDSYYLIQVDEIRKLLNEYASYVADCVLGASNWFMDYSRLPRKMRYWDTWATPEMKGKPYDYYPRHEGMPAGWEQVMGCGGFSLYPRWLWEKRGYGIPEPFPKSGNEVNYLCQYSGIQSYVTLNVKAHRETPEELINRSLVNRLRTTIGLRSRLGLRRR*
Ga0137396_1085316713300012918Vadose Zone SoilMKRNIVAVCVSGKEEVWSIPEIPFYCHTQGFKDFPHYPPVKTRERIQYLAARRNAAHKRTLELNPNTEHFLSIDSYYLTNVDGILHLLKEYSDYDAECILGATNWFPDYSRVPLKLRYWDTWGTPEMKDKRYDYYPIHQGLPEGWETVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCQCPGI
Ga0137416_1068978823300012927Vadose Zone SoilFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDSYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGLERRGSLSPACSRLDDLN*
Ga0137416_1095146813300012927Vadose Zone SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVKRSMINRLQTTLKV
Ga0134077_1003369123300012972Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNGFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD*
Ga0134087_1009747123300012977Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD*
Ga0134085_1017722313300015359Grasslands SoilWNIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRFPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPKWVWERRGYGVPEPFPEAGNEVNYLCQCRGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPIRENVLAAPTI*
Ga0134112_1012927213300017656Grasslands SoilMKENIVAVCISGKQEAWSIPAIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD
Ga0134074_110943223300017657Grasslands SoilMKENVVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAGHKRVLELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPKWVWERRGYGVPEPFPEAGNEVNYLCQCPQISSYVSL
Ga0134083_1005095423300017659Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPI
Ga0066655_1000858143300018431Grasslands SoilVKAHITAVCISGKEESWNVAEIPFYCHTEGFKEFPHYPPVKTRERVQYLSMRRNDANRRALELNPDTEHFLSIDSYYLNQVGEIRGLVKEYNAYTADCVLGATNWFLDYSRFPSKLRYWDSWATPEMLGKSYNYYPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPSFVTSNVKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRG
Ga0066655_1027200713300018431Grasslands SoilMKENIVAVCISGKQEAWNIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLR
Ga0066667_1014381023300018433Grasslands SoilMKQNIAAVCISGREENWSIGEVPFYCHTEGYKEFAHYPPVKTWERVKYLANRRNTANKKLLKLHPDTEHFLSIDSYYLDQVHEIRLLIKEYANYNMDCVLGATNWFLDSSKYPSRVRYWDTWATPEMKRKSYHYYPRHEQIPAGWERVAGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPSYVTFNVKARRETPKELLDRSFIKKLRTTVGLRSRLGLRRS
Ga0066667_1021907913300018433Grasslands SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIREDALDEPTIRKFNVT
Ga0066669_1002146123300018482Grasslands SoilMNENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPQISSYVSLNVRVFRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLIGEMHRLGPEPMREDSLD
Ga0207646_1122663113300025922Corn, Switchgrass And Miscanthus RhizosphereAWSTLEIPFYCYTEGFKDFLHYPPVKTRERIQYLATRRNMAHERALELNPNAEHFLSIDSYYLTCTIEIRRLLKEYSDYNAECILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVGGCGGFTLYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVTLNVKALRETPKEIVNRPLINRLRTTLGLRTKLGLKKSLSS
Ga0209234_100813323300026295Grasslands SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRNRSLINRLRTTLRVGTRLGMRRPLISRASEMHGFGPEPIREDALDEPTIRKFDVT
Ga0209235_100041783300026296Grasslands SoilMKENITAVCISGKEEAWNVPEIPFYCHTAGFNKFPHYPPLKTRERVQYLSTRRNEANRRALEPNPTTEHFLSIDSYYLNQTTEIRKLIKEYSYYDDDCVLGATNWFLDYSKFPSKVRYWDIWATPEMKGKSYDYQPKNEGMPEGWERVRGCGGFTLYPRWLWERRGYGIPEPFPEAGNEVNYLCNYPGISTYVTFNVKAHRETPEELLKRSFARRLRTTVGLRSRLGLRQLEHKGHESN
Ga0209235_100406023300026296Grasslands SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDCDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDSR
Ga0209235_100647053300026296Grasslands SoilMKGNIVAVCISGKQEAWAIPEIPFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDSYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGPERRGSLSPACSRLDDLN
Ga0209235_103462423300026296Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNTTHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKSFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNHLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHRLGPEPIRENVLAAPTI
Ga0209237_100767313300026297Grasslands SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNTTHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSHYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKSFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNHLCQCPGISSYVSLNVRVLRETPK
Ga0209237_102860633300026297Grasslands SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSMINRLRTTLKVRTRLGLKRPLSSYGSEKRGDSR
Ga0209236_100738773300026298Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHERLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT
Ga0209238_1000321103300026301Grasslands SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRYLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWERVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRNRSLINRLRTTLRVGTRLGMRRPLISRASEMHGFGPEPIREDALDEPTIRKFNVT
Ga0209055_113851813300026309SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFG
Ga0209761_103369633300026313Grasslands SoilMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRGTPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT
Ga0209761_114841113300026313Grasslands SoilMKQNIAAVCISGREENWSVGKVPFYCHTEGYKEFAHYPPVKTWERVKYLADRRNTANKKLLKLHPDTEHFLSIDSYYLDQVHEIRLLIKEYANYNMDCVLGATNWFLDGSKYPSRVRYWDTWATPEMRRKSYHYYPRHEQIPAGWERVAGCGGFTLYPRWLWQKRGYGIPEPFPDAGNEVNYLCNYPGIPSYVTFNVKARRETPKELLDRSFIKRLRTTVGLRSRLGLRRF
Ga0209268_109942113300026314SoilMNENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPQISSYVSLNVRVFRETPKEIRDRSLINRLRTTLKVGTRLGVR
Ga0209470_100104793300026324SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLINRASEMHRLGPEPIREDILD
Ga0209473_108407413300026330SoilMKQNIAAVCISGREENWSIGEVPFYCHTEGYKEFAHYPPVKTWERVKYLANRRNTANKKLLKLHPDTEHFLSIDSYYLDQVHEIRLLIKEYANYNMDCVLGATNWFLDSSKYPSRVRYWDTWATPEMKRKSYHYYPRHEQIPAGWERVAGCGGFTLYPRWLWEKRGYGIPEPFPDAGNEVNYLCNYPGIPSYVTFNV
Ga0209267_100361413300026331SoilIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYDAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIREDALDEPTIRKFNVT
Ga0209057_111112313300026342SoilTAFGETAQMKRSIVAVCISGKQEDWSIPEIPFYCLTEGFKDFPHYPPVKTRERIQYLAVRRNMAHRRALELNPDAEHFLSIDSYYLTSMNEIRHLVKEYSDYEAECILGATNWFPDYSRVPLKVRYWDTWATPEMKDRKYDYYPKHEGLPEGWESVRGCGGFTLYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIQSYVTLNVKALRETPSEIANRSLIKRLRTTIKVRTRLGMKRPLRSLSQGT
Ga0209057_113333813300026342SoilEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVREIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGISRPLISRASEMHDLGPEPIRQDILAEPTIRKFNATQLLD
Ga0209690_107929513300026524SoilMKTNITAVCISGRKEPWNVAEIPFYCHTDGFKEFPHYPPVKTRERVQYLSKRRNDANRTALKLNPATEHFLSIDSYYLNQVGEIRKLVKEYRAYNSDCVLGATNWFLDYSKFPSKVRYWDAWATPEMLGKSYNYRPRNKGMPEGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTVGLRSRLGLRG
Ga0209806_1000242373300026529SoilMKESITAVCISGKEEAWSVPEIPFYCHTAGFKESPHYPPVMTRERVQYLSTRRNEANRRALELNPNTEHFLSIDSYYLNQTTEIRKLVKEYGYYDADCVLGATNWFLDHSKLPSKMRFWDTWATPEMRKKSYNYQPRNEGIPEGWERVRGCGGFTLYPRWLWEKRGYGIPEPFPKAGNEVNYLCNYPWIPTYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLWHRRSEPRRLESR
Ga0209160_113621013300026532SoilMRLVHGKGVERMKESITAVCISGKEEAWSVPEIPFYCHTAGFKESPHYPPVMTRERVQYLSTRRNEANRRALELNPNTEHFLSIDSYYLNQTTEIRKLVKEYGYYDADCVLGATNWFLDHSKLPSKMRFWDTWATPEMRKKSYNYQPRNEGIPEGWERVRGCGGFTLYPRWLWEKRGYGIPEPFPKAGNEVNYLCNYPWIPTYVTFNVKAHRETPEELLNRSFARRLRTTIGLRSRLWHRRSEPRRLESR
Ga0209058_100166783300026536SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIQHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPKHEGLPEGWESVRGCGGFAVYPRWVWERRGYGMPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGVRRPLISRASEMHLLGPEPIREDILD
Ga0209058_101997353300026536SoilCHTEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVREIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGISRPLISRASEMHDLGPEPIRQDILAEPTIRKFNATQLLD
Ga0209376_103848413300026540SoilMKENIVAVCISGKQEAWNIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAKRRNAAHKRALELHPNTEHFLSIDSYYLSYVREIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKEKRFDYYPRHEGLPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPGAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLKVGTRLGISRPLISRASEMHDLGPEPIRQDILAEPTIRKFNATQLLD
Ga0209376_125242913300026540SoilMKENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIRYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRHLLQEYSNYNAECALGATNWFSDFSRIPLKLRYWDTWATPEMKDRGFDYYPRHEGLPEGWENVRGCGGFAVYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGISSYVSLNVRVLRETP
Ga0209161_1000188473300026548SoilMKENIVAVCISGKEEAWSVPEIPFYCHAEGYKDFPHYPPVKTRERIRYLAARRNAAHKRALELHPNTEHFLSIDSYYLSYAHEIRSLLKEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMRDKRFDYYPRHEGMPEGWESVRGCGGFAVYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLRVGTRLGMRRPLTSRASEMHGFGPEPIREDALDEPTIRKFNVT
Ga0209161_1002664633300026548SoilMKENIVAVCISGKQEAWGIPEIPFYCRAEGFKEFPRYPPVKTRERIQYLAMRRNMAHKQALELNPNAEHFLSIDSYYLTGINEIWHLLKEYVDYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMITHDVRNETESTETKITLSPAVSPLTSLV
Ga0209161_1012664113300026548SoilMNENIVAVCISGKQEAWSIPEIPFYCHTEGYRDFPHYPPVKTRERIGYLAERRNAAHKRALELHPNTEHFLSIDSYYLSYVHEIRHLLQEYSNYNAECVLGATNWFSDYSRIPLKLRYWDTWATPEMKDKRFDYYPRHEGLPEGWENVRGCGGFAVYPRWAWERRGYGVPEPFPEAGNEVNYLCQCPGISSYVSLNVRVLRETPKEIRDRSLINRLRTTLK
Ga0209577_1013728923300026552SoilMTSSVEKMKANITAVCISGKKEPWNVAEIPFYCHTDGFKEFPHYPPVKTRERVQYLSNRRNDANRTALKLNPDTEHFLSIDSYYLNQVGEIRKLVKEYSAYNSDCVLGATNWFLDYSRFPSKIRYWDSWATPEMLGKSYNYRPRNKGMPYGWEKVRGCGGFTLYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPCFVTSNVKAHRDTPVELLNRSLVNRIRTTVGLRSRLGLRG
Ga0209180_1000576163300027846Vadose Zone SoilMKGNIVAVCISGKQEAWAIPEIPFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDRYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGPERRGSLSPACSRLDDLN
Ga0209701_1052482113300027862Vadose Zone SoilYCHTEGFKQFPHYPPVKTRERVQYLSKRRNDANSRALELNPDTQHVLSIDSYYLNQVGEIRQLVKEYSAYNADCVLGATNWFLDYSKFPSKVRYWDSWATPEMLGKPYNYYPSNEGMPEGWEKVRGCGGFTLYPRWLWESRGYGIPEPFPDAGNEVNYLCHYPGIPSFVTFNVKAHRDTPAELLSKSLVSRIRTTVGLRSRLGLRG
Ga0209590_1001681453300027882Vadose Zone SoilMKRNITAVCISGKKETWSFDGIPFYCHTDGYKEFPNYPPAKTRERVQYLSNRRNAANRKLLELHSETEHFLSIDSYYLDQITEIRQLLREYTDFNADCVLGATNWFLDYSKYPVRVRYWDTWATPEMRGKAYDYYPRREQIPEGWERVRGCGGFTLYPRWLWEKRGYGMPEPFPGAGNEVNYLCNFPGIPSYVTFNVKAHRETPKELLDRSLVNRIRTTIGLRSRLGFRP
Ga0209590_1008645423300027882Vadose Zone SoilMKQNIIAVCISGKQEAWGIPEIPFYCHAEGFKEFPHYPAMKTRERIQYLAVRRNMAHKQALELNPNAEHFLSIDSYYLTRINEIWHLLKEYADYNGECILGATNWFADYSRVPLKLRYWDTWATPEMKDKKYDYYPRHEGLPQGWEKVRGCGGFTVYPRWVWERQGYGVPEPFPEAGNEVNYLCLCPGIPSYVTLNVKALRETPKEIVNRSMINRVRTTVKLRTRLRMKWNPLLAGTMTTLDVRNETESTETKITLSPAVSPLTFLV
Ga0209590_1018779723300027882Vadose Zone SoilMKQNVAAVCISGKRENWNLGGVPFYCHIEGYKAFPHYPPVKTRERVQYLSSRRNTANKRLLELHPDTEHFLTIDSYYLHQVNEIRQLIREYTNHIADCVLGATNWFLDRSKYPSRVRYWDTWATPEMKGKAYDYYPRHKQIPEGWEQVRGCGGFALYPRWLWEKRGYGVPEPFPDAGNEVNYLCNFPGILAYVTFNVKAHRETPSELLNRSFVKRLRTTVGLRSRLGLRR
Ga0209590_1020834813300027882Vadose Zone SoilMKQNIVAVCISGKQEPWSISEIPFYCHTEGFKEFPHYPPVKTRERIQYLAMRRNTANKQALELSPKAEHFLSIDSYYLAQVDEIRHLLKEYSDFDAECILGATNWFPDHSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPEGWENVRGCGGFTLYPRWVWERRGYGVPEPFPDAGNEVNYLCQCPGIP
Ga0137415_1014556613300028536Vadose Zone SoilMTRSNGKMKTKIMAICISGKEETWNVAEIPFYCHTEGFKEFPHYPPVKTRERVQYLSMRRNNANRRALELNPDTEHFLSIDSYYLDQVSEIRRLVKEYSAYNADCVLGATNWFLDYSKFPSNVRYWDTWATPEMLGKSYNYYPRNKGMPEGWEKVRGCGGFALYPRWLWEKRGYGVPEPFPDAGNEVNYLCQYPGIPSFVTFNVKAHRDTPVELLSRSLVSRIRTTVGLRSRLGLRCLEHEPLVSGHSRPTAHLDRFGNGFLLDFRRLPSSLLITFIHLCSSYTFISIV
Ga0137415_1029257223300028536Vadose Zone SoilFYCHTEGFKEFPHYPPVKTSERIQYLAIRRNMAHKRALELNPKAEHFLSIDSYYLTKINEIQHLLKEYSEYEADCILGATNWFPDYSRVPLKLRYWDTWATPEMKDRKYDYYPRHEGLPQGWEVVRGCGGFTAYPRWVWERRGYGVPEPFPEAGNEVNYLCQCPGILSYVTLNVKALRETPKEIVNRSMINRVRTTIKLRTRLRMKRSPLLMGLITVHASETKPGLERRGSLSPACSRLDDLN
Ga0137415_1078437513300028536Vadose Zone SoilMKGNIVAVCISGKQEAWSISEIPFYCHTEGFKDFPHYPPVKTRERVHYLATRRNMAHRRALELNPNTEHFLSIDSYYLTSVHEIRHLLEEYSDYDVECILGATNWFPDYSRVPLKLRYWDIWATPEMKGRKYDYYPRHEGLPEGWESVRGCGGFTVYPRWVWEQRGYGVPEPFPEAGNEVNYLCQCPGIPSYVTLNVKALRETPKEIVNRSM


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.