NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105816

Metagenome / Metatranscriptome Family F105816

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105816
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 242 residues
Representative Sequence MKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPV
Number of Associated Samples 92
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 20.00 %
% of genes near scaffold ends (potentially truncated) 5.00 %
% of genes from short scaffolds (< 2000 bps) 4.00 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(18.000 % of family members)
Environment Ontology (ENVO) Unclassified
(43.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 34.68%    β-sheet: 22.18%    Coil/Unstructured: 43.15%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF02566OsmC 11.00
PF08442ATP-grasp_2 1.00
PF03129HGTP_anticodon 1.00
PF00587tRNA-synt_2b 1.00
PF09994DUF2235 1.00
PF02868Peptidase_M4_C 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 11.00
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 11.00
COG0458Carbamoylphosphate synthase large subunitAmino acid transport and metabolism [E] 2.00
COG0026Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase)Nucleotide transport and metabolism [F] 1.00
COG0045Succinyl-CoA synthetase, beta subunitEnergy production and conversion [C] 1.00
COG0124Histidyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0151Phosphoribosylamine-glycine ligaseNucleotide transport and metabolism [F] 1.00
COG0423Glycyl-tRNA synthetase, class IITranslation, ribosomal structure and biogenesis [J] 1.00
COG0441Threonyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0442Prolyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG1042Acyl-CoA synthetase (NDP forming)Energy production and conversion [C] 1.00
COG3227Zn-dependent metalloprotease (Neutral protease B)Posttranslational modification, protein turnover, chaperones [O] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.00 %
All OrganismsrootAll Organisms3.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459005|F1BAP7Q02GP3WGNot Available536Open in IMG/M
3300005179|Ga0066684_10923475Not Available568Open in IMG/M
3300021078|Ga0210381_10088306All Organisms → cellular organisms → Bacteria991Open in IMG/M
3300031715|Ga0307476_10624630All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium799Open in IMG/M
3300031947|Ga0310909_10015104All Organisms → cellular organisms → Bacteria → Proteobacteria5419Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil16.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere2.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere1.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.00%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2124908045Soil microbial communities from Great Prairies - Kansas assembly 1 01_01_2011EnvironmentalOpen in IMG/M
2170459005Grass soil microbial communities from Rothamsted Park, UK - July 2009 direct MP BIO1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002128Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005290Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Rhizosphere Soil Replicate 1: eDNA_1Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010395Agave microbial communities from Guanajuato, Mexico - Or.Ma.eHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019872Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020005Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020062Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021510Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300030844Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030969Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_034877302088090014SoilVKRSVYIVWGLGLAATVFAWAFILYKEKAVSSLQTNTPPNRVRSYGIGRYVTDIRVGWCNNDGHIIESNPFTDSLTLTVFLQNFDGWLLAKARGDQRLLPEDVKPEDLKNYLSLVELRKSSPLSAEQDATLQTLQIKINHWMLQERSNLRLMIAGQVFETIPPFDATAPPTYGSGEFEGETYNRVVFHLAAPSDPKEL
KansclcFeb2_096640102124908045SoilYIVWGVGLAVILVVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRVGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKSYLSLVDLRKSSPLSADQEMALQPLQVKVNHWMLQERSSLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPM
E41_044753102170459005Grass SoilAKHLSSDLVNTPPNRVRSYGTGRYVTDIRIGWCNNAGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLSAEQNAAFQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDVSASPTYGSGEFEGETYNRIVFHLAAPKDPIELRKMARH
ICCgaii200_060538522228664021SoilMKRTVCLVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAAD
INPgaii200_115973812228664022SoilVKRSVYIVWGLGLAATVFAWAFILYKEKAVSSLQTNTPPNRVRSYGIGRYVTDIRVGWCNNDGHIIESNPFTDSLTLTVFLQNFDGWLLAKARGDQRLLPEDVKPEDLKNYLSLVELRKSSPLSAEQDATLQTLQIKINHWMLQERSNLRLMIAGQVFETIPPFDATAPPTYGSGEFEGETYNRVVFHLAAPSDPKELEKWRAXIRAAGPTCDAQVSLARPIAGAADA
ICChiseqgaiiDRAFT_067642213300000033SoilTVYSVWGVGLAVILVVWGLIWMKETRRSSDQLKIPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLXERSNLRXMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPSDAKELEKWRAIIRAVGPTCDAQISLARPIVGTADALRMPTLVNAEATYSSVAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAA
ICChiseqgaiiDRAFT_067821413300000033SoilMKRTVYLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIEXNXFTDSLTVTVFLQNFDGWXLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQSDQDAPWSLSRV
JGI1027J12803_10625590113300000955SoilMKRTVYSVWGVGLAVILVVWGLIWMKETRRSSDQLKIPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPSDAKELEKWRAIIRAVGPTCDAQISLARPIVGTADALRMPTL
JGI24036J26619_1001781813300002128Corn, Switchgrass And Miscanthus RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNVFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSSGAKPVYRSLPMVP
Ga0062592_10017315523300004480SoilMGRYVTDIRVGWCNNDGHIVESNPFTDSLTLTVFLQNFDGWLLAHARGEERLLPEDLKSEELKNYLSLVEKRKSSSVSAEEQATLQTLQIKINHWMLHERSNLRLMIAGQVFETIPPFDATAPPTYGSGDFEGETYNRVVFHLEAPSDPKELEKWRAIIRTAGPTCDAQVSLARPIAGTSDALRMPTLVNADAIYTAGAKPVYRSLPLVPPFRQGAAITAVIFTICAVFAAALGTSALRDTRGAGLVPDRDAPWSLSRVVFAWWLTICVGCFA
Ga0066677_1005160113300005171SoilMKRTVYIVWGTGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALR
Ga0066679_1051767213300005176SoilGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFT
Ga0066684_1092347513300005179SoilWGTGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKE
Ga0066678_1004280733300005181SoilMKRTVYIVWGVGLAVILVVWGFIWMKETRRSSDQLKTPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPVDPKELEKWRAIIRAVGPTCDAQISLARPIVGTADALRMPTLVNAEATYSSAAKPVYRSLPMVPPMRQGAAIT
Ga0066675_1042906113300005187SoilMKRTVYIVWGAGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTC
Ga0065712_1020733313300005290Miscanthus RhizosphereMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDERLLPEDLKPEELKNYLALAASRKSSSLSPDQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAII
Ga0065705_1036308913300005294Switchgrass RhizosphereLVVPYEAYRLLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNVFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIV
Ga0065707_1050037613300005295Switchgrass RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNVFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSS
Ga0066388_10439783313300005332Tropical Forest SoilIDTKDATLHALGQPRPYVMKRTVYIVWGVGLAVTLMIWGFIWMKESRRTADQLKASPNRVRSYGTEKYVTDIRVGWCNNDGRIIESNPFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLALVDLRKTSPLSAEQQAALQSLQVKVNHWMLQERLNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDPKELEKWRAIIRAVGPTCDAQISLAR
Ga0070688_10065983213300005365Switchgrass RhizosphereMKRTVYIVWGAGLAVMLVVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKASSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTS
Ga0070711_10112086713300005439Corn, Switchgrass And Miscanthus RhizosphereVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVF
Ga0070697_10089104113300005536Corn, Switchgrass And Miscanthus RhizosphereGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLVELRKSSRLTPEQDMTLQPLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEGIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFAWWLTICVGCF
Ga0070665_10119687313300005548Switchgrass RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVN
Ga0070704_10044291913300005549Corn, Switchgrass And Miscanthus RhizosphereMKRTVYIVWGAGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGA
Ga0066699_1027124013300005561SoilMKRTVYIVWGTGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLADLRKSSSLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFAWW
Ga0070664_10084453513300005564Corn RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVG
Ga0066705_1045814213300005569SoilIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFAWWLTICVGCFAYLWALMGEHR
Ga0066903_10176606513300005764Tropical Forest SoilMKRTVYIVWGVGVAVILVVWGFIWLKESRRSSGQANTPPNRVRSHGTQQYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLGKAGVDQRLLPEDLKPEELKNYLTLVDLKKSSRLSAEQDMALQPLQVKINHWMLHERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPTDPKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTVCAVFAAALGTSALRDARRPGLPPDQNAPWSLSRVVFAWWLTICV
Ga0070717_1090922713300006028Corn, Switchgrass And Miscanthus RhizosphereVRSYGTEKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLVELRKSSRLTPEQDMTLQPLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEGIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVF
Ga0075018_1030350013300006172WatershedsVKRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLSLIELKKSSPLSAEQNAAFQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPIT
Ga0066659_1025558213300006797SoilMKETRRSSDQLKTPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFAWW
Ga0066710_10274423513300009012Grasslands SoilGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPQLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSG
Ga0099829_1127775813300009038Vadose Zone SoilYIVWGVGLAVILVVWVLIWSKESRRSSDQVNTPPNRVRSHGTEKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELRDYLSLAELRKSSRLSAEQDMTLQPLQVKINHWLLHERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPTDAKELEKWRAIIR
Ga0066709_10138305813300009137Grasslands SoilMKRTVYIVWGAGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDTKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEATYTSGAKPVYRSLPMVPPMRQG
Ga0105249_1010328443300009553Switchgrass RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSSGA
Ga0126384_1100938413300010046Tropical Forest SoilIVWGVGVAVILVVWGFIWLKESRRSSGQANTPPNRVRSHGTQQYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLGKAGVDQRLLPEDLKPEELKNYLTLVDLKKSSRLSAEQDMALQPLQVKINHWMLHERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVFHLAAPTDPKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPM
Ga0126376_1019702933300010359Tropical Forest SoilMKRTVYIVWGVGLAVILVVWGFIWMKESRRSSDQLKSPPNRVRSYGQEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKTYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGDFEGETYNRVVFHLAAPADPKELEKWRAIIRAVGPTCDAQISLARPIVGTADALRMPTLVNAEATYSAAVKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQ
Ga0126372_1117810613300010360Tropical Forest SoilKSPPNRVRSYGQEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKTYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGDFEGETYNRVVFHLAAPADPKELEKWRAIIRAVGSTCDAQISLARPIVGTADALRMPTLVNAEATYSAAVKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQN
Ga0126379_1140652313300010366Tropical Forest SoilMKRTVYIVWGVGLAVTLMIWGFIWMKESRRTADQLKASPNRVRSYGTEKYVTDIRVGWCNNDGRIIESNPFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLALVDLRKTSPLSAEQQAALQSLQVKVNHWMLQERLNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDPKELEKWRAIIRAVGPTCDAQISLARPIPGTADALRMPTLVNAEATYSSAVKPVYRSLPMVPPMRQGAA
Ga0126379_1188418613300010366Tropical Forest SoilMKRIVYIVWGVGLAVILVVWGFIWFNESRRASDQLKSPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKTYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGDFEGETYNRVVFHLAAPADPKELEKWRAIIRAVGPTCDAQISLARPI
Ga0134125_1012231513300010371Terrestrial SoilMKRTVYIVWGAGLAVMLVVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKKYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFA*
Ga0126381_10053203033300010376Tropical Forest SoilMKRTVYIVWGVGLAVILVVWGFIWMKESRRSSDQLKSPPNRVRSYGQEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKTYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGDFEGETYNRVVFHLAAPADPKELEKWRAIIRAVGPTCDAQISLARPIVGTADALRMPTLVNAEATYSAAVKPVYRSLPMVPPMRQGAAITAVVFTICAVFAA
Ga0058701_1025392623300010395AgaveMKRTVYIVWGVGLGVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRVGWCNNDGQIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLTDLRKSANLSPEQETVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAK
Ga0134124_1153935913300010397Terrestrial SoilGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFA
Ga0134121_1028108933300010401Terrestrial SoilMKRTVYIVWGAGLAVMLVVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMR
Ga0134123_1116844013300010403Terrestrial SoilSRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGL
Ga0138514_10005765513300011003SoilTLFCSRELGNSLMKRSVYIVWSLGLAATVLAWTFILYKEKAGSSRQANTPPNRARSYGIGRYVTDIRVGWCNNDGHIIESNPFTDSLTLTVFLQNFDGWLLAQARGDQRLLPEDLKSEELQNYLSLVELRKSSRLSAEQAASLQTLQIKINHWMLHERSNLRLMIAGQVFETIPPFDATAPPTYGSGDFEGETYNRVVFHLEAPSDPKELEKWRAIIRAAGPTCDAQVSLARPIAGASDALRMPTLVNAEAISTAGAKPVYRSL
Ga0137383_1104790213300012199Vadose Zone SoilWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITVFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLALVELRKSSPLSAEQNAAFQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQI
Ga0137363_1013938213300012202Vadose Zone SoilMIVICWFLWLTGEDISLKDSNVSPNRVRSYGTLKYVTDIRLGWCNNDGQIIEHNFFTDSLSLTVFLQNFDGWLLTQAHLNQRLLPDDLKPDDLANYLSLVDVKKSSHLTAEQDKTLQSLQGRVNHWILQERSNLRLMIGGQVFETIPPFDATAPPTYGTGEFEGETYNRVVFYLAAPKDPKELEKWRAIIRAAG
Ga0137363_1073628513300012202Vadose Zone SoilDQLKTPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKAPVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGPGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPDQDAPWSLSRVVFAW
Ga0137363_1084453513300012202Vadose Zone SoilVSDIRIGWCNNEGKIIESNRLTDALTLTVFLQNFDGWLLDQGRTEPRILPKDLSQEELRNYVALVELKKTSPLSQDQQVALHSLQTKVNHWILQERSNLRLTIAGQVFESIPPFDATAPPTYGSGEFDGETYNRVVFYLAAPKDPKELEKWRAIIRAAGTTCEAQISVALPITGATDALRMPTLVNAEAIYHLGASPVYRSLPMVPRIRQGAAAVAVIFTITVVLATALGTSVLRNAPPANLAPNQKASWSISRVVLA
Ga0137362_1127511113300012205Vadose Zone SoilCWFLRLTGEDISLKDSNVPPNRVRSYGTLKYVTDIRLGWCNNDGQIIEHNFFTDSLSLTVFLQNFDGWLLTQAHLNQRLLPDDLKPDDLTNYLSLVDTKKSSHLTAEQDKTLQSLQGRVNHWILQERSHLRLMIGGQVFETIPPFDATAPPTYGTGEFEGETYNRVVFYLAAPKDPKELEKWRAIIRAAGPIFEAQISVARPVVGG
Ga0137387_1052674113300012349Vadose Zone SoilVKRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLSLIELKKSSPLSAEQNAAFQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLAR
Ga0137360_1065497113300012361Vadose Zone SoilMIVICWFLRLTGEDISLKDSNVPPNRVRSYGTLKYVTDIRLGWCNNDGQIIEHNFFTDSLSLTVFLQNFDGWLLTQARLNQRLLPDDLKPDDLTNYLSLVDVKKSSHLTAEQDKTLQSLQGRVNHWILQERSNLRLMIGGQVFETIPPFDATAPPTYGTGEFEGETYNRVVFYLAAPKDPKELEKWRAIIRAAGPIFEAQISVARPVVGGADAIRMPTLVNA
Ga0137360_1101159313300012361Vadose Zone SoilMKRTVYIVWGVGLAVILVVWGFIWSKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYSSLVDLRKSSRLSAEQEMALQPLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDPKELEK
Ga0137361_1082027013300012362Vadose Zone SoilTKLMKIAPARIVSVIGVVLILVICGVIYAREKDLYVSEGPPNRAKSYGTGKFVSDIRIGWCNNEGKIIESNRLTDALTLTVFLQNFDGWLLDQGRTEPRILPRDLSQEELRNYVALVEPKKTSPLSQDQQVALHSLQSKVNHWILQERSNLRLTIAGQVFESIPPFDATAPPTYGSGEFDGETYNRVVFYLAAPKDPKELEKWRAIIRAAGTTCEAQISVALPITGATDALRMPTLVNAEAIYSLGASPVYRSLPMVPRIRQGAAAVAVIFTITLVLATAL
Ga0137361_1117380713300012362Vadose Zone SoilVWGVGLAVILVVWGFIWSKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLALADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDPKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVN
Ga0137373_1022476613300012532Vadose Zone SoilMKITPARIISFIGIALILLICAVIYFREKDLYLSTGPPNRAKSYGTGKFVSDIRIGWCNNDGKIIEGNRFTDALALTVFLQNFDGWLLDQGRTEPRILPKDLSQEELRNYLALVELKKASSLSQDQQAAFQSLQIKVNHWILQERSNLRLMIAGHVFESIPPFDATAPPTYGSGEFAGETYNRVVFYLAPPKDPKELENWREIIRAAGATCEAQISLALPITGATDALRMPTLVNADAIYSLGASPVYRPLPMVSRIRHGAAA
Ga0137398_1019042923300012683Vadose Zone SoilVKRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLSLIELKKSSPLSAEQNAAFQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPITGATDALRMPTLVNADAIYSAGAKPV
Ga0137413_1063945213300012924Vadose Zone SoilPSRFYAAANFDNSFVKRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLALVELKKSSPLSAEQKGALQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPITGATDALRMPTLVNADAIYSAGAKPVYRSL
Ga0137413_1078218213300012924Vadose Zone SoilGLAVILVVWGFIWMKETRRSSDQLKTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSRLSPEQEMAFQSLQVKLNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIIGAADALRMPTLVNSEAIYTSGAKPVFRSLPM
Ga0164299_1037255513300012958SoilVNRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLVNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNLFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLSAEQNAALQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPIPGATDALRMPTLVNADAIYSAGAKPVYRSLPLVPPF
Ga0164302_1042244613300012961SoilVNRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLVNTLPNRVRSYGSGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLPAEQNAALQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPIPGATDALRMPTLVNADAIYSAGAKPVYRSLPLVPPFRQGAATTAVVFTIC
Ga0164308_1180780813300012985SoilLIVWWVIWSNAKHLSSDLVNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNAFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKSEELTSYLSLVDLKKSSPLSAEQNAALQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAII
Ga0157374_1141936613300013296Miscanthus RhizosphereMKRTVYIVWGAGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIV
Ga0157375_1163355113300013308Miscanthus RhizosphereMKRTVCLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNVFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAK
Ga0163163_1069691923300014325Switchgrass RhizosphereMKRTVYIVWGAGLAVMLAVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKLEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFA
Ga0137412_1084046413300015242Vadose Zone SoilSDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSRLSPEQEMAFQSLQVKLNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIIGAADALRMPTLVNSEAIYTSGAKPVFRSLP
Ga0132256_10034254513300015372Arabidopsis RhizosphereMKRTVYIVWGAGLAVMLVVWGFIWLKESRRALDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLGDLRKSASLSPEQEMVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPT
Ga0132257_10199886713300015373Arabidopsis RhizosphereMKRTVYIVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLGDLRKSASLSPEQEMVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGATDALRM
Ga0184605_1040976513300018027Groundwater SedimentYIVWGVGLAVILVVWGFIWSKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSRLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAII
Ga0184625_1034207313300018081Groundwater SedimentMKRTVYIVWGVGLAVILVVWGCIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLTDLRKSSRLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPVFRSLP
Ga0173479_1037142013300019362SoilWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNVFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLEPEELNNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSAEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTL
Ga0193704_106380713300019867SoilILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSSGAKPVY
Ga0193754_101479213300019872SoilIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSASLSPEQEMVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPVFRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQPNQDAPWSLSRVVFAWWLTICVGCFAYLWALMGEHRN
Ga0193707_100219413300019881SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAIIAVVFTI
Ga0193725_108677913300019883SoilMKRTVYIVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSASLSPEQEMVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADAL
Ga0193711_103093413300019997SoilLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKLSSLSPEQEMSLQSLQVKVNHWMLQERLNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTL
Ga0193731_1000704133300020001SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPM
Ga0193697_106052913300020005SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPDELKNYLSLADLRKLSSLSPEQEMSLQSLQVKVNHWMLQERLNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAII
Ga0193749_101238413300020010SoilMKRTVYIVWGVGLAVILIVWGFIWSKESRRSSDQVNTPPNRVRSYGTEKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLRPEELKNYLSLVDLRKSSRLSPEQEMALQPLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGA
Ga0193733_101678413300020022SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRSLPMVPPMRQGAAIIAVVF
Ga0193733_101839813300020022SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPSDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVYRGLPMVPPMRQGAAIIAVVFTICAVFAAALGTSALRDARGAGLQSDQDAPW
Ga0193724_100927113300020062SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSASLSPEQEMVLQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPVFRSLPMVPPMRQGAAITAVVFTICAVF
Ga0210381_1008830613300021078Groundwater SedimentMKRTVYIVWGVGLAIILVVWGFIWLKESRRASDQVNTAPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTD
Ga0222621_104759913300021510Groundwater SedimentMKRTVYIVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGRIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSSSLSPEQEMALQSLQVKVNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNA
Ga0222622_1038989213300022756Groundwater SedimentMKRTVYLVWGVGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYVSLTDLRKSSSLSPEQEMALQSLQVKVNHWMLQERLNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAMYSSGAKPVYRSLPMVPPMRQGAAITAVVFTICAVFAAALGTSALRDARGAGLQSDQDAPWSLSRVVFA
Ga0179589_1032397713300024288Vadose Zone SoilPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSRLSPEQEMAFQSLQVKLNHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTLVNAEAIYTSGAKPVFRSLPMVPPMRQGAAITAVV
Ga0207663_1043766013300025916Corn, Switchgrass And Miscanthus RhizosphereLAVRPVTLYGAANFDNSFVNRYAYTVWGVGLLVVLIVWWVIWSNAKHLSSDLVNTLPNRVRSYGSGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLSAEQNAAFQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTCDAQISLARPIPGATDALRMPTLVNADAIYSAGAKPVYRSLPLVPPFR
Ga0207664_1065128213300025929Agricultural SoilMKRTVYIVWGVGLAVILVVWGLIWSKESRRASDQVNTPPNRVRSYGTEKYVTDIRIGWCNNDGHIIESNLFTDSLTITVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLVELRKSSRLTPEQDMTLQPLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVG
Ga0209686_105924213300026315SoilMKRTVYIVWGTGLAVILIVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDPRLLPEDLKPEELKNYLSLVDLRKSSRLSPEQEMALQSMQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGAADALRMPTL
Ga0209488_1086543813300027903Vadose Zone SoilNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRADPRLLPEDLKPEELTNYLALVELKKSSPLSAEQNGALQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPITGATDALRMPTLVN
Ga0307308_1002458653300028884SoilMKRTVYIVWGVGLAVILVVWGFIWLKESRRASDQVNTPPNRVRSYGTQKYVTDIRIGWCNNDGHIIESNLFTDSLTVTVFLQNFDGWLLSKARVDQRLLPEDLKPEELKNYLSLADLRKSSSLSPEQEMALQSLQVKINHWMLQERSNLRLMIAGQVFETIPPFDATSPPTYGSGEFEGETYNRVVYHLAAPTDAKELEKWRAIIRAVGPTCDAQISLARPIVGGADALRMPTLVNAEAIYTSGAKPV
Ga0075377_1154809113300030844SoilVNRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLVNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLSAEQNAALQALQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAII
Ga0075386_1210338913300030916SoilRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRADPRLLPEDLKPEELTSYLSLLDLKKSSPLSAEQNAAFQVLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARPIPGATDALRMPTLVNADAIYSAGAKPVYRSLPLVPPFRQGAAIMAVVFTICAVLAAALGTSALRD
Ga0075394_1094899713300030969SoilDLVNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNSFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTSYLSLVDLKKSSPLSAEQNAALQVLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTVDAQISLARP
Ga0170824_12079801313300031231Forest SoilEFLTVGRQPITLFCSRELGNSLMKRSVYIVWSLGLAATVLAWTFILYKEKAGSSRQANTPPNRARSYGIGRYVTDIRVGWCNNDGHIIESNPFTDSLTVTVFLQNFDGWLLAQARGDQRLLPEDLKSEELQNYLSLVELKKSSRLSAEQAASLQTLQIKINHWMLHERSNLRLMIAGQVFETIPPFDATAPPTYGSGDFEGETYNRVVFHLEAPSDPKELEKWRAIIR
Ga0307476_1062463023300031715Hardwood Forest SoilMKRPIVIIWCIGILSMIVICWVLWRTGEDMSLKDTNVPPNRVRSYGTLKYVTDIRLGWCNNDGQIIEHNFFTDSLSLTVFLQNFDGWLLTQAHLNQRLLPDDLKQDDLTNYLSLVDIKKAAHLSPDQDKTLQSLQGKVNHWILQERSNLRLMIAGQVFETIPPFDATAPPTYGTGEFEGETYNRVV
Ga0310909_1001510463300031947SoilMVALIAVGLIWWRHRDLYLAGDKGPPNRARSYGSGRYVSDIRLGWCNNDGKVIAHNFFTDSLSVTVFLQNFDGWLLAQGHNEPRLLPSDLNSEELKRYDSLRDLQHASTKLAQEQELQLQALQSKINHWIIQERSNLRLMIAGHVFATIPPFDATAPPTYGTGEFDGETYNRVVFHLEAPKDPDELETWRSIIRAVGTDCGAQISVARPIAGRTDALRMPSLVNADAIYGV
Ga0307471_10026598143300032180Hardwood Forest SoilVKRYVYTVWGVGLLVVLIVWWVIWSNAKHLSSDLLNTLPNRVRSYGTGRYVTDIRIGWCNNEGHIIDTNPFTDSLTITIFLQNFDGWLLAHSRVDPRLLPEDLKPEELTNYLSLIELKKSSPLSAGQNAAFQTLQTKVNHWILHERSNLRLMIAGQVFETIPPFDATASPTYGSGEFEGETYNRIVFHLAAPKDPIELEKWRAIIRAAGPTCDAQISLARPITGATDALRMPTLVNADAI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.