NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104840

Metagenome / Metatranscriptome Family F104840

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104840
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 124 residues
Representative Sequence MCGQPIWGIAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Number of Associated Samples 90
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(25.000 % of family members)
Environment Ontology (ENVO) Unclassified
(56.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(67.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 69.92%    β-sheet: 0.00%    Coil/Unstructured: 30.08%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF01144CoA_trans 39.00
PF00378ECH_1 27.00
PF01799Fer2_2 5.00
PF00171Aldedh 2.00
PF05138PaaA_PaaC 2.00
PF02738MoCoBD_1 2.00
PF00581Rhodanese 1.00
PF00111Fer2 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1788Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunitLipid transport and metabolism [I] 39.00
COG2057Acyl-CoA:acetate/3-ketoacid CoA transferase, beta subunitLipid transport and metabolism [I] 39.00
COG4670Acyl CoA:acetate/3-ketoacid CoA transferaseLipid transport and metabolism [I] 39.00
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 2.00
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 2.00
COG33961,2-phenylacetyl-CoA epoxidase, catalytic subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 2.00
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 2.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300006028|Ga0070717_10138301All Organisms → cellular organisms → Bacteria2099Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil25.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil16.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil15.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010108Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012384Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012389Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1005811623300002558Grasslands SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
JGI25385J37094_1013721723300002558Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLGLVIWRMAPVGAVREVRLVSAWLWVIAALVSVV
JGI25384J37096_1006284023300002561Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
JGI25382J43887_1006361023300002908Grasslands SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
JGI25390J43892_1017338913300002911Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLGLVIWRMAPVGAVREVRLVSAWLWVI
JGI25389J43894_106753023300002916Grasslands SoilWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0066395_1073209713300004633Tropical Forest SoilSYLFYLSWNHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNEVVREVRIVSVVLWSLAAAASVIVVFSGGTPPAAAKQNGNV*
Ga0066673_1030640323300005175SoilKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0066688_1066683613300005178SoilMKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGSTRSIQ*
Ga0066684_1000153323300005179SoilVETQITSKRPRIRLAHLMCGQPIWGIAGVLSCSSLAYLSWGHVRREEFDWPHDSWSIVTYAVWILLMRGLLSETRCWRERIFFALVLTNFVLGFVLAIWNTVPNSAVREVRIISAALWALAAAVSLIVTFSSGSSTTATKKAGNV*
Ga0066671_1008186123300005184SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVAAWLWAIAAFFSVVITVSSGGSTRSIQ*
Ga0066675_1018665223300005187SoilMCGQPIWGIAGVLSCSSLAYLSWGHVRREEFDWPHDSWSIVTYAVWILLMRGLLSETRCWRERIFFALVLTNFVLGFVLAIWNTVPNSAVREVRIISAALWALAAAVSLIVTFSSGSSTTATKKAGNV*
Ga0066388_10508024913300005332Tropical Forest SoilQPIWGIAGVLSCSYLAYLSWSHVHREEFDWPHDGWSILTYAIWVLLMGGLLSETRCWRERIFFGLVLTNFALGFALVIWSAAPNNAVRDLRITSAVLWALAAAISLVVTFSSGTPPAAAEKASNV*
Ga0070714_10211354813300005435Agricultural SoilVETQITTKPPRAKPVHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ*
Ga0066686_1090910623300005446SoilKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0070706_10005965333300005467Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ*
Ga0070707_10037133723300005468Corn, Switchgrass And Miscanthus RhizosphereVETQITTKPRRIKPTHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ*
Ga0070707_10074616623300005468Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGIAGVLSCSYLAYLSWGHVHREEFDWPHDGWSILTYAIWVLLMGGLLSETRCWRERIFFGLVLTNFALGFALVIWSAAPNNAVREVRMISAALWALAAAVSLLVTFSSGTPSAARKKASNV*
Ga0070698_10008649643300005471Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALISVVITVSPGGSTRSIQ*
Ga0070697_10020664513300005536Corn, Switchgrass And Miscanthus RhizosphereSYLAYLSWSHVHQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLGNFSTGLALVIWRMAPVGAVREVRLVSAWLWAIAALVGVVITVSPGGSTRSIQ*
Ga0070697_10023229323300005536Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGFAGVLSCSYLAYLSWSHVHQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALISVVITVSPGGSTRSIQ*
Ga0066695_1068927423300005553SoilVETQITTKPRRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ*
Ga0066707_1060012823300005556SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGSTRSIQ*
Ga0066693_1002259823300005566SoilVETQITKPRRIKPAHLMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0066706_1006125943300005598SoilMCGQPIWGIAGVLSCSSLAYLSWGHVRREEFDWPHDSWSIVTYAVWILLMRGLLSETRCWRERIFFALVLTNFVLGFVLAIWNTVPNSAVREVRIISAALWALAAAV
Ga0070717_1008414543300006028Corn, Switchgrass And Miscanthus RhizosphereVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALISVVITVSLGGSTRSIQ*
Ga0070717_1013830113300006028Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGIAGALSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLVLVIWRMAPVGAVREVRVVSAWLWAIAA
Ga0066652_10053722023300006046SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0066665_1075750423300006796SoilVETQITTKPRRMKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGSTRSIQ*
Ga0075426_1148450613300006903Populus RhizosphereVETQITTKRPRTKAAHLMCGQPIWGIAGVLSCSYLAYLSWGHVHREEFDWPHDGWSILTYAIWVLLMGGLLSETRCWRERIFFALVLTNFALGFALVIWSAAPNNAVREVRMISAALWALAAAVSLLVTF
Ga0099829_1148959923300009038Vadose Zone SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0126373_1051919523300010048Tropical Forest SoilMCGQLLWGIAGALGCSYLFYLSWNHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNEVVREVRIVSVVLWSLAAAASVIVVFS
Ga0127474_103251613300010108Grasslands SoilMQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWVHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ*
Ga0134109_1008314623300010320Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0134084_1000253643300010322Grasslands SoilMQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0134065_1001688113300010326Grasslands SoilMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0134111_1000457123300010329Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSPSGSRRSTQ*
Ga0134071_1039793123300010336Grasslands SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLTLVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSV
Ga0134071_1056941413300010336Grasslands SoilQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0126372_1002531523300010360Tropical Forest SoilMCGQLIWGIAGVLGCSYLFYLSWSHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNDVVREVRIVSVVLWSLAAAASVVVVFSGGTPPEAAKKNGNV*
Ga0126378_1056893813300010361Tropical Forest SoilMCGQLIWGIAGALGCSYLFYLSWSHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNDVVREVRIVSVVLWSLAAAASVVVVFSGGTPPEAAKKNGNV*
Ga0134066_1005481423300010364Grasslands SoilMCGQPIWGIAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0126379_1018231733300010366Tropical Forest SoilMCGQLLWGIAGALGCSYLFYLSWNHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNEVVREVRIVSVVLWSLAAAASVIVVFSGGTPPAAAKKNGNV*
Ga0126381_10506928123300010376Tropical Forest SoilAQVWSHVRREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNDVVREVRIVSVVLWSLAAAASVVVVFSGGTPPAAAKKNGNV*
Ga0126383_1224589223300010398Tropical Forest SoilREEFDWPHDGWSILTYGIWVVLMSGLLNETRCWRERIFFGLVLTNFVLGFALVIWGAAPNDVVREVRIVSVVLWSLAAAASVVVVFSGGTPPAAAKKNGNV*
Ga0137389_1010535333300012096Vadose Zone SoilMCGQPIWGIAGVLGCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0137383_1068706523300012199Vadose Zone SoilMQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0137363_1022798423300012202Vadose Zone SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETGCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0137379_1068287423300012209Vadose Zone SoilCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ*
Ga0137379_1094840313300012209Vadose Zone SoilMKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGS
Ga0137370_1018165223300012285Vadose Zone SoilGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0137366_1051234323300012354Vadose Zone SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTSSVQ*
Ga0137366_1090321723300012354Vadose Zone SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWVIAALVSVVITVSPGGSMRSIQ*
Ga0137384_1099194213300012357Vadose Zone SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLGLVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSG
Ga0137385_1015716433300012359Vadose Zone SoilMQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ*
Ga0134036_117814013300012384Grasslands SoilGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ*
Ga0134040_129146013300012389Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSLGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ*
Ga0137359_1085828713300012923Vadose Zone SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTSSVQ*
Ga0137404_1184046223300012929Vadose Zone SoilMKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ*
Ga0134087_1015093823300012977Grasslands SoilMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVE*
Ga0134081_1003067933300014150Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ*
Ga0134075_1005317123300014154Grasslands SoilMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAITALVSVVITVSSGGSTRSIQ*
Ga0134078_1003304623300014157Grasslands SoilMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ*
Ga0134069_107459423300017654Grasslands SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVE
Ga0066655_1087233423300018431Grasslands SoilQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTSSVQ
Ga0066667_1191650413300018433Grasslands SoilAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGSTRSIQ
Ga0066669_1125862223300018482Grasslands SoilMCGQPIWGIAGVLSCSSLAYLSWGHVRREEFDWPHDSWSIVTYAVWILLMRGLLSETRCWRERIFFALVLTNFVLGFVLAIWNTVPNSAVREVRIISAALWALAAAVSLIVTFSSGSSTTATKKAGNV
Ga0210385_1131730923300021402SoilMETHVTPTPQARRVRSAHLMCGMPLWGITGTLSCSYLAYLSYGHVRRAEFEWTHDGWSIATYGVWVLLMVGLLGESRCWRERVFFGLVMANFVLGLALAVWQAAPVFAVREVRVISAALW
Ga0207684_1015301723300025910Corn, Switchgrass And Miscanthus RhizosphereVETQITTKPRRIKPTHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ
Ga0207664_1199132523300025929Agricultural SoilIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSLGLALVIWRMAPVGAVREVRLVSAWLWAIAAFVSVVITVSSGGSTRSIQ
Ga0207665_1051436123300025939Corn, Switchgrass And Miscanthus RhizosphereMCGQPIWGFAGVLSCSYLAYLSWSHVHQGDFVWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVHEVRLVSAWLWAIAALVGVVITVSPGGSTRSIQ
Ga0209350_101263823300026277Grasslands SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209234_100242553300026295Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLGLVIWRMAPVGAVREVRLVSAWLWVIAALVSVVITVSPGGSMRSIQ
Ga0209235_105037023300026296Grasslands SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSPSGSRRSTQ
Ga0209235_108761423300026296Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209237_118927923300026297Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETGCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209238_105035223300026301Grasslands SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209688_102820623300026305SoilVETQITKPRRIKPAHLMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ
Ga0209468_102453633300026306SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ
Ga0209239_101849233300026310Grasslands SoilVETQITTKPQRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209268_100152763300026314SoilVETQITKPRRIKPAHLMCGQQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ
Ga0209686_110711923300026315SoilVEMQITTKPRRMKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAVLVSVVITVSPGGSTRSIQ
Ga0209471_123511413300026318SoilVETQITKPRRIKPAHLMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209801_101952833300026326SoilMQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209375_102851213300026329SoilWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSRRSTQ
Ga0209159_102687413300026343SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVI
Ga0209059_100719853300026527SoilMCGQPIWGLAGVLSCAYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209157_105840933300026537SoilQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209805_100793743300026542SoilVETQITTKPRRIKPAHLMCGQPIWGLAGVLSCSYLAYLSWGHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVITVSSGGSTRSIQ
Ga0209156_1018800723300026547SoilVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRTAPVGAVREVRLVSAWLWAIAALV
Ga0209474_1031986623300026550SoilVETQITSKRPRIRLAHLMCGQPIWGIAGVLSCSSLAYLSWGHVRREEFDWPHDSWSIVTYAVWILLMRGLLSETRCWRERIFFALVLTNFVLGFVLAIWNTVPNSAVREVRIISAALWALAAAVSLIVTFSSGSSTTATKKAGNV
Ga0209577_1021261413300026552SoilWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209689_111308613300027748SoilQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0209590_1040789023300027882Vadose Zone SoilLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETGCWRERAFFALVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAALVSVVIAVSPSGSTPSVQ
Ga0137415_1117645913300028536Vadose Zone SoilVETQITTKPQRIKPAHLMCGQPIWGIAGVLSCSYLAYLSWSHVRQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLGLVIWRMAPVGAVREVRLVSAWL
Ga0307473_1065198423300031820Hardwood Forest SoilVETQITTKRQRIKPAHLMCGQPIWGFAGVLSCSYLAYLSWSHVHQGDFEWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVHEVRLVSAWLWAIAALVGVVITVSPGGSTRSIQ
Ga0307473_1089929923300031820Hardwood Forest SoilMETHVTPTPQARRVRSAHLMCGMPLWGITGTLSCSYLAYLSYGHVRRAEFEWTHDGWSIATYGVWVLLMVGLLGESRCWRERVFFGLVMANFVLGLALAVWQAAPVDAVREVRVISAALWAAAAAV
Ga0307479_1009433823300031962Hardwood Forest SoilMCGMPLWGITGTLSCSYLAYLSYGHVRRAEFEWTHDGWSIATYGVWVLLMVGLLGESRCWRERVFFGLVMANFVLGLALAVWQAAPVFAVREVRVISAALWAAAAAVSLVITFSSGQDRTVEKQGRIESR
Ga0307472_10002937723300032205Hardwood Forest SoilMCGQPIWGFAGVLSCSYLAYLSWSHVHQGDFEWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFALVLANFSMGLALVIWRMAPVGAVHEVRLVSAWLWAIAALVGVVITVSPGGSTRSIQ
Ga0307472_10234790113300032205Hardwood Forest SoilAYLSWSHVHQGDFDWAHDAWSIVTYAVWVLLMLGLLTETRCWRERAFFVLVLANFSMGLALVIWRMAPVGAVREVRLVSAWLWAIAAVVGVVITVSPGGSTRSIQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.