NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103524

Metagenome Family F103524

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103524
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 135 residues
Representative Sequence HVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Number of Associated Samples 89
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(21.782 % of family members)
Environment Ontology (ENVO) Unclassified
(43.564 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.416 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.20%    β-sheet: 1.26%    Coil/Unstructured: 46.54%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01977UbiD 51.49
PF02776TPP_enzyme_N 10.89
PF02515CoA_transf_3 2.97
PF02518HATPase_c 1.98
PF01070FMN_dh 0.99
PF01791DeoC 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG00433-polyprenyl-4-hydroxybenzoate decarboxylaseCoenzyme transport and metabolism [H] 51.49
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 2.97
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.99
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.78%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil14.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.93%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere3.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere3.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere2.97%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.97%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.97%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.99%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026944Soil microbial communities from Kellog Biological Station, Michigan, USA - Nitrogen cycling UWRJ-G01K2-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027480Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1167769123300000891SoilGAIDAGVSQPPDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
JGI10216J12902_11201313513300000956SoilLLSGAIQGGVSQPPDSLTLEAKGFHVLYDLASQKLPSANTSVAVRRSYIASSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGVTYDFYAVNVAPSQPFAKPEMYVDAQATLGASDAKVKAFDISKLLDSNFVQSAIDRGLDK*
JGI25385J37094_1018463423300002558Grasslands SoilRRSYIASSKDVVQRYVDSLVLGIKKLKADKAFGVSVLKKYFASTDEAAMSVTYDFYALSVAPTQPFAKPDMYADAQTTLGASDPKVKAFDVSKLLDSTFVQSAVDRGLDK*
JGI25384J37096_1024034013300002561Grasslands SoilGVSQPPDSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK*
JGI25382J43887_1015706423300002908Grasslands SoilKDVSILAVGSAQNRTAALLAGSIQGGVSQPPDSIALEAKGFHVLYDLASQKLPSANTSVAVRRSYIASSKDVVQRYVDSLVLGIKKLKADKAFGVSVLKKYFASTDEAAMSVTYDFYALSVAPTQPFAKLDMYADAQTTLGASDPKVKAFDVSKLLDSTFVQSAVDRGLDK*
Ga0066674_1003681033300005166SoilPSANTSVVVRRSYIASNKDVVQRYVDSLVLGIKRLKADKAFGITVLKKYFQSTDDQAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN
Ga0066683_1034607423300005172SoilAVRRSYISQNKAVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFSSTDEASMSATYDFYALSVAPTQPFAKPEMYADGQAVLGANDAKVKAFDVTKMLDSTFVQSAVDRGLDK*
Ga0066680_1019324823300005174SoilPDSIALEAKGFHVLYDLASQKLPSANTSVAVRRSYIASSKDVVQRYVDSLVQGTKKLKADKAFGVSVLKKYFASTDEAAMSVTYDFYALSVAPTQPFAKLDMYADAQTTLGASDPKVKAFDVSKLLDSTFVQSAVDRGLDK*
Ga0066688_1076543113300005178SoilQKLPSANTSVVVKRDYLNANRGVVQRYVDALFLGIKKVKSDRAFGVQVLKKYFQSTDDKAMGATYDFYALTVTPSQPVPRPEQFADAQATLGATNAKVKDYDVSKMLDQSFTKSAIDRGLDK*
Ga0066676_1018819513300005186SoilLSQPPDSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYIASSRDVVQRYVDSLVQGIKKLKSDKPFGVSVLKKYFSSTDEAAMSVTYDFYALSVAPTQPYAKADMYADAQTTLGANDAKVKAFDVTKLLDSTFVQSAVDRGLDK*
Ga0066676_1090673223300005186SoilANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0066675_1144355913300005187SoilAGSIQAGVSQPPDSIALEDKGFHVLYDLASQKLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0065705_1037028313300005294Switchgrass RhizosphereEKAGFHVLYDLAGQKLPSANTSVVVTRAFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDAAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0065705_1066342023300005294Switchgrass RhizosphereQPPDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFLNANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQTILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0065705_1095723723300005294Switchgrass RhizosphereANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070676_1040984613300005328Miscanthus RhizosphereIDAGVSQPPDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070690_10047553613300005330Switchgrass RhizosphereSGAIDAGVSQPPDSLAVEKAGFHVLYDLAGQKLPSANTSVVVTRAFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDAAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070680_10027524713300005336Corn RhizosphereAGFHVLYDLAGQKLPSANTSVVVTRSFMTANKAVIQRYVDSLVQGIKKMKADRQFGIDTLKKYFKSTDDAAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0068868_10164452213300005338Miscanthus RhizosphereDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFLTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070692_1004522813300005345Corn, Switchgrass And Miscanthus RhizosphereAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070692_1052928323300005345Corn, Switchgrass And Miscanthus RhizosphereQRTAAMLSGAIDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRTFLTANKAVIQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070708_10037338713300005445Corn, Switchgrass And Miscanthus RhizosphereDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRTFLTANKAVIQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTTFVQSAVDRGLDKK*
Ga0066687_1035974223300005454SoilDLASQHLPSANTSVVVRRSYITSNKDVGQRYVDSLVLGIKRLKADKAFGITVLKKYFQSTDDQAMGATYDFYAQLVTATQPFAKPEMFADSQATLGATNAAVKSFDISKMLDTSFVQSAVDRGLDK*
Ga0070707_10074683213300005468Corn, Switchgrass And Miscanthus RhizosphereDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRTFLTANKAVIQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070699_10002931113300005518Corn, Switchgrass And Miscanthus RhizosphereDLASQKLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGITYDFYAQLVTATQPFPKPEMFADAQTILGAKSDKVKSYDVTKMLDTSFVQSAVDRGLDRN*
Ga0070699_10114485213300005518Corn, Switchgrass And Miscanthus RhizosphereAVEEKGFHVLYDLASQKLPSANTSVVVTRAFLSANRAVVQRYVDSLVLGIKKLKADRAFGIQTLKKYFKSTDDKAMAATYDFYAVLVTATQPFPRPEMFADAQSILGAKNDKVKNYNVNNMLDVSLVQSAVDRGLDKQ*
Ga0070679_10069998413300005530Corn RhizosphereVLYDLASQKLPSANTSVVVTRSFLTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0070697_10175722423300005536Corn, Switchgrass And Miscanthus RhizosphereTSVVVKRDYLNANKSVVQRYVDALFLGIKKVKSDRAFGVQVLKKYFQSTDDKAMGATYDFYALTVTPSQPVTRPEQFADAQATLGATNAKVKDYDVNKMLDTSFTKSAIDRGLDK*
Ga0066697_1059980523300005540SoilFHVLYDLASQKLPSANTAVAVRRSYISQNKAVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFSSTDEASMSATYDFYALSVAPTQPFAKPEMYADGQAVLGANDAKVKAFDVTKMLDSTFVQSAVDRGLDK*
Ga0066692_1080617213300005555SoilVQRYADSLVLGIKKLKADKAFGVSVLKKYFASTDEAAMSVTYDFYALSVAPTQPFAKPDMYADAQTTLGASDPKVKAFDVSKLLDSTFVQSAVDRGLDK*
Ga0066707_1027248413300005556SoilLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0066704_1104947913300005557SoilSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKALDISKLLDSTFVQSAIDRGLDK*
Ga0070664_10068095623300005564Corn RhizosphereSLAVEKAGFHVLYDLAGQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDPAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0066652_10047999613300006046SoilEDKGFHVLYDLASQKLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0075422_1032775913300006196Populus RhizosphereGQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDPAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0068865_10157759123300006881Miscanthus RhizosphereVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0099829_1049455913300009038Vadose Zone SoilQKLPSANTSVVVTRSFLNANKAVVQRYVDSLIQGIKKLKADRPFGIDVLKKYFKSTDDKAMGVTYDFYAQLVTTTQPFPKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDRN*
Ga0099830_1084927923300009088Vadose Zone SoilSQKLPSANTSVVVTRAFMIANKAVVQRYVDSLVLGIKKMKGDRDFGIQTLKKYFKSTDDKAMAATYDFYAVLVSATQPFARPEMFADAQATLGAKNDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0075423_1092415613300009162Populus RhizosphereYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVQGIRKMKADRQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0134109_1035287013300010320Grasslands SoilQNKDVVQRYVDSIVLGIKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN*
Ga0134111_1021608613300010329Grasslands SoilKLPSANTSVVVTRTFLTANKAVVQRYVDSLVQGIKKLKADRQFGIDVLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQTILGAKSDKVKSYDVTRMLDTSFVQSAVDRGLDKN*
Ga0134080_1041416023300010333Grasslands SoilNKAVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEASMSATYDFYALSVAPTQPFAKPEMYADGQAVLGANDAKVKAFDVTKMLDSTFVQSAVDRGLDK*
Ga0134071_1052480623300010336Grasslands SoilSANTSVAVRRSYITSSKDVVQRYVDSIVQGIKKLKSDKAFGVGVLKKYFNSTDDAAMGVTYDFYALNVAPSQPFAKPEMYVDAQATLGASDAKVKGFDISKLLDSTFVQSAIDRGLDK*
Ga0134071_1076038323300010336Grasslands SoilKGFHVLYDLASQKLPSANTSVVVRRDYLNANRGVVQRYVDALFLGIKKVKSDRAFGVQVLKKYFQSIDDKAMGATYDFYALTVTPSQPVPRPEQSADAQATLGATNVKVKDYDVSKMLDQSFTKSAIDRGLDK*
Ga0134062_1065675613300010337Grasslands SoilLPSENTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0137364_1050342413300012198Vadose Zone SoilAALLSGAIQGGVSQPPDSLTLEAKGFHVLYDLASQKLPSANTSVAVRRSYIASSKDVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEAAMTATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK*
Ga0137382_1007559313300012200Vadose Zone SoilNTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0137399_1053920023300012203Vadose Zone SoilSVVVTRTFLNASKPVVQRYVDSLVQGIRKMKADRQFGIDVLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0137374_1029029723300012204Vadose Zone SoilQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRAFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0137368_1015669213300012358Vadose Zone SoilMLSGAIDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRAFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0137361_1077732323300012362Vadose Zone SoilAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRSFLTANKAVVQRYVDSIVQGIRKLKAERQFGIDVLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0137397_1041072223300012685Vadose Zone SoilLPSANTSVVVTRSFLTANKAVVQRYVDSLVMGIKKMKADRQFGIDVLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK*
Ga0157303_1025475913300012896SoilVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK*
Ga0134110_1036887233300012975Grasslands SoilVVVTRSFLTANKAVVQRYVDSIVQGIKKLKADRQFGIDVLKKYFKSTDDVAMAATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0134110_1040251913300012975Grasslands SoilVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN*
Ga0134076_1036143713300012976Grasslands SoilRRSYIASSRDVVQRYVDSLVQGTKKLKSDKPFGVSVLKKYFSSTDEAAMSVTYDFYALSVAPTQPFAKADMYADAQTTLGANDAKVKAFDVTKLLDSTFVQSAVDRGLDK*
Ga0134078_1006872113300014157Grasslands SoilPVGSAANRTAALLSGAIQGGVSQPPDSLALEAKDFHVLYDLASQKLPSANTSVAVRRSYIAQNKDVVQRYVDSIVLGIKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN*
Ga0134078_1055912513300014157Grasslands SoilKLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN*
Ga0134089_1031559123300015358Grasslands SoilLYDLASQKLPSANTSVAVRRSYIASSRDVVQRYVDSLVQGIKKLKSDKPFGVSVLKKYFSSTDEAAMSVTYDFYALSVAPTQPYAKADMYADAQTTLGANDAKVKAFDVTKLLDSTFVQSAVDRGLDK*
Ga0134085_1031419123300015359Grasslands SoilSQPPDSLTLEAKGFHVLYDLASQKLPSANTAVAVRRSYISQNKAVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEASMSATYDFYALSVAPTQPFAKPEMYADGQAVLGANDAKVKAFDVTKMLDSTFVQSAVDRGLDK*
Ga0134069_134691713300017654Grasslands SoilLTKAGFKIAVDLSKQKVPATDNTMVTTRTYASANRAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN
Ga0134083_1055221113300017659Grasslands SoilIIAVGSSQQRTAALLAGAIQGGVSQPPDSIALEAKGFHVLHDLASQKLPSANTSVVVKRDYLNANRGVVQRYVDALFLGIKKVKSDRAFGVQVLKKYFQSTDDKAMGATYDFYALTVTPSQPVARPEQFADAQATLGATNAKVKDYDVSKMLDQSFTKSAIDRGLDK
Ga0184619_1011002313300018061Groundwater SedimentYDLASQKLPSANTSVAVRRSYIASSKDVVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEAAMTATYDFYALTVTPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK
Ga0184617_109438213300018066Groundwater SedimentGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQIVTATQPFPKPEMFADAQTILGAKSDKVKSYDVSKMLDSSFVQSAVDRGLDKK
Ga0066667_1003581513300018433Grasslands SoilNTSVAVRRSYIAQNKDVVQRYVDSIVLGIKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN
Ga0066662_1015652823300018468Grasslands SoilANTSVAVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK
Ga0210381_1015342013300021078Groundwater SedimentVEKAGFHVLYDLAGQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKK
Ga0210382_1024767413300021080Groundwater SedimentIDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKN
Ga0207645_1038649723300025907Miscanthus RhizosphereHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0207660_1167772913300025917Corn RhizosphereHVLYDLAGQKLPSANTSVVVTRSFMTANKAVIQRYVDSLVQGIKKMKADRQFGIDTLKKYFKSTDDAAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0207652_1067468613300025921Corn RhizosphereIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFLTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0207704_1143335123300025938Miscanthus RhizosphereAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKI
Ga0207667_1045755523300025949Corn RhizosphereAVGSASQRLAAMLSGAIDAGVSQPPDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0207648_1012996823300026089Miscanthus RhizosphereSFVAVGSAAQRTAAMLSGAIDAGVSQPPDSIAVEKAGFHVLYDLASQKLPSANTSVVVTRSFLTANKAVVQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0209235_106916613300026296Grasslands SoilPPDSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYVASSKDVVQRYIDSIVQGTRKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGASDAKVKAFDISRLLDSTFVQSAIDRGLDK
Ga0209235_112271923300026296Grasslands SoilLASQKLPSANTSVAVRRSYIASSKDVVQRYVDSLVLGIKKLKADKAFGVSVLKKYFASTDEAAMSVTYDFYALSVAPTQPFAKPDMYADAQTTLGASDPKVKAFDVSKLLDSTFVQSAVDRGLDK
Ga0209237_108160023300026297Grasslands SoilTAALLGGSIQGGVSQPPDSLTLEAKGFHVLYDLASQKLPSANTSVAVRRSYITSSKDVVQRYVDSIVQGIKKLKSDKAFGVGVLKKYFNSTDDAAMGVTYDFYALNVAPSQPFAKPEMYVDAQATLGASDAKVKAFDISKLLDSTFVQSAIDRGLDK
Ga0209237_108497623300026297Grasslands SoilPPDSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK
Ga0209377_131963313300026334SoilPDKDVSIVPVGSAANRTAALLSGAIQGGVSQPPDSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKALDISKLLDSTFVQSAIDRGLD
Ga0209806_133586413300026529SoilQATVANPPETTSLEKAGFHSVLDLAGLKLPASLQGTITRRDSAKSKPEIVQRYVDSIVLGIKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN
Ga0209058_134506523300026536SoilASQKLPSANTSVVVKRDYLNANRGVVQRYVDALFLGIKKVKFDRAFGVQVLKKYFQSTDDKAMGATYDFYALTVTPSQPVPRPEQFADAQATLGATNAKVKDYDVSKMLDQSFTKSAIDRGLDK
Ga0209056_1036237623300026538SoilVGSAAQRTAALLAGSIQAGVSQPPDSIALEDKGFHVLYDLASQKLPSANTSVVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN
Ga0209376_116770323300026540SoilSLALEAKGFHVLYDLASQKLPSANTSVAVRRSYISSSKDVVQRYVDSIVQGTKKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGAGDAKVKALDISKLLDSTFVQSAIDRGLDK
Ga0209161_1001627653300026548SoilVVTRSFLNANKAVVQRYVDSLVQGIKKLKADRPFGIDVLKKYFKSTDDKAMGATYDFYAQLVTATQPFAKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDKN
Ga0209474_1022750113300026550SoilGSIQGRVSQPPDSLDLEAKGFHVLYDLASQKLPSANTSVAVRRSYIAQNKDVVQRYVDSIVLGIKKLKSDKAFGISVLKKYFNSTDDAKMGVTYDFYALSVAPVQPFAKLEMYTDSQVTLGATNAAVKSYDLSKMLDSTFVQSSIDRNLDKN
Ga0207570_100197013300026944SoilYDLASQKLPSANTSVVVTRTFLTANKAVIQRYVDSLVMGIKKMKADRQFGIDTLKKYFKSTDDVAMGATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0208997_100132933300027181Forest SoilPPNLTKLKDAGFHSLYDLAGQKFAAANTTIVAQRAWVNANKSVMQRYVDSIVQGIKKLKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFAKPEMFADAQTILGVKSDKVKNYDVTKMLDTTFVQSAIDRGLDKK
Ga0208993_103750013300027480Forest SoilIDAGVSQPPDSLAVEAKGFHVLYDLASQKLPSANTSVVVTRTFLNANRAVVQRYVDSLVMGIRRMKSDRQFGIDVLKKYFKSTDDTAMAATYDFYAQLVTSTQPFPRPEMFADAQTILGAKSDKVKNYDVTKMLDTTFVQSAIDRGLDKK
Ga0208991_109027113300027681Forest SoilPPDSIALEEKGFHVLYDLASQKLPSANTSVVVTRKFMTENKAVVQRYVDALVLGIKKMKADRDFGIATLKKYFKSTDDTAMAATYDFYAQLVTSTQPFPRPEMFADAQTILGAKSDKVKNYDVTKMLDTTFVQSAIDRGLDKK
Ga0209180_1042061113300027846Vadose Zone SoilQKLPSANTSVVVTRSFLNANKAVVQRYVDSLIQGIKKLKADRPFGIDVLKKYFKSTDDKAMGVTYDFYAQLVTTTQPFPKPEMFADAQTILGAKSDKVKAYDVTKMLDTSFVQSAVDRGLDRN
Ga0209814_1012269423300027873Populus RhizosphereSYIAQNKGVVQRYIDSIVQGIKKLKADKAFGVSVLKKYFSSTDEAAMSATYDFYALSVTPTQPFAKPEMYADGQAVLGANDAKVKAFDVTKMLDSTFVQSAVDRGLDK
Ga0209590_1049396713300027882Vadose Zone SoilALEAKGGFHVLYDLASQKLPSANTSVAVRRSYITSSKDVVQRYVDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEAALAATYDFYALTVTPSQPFARPEMYVDAQALLGANDAKVKAFDIGKLLI
Ga0137415_1135001313300028536Vadose Zone SoilGFHVLYDLASQKLPSANTSVAVRRSYVASSKDVVQRYIDSVVQGTRKLKSDKAFGVSVLKKYFQSTDDAAMGATYDFYALTVAPSQPFAKPEMYVDAQATLGASGAKVKAFDISRLLDSTFLQSAIDRGLDK
Ga0307307_1027516523300028718SoilPPDSLAVEKAGFHVLYDLAGQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0307320_1010002323300028771SoilVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQTILGAKSDKVKSYDVSKMLDSSFVQSAVDRGLDKK
Ga0307299_1018203823300028793SoilSQKLPSANTSVVVTRAFLTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDSSFVQSAVDRGLDKK
Ga0307292_1024138423300028811SoilVVVTRSFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0307292_1034707623300028811SoilMLSGAIDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVQGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKSYDVSKMLDTSFVQSAVDRGLDKK
Ga0307312_1052703313300028828SoilQRTAAMLSGAIDAGVSQPPDSLAVEKAGFHVLYDLASQKLPSANTSVVVTRSFMTANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKN
Ga0307312_1099468913300028828SoilAKGFHVLYDLASQKLPSANTSVAVRRSYIASSKDLVQRYIDSIVQGIKKLKSDKAFGVSVLKKYFNSTDEAAMTATYDFYALTVTPSQPFAKPEMYVDAQATLGAGDAKVKAFDISKLLDSTFVQSAIDRGLDK
Ga0307308_1027566623300028884SoilKLPSANTSVVVTRSFMAANKAVVQRYVDSLVMGIKKMKAERQFGIDTLKKYFKSTDDVAMAATYDFYAQLVTATQPFPKPEMFADAQSILGAKSDKVKNYDVSKMLDTSFVQSAVDRGLDKN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.