NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102666

Metagenome Family F102666

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102666
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 151 residues
Representative Sequence VLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVISNTVQVNAL
Number of Associated Samples 62
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 75.68 %
% of genes near scaffold ends (potentially truncated) 6.93 %
% of genes from short scaffolds (< 2000 bps) 16.83 %
Associated GOLD sequencing projects 53
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (63.366 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.634 % of family members)
Environment Ontology (ENVO) Unclassified
(50.495 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.436 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 1.67%    β-sheet: 27.22%    Coil/Unstructured: 71.11%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF13185GAF_2 8.91
PF02518HATPase_c 7.92
PF13492GAF_3 3.96
PF01590GAF 2.97
PF00512HisKA 2.97
PF01960ArgJ 1.98
PF12840HTH_20 1.98
PF08327AHSA1 1.98
PF00202Aminotran_3 0.99
PF01494FAD_binding_3 0.99
PF13474SnoaL_3 0.99
PF13602ADH_zinc_N_2 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.98
COG1364Glutamate N-acetyltransferase (ornithine transacetylase)Amino acid transport and metabolism [E] 1.98
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.99
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.99
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A63.37 %
All OrganismsrootAll Organisms36.63 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10052924All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1305Open in IMG/M
3300002561|JGI25384J37096_10008032All Organisms → cellular organisms → Bacteria3945Open in IMG/M
3300002908|JGI25382J43887_10168318All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300005172|Ga0066683_10094239All Organisms → cellular organisms → Bacteria → Acidobacteria1812Open in IMG/M
3300005174|Ga0066680_10477089All Organisms → cellular organisms → Bacteria → Acidobacteria786Open in IMG/M
3300005180|Ga0066685_10040879All Organisms → cellular organisms → Bacteria → Acidobacteria2938Open in IMG/M
3300005332|Ga0066388_100089701All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3491Open in IMG/M
3300005450|Ga0066682_10071946All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_2_68_142136Open in IMG/M
3300005536|Ga0070697_100001169All Organisms → cellular organisms → Bacteria19920Open in IMG/M
3300005536|Ga0070697_100089454All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_2_68_142544Open in IMG/M
3300006034|Ga0066656_10025267All Organisms → cellular organisms → Bacteria3239Open in IMG/M
3300006903|Ga0075426_10301882All Organisms → cellular organisms → Bacteria → Proteobacteria1171Open in IMG/M
3300007255|Ga0099791_10004499All Organisms → cellular organisms → Bacteria → Acidobacteria5721Open in IMG/M
3300007255|Ga0099791_10035808All Organisms → cellular organisms → Bacteria → Acidobacteria2192Open in IMG/M
3300009012|Ga0066710_100025546All Organisms → cellular organisms → Bacteria6714Open in IMG/M
3300009012|Ga0066710_100093562All Organisms → cellular organisms → Bacteria → Acidobacteria3985Open in IMG/M
3300009012|Ga0066710_100745479All Organisms → cellular organisms → Bacteria → Acidobacteria1497Open in IMG/M
3300009137|Ga0066709_100129314All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3177Open in IMG/M
3300012199|Ga0137383_10017275All Organisms → cellular organisms → Bacteria → Acidobacteria4951Open in IMG/M
3300012203|Ga0137399_10203236All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1609Open in IMG/M
3300012206|Ga0137380_10442924All Organisms → cellular organisms → Bacteria → Acidobacteria1149Open in IMG/M
3300012207|Ga0137381_10076958All Organisms → cellular organisms → Bacteria → Acidobacteria2798Open in IMG/M
3300012209|Ga0137379_10018296All Organisms → cellular organisms → Bacteria6718Open in IMG/M
3300012351|Ga0137386_10251792All Organisms → cellular organisms → Bacteria → Acidobacteria1270Open in IMG/M
3300012362|Ga0137361_10354768All Organisms → cellular organisms → Bacteria → Acidobacteria1346Open in IMG/M
3300012685|Ga0137397_10000301All Organisms → cellular organisms → Bacteria33733Open in IMG/M
3300012929|Ga0137404_10093458All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_2_68_142411Open in IMG/M
3300012930|Ga0137407_10164290All Organisms → cellular organisms → Bacteria → Acidobacteria1972Open in IMG/M
3300012930|Ga0137407_10189394All Organisms → cellular organisms → Bacteria → Acidobacteria1840Open in IMG/M
3300017654|Ga0134069_1086137All Organisms → cellular organisms → Bacteria → Acidobacteria1015Open in IMG/M
3300026296|Ga0209235_1042917All Organisms → cellular organisms → Bacteria → Acidobacteria2237Open in IMG/M
3300026297|Ga0209237_1122912All Organisms → cellular organisms → Bacteria → Acidobacteria1086Open in IMG/M
3300026540|Ga0209376_1047914All Organisms → cellular organisms → Bacteria → Acidobacteria2487Open in IMG/M
3300027655|Ga0209388_1024127All Organisms → cellular organisms → Bacteria → Acidobacteria1718Open in IMG/M
3300031720|Ga0307469_10246517All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1434Open in IMG/M
3300031820|Ga0307473_10003330All Organisms → cellular organisms → Bacteria → Acidobacteria4830Open in IMG/M
3300032180|Ga0307471_101071757All Organisms → cellular organisms → Bacteria → Acidobacteria972Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.76%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil18.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil7.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.96%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1005292433300002560Grasslands SoilMRGNLWSRLGLVAGVTAATFLSGVPAWSKGWSKGPKDGAPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQADLADSRSGQDDRLQPARHEPDPVTAEPSCATGTLVHTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL*
JGI25384J37096_1000803253300002561Grasslands SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
JGI25384J37096_1002982713300002561Grasslands SoilGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
JGI25382J43887_1016831833300002908Grasslands SoilSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
JGI25382J43887_1032717013300002908Grasslands SoilMRGDRRFWSVLAGVVATAMIAGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVMQADFANPRTGADDPMTARTDRTNAPAGPPEPLCSTGTLVRTQDGFNYTHEMRFERPGTFRVRLTMVDATGHRAISNTVQVNAL*
JGI25382J43887_1033439523300002908Grasslands SoilAWSKGWSKGPKESEPALWIAADRVVGFVPFTVALYGKVRGAAELNRFELCRELAVQADLANSRNGEDDRMQPARHEPETGPAEPSCAPGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQAGHRLISNTVQVNAL*
Ga0066683_1008715123300005172SoilMRGDRRFWSVLAGVVVTAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRAGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0066683_1009423923300005172SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVAAQADLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL*
Ga0066683_1060205623300005172SoilGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCREPAMQTDVAGSRGGADDPMAERPDRNTAPAARQDPVCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0066680_1047708913300005174SoilVGFVPFTVALYGKVRGAAELNRFELCRELAVQADLANSRNGEDDRMQPARHEPETGPAEPSCAPGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQAGHRLISNTVQVNAL*
Ga0066685_1004087923300005180SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQVDLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL*
Ga0066685_1074022113300005180SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGTSALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0066685_1098916913300005180SoilMRGTWRSRSVLAGVVTGVMLFGGGPAWSKGLRAKDGASALWIAADRVVGFVPLTVTLYGKVPGSAEPARLELCRDVVIQADFASPRAGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0066676_1010729943300005186SoilSALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRAGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0066676_1058795813300005186SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQADLADSRSGQDDRLQPARHEPDPVTAEPSCATGTLVRTPDGYDYQHEIRFDRPGTYRVRLSMVDQGGHRVIS
Ga0066676_1097982513300005186SoilALAAGLAAAMLCGSVPAWSKGWSKGAKDTAPALWIAADRVIGFVPFTVSLYGKVRSVTEPSRLELCREVAAQADMTGARDGEDDRMQPARHEQPDGAPTEPSCATGTIVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQTGHRVVSNTVQVNAL*
Ga0066388_10008970133300005332Tropical Forest SoilMREGVKGWCGLAVAGVAALSLMTGPAWSKGSRGKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPNRLELCREMAMQADVGGSRGGSDDPFAARSDRGAAPSGPQEPICATGTLVRTHDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0066686_1085523413300005446SoilYGKVRSVTEPSRLELCREVAAQADMTGARDGEDDRMQPARHEQPDGAPTEPSCATGTIVRTPDGYDYQHEMRFDRPGTYRIRLSMVDQTGHRVVSNTLQVNAL*
Ga0066682_1007194623300005450SoilMRERRMCWSLLAGGVVAAMILSTGPTWAKGSRFKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCREPAMQTDVAGSRGGADDPMAERPDRNTAPAARQDPVCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0066682_1025959213300005450SoilPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREIAAQADLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL*
Ga0066682_1079062323300005450SoilGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVISNTVQVNAL*
Ga0066681_1070392313300005451SoilMRKRRRCRSVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVISNTVQVNAL*
Ga0070697_100001169223300005536Corn, Switchgrass And Miscanthus RhizosphereMRENQWSRSALAAALAAAMLCGSVPAWSKGWSKGAKDGAPALWIAADRVVGFVPFTVSLYGKVRSVTEPGRLELCREVAAQTDMAGARDGEDDRMQPARHEQPDGAPAEPSCAAGTVVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQTGHRVVSNTVQVNAL*
Ga0070697_10008945443300005536Corn, Switchgrass And Miscanthus RhizosphereMRERRMCWSLLAGGVVAAMILGTGPAWAKGSRSKDGPPALWIAADRVVGFVPFTVSLYGKVLGSGEPDRLELCRELAMQTDVAGSHGGADDPMAERPDRNTAPAAHQDPVCASGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0066704_1018949913300005557SoilMRERRRCRSVLAGGVVAAVILGIGPAWSKGTRAKDGPPGLWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAIQADAAGSRGGADDPMAERPDRNTGPAAQQDPLCAAGTLVRTRDGFDYSHEMRFDRPGTYRVRLS
Ga0066698_1048526813300005558SoilMRENQRSRLALAAGLAAAMLCGSVPAWSKGWSKGAKDTAPALWIAADRVIGFVPFTVSLYGKVRSVTEPSRLELCREVAVQADMTGGRDPEDERMQPARHEPDGAAAEPPCATGTVVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQTGHRVVSNTVQVNAL*
Ga0066691_1020346423300005586SoilMRGNVWSRILLAAGVIGATLLGGGPAWSKGWSKGPKDGAPALWIAADRVVGFVPFTVALYGKVRSAAEPNRLELCREIATQADLADSRSGLDDRMQPARHEPETGSTEPVCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQAGHRVISNTVQVNAL*
Ga0066905_10086487623300005713Tropical Forest SoilMRVSRRAWAVPLGIVAAAVILGSGPAWSKGSRAKDGKDGTSALWIAADRVVGFVPFTVTLYGKVPGGAEPSRLELCRDAPLQVDPANPRAGADDPMMGRAERANPSAAPLEPLCATGRLVRTQTGYDYTHEMRFDQPGSYRVRLNMVDATGHRVISNTVQVNAF*
Ga0066905_10185831413300005713Tropical Forest SoilMRVILRAWAVPLGIVAAAVILGSGPAWSKGSRSKDGKDGTSGLWIAADRVVGFVPFTVTLYGKVPGGAEPSRLELCRDVPLQVDPAGPRAGADDPMMGRTERANPSLAPLEPLCATGRLVRTQNGFDYTHEMRFDQPGSYRVRLNMVDA
Ga0066656_1002526723300006034SoilMRGTWRSRSVLAGVVTGVMLFGGGPAWSKGLRAKDGASALWIAADRVVGFVPLTVTLYGKVPGSAEPGRLELCRDVAMQPDFAGPGSRAGAEDPMMGRTERNTATSAPPEPVCSSGTLVRTQDGFDYRHEMRFDRPGTYRVRLSMVDAAGHRVISNTVQVNAL*
Ga0066656_1045753613300006034SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQVDLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEIRFDRPGTYRVRLSMVDQGGHR
Ga0075426_1030188233300006903Populus RhizosphereMREGLKSWLGLGLAAAVAMSAGAGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSTEPSRLELCRELAMQTDVGGSRGVADDPFAGRSDRGAGPSGSQEPVCATGTLVRTRDGFDYTHEMRFDRPGTYRVRLSMVDAGGHRVMSNTVQVNAL*
Ga0099791_1000449943300007255Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVTGSRGGGDDPMAERPDRGTGPAAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0099791_1003580823300007255Vadose Zone SoilVAVGFVAAATILGGGSAWSKGSRAKDAKDGVPAMWIAADRVVGFVPLTVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTPRADRNNNAPAGPPEPLCSSGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL*
Ga0099793_1045666713300007258Vadose Zone SoilMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDESGHRVMSNTVQVNAL*
Ga0066710_10002554633300009012Grasslands SoilMRGTWRSRSVLAGVVTGVMLFGGGPAWSKGLRAKDGASALWIAADRVVGFVPLTVTLYGKVPGSAEPGRLELCRDVAMQPDFAGPGSRAGAEDPMMGRTERNTATSAPPEPVCSSGTLVRTQDGFDYRHEMRFDRPGTYRVRLSMVDAAGHRVISNTVQVNAL
Ga0066710_10009356223300009012Grasslands SoilMRESRMCWSLLAGGVVAAMILSTGPAWAKGSRSKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCRELAMQTDVAGSRGGADDPMSERPDRNTAPAARQDPVCATGTLVPTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL
Ga0066710_10042983343300009012Grasslands SoilMRGDRRFWSVLAGVVVTAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRPGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL
Ga0066710_10074547913300009012Grasslands SoilMRERRRCRSVLAGGVVAAMILGIGPVWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSDEPARLELCREQAMQADAAAGSRGGIDDPMAERPDRNTGPAAHQDPLCASGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVISNTVQVNAL
Ga0066710_10140462513300009012Grasslands SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREIAAQADLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0099829_1077382213300009038Vadose Zone SoilVLAGGVVAAVILGIGPAWSKGSRAKDGPPGLWIAADRVVGFVPFTVPLYGKVLGSAEPARLELCRELAIQADAAGSRGGADDPMAERPDRNTGPAAHQDPLCAAGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMV
Ga0066709_10012931423300009137Grasslands SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVISNTVQVNAL*
Ga0066709_10031960923300009137Grasslands SoilVLAGGVVVAMALGTGAAWSKGTRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGGVEPNRLELCREQAAQTDVAGSRGGADAPIADRPDRNASPAGLQDSMCATGTLVRTRDGFDYSHEMRFDQPGTYRVRLSMVDAAGHRVISNTVQVNAL*
Ga0066709_10075771623300009137Grasslands SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRAGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0126382_1125855913300010047Tropical Forest SoilAVILGSGPAWSKGSRAKDGKDGTSALWIAADRVVGFVPFTVTLYGKVPGSAEPSRLELCRDAPLQVDPANPRAGADDPMMGRAERANPSAAPLEPLCATGRLVRTQTGYDYTHEMRFDQPGSYRVRLNMVDATGHRVISNTVQVNAF*
Ga0134088_1014176323300010304Grasslands SoilMRENQRSRLALAAGLAAAMLCGSVPAWSKGWSKGAKDTAPALWIAADRVIGFVPFTVSLYGKVRSVTEPSRLELCREVAVQADMTGGRDPEDERMQPARHEPDGAAADPPCATGTVVRTPDGYDYQHEMRFDRPGTYR
Ga0134088_1070146713300010304Grasslands SoilKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQVDLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEIRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL*
Ga0126376_1117908023300010359Tropical Forest SoilMREGLKGWGTLALAGVAALSMSAGPAWSKGSRVKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPNRLELCRELAMQADVGGSRGGSDDPFGARSDRSAGAAGPQEPICATGTLVRTHDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0134121_1281121023300010401Terrestrial SoilSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGNTEPARLELCRELAMQADVAGARGGADDPMGERGDRSAALQGAQEPVCATGTLVRTRDGFDYTHEMRFDRPGTYRVRLSMVDAGGHRVMSNTVQVNAL*
Ga0137383_1001727533300012199Vadose Zone SoilMRGNLWSRLALGAGVIGTTLLGGVPAWSKGWSKGPRDGAPALWIAADRVVGFVPFTVALYGKVRSATEPGRLELCREPAAQADLADSQDALNDRAQPARREPDNGPAEPSCAAGTLVRTPDGYDYRHEMRFDRPGTYRVRLSMVDQSGHRVISNTVQVNAL*
Ga0137383_1108773813300012199Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVISNTVQV
Ga0137365_1044931013300012201Vadose Zone SoilMLATSGSRSVLVGVVVAATILGSGPVWSKGSRAKDSGAPALWIAADRVVGFVPFTVSLYGKVLASVEPGRLELCREVAMPADLSGGHGAGDDPMAARTDRSVPPAAPPEPVCSPGTLVRTQDGYDYAHEMRFDRPGTYRVRLSMVDA
Ga0137365_1123569813300012201Vadose Zone SoilVLAGGVVAAVILGIGPAWSKGLRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHPDPLCDTGTLVRTRDGFDYSHDMRFDRPGTYRVRLSMVDATGHRVMSNT
Ga0137363_1014177123300012202Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADAAGSRGGMDDPMAERLDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137399_1012157023300012203Vadose Zone SoilMAAGVVVAATILGGGTAWSKGSRAKDVKEGVQAMWIAADRVVGFVPLPVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTSRSERSNNAPVGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL*
Ga0137399_1020323623300012203Vadose Zone SoilMLGNLWSRSTLVAGVVAATFCWNVPAWSKGWLKGPKDGAPALWIAADRVVGFVPFTVALYGKVRSAVEPNRLELCRELAIQVDFANSRNGDEGGMQPARHEPDTGPAEPSCAPGTLVRTPDGYDYKHEIRFDQPGTYRVRLSMVDQTGRRMISNTVQVNAL*
Ga0137362_1137217113300012205Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137380_1007168013300012206Vadose Zone SoilVLAGGVVAAMILGIGPGWSKGLRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADAAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137380_1044292413300012206Vadose Zone SoilLWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCRELAMQTDVAGSHGGADDPMAERPDRSTAPAARQDPVCASGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0137381_1002715513300012207Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSCAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADAAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137381_1007695813300012207Vadose Zone SoilVWSRLLLAAGLLGTTLLGSGPVWSKGWSKGPKDGAPALWIAADRVVGFVPLTVALYGKVRSATEPSRLELCREVAVQADLADSQDTQDDRMQPARREPDNGPPEASCAAGTLVRTPDGYDYRHEMRFDRPGTYRVRLSMVDQSGHRVISNTVQVNAL*
Ga0137379_10018296123300012209Vadose Zone SoilMRGNLWSRLALGAGVIGTTLLGGAPAWSKGWSKGPRDGAPALWIAADRVVGFVPFTVALYGKVRSATEPGRLELCREPAAQADLADSQDTQDDRMQPARREPDNGPPEASCAAGTLVRTPDGYDYRHEMRFDRPGTYRVRLSMVDQSGHRVISNTVQVNAL*
Ga0137387_1004535333300012349Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSLAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADAAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNALYRIPP
Ga0137386_1025179233300012351Vadose Zone SoilVGFVPLTVALYGKVRSATEPSRLELCREVAVQADLADSRNGPDDRMQPAPHEAETGSTEPACAPGTLVRTPDGYDYQHEIRFDRPGTYRVRLSMVDQAGHRVISNTVQVNAL*
Ga0137386_1051157413300012351Vadose Zone SoilMCWSLLAGGVVAAMILSTGPAWAKGSRSKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCRELAMQTDVAGSRGGTDDPMSERPDRNTAPAARQDPVCATGTLVPTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0137386_1086069213300012351Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSCAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPIAERPDRNTGPPAHQDPLCDTGTLVRTRDGFDYSHEMRFDQPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137361_1035476823300012362Vadose Zone SoilMCWSLLAGGVVAAMILSTGPAWAKGSRSKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPDRLELCREPAMQTDVAGSRGGADDPMSERPDRNTAPAARQDPVCATGTLVPTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0137361_1096056913300012362Vadose Zone SoilVLAGGVVAAMILGIGPVWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGLPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDATGHRVMSNTVQVNAL*
Ga0137397_10000301183300012685Vadose Zone SoilVLAGGVVAAMILGIGPVWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMADASGHRVMSNTVQVNAL*
Ga0137397_1038450713300012685Vadose Zone SoilMAAGVVVAATILGGGTAWSKGSRAKDAKEGVPAMWIAADRVVGFVPLTVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTSRSERSNNAPVGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHR
Ga0137394_1030361013300012922Vadose Zone SoilMAAGVVVAATILGGGTAWSKGSRAKDAKEGVPAMWIAADRVVGFVPLTVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTSRSERSNNAPVGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQ
Ga0137419_1009217733300012925Vadose Zone SoilMAAGVVVAATILGGGTAWSKGSRAKDVKEGVQAMWIAADRVVGFVPLTVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTSRSERSNNAPVGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL*
Ga0137419_1027168813300012925Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGLRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137416_1024237813300012927Vadose Zone SoilVLAGGVVAAMILGTGTAWSKGTLAKDAPPALWIAADRVVGFVPFTVSLYGKVLGSVEPNRLELCREQAAQTDVAGSRGGAEDPMADRPDRSAAPAGPQDAICATGTLVRTREGFDYSHEMRFDRPGTYRLRLSMVDSTGHRVMSNTVQVNA
Ga0137404_1009345823300012929Vadose Zone SoilVLAGGVVAAMILGIGPVWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMVDASGHRVMSNTVQVNAL*
Ga0137407_1011168543300012930Vadose Zone SoilVLAGGVVAAMILGIGPVWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLDLCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMADASGHRVMSNTVQVNAL*
Ga0137407_1016429043300012930Vadose Zone SoilMAAGVVVAATILGGGAAWSKGSRAKDAKEGVPAMWIAADHVVGFVPLTVTLYGKVPGTVDPSRLELCRDAATQTDFANPRGAAEDQMTSRSERSNNAPVGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL*
Ga0137407_1018939443300012930Vadose Zone SoilMLFGGGPAWSKGSRAKDGASALWIAADRVVGFVPLTVTLYGKVPGSAEPGRLELCRDVAMQPDFAGPGSRAGAEDPMMGRTERNTAPTAPSEPVCSTGTLVRTQDGFDYRHEMRYDRPGTYRVRLSMVDAAGHRVISNTVQVNAL*
Ga0134075_1003223013300014154Grasslands SoilMRGDRRFWSVLAGVVVTAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRPGADDPMTGRTDRNNAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0134075_1026190323300014154Grasslands SoilVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREIAAQADLADSRSGQDERRQPARHEPDPAAAEPSCATGTLVHTPDGYDYQHEMRFDRPGTYRVRLS
Ga0137418_1114255513300015241Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGLRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTY
Ga0137409_10004512213300015245Vadose Zone SoilVLAGGVVAAMILGIGPAWSKGSRAKDGPPALWIAADRVVGFVPFTVSLYGKVLGSAEPARLELCRELAMQADVAGSRGGMDDPMAERPDRNTGPPAHQDPLCATGTLVRTRDGFDYSHEMRFDRPGTYRVRLSMADASGHRVMSNTVQVNAL*
Ga0134089_1004490413300015358Grasslands SoilIYITKSWLHIGNWRRWYVSRIKQGGGPMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL*
Ga0134069_108613713300017654Grasslands SoilTFLSGVPAWSKCWSKGPKEGAPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREIAAQADLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0134083_1004403533300017659Grasslands SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRPGADDPMTGRTDRNNAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNTVQVNAL
Ga0134083_1026951123300017659Grasslands SoilMRENQRSRLALAAGLAAAMLCGSVPAWSKGWSKGAKDTAPALWIAADRVIGFVPFTVSLYGKVRSVTEPSRLELCREVAAQADMTGGRDPEDERMQPARHEPDGAAAEPPCATGTVVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQTGHRVVSNTVQVNAL
Ga0209235_104291733300026296Grasslands SoilMRGNLWSRLGLVAGVTAATFLSGVPAWSKGWSKGPKDGAPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQADLADSRSGQDDRLQPARHEPDPVTAEPSCATGTLVHTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0209235_107328523300026296Grasslands SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRIDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYR
Ga0209237_106537533300026297Grasslands SoilMRGNLWSRLGLVAGVTAATFLSGVPAWSKGWSKGPKDGAPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQADLADSRSGQDDRLQPARHEPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0209237_112291213300026297Grasslands SoilAWSKGWSKGPKESEPALWIAADRVVGFVPFTVALYGKVRGAAELNRFELCRELAVQADLANSRNGEDDRMQPARHEPETGPAEPSCAPGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQAGHRLISNAVQVNAL
Ga0209761_101560223300026313Grasslands SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL
Ga0209470_101462673300026324SoilMRGDRRFWSVLAGVVVTAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFASPRAGADDPMTGRTDRNNAPAEPQEPLCSTGTFMRTQDGFNYAHEMRFDRPGTYRVRLTMVDATGHRAISNAVQVNAL
Ga0209058_101406723300026536SoilMRGDRRFWSVLAGVVATAIIVGSGPAWSKGSRSKDGASALWIAADRVVGFVPFTVSLYGKVPGSAEPARLELCRDVVIQADFANPRAGADDPMTGRTDRNSAPAGPPEPLCSPGTLMRTQDGFNYTHEMRFDRPGTYRVRLTMVDATGHRAISNTVQVNAL
Ga0209157_110462733300026537SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVAAQADLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEMRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0209376_104791423300026540SoilMRGNLWSRLGLAAGVTAVTFLSGVPAWSKGWSKGPKDGVPALWIAADRVVGFVPFTVALYGKVRSATEPSRLELCREVATQVDLADSRSGQDERMQPARHEPDPAAAEPSCATGTLVRTPDGYDYQHEIRFDRPGTYRVRLSMVDQGGHRVISNTVQVNAL
Ga0209388_102412723300027655Vadose Zone SoilMRGIRGSWTVAVGFVAAATILGGGSAWSKGSRAKDAKDGVPAMWIAADRVVGFVPLTVTLYGKVPGTVEPSRLELCRDVATQTDFANPRGAAEDPMTPRADRNNNAPAGPPEPLCSSGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL
Ga0137415_1004350113300028536Vadose Zone SoilMRKRRRCWSVLAGGVVAAMILGTGTAWSKGTLAKDAPPALWIAADRVVGFVPFTVSLYGKVLGSVEPNRLELCREQAAQTDVAGSRGGAEDPMADRPDRSAAPAGPQDAICATGTLVRTREGFDYSHEMRFDRPGTYRLRLSMVDSTGHRVMSNTVQVNA
Ga0307469_1024651713300031720Hardwood Forest SoilIAADRVVGFVPFTVSLYGKVTGNIEPGRLELCRDLAIPPDLATARPGADEPVTARSERNGAPAGLPEPSCSTGTLMRTQDGFDYRQEMRFDRPGTYRVRLTMVDATGHRAISNVVQVNAL
Ga0307473_1000333063300031820Hardwood Forest SoilMRVSRRAWAVPLGIVAAAVILGSGPAWSKGSRAKDGKDGTSALWIAADRVVGFVPFTVTLYGKVPGSTEPSRLEMCRDAPLQVDPANPRAGADDPMMGRTERTNPSAAPLEPLCATGRLVRTQTGFDYTHEMRFDQPGSYRVRLNMVDAMGHHVISNTVQVNAF
Ga0307471_10107175713300032180Hardwood Forest SoilVLAGVVTGVMLFGGGPAWSKGSRSKDSASALWIAADRVVGFVPFTVTLYGKVPGSAEPGRLELCRDGAMQTDFAGPASRGGAEDPMTARNERNTVPPTAPPEPACSTGTLVRTQDGFDYKHEMRFDRPGTYRVRLSMVDATGHRVISNTVQVNAL
Ga0307471_10360202623300032180Hardwood Forest SoilKGSRAKDARDGVPAMWIAADRVVGFAPLTVTLYGKVPGSVEPSRLELCRDAATQADFSNPRGASEDPMTSRADRGNNAPAGPPEPLCSTGKLVRTQDGFDYTHEMRFDRAGTYRVRLTMVDADGHRVISNTVQVNAL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.