NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F106045

Metagenome Family F106045

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F106045
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 194 residues
Representative Sequence FFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Number of Associated Samples 68
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 5.00 %
% of genes from short scaffolds (< 2000 bps) 4.00 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (95.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(53.000 % of family members)
Environment Ontology (ENVO) Unclassified
(51.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(56.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 71.43%    β-sheet: 0.00%    Coil/Unstructured: 28.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF05343Peptidase_M42 14.00
PF13692Glyco_trans_1_4 7.00
PF02350Epimerase_2 4.00
PF16861Carbam_trans_C 3.00
PF01370Epimerase 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1362Aspartyl aminopeptidaseAmino acid transport and metabolism [E] 14.00
COG1363Putative aminopeptidase FrvXCarbohydrate transport and metabolism [G] 14.00
COG2195Di- or tripeptidaseAmino acid transport and metabolism [E] 14.00
COG0381UDP-N-acetylglucosamine 2-epimeraseCell wall/membrane/envelope biogenesis [M] 4.00
COG0707UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferaseCell wall/membrane/envelope biogenesis [M] 4.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A95.00 %
All OrganismsrootAll Organisms5.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300012929|Ga0137404_12259370All Organisms → cellular organisms → Bacteria → Acidobacteria509Open in IMG/M
3300020583|Ga0210401_10093325All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2828Open in IMG/M
3300021180|Ga0210396_11407765All Organisms → cellular organisms → Bacteria → Acidobacteria576Open in IMG/M
3300031720|Ga0307469_10310688All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1303Open in IMG/M
3300031754|Ga0307475_10990699All Organisms → cellular organisms → Bacteria → Acidobacteria661Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil53.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil11.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil8.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1037193213300001593Forest SoilFSGAKGGFSSELFFPGLVAYIILILMAPAYNSFAYESAGVQTYFTAPLQFRNVFLGKNFVQVALVATELTLCIAAFCYRVGMPSAPIFAATLAAIVFTLIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILFFGISSLVLGLGRWTGDRWLPTKAFVLLAAAAVAGYVASLDALTSYAEKNKEKLIEALCR*
JGI12053J15887_1023602513300001661Forest SoilRAEWDADGLSLLSPQVAAIIRKEVRYLLRNGFAALLLFLPPVLVFTLISQSSLLLFMGSKGISPELFFPGLVAYIVLILMTPAYNSFAYENAGVQTYFTAPLRFRDVFLGKNFVQVCLIVTELTLCIAAFCYRVGSPSAPIFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSRMAVLIAFAAQILLFAISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEILIEALCR*
JGI25614J43888_1016357813300002906Grasslands SoilVRKEIRYLLRNGFAALLLLVPPILVFALISQAMLFRLSQAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDIFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLP
JGI25615J43890_101364723300002910Grasslands SoilFFPGLVAYIILILMAPAYNSFAYENTGIQTYFTAPLQFRDVFLGKNFVQVSSIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0062595_10058397523300004479SoilIILILMAPAYNSFAYESSGVQTYFTSPLPFRNVFLGKNFVQVSLIAAELTLCIAAFSYRLGLPSLPVLIATLVAIIFTVVGQLSIANWSSLSFPRKLAFGQVYGQRQSGMAVLVAFSAQILLFGISLLVLMLGRWTGDRWLPAKAFMLLAGAAVGGYMAALDALTSYAEKKKEKLIEALCR*
Ga0066675_1072273013300005187SoilRFRDLFLGKNFVQACLILAELVLCMAAFSLRVGLPSPPAFVATLVAIIFTVVGQLSIANWSSLSFPRKLAFGQMHGQRQSGMAVLVAFGAQIVLFGISSVILMLGRWTGERWLPAEAFTLLAVAAVAGYVASLDALTVFAEKKKEILIEALCR*
Ga0070709_1066193513300005434Corn, Switchgrass And Miscanthus RhizosphereETARSEAREDVLALFSPQVAAVIRKEFHYLLRNGFAAMLLLLPPVLVFALISQASPLRFMTGKGVSPELFFPGLMGYIILVLMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALIGAELFLCIVAFSYRMGLPSAPVFVATLAAILFTVIGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFAAQILLFGISTLILALGTWTGDPWLPAKAFVLLAAAAIGGYVASLDPLTSYAEKKKEALIEALCR*
Ga0066661_1085891113300005554SoilFREVFLGKNFVQVCLIAIELTLCIAAFCYRVGTPSAPIFVATLAAIVFTVVGQLSIANWSSLSFPRKLTFGQIHGQRQSRMAVLIAFSAQILLFAISSLVLGVGRWTGDQWLPAKAFTLLAVAAMGGYVASLDPLTSYAEKKKEKLIDALCR*
Ga0066702_1090409913300005575SoilTLIHFAGTREGIPGDAFFPGLMAYIILVLMAPAYNSFAYESTGVQTYFTAPVGFRAVLLGKNFVQGCLILAELFLCIGAFSFRVGLPSPPVFVATLAAIIFTVIGQLSIANWSSLSFPRKLTFGQMHGQRQSGMAVLVAFGSQIVLFGIGSVILMLGRWTGERWLPAGTFALLSV
Ga0075023_10053009013300006041WatershedsVRYLLRNGFAALLLFLPPILVFALISQATMLSGLRKGIHPDLFFPGLVGYIILILMAPAYNSFAYENAGVLTYFTSPLRFRNVFLGKNFVQVSLIAAELALCIAAFSYRVGLPSLPVFMATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVYGQRQSGMAVLVAFGAQILLFGISLL
Ga0075030_10130603313300006162WatershedsMAPAYNSFAYESTGVQTYFTAPLQFRSVFLGKNFVQIALVVTELALCIAAFCYRVGMPSAPIFAATLAAIVFALIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILFFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0099829_1000294613300009038Vadose Zone SoilFLGKNFVQVALVTIELTLCIAAFCYRVGSPSAPIFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSSMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAVVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0099829_1017590623300009038Vadose Zone SoilRKEIRYLLRNGFDALLLLLPPILVFALITQVTVFRFSGAKSSISAELSFPGLAAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGWPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAALGGYVASLDALASYAEKKKETLIEALCR*
Ga0099829_1019977713300009038Vadose Zone SoilVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNVVHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0099829_1080124423300009038Vadose Zone SoilALISQATVLSGFKKGIPTEAFFPGLVGYIILILMVPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELALCIAAFSYRVGLPSMPVFLATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQGGMAALVQFGSQILLFGISSVILMLGRWTGDRWLPAQVFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0099830_1032431213300009088Vadose Zone SoilIRFRDILLGKNFVQVCLILTELALCIVAFAYRVGLPSAPTFAATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISTLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVGSLDALTSYAEKKKEKLIEALCK*
Ga0099830_1061244413300009088Vadose Zone SoilRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYLILILMAPAYNSFAYENTGVQVYFTAPLRFRAVFLGKNFVHVSLIAAELALCITAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0099828_1107323823300009089Vadose Zone SoilLRFRAVFLGKNFVHVSLIAAELTLCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLLNDNWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0099828_1134315213300009089Vadose Zone SoilAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELALCIAAFSYRVGLPSMPVFMATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQGGMAALVQFGSQILLFGISSLILMLGRWTGDRWLPAQVFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0099796_1057411813300010159Vadose Zone SoilLMAPAYNSLAYENTGIQTYFTAPLQFRNVFLGKNFVQVVLVATELSLCIAAFCYRVGSPSAPTFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSSMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVAALDASSSYAEKKKE
Ga0126370_1143281513300010358Tropical Forest SoilYFTAPLRFREIFLGKNFVQVCLLTTELALCIAAFSYRVGLPSPPIFVGTLTAIVFTVVGQLSVANWSSLSFPRKLAFGQLHGQRQSGMAVLVGFGVQILLFGIGALVLALGKWTGDRWLPAKAFALLSIAAIGGYMASLNALTNLAEKKKERLIEALCR*
Ga0137392_1093862123300011269Vadose Zone SoilYFTAPLRFRAVFLGKNFVHVSLIAAELALCITAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0137391_1009568333300011270Vadose Zone SoilAPLQFRNVFLGKNFVQVALVTIELTLCIAAFCYRVGSPSAPIFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSSMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAVVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137391_1023423123300011270Vadose Zone SoilLRFRAVFLGKNFVHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAEKKKEKLIEALCR*
Ga0137391_1063207813300011270Vadose Zone SoilILVFALISQATVLSGFKKGIPTEAFFPGLVGYLILILMAPAYNSFAYENTGVQTYFTAPLRFRTVFLGKNFVHVSLIAAELVLCIAAFSYRVGLPSMPAFLATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0137393_1054724623300011271Vadose Zone SoilENTGVQVYFTAPLRFRAVFLGKNFVHVSLIAAELALCITAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0137393_1061891323300011271Vadose Zone SoilLVFALISQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFIHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFSAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAEKKKEKLIEALCR*
Ga0137389_1036781313300012096Vadose Zone SoilEFRYLVRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFIHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFSAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAEKKKEKLIEALCR*
Ga0137389_1045672323300012096Vadose Zone SoilEFRYLVRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELTLCIAAFSYRVGLPSMPVFMATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAEKKKEKLIEALCK*
Ga0137389_1123259313300012096Vadose Zone SoilLGLLSPQVAAVIRKEIRYLLRNGFAALLLLLPPILVFALITQVTVFRFSGAKSSISAELSFPGLAAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGWPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLMAFGAQILLFGISSLVLGLGRWTGDRWLPAKAF
Ga0137388_1021252313300012189Vadose Zone SoilEFRYLVRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELTLCIAAFSYRVGLPSMPVFMATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQGGMAALVQFGSQILLFGISSLILMLGRWTGDRWLPAQVFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0137388_1068503113300012189Vadose Zone SoilEFRYLVRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYLILILMAPAYNSFAYENTGVQVYFTAPLRFRAVFLGKNFVHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR*
Ga0137388_1100820023300012189Vadose Zone SoilLILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVVTELTLCIAAFCYRVGWPPAPIFAATLAAIIFTLVGQLSIANWSSLSFPRRLAFGQLHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137363_1007592013300012202Vadose Zone SoilAPAYNSFAYENAGVQTYFTAPLQFRDVFLGKNFVQVALVATELTLCIAAFCYRVGRPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRRLAFGQVHGQRQSGMAVLVAFGAQILLFGISSVVLGLGRWTGDRWLPAKAFALLAAAAVAGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137363_1063387823300012202Vadose Zone SoilLQFRSVFLGKNFVQIALVATELALCIAAFCYRVGMPSAPIFAATLAAIIFALIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILFFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALCR*
Ga0137363_1150677913300012202Vadose Zone SoilERAIRNRVRPESGADVLGLLSPQVAAVVRKEIRYLLRNGFAALLLLVPPILVFALISRATLFRYSGEKGGISPELFFPGLVAYIILILMAPAYNSFAYENTGIQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQIHG
Ga0137399_1002921913300012203Vadose Zone SoilLPPILVFVLISQASLFRFTGGRGVTPELFFPGLMGYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVCMMAIELTLCIVAFSYRVGLPSPPIFTATLAAIAFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYISSLDALTSYAEKKKEKLIEALCR*
Ga0137362_1037010713300012205Vadose Zone SoilLGKNFVQVVLVATELSLCIAAFCYRVGSPSAPTFIATLSAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFCAQILLFAISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDAWSSYAEKKKEKLIEALCK*
Ga0137362_1082843323300012205Vadose Zone SoilLLLPPILVFALITQATLFRFSGAASGTSPELVFPGLMAYIILILMAPAYNSFAYENTGVQTYFTAPLKFRDVLLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137362_1146838513300012205Vadose Zone SoilVFLGKNLVQGCLIVTELTLCIAAFCYRVGPPSASIFLATMAAVVFTVVGQLSIANWSSLSFPRKLTFGRIHGQRQSAMAALVAFSTQILLFAISSFVLELGRWTSDPWLPAKAFTLLAVAAMGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137360_1048626423300012361Vadose Zone SoilRFSGAKSGISPELFFPGLVAYIILILMAPAYNSFAYENAGVQTYFTAPLQFRDVFLGKNFVQVALVATELTLCIAAFCYRVGRPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRRLAFGQVHGQRQSGMAVLVAFGAQILLFGISSVVLGLGRWTGDRWLPAKAFALLAAAAVAGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137360_1066619323300012361Vadose Zone SoilQATLFRSSEAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGIQTYFTAPLQFRNVFLGKNFVQVVLVATELSLCIAAFCYRVGSPSAPTFIATLSAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFCAQILLFAISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDAWSSYAEKKKEKLIEALCK*
Ga0137361_1022180613300012362Vadose Zone SoilDVFLGKNFVQVALVATELTLCIAAFCYRVGRPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRRLAFGQVHGQRQSGMAVLVAFGAQILLFGISSVVLGLGRWTGDRWLPAKAFALLAAAAVAGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137361_1174606813300012362Vadose Zone SoilILMAPAYNSFAYENTGIQTYFTAPLQFRNVFLGKNFVQVVLVATELSLCIAAFCYRVGSPSAPTFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVAALDASSSYAEKKKEKLIEALCR*
Ga0137390_1170800613300012363Vadose Zone SoilPILVFALITQVTVFRFSGAKSSISAELSFPGLAAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGWPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAA
Ga0137395_1082466513300012917Vadose Zone SoilIPSEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPVHFRHVFLGKNFVQVSLIGAELTLCIVAFAYRVGLPSMPVFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVYGQRQSGMAVLVAFGAQILLFGISSVVLMLGRWTGDRWLPAKAFVLLAAAAVGGYIAALDALTDYAEKKKETLIEAMCR*
Ga0137396_1000385783300012918Vadose Zone SoilLGLLSPQVAAVIRKEIRYLLRNGFAALLLLLPPILVFALITQATLFRFSGAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVALVTIELTLCIAAFCYRVGSPSAPIFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSSMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAVVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137396_1122773413300012918Vadose Zone SoilNVLLGKNFVQVALVATELTLCIAAFCYRVGVPSAPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDHWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137394_1047042923300012922Vadose Zone SoilFREVFLGKNFVQVCLIVIELALCIAAFCYRVGTPSAPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSRMAVLVAFAAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137359_1099593813300012923Vadose Zone SoilVFALISQASLFHFSGGKGVSPELFFPGLTGYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVCLVVIELTLCIVAFSYRVGLPSPPVFTATLAAIAFTVIGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFTVLAAAAIGGYISSLDALTSYAEKKKEKLIEALCR*
Ga0137359_1132696613300012923Vadose Zone SoilIRYLLRNGFAALLLLVPPILVFALISQAMLFRLSQAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDIFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLA
Ga0137359_1132696713300012923Vadose Zone SoilIRYLLRNGFAALLLLVPPILVFALISQAMLFRLSQAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGIQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLA
Ga0137416_1006527313300012927Vadose Zone SoilLISPQIAAVIRKEFHYLLRNGFAALVLLLPPILVFVLISQASLFRFTGGRGVTPELFFPGLMGYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVCMMAIELTLCIVAFSYRVGLPSPPIFTATLAAIAFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYISSLDALTSYAEKKKEKLIEALCR*
Ga0137416_1046790523300012927Vadose Zone SoilGLLSPQVAAVVRKEIRYLLRNGFAALLLLVPPILVFALISQAMLFRLSQAKSGISPESFFPGLVAYIILILMAPAYNSFAYENTGIQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137404_1225937013300012929Vadose Zone SoilNFVQVCMMAIELTLCIVAFSYRVGLPSPPIFTATLAAIAFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYISSLDALTSYAEKKKEKLIEALCR*
Ga0137407_1127525713300012930Vadose Zone SoilELFFPALVAYIVLILMTPAYNSFAYENAGVQTYFTAPLRFREIFLGKNFVQVCLIVIELALCIAAFCYRVGTPSAPIFLATLAAIVFTVVGQLSIANWSSLSFPRKLTFGQIHGQRQSRMAVLIAFSAQLLLFAISSLVLGLGRWTGDRWLPAKAFTLLAVAATGGYVASLDALTSYAEKKKEKLIEALCR*
Ga0137403_1120462513300015264Vadose Zone SoilMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVCMMAIELTLCIVAFSYRVGLPSPPIFTATLAAIAFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYISSLDALTSYAEKKKEKLIEALCR*
Ga0066662_1200743713300018468Grasslands SoilVLRNGFAALLLFLPPILVFALISQASLLRFTGSKGVSAELFFPGLMAYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNVVQVSLITIELTLCIAAFCYRVGRPSAPIFVATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGSQILLFAISSLVLALGRWTGDPWLPAKAFALLAAAA
Ga0210401_1009332513300020583SoilNFVQVSLIAAELILCITAFSYRVGLPSLPVLMATLAAIVFTVMGQLSIANWSSLSFPRKLAFGQVYGQRQSGMAVLVAFGAQILLFGISSLVLMLGRWTGDRWLPAKAFVLLAAAAVAGYIAALDALTSYAEKKKEKLIEALCR
Ga0210404_1005931133300021088SoilGISPELFFPGLMAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLALGRWTGDRWLPTKAFALLAAAAFGGYIASLDALTSYAEKKKEKLIEALCR
Ga0210406_1111699113300021168SoilAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGVPSPPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKRKEKLIEALCR
Ga0210400_1057864713300021170SoilLSLLSPRVAAVIRKEVRYLLRNGFAAMLLLLPPGLVFTLISQSSLLQFMGAKGISPELFFPGLLAYIVLILMTPAYNSFAYESAGVQTYFTAPLRFRDVFLGKNFVQVCLIVIEVTLCIAAFCYRVGAPPAPIFLATLAAIVFTVVGQLSIANWSSLCFPRKLEFGRIHGQRQSRMAVLIAFAAQILLFAISSLVMGLGRWTGDPWLPAKAFALLAAAAMGGYFASLDALTSYAEKKKEKLIEALCR
Ga0210396_1140776513300021180SoilLGKNFVQVCLVTIELTLCIVAFSYRVGLPSWPVFIATLAAITFTVIGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQVLLFGISSLVLALGRWTGDRWLPAKAFTLLAAAAIGGYISSLDALTSYAEKKKEKLIEALCR
Ga0210396_1175214813300021180SoilQYSLLHFMGARGVSPELFFPGLVAYIVLILMAPAYNSFAYENAGVQIYFTAPLHFRNVLLGKNFVQVCLIVTELTLCIAAFCYRVGTPSAPIFLATVTAVVFTVIGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLIAFAAQILLFGISSLVLGLGRWTDDRWLP
Ga0210389_1106503713300021404SoilYFTAPLQFRNVFLGKNLVQVALVATELALCIAAFCYRAGMPSAPIFAATLAAIVFALIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALCR
Ga0210402_1182684913300021478SoilSSMSADLFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGVPSSPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDA
Ga0210410_1101836413300021479SoilLLLVPPILVFALISQATLFRFSQAKNGISPELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQLHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0210409_1155055013300021559SoilPLQFRDVFLGKNFVQVCLVTIELTLCIVAFSYRVGLPSWPVFIATLAAITFTVIGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQVLLFGISSLVLALGRWTGDRWLPAKAFTLLAAAAIGGYISSLDALTSYAEKKKEKLIEALCR
Ga0207699_1066311713300025906Corn, Switchgrass And Miscanthus RhizosphereETARSEAREDVLALFSPQVAAVIRKEFHYLLRNGFAAMLLLLPPVLVFALISQASPLRFMTGKGVSPELFFPGLMGYIILVLMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALIGAELFLCIVAFSYRMGLPSAPVFVATLAAILFTVIGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFAAQILLFGISTLILALGTWTGDPWLPAKAFVLLAAAAIGGYVASLDPLTSYAEKKKEALI
Ga0207700_1053629813300025928Corn, Switchgrass And Miscanthus RhizosphereFAALLLLLPPVLVFVLISQAALLRFTGSKGISPEMFFPGLVAYIILILMAPAYNSFAYESAGVQTYFTAPLRFRDVFLGKNFVQAALVATELTLCIVAFAYRVGLPPAPIFFATLAAIVFTVVGQMSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQIVFFGISSVVLALGRWTGDLWLPAEAFALLAAAAVGGYISSLDSLTTYAEKKKEKLIEALCR
Ga0209648_1028448713300026551Grasslands SoilFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILGMGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0209648_1042983513300026551Grasslands SoilLVRNGFAALLLFLPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFIHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFSAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAEKKKEKLIEALCR
Ga0179587_1031015323300026557Vadose Zone SoilMGYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVCMMAIELTLCIVAFSYRVGLPSPPIFTATLAAIAFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFAISSLVLGLGRWTGDRWLPAKAFTLLAAAAVGGYISSLDALTSYAEKKKEKLIEALCR
Ga0209388_102154713300027655Vadose Zone SoilEIRYLLRNGFAALLLLLPPILVFALITQATLFRFSGAKSGISPELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVFLGKNFVQVALVTIELTLCIAAFCYRVGSPSAPIFIATLAAIIFTVVGQLSIANWSSLSFPRKLAFGQIHGQRQSSMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAVVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0209009_107613613300027667Forest SoilRRESGADALGLLSPQVAAVIRKEIRYLLRNGFAALLLLLPPILVFALITQATLFRFSGAKGGFSSELFFPGLVAYIILILMAPAYNSFAYESAGVQTYFTAPLQFRSVFLGKNFVQVALVATELMLCIAAFCYRVGMPSAPIFAATLAAIVFTLIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALCR
Ga0209118_112642513300027674Forest SoilALGSLSPQVAAVIRKEIRYLLRNGFAALLLFLPPILVFALITQATLLRFSGAQSNMSADLFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVVTELTLCIAAFCYRVAVPSAPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLVFGQIHGQRQSGMAVLVAFAAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAE
Ga0209011_109015913300027678Forest SoilSQATMLSGLRKGISSDVFFPSLVGYIILILMAPAYNSFAYENTGVQTYFTSPLRFRNVFLGKNFVQVSLIAAELALCILAFSYRVGLPSMPVFMATLAAVAFTVMGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAVGGYFASLDALTAYAEKKKEKLIEALCK
Ga0209328_1019178913300027727Forest SoilIRNKARRESEADALGSLSPQVAAVIRKEIRYLLRNGFAALLLFLPPILVFALITQTTVLRFSGAKSSLSADLFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVTLVATELTLCIAAFCYRVGVPSSPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGI
Ga0209180_1012418913300027846Vadose Zone SoilRKEIRYLLRNGFDALLLLLPPILVFALITQVTVFRFSGAKSSISAELSFPGLAAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGWPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAALGGYVASLDALASYAEKKKETLIEALCR
Ga0209590_1007166233300027882Vadose Zone SoilPPILVFALISQATVLSGFKKGIPTEAFFPGLVGYIILVLMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTDDRWLPAKAFVLLAAAAVGGYIASLDALTVYAEKKKEKLIEALCR
Ga0209590_1053148413300027882Vadose Zone SoilFFPGLVGYIILILMVPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIGAELALCIAAFSYRVGLPSMPVFLATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQGGMAALVQFGSQILLFGISSLILMLGRWTGDRWLPAQVFVLLAASAVGGYIASLDALTVYAEKKKEKLIEALCR
Ga0209488_1001545543300027903Vadose Zone SoilVGYIILILMAPAYNSFAYENTGVQTYFTAPVHFRHVFLGKNFVQVSLIGAELTLCIVAFAYRVGLPSMPVFTATLAAIVFTVVGQLSIANWSSLSFPRKLSFGQVYGQRQSGMAVLVAFGAQILLFGISSVVLMLGRWTGDRWLPAKAFVLLAAAAVGGYIAALDALTDYAERKKERLIEALCR
Ga0209488_1098706613300027903Vadose Zone SoilSQATVLSGFKKGIPTEAFFPGLVGYIILILMAPAYNSFAYENTGVQTYFTAPLRFRAVFLGKNFVHVSLIAAELTLCIAAFSYRVGLPSMPVFMATMAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAAAGYIASLDALTAYAE
Ga0209526_1019557713300028047Forest SoilGFAALLLFLPPILVFTLISQATMLSGLRKGISSDVFFPSLVGYIILILMSPAYNSFAYENTGVQTYFTSPLRFRNVFLGKNFVQVALIAAELALCILAFSYRVGLPSMPVFMATLAAVAFTVMGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLILMLGRWTGDRWLPAKAFVLLAAAAVGGYFASLDALTAYAEKKKEKLIEALCK
Ga0209526_1026426423300028047Forest SoilNGFAALLLLLPPILVFALITQATLFRFSGPASGISPELFFPGLMAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRDVFLGKNFVQVSLIVTELTLCIVAFYYRVGLPSAPIFTATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLALGRWTGDRWLPAKAFALLAAAALGGYVASLDALTSYAEKKKEKLIEALCR
Ga0137415_1059660223300028536Vadose Zone SoilLRFREVFLGKNFVQVCLIVIELALCIAAFCYRVGTPSAPIFLATLAAIVFTVVGQLSIANWSSLSFPRKLTFGQIHGQRQSRMAVLIAFSAQLLLFAISSLVLGLGRWTGDRWLPAKAFTLLAVAATGGYVASLDALTSYAEKKKEKLIEALCR
Ga0170834_10099293613300031057Forest SoilELFFPGLVAYIILILMAPAYNSFAYESTGVQTYFTAPLQFRNVFLGKNFVQIALVATELTLCIAAFCYRVGMPSGPIFAATLAAIIFTLIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILFFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALC
Ga0170824_12511691013300031231Forest SoilLITQASLFRFSGTKGGFSSELFFPGLVAYIILILMAPAYNSFAYESTGVQTYFTAPLQFRNVFLGKNFVQIALVATELTLCIAAFCYRVGMPSGPIFAATLAAIIFTLIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILFFGISSLVLGLGRWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALCR
Ga0307469_1031068813300031720Hardwood Forest SoilNLVQVALVATELALCIAAFCYRVGMPSAPIFAATLAAIVFALIGQLSLANWSSLSFPRKLVFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGQWTGDRWLPAKAFVLLAAAAVGGYVASLDALTSYAEKNKEKLIEALCR
Ga0307477_1003119313300031753Hardwood Forest SoilLLRNGFAALLLLLPPILVFALITQATVFRFSGAKSNISPELFFPSLVAYIILILMAPAYNSFAYENTGVQAYFTAPLQFRNVFLGKNFVQVALVTTELTLCIAAFCYRVGSPSPPIFIATLAAIIFTVVGQLSIANWSSLSFPRRLAFGQIHGQRQSGMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0307477_1032159913300031753Hardwood Forest SoilRNAEGAATGVDGLSLLSPRVAAVIRKEARYLLRNGFAAMLLLLPPGLVFTLISQSSLVHFMGTKGISPELFFPGLVAYIVLILMTPAYNSFAYESAGVQTYFTAPLRFRDVFLGKNFVQVCLIAIELALCIAAFCYRVGAPPAPIFLATLAAVVFTVVGQLSIANWSSLSFPRKLEFGKIHGQRQSRMAVLIAFAAQILLFAISSLVLGLGRWTGDPWLPAKAFAVLAAAAMGGYFASLDALTSYAEKKKEKLIEALCE
Ga0307477_1046216013300031753Hardwood Forest SoilNTGVQTYFTAPLQFRNVLLGKNFVQVALVVTELSLCIAAFCYRVGMPSAPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTDDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0307475_1079162013300031754Hardwood Forest SoilFTAPLRFRDVFLGKNFVQVCLIAIELALCIAAFCYRVGAPPAPIFLATLAAVVFTVVGQLSIANWSSLSFPRKLEFAKIHGQRQSRMAVLIAFAAQILLFAISSLVLGLGRWTGDAWLPAKAFAVLAAAAMGGYFASLDALTAYAEKKKEKLIEAFCR
Ga0307475_1099069923300031754Hardwood Forest SoilLIVTELTLCIAAFCYRVGTPSAPIFLATVTAVVFTVIGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLIAFAAQILLFGICSLVLGLGRWTDDRWLPAKAFALLAAAAVAGYVASLDALTSYAEKKKEKLVEALCR
Ga0307473_1018476523300031820Hardwood Forest SoilLQFRNVFLGKNFVQVALVATELTLCIAAFCYRVGWPSAPIFTATLAAIIFTLVGQLSIANWSSLSFPRRLAFGQVHGQRQSGMAVLVAFGAQILLFGISSVVLGLGRWTGDRWLPAKAFALLAAAAVAGYVASLDALTSYAEKKKEKLIEALCR
Ga0307479_1214199213300031962Hardwood Forest SoilFRFSGAKSSMSADLFFPGLVAYIILILMAPAYNSFAYESAGVQTYFTAPLQFRNVLLGKNFVQVALVTTELALCIAAFCWRAGWPSAPIFAATLAAIVFTVVGQLSIANWSSLSFPRKLAFGQVHGQRQSGMAVLVAFGAQILLFGISSLVLELGRWTGDRWLPTKAF
Ga0318533_1097671113300032059SoilFRYLFRNVFAASLLLLPLLVVLVIISQARLIRSSSRGITPETLFPGLMAYLILILMAPAYNSFAYESTGIQTYFTAPLRFREVFLGKNFVQVCLLTTALALCIAAFSYRVGLPSPPIFVATLTAMVFTVVGQLSIANWSSLSFPRKLAIGRLHGQRQSGMAVLVGLGVQILLFGIATLVLALGKWTGDRWLPAKAFALLSIAAIGGY
Ga0307471_10011766323300032180Hardwood Forest SoilQTTVLRFSGAKSSMSADLFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVALVATELTLCIAAFCYRVGVPSAPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0307471_10121953013300032180Hardwood Forest SoilPAPESVVGNKARRDSDADALGLLSPQVAAVIRKEIRYLLRNGFAALLLLLPPILVFALITQATVFRFSGAKSNISPELFFPCLVAYIILILMAPAYNSFAYENTGVQAYFTAPLQFRNVFLGKNFVQVALVTTELTLCIAAFCYRVGSPSPPIFIATLAAIIFTVVGQLSIANWSSLSFPRRLAFGQIHGQRQSGMAVLVAFCAQILLFGISSLVLGLGRWTGDRWLPAKAFALLAAAAVGGYVASLDALTSYAEKKKEKLIEALCR
Ga0307471_10219418013300032180Hardwood Forest SoilPQVAAVIRKEIRYLLRNGFAALLLFLPPILVFALITQATVFRFSGATSSTSAELFFPGLVAYIILILMAPAYNSFAYENTGVQTYFTAPLQFRNVLLGKNFVQVTLVVTELTLCIAAFCYRVGWPSAPIFAATLAAIIFTLVGQLSIANWSSLSFPRKLAFGQIHGQRQSGMAVLVAFGAQILLFGISSLVLGLGRWTVDRWLPAKAFALLAAAAVGGYVASLDALTSYAE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.