NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099066

Metagenome Family F099066

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099066
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 164 residues
Representative Sequence MVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Number of Associated Samples 93
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 83.33 %
% of genes near scaffold ends (potentially truncated) 3.88 %
% of genes from short scaffolds (< 2000 bps) 8.74 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (92.233 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.126 % of family members)
Environment Ontology (ENVO) Unclassified
(52.427 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.369 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 10.83%    β-sheet: 38.22%    Coil/Unstructured: 50.96%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF02954HTH_8 38.83
PF08281Sigma70_r4_2 6.80
PF13185GAF_2 6.80
PF04542Sigma70_r2 2.91
PF09335SNARE_assoc 1.94
PF07690MFS_1 0.97
PF00496SBP_bac_5 0.97
PF00296Bac_luciferase 0.97
PF12680SnoaL_2 0.97
PF00723Glyco_hydro_15 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 2.91
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 2.91
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 2.91
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 2.91
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 1.94
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 1.94
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 1.94
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.97
COG3387Glucoamylase (glucan-1,4-alpha-glucosidase), GH15 familyCarbohydrate transport and metabolism [G] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A92.23 %
All OrganismsrootAll Organisms7.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005330|Ga0070690_101698709Not Available513Open in IMG/M
3300005355|Ga0070671_101184093Not Available672Open in IMG/M
3300005440|Ga0070705_100485936All Organisms → cellular organisms → Bacteria934Open in IMG/M
3300005467|Ga0070706_101425667Not Available634Open in IMG/M
3300006854|Ga0075425_100013292All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria8853Open in IMG/M
3300006871|Ga0075434_100176472All Organisms → cellular organisms → Bacteria2156Open in IMG/M
3300006903|Ga0075426_10405470All Organisms → cellular organisms → Bacteria1006Open in IMG/M
3300007076|Ga0075435_100423696All Organisms → cellular organisms → Bacteria1146Open in IMG/M
3300009100|Ga0075418_10564590Not Available1224Open in IMG/M
3300010401|Ga0134121_10417125All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300012582|Ga0137358_10323428All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300015052|Ga0137411_1120234All Organisms → cellular organisms → Bacteria3809Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.13%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.13%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil14.56%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.71%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.97%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.97%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005344Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10546458613300000364SoilMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYAATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTLQIAVQKDSGALVSPPTAQSGRE*
JGI25383J37093_1011695313300002560Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
JGI25382J37095_1013942013300002562Grasslands SoilTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYXPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQGGQ*
JGI25382J43887_1016445823300002908Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVXPSALPSDLVGTWHVAYXPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
JGI25382J43887_1029653913300002908Grasslands SoilILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
JGI25390J43892_1002717423300002911Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0066674_1002079433300005166SoilMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066679_1023791513300005176SoilVTRPLSDDSRCARLSPWRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEGAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQ
Ga0066679_1075271613300005176SoilTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSDCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSVMRRGDMLYGVALDQVIGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066690_1022302933300005177SoilTGAWRASASLPESFDSGQIVRSRPVTRPLSDDSRCARLSPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEGAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066685_1113496313300005180SoilALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066678_1056782913300005181SoilAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0070690_10169870913300005330Switchgrass RhizosphereMSKRGIVLGSSILLIALSGCATHQRTDVVARSTEADVRPSALPADLVGTWSGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVANGRSITFRNSSGRSVSLRRRGDALYGVAQDMSG
Ga0070661_10142363213300005344Corn RhizosphereTDVVARSTEADLRPSALPADLVGTWSGSVMPVAPSAGGSNTVGSVTITIKDDGTYTATDRRGASTRNYSGVVVANGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPSAENGR*
Ga0070671_10118409323300005355Switchgrass RhizosphereTHQRTDVVARSTEADVRPSALPADLVGTWSGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVANGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPSAENGR*
Ga0070705_10048593613300005440Corn, Switchgrass And Miscanthus RhizosphereMSKRGIVLGSSILLIALSGCATHQRTDVVARSTEADVRPSALPADLVGTWSGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVANGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPSAENGR*
Ga0070706_10142566713300005467Corn, Switchgrass And Miscanthus RhizosphereATHQRTDGVARLTESDARPSPLPADLVGTWTGYFVPVAAGAGGEGAVGNVTLTIKDDGTYTAIERRRASTRNHSGVVAANGGTITLRNSSGQWVSLRHRGDALYGLTHDLSGYTLQFSAQKDSGTLAGSPSAPSGGE*
Ga0066697_1001543723300005540SoilMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066661_1024430313300005554SoilMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLIGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066661_1064057813300005554SoilTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSDCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVIGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066700_1054822913300005559SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLAGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0066699_1004374133300005561SoilVTRPLSDDSRCARLSPWRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEGAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066703_1021279633300005568SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYVPVGSDAGGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSVMRRGDMLYGVALDQVIGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066694_1002749713300005574SoilGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066691_1047369113300005586SoilNMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYVPVGSDAGGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066706_1078324713300005598SoilAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066652_10001759753300006046SoilMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0075417_1025450923300006049Populus RhizosphereVPFFLLQEAPTKALPEEEKINMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYAATDRRGASTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTIQIAVQKDSGALVSPPTAQSGRE*
Ga0066653_1006698833300006791SoilSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066659_1050524823300006797SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0066659_1096517513300006797SoilALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0075428_10082762023300006844Populus RhizosphereMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDRSGYTLQIAVQKDSGALVSPPTAQSGRE*
Ga0075433_1018410023300006852Populus RhizosphereVPFFLLQEAPTKALPEEEKINMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTIQIAVQKDSGALVSPPTAQSGRE*
Ga0075425_10001329233300006854Populus RhizosphereMSKRGIVLGAGILLIALSGCATHQRTDVVARSTEADVRPSALPADLVGTWSGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVAKGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPPAENGR*
Ga0075434_10017647223300006871Populus RhizosphereMSKRGIVLGAGILLIALSGCATHQRTDVVARSTEADVRPSALPADLVGTWNGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVAKGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPPAENGR*
Ga0075429_10065473123300006880Populus RhizosphereVPFFLLQEAPTKALPEEEKINMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFVPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTIQIAVQKDSGALVSPPTAQSGRE*
Ga0075426_1012306323300006903Populus RhizosphereMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTIQIAVQKDSGALVSPPTAQSGRE*
Ga0075426_1040547013300006903Populus RhizosphereMSKRGIVLGAGILLIALSGCATHQRTDVVARSTEADVRPSALPADLVGTWNGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVAKGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSP
Ga0075435_10042369623300007076Populus RhizosphereMGQRGIVFVASIALIALSGCATHQRTDGVARLTESDARPSPLPADLVGTWTGYFVPVAAGAGGEGAVGNVTLTIKDDGTYTAIERRRASTRNHSGVVAANGGTVTLRNSSGQWVSLKHRGDALYGLTHDLSGYTLQFSAQKDSGTLAGSPSAPSGGE*
Ga0099791_1015060423300007255Vadose Zone SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0099793_1004900813300007258Vadose Zone SoilLPEEAINMVKRGILFVASVALIAFSGCATHSQRYGVAQSTESDVRPSALPSDLVGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0099794_1000794243300007265Vadose Zone SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0075418_1056459023300009100Populus RhizosphereMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGGSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDRSGYTLQIAVQKDSGALVSPPTAQSGRE*
Ga0114129_1048130233300009147Populus RhizosphereVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTIQIAVQKDSGALVSPPTAQSGRE*
Ga0134082_1002277313300010303Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPRDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGV
Ga0134088_1004615913300010304Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134086_1002494423300010323Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVAGVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134111_1000404723300010329Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134080_1001773623300010333Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134063_1002162223300010335Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134062_1008197713300010337Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134121_1041712523300010401Terrestrial SoilMSKRGIVLGSSILLIALSGCATHQRTDVVARSTEADVRPSALPAELVGTWSGSVMPVAPSAGGSNAVGSVTITIKDDGTYTATDRRGASTRNYSGVVVANGRSITFRNSSGRSVSLRRRGDALYGVAQDMSGYALQISVQKDSGALAGSPSAENGR*
Ga0137389_1077266323300012096Vadose Zone SoilASVALIALSGCATHSQSYGVAQSAESDVRPSALPSDLVGTWHGADVPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGGVVANGRTLTLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0137363_1019683113300012202Vadose Zone SoilLIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0137362_1023373633300012205Vadose Zone SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQV
Ga0137380_1010781723300012206Vadose Zone SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDLRPSALPSDLVGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVAPDQVSGHRIQISAEKDTGVLASPPSEPSGQ*
Ga0137381_1017954523300012207Vadose Zone SoilVPFFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPRDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDNGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVPANPPSAQSGQ*
Ga0137376_1010683923300012208Vadose Zone SoilMVKRGILFVASVALIALSGCAMHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPIDQ*
Ga0137378_1127187213300012210Vadose Zone SoilRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0137360_1160604313300012361Vadose Zone SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDVRPSVLPADLVGTWSGSFFPVGSDAGGSNAFGNVTVVIKDDGTYTVTERRKGSTRILSGVVVANGRTITLQSSTGQWISLRRRGDRLYGMSPDQTSGFRVQIFVEKESGALASPPSAPSG
Ga0137361_1080634513300012362Vadose Zone SoilTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0137361_1176370313300012362Vadose Zone SoilPDDSRFARLQPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGATRTFSGVVVANGPTITLRTSTGSWESLMRRGEMLYCVAV
Ga0137358_1025341613300012582Vadose Zone SoilVTRPLSGDSRCARLSPWRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTRIDRRKGSTQTFSASSWRTVARSHCEPRRAGGSRSCAEATCCTAWLSIR*
Ga0137358_1032342813300012582Vadose Zone SoilMRKRGIVLGSSIVLIALSGCATHQRTDVVARSTESDVRPSPLPTDLVGTWSGSFVPIGAGAGGDNAVGSVILTIKDDGTYTATERRKASTWSYSGVVVANGHTITLRSSSGTWVSLRRRGDALYGVAHDRAGYTLQVSVEKDSGALAGPPSAQSRRE*
Ga0137358_1105594913300012582Vadose Zone SoilPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTRHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVS
Ga0137396_1064386923300012918Vadose Zone SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIAFSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ*
Ga0137396_1126454713300012918Vadose Zone SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVA
Ga0137394_1116935123300012922Vadose Zone SoilLSGCATHSQSYGVAQSTESDVRPSVLPADLVGTWSGSFFPVGSDAGGSNAFGNVTVVIKDDGTYTVTERRKGSTRILSGAVVASGRTVTLQSSTGRWISLRRRGDRLYGMSPDETSGFRVQISVEKESGALASPPSTQSGRE*
Ga0137419_1007893013300012925Vadose Zone SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVAPDQVSGHRIQISAEKDTGVLASPPSEQSGQ*
Ga0137404_1013102423300012929Vadose Zone SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRIMISVEKDTGALASPPSEQSGQ*
Ga0137407_1042609223300012930Vadose Zone SoilVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSVLPADLVGTWSGSFFPVGSDAGGSNAFGNVTVVIKDDGTYTVTERRKGSTRILSGVVVANGRTITLQSSTGQWISLRRRGDRLYGMSPDQTSGFRVQIFVEKESGALASPPSAPSGRE*
Ga0137410_1089845613300012944Vadose Zone SoilMVKRGISFIASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRIMISVEKDTGALASPPSEPSGQ*
Ga0134110_1040238523300012975Grasslands SoilATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITSQTSTGRWVSLMRRGAMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0134076_1007688423300012976Grasslands SoilLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILIPVEKDTGVLANPPSAQSGQ*
Ga0134087_1004968813300012977Grasslands SoilLLRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0137411_112023443300015052Vadose Zone SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRIMISVEKDTGALASPPSEQSGQ*
Ga0137418_1001791693300015241Vadose Zone SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ*
Ga0066655_1000913223300018431Grasslands SoilVPLFRLQEAPAKALPEEAINMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0066667_1074681113300018433Grasslands SoilTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0066662_1092284413300018468Grasslands SoilTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0179594_1000355213300020170Vadose Zone SoilEADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYVPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0179594_1011804813300020170Vadose Zone SoilPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSVLPADLVGTWSGSFFPVGSDAGGSNAFGNVTVVIKDDGTYTVTERRKGSTRILSGVVVANGRTITLQSSTGQWISLRRRGDRLYGMSPDQTSGFRVQIFVEKESGALASPPSAPSGRE
Ga0209350_103139433300026277Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGALASPPSEPSGQ
Ga0209237_100421493300026297Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYVPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209238_110176613300026301Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209469_103393333300026307SoilMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209055_102752533300026309SoilMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLIGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTQTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209239_103772733300026310Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209761_110303833300026313Grasslands SoilRVTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209268_105206313300026314SoilALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209471_102497533300026318SoilMVKRGILFVASVALIALSDCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSVMRRGDMLYGVALDQVIGYRILISVEKDTGVLANPPSAQSGQ
Ga0209131_115929713300026320Grasslands SoilQSTESDVRPSALPSDLVGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTGIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRIMISVEKDTGALASPPSEPSGQ
Ga0209375_100863823300026329SoilLPELSDAGHLVRSRPAPRPLPDDSRFARLQPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVERGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209804_114879023300026335SoilLPESFDSGQIVRSRPVTRPLSDDSRCARLSPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEGAINMVKRGILFVASVALIALSGCATHSKSYGVAQSTESDVRPSALPSDLIGTWHVAYAPVGSDAGGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRT
Ga0209057_102616023300026342SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLQTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209806_106023313300026529SoilNMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYVPVGSDAGGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209157_100117243300026537SoilMVKRGILFVASAALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209376_109682433300026540SoilALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209648_1001289443300026551Grasslands SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAHAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209388_123068913300027655Vadose Zone SoilRFARLQPRRATSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLTRRGDM
Ga0209588_115356213300027671Vadose Zone SoilLQPRRVTSREADRSSLSTGLVRVVPLFRLQEAPAKALPEEAINMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVHPSALPSDLVGTWHVAYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGGMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ
Ga0209382_1222108913300027909Populus RhizosphereMGKRGIAFVASVALIALSGCATHQRTDGVARSTESDVRPSALPTDLVGTWSGSFAPVAADAGGKNAVGSVTLVIKDDGTYTATDRRGVSTRNYTGVVVANGRSITLRNSSGGWVSLRRRGDVLYGVVHDLSGYTLQIAVQKDSGALVSPPTAQSGRE
Ga0137415_1000669193300028536Vadose Zone SoilMVKRGILFVASVALIALSGCATHSQSYGVAQSTESDVRPSALPSDLVGTWHVLYAPVGSDASGGNAFGSATLVIKDDGTYTAIDRRKGSTRTFSGVVVANGRTITLRTSTGRWVSLMRRGDMLYGVALDQVSGYRILISVEKDTGVLANPPSAQSGQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.