NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069908

Metagenome Family F069908

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069908
Family Type Metagenome
Number of Sequences 123
Average Sequence Length 111 residues
Representative Sequence MTSRTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASY
Number of Associated Samples 88
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.67 %
% of genes near scaffold ends (potentially truncated) 7.32 %
% of genes from short scaffolds (< 2000 bps) 1.63 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (92.683 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.715 % of family members)
Environment Ontology (ENVO) Unclassified
(47.154 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.407 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 66.43%    β-sheet: 0.00%    Coil/Unstructured: 33.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF04185Phosphoesterase 14.63
PF00005ABC_tran 14.63
PF01243Putative_PNPOx 8.13
PF01402RHH_1 0.81
PF07883Cupin_2 0.81
PF00801PKD 0.81
PF03176MMPL 0.81
PF12679ABC2_membrane_2 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 14.63
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 0.81
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A92.68 %
All OrganismsrootAll Organisms7.32 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10112316All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1428Open in IMG/M
3300002916|JGI25389J43894_1000237All Organisms → cellular organisms → Archaea8602Open in IMG/M
3300005174|Ga0066680_10010841All Organisms → cellular organisms → Archaea4677Open in IMG/M
3300009038|Ga0099829_10003200All Organisms → cellular organisms → Archaea9570Open in IMG/M
3300010303|Ga0134082_10017269All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon2635Open in IMG/M
3300010304|Ga0134088_10013931All Organisms → cellular organisms → Archaea3472Open in IMG/M
3300012361|Ga0137360_10011147All Organisms → cellular organisms → Archaea5732Open in IMG/M
3300026326|Ga0209801_1028850All Organisms → cellular organisms → Archaea2642Open in IMG/M
3300027862|Ga0209701_10137469All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1501Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil44.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.13%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.13%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.44%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.63%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.63%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1004015233300002908Grasslands SoilMTSRTGSSQARLVGIFLLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIIVAAIGLFIVAGGAGYVVSKRLGIPTSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSST
JGI25382J43887_1011231623300002908Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYVVSKRLGIPRSSGPRVPTFARIGSFVGRRYKLIIVFWILL
JGI25386J43895_1012684713300002912Grasslands SoilMTIRTGSSQAQLVGIFVLVLIVSLAEFSALILDYQNNRYLRQYVSDNLGTIIVFAIGLLIVAGGAGYVVSRRLGVPKTTAPSVPTFARIGSFVGRRYKIIIVF
JGI25387J43893_104685713300002915Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYIVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIV
JGI25389J43894_1000237143300002916Grasslands SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYVVSKRLGIPKSPGPR
Ga0066672_1089909113300005167SoilMTSRTSSSQARLIGIFVLVLIISLAEFAALLLDYQNNRYLRQYISDNLGTIILAAIGLLIVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKL
Ga0066680_1001084153300005174SoilMSMMSWVRDAKDDFSGARWTGIFVLVIILSLAELSALILDYQNSRYLRQYVSDNLGTIIVAAIGLLIVAGGAGYVVSRRLGVPKTQGSRVQTFARIGSFVGRRYKLIIVFW
Ga0066688_1102127013300005178SoilMTTRTGSSQAQLVGIFVLVLIVSLAEFSALILDYQNNRYLRQYVSDNLGTIIVFAIGLLIVAGGAGYVVSRRLGVPKTTAPSVPTFARIGSFV
Ga0066685_1006709813300005180SoilMISGTNNSQARWIGIFVLMLIISLAGFTALILDYQNNRYLRQYVSDNLVTTIVAAIGLLIVAVGAGYVVSRRLGVPKTPGLRVPTFARIGSVVGRRYKLIIVFWILLFA
Ga0066685_1076073313300005180SoilMTSRTGSSQTRLVGIFVLVLIISLAEFAALILDYQNNLYLRQYVSDNLVTIIVAAIGLLIVAGGAGYAVSRKLGVPRIQGPRIPTFARIGSFVGRRHRLIIAFWILLFAASFPLSQQLSQVTTSSTSGGQSSMSPSALAQNLMAKEFPHPQSNASAIILLQ
Ga0066676_1029044323300005186SoilMSETGPSQARWIAIFVLVLILALAELAALILDYQNNRYLRQYVSDNLGMIVAGSIGLLMIAGIAGYAVSRKLGIPKTPGPRVPTFARIGAFVGRRYKL
Ga0070708_10066954513300005445Corn, Switchgrass And Miscanthus RhizosphereMRGPSISQARWTGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLGTIIVAAIGLLIVAVGAGYVVTRRLRVPKTPGPRVPTFARIGSFIGRRYRLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSNTSQSA
Ga0070708_10126201123300005445Corn, Switchgrass And Miscanthus RhizosphereMMKGTSISQARWIGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLGTIIVAAIGLLIVAGGAGYVVSRRLGVPKTPGPRLPTFARIGSFVGRRSKLIIVFWILLFAVSFPLSQQLSQVTTSS
Ga0066686_1092254513300005446SoilMRKSDSSQARSIGILVLVLIISLAELSALILDYQNNRYLRQYVSDNLGMIVVASIGLFIVAGTAAYGVSRKLAVPKTQGSRVPTFARIGAFVGRRYKLIIVFWILLFAASFPLS
Ga0066689_1024524823300005447SoilLVGIFVLVLILSLAEFAALLLDYQNNPYLRQYVSDNLGTIILAAIGLLIVAGGGGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFW
Ga0070707_10134546613300005468Corn, Switchgrass And Miscanthus RhizosphereMRGTSISQARWIGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLGTIIVAATGLLIVAVGAGYAVSRRLGVAKTPGPRVPTFARIGSFIGRRYRLIIVFWILLFAASFPLSQQLSQVTTSSTSGG
Ga0066697_1080148313300005540SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAATGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSRQLSQVTTSNTSGGQSGTSQSALAQNLMAQ
Ga0066695_1074412613300005553SoilMTSRIGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIIVAGIGLLIIAVGAGYIVFRRLGVSKTTGLHVPTFARISSFVGRRYKLIILFWILLFAASFPLSRQLPQVTTSNTSGGQSSSSLSALAQNLMTQEFPRPQSNASAIILLQGNNVTD
Ga0066704_1049485623300005557SoilLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLPTIIVAGIGLLIIAGGAGYIVFRRFGVSKTTGPRAPTFARISSFV
Ga0066703_1023588623300005568SoilMTSKTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLGLIVVATIGLVIVAGGAGYVVSRRLGAPKTPGPRVPTFARIGSFVGKRYKLIILFWILLFAASFPLSQQ
Ga0066703_1023870323300005568SoilMRGTSISQARWIGIFVLVLIISLAELTALILDYQNNRYLREYVSDNLGMIVAATIGLVIVAGGAGYVVSRRLGVPKIPGPRVPTFARIGSFVGKRYKLIILFWILLFAASFPLSQQ
Ga0066694_1000574813300005574SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYLVFKRLGIPKSPGPRVPTFARIGSSVGRRYKLIIVFWILLF
Ga0066691_1034067513300005586SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGAIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWIL
Ga0066656_1062075013300006034SoilMSETGPSQARWIAIFVLVLIMALAELAALILDYQNNMYLRQYVSDNLGMIVAGSIGLFIIAGIAGYAVSRKLGIPKIPGPRVPTFARIGAFVGRRYKLIIFFWILLFASSFPLSQQLAQV
Ga0066665_1035314113300006796SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSRRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSRQLSQVTTSNTSG
Ga0066659_1101182913300006797SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGAIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFTASFP
Ga0079221_1147419013300006804Agricultural SoilMSKTGTSQARWVGIFVLVLTLFLAGLGALILDYQNNSYLRQYVSDNLGIVVAGSIGILIVAGIAGYLVSRRLGVTKTPGPITPTFARIGSFVGRRYKLIIVFWILLFA
Ga0099793_1011606813300007258Vadose Zone SoilMSKVDPSRASWTGILVLVLTISLAELAALILDYQNNRYLRLYVSDNLSTIILAAIGLLIVAGGAGYVVSKRLGVQKTPGPRVPAFARIGSFVGRRYKPIIVFWILLFAAS
Ga0066710_10306114013300009012Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYLVSKRFGIPKSPGPRVPTFARIGSSVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSSS
Ga0099829_1000320013300009038Vadose Zone SoilMMSGTGHSQARWIGIFVLVLVISLAELTALILDYQNNRYLRLYVSDNLSMIIVAAIGLLIVAGGAGHVVSRRLGVQKTPGPLVPVFARIGSFVGRRYK
Ga0099829_1018350833300009038Vadose Zone SoilMMSGTSHSQAGWIGIFVLVLIISLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGP
Ga0099829_1027570513300009038Vadose Zone SoilMSGTYPSRARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLTSIIVASIGLLIVAGGAGFVVSRRLGVQKTLGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQLLS
Ga0099829_1079812713300009038Vadose Zone SoilMSGSDSSQARLIGIFVLVLIVSLAELAALILDYQNNRYLRQYVSDNLGTIVAAAIGLAVVASVAGYVLSRKLGAQKTPRPKVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSSTSQSALAQNLMAQEFPHP
Ga0099830_1017917333300009088Vadose Zone SoilMMSGTSHSQARWIGIFVLVLIISLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSST
Ga0099828_1152345513300009089Vadose Zone SoilMSGSGSSQARLIGIFVLVLIVSLAELAALILDYQNNRYLRQYVSDNLGTIVAAAIGLAVVASVAGYVLSRKLGAQKTPRPKVPTFARIGSFVGRRYKLII
Ga0099827_1005059843300009090Vadose Zone SoilMTSRTGSSQARLVGILVLVLIISLAELAALILDYQNNRYLRQYVSGNLGTIIVAAISLLIVAGGAGYVVSRRLGVPKTPGPRLPTFARIGSFVGRRY
Ga0099827_1008594533300009090Vadose Zone SoilMMSKTGSSQARLIGIFILVLILSLAELAALILDYQNNRYLRQYVSDNLGMIVAGSIGFLIIAGLTGYAVSRKLGVSKTPGS
Ga0134082_1001726913300010303Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYLVFKRLGIPKSPGPRVPTFARIGSSVGRRYKLIIVFWILLFAASFPFS
Ga0134088_1001393153300010304Grasslands SoilMTSRTGSSQTRLVGIFVLVLIISLAEFAALILDYQNNLYLRQYVSDNLVTIIVAAIGLLIVAGGAGYAVSRKLGVPRIQGPRIPTFA
Ga0134088_1009377923300010304Grasslands SoilMTTRTGSSQAQLVGIFVLVLIVSLAEFSALILDYQNNRYLRQYVSDNLGTIIVFAIGLLIVAGGAGYVVSRRLGVPKTTAPSVPTFARIGSFVGRRYKIIIVFWILLFAASFPLSEQLS
Ga0134088_1064821013300010304Grasslands SoilMLIISLAGFTALILDYQNNRYLRQYVSDNLVTTIVAAIGLLIVAVGAGYVVSRILGVPKTPGPLVPTFARIGS
Ga0134067_1033806113300010321Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYISDNLGTIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLF
Ga0134063_1057234913300010335Grasslands SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLVTIIVAAIGLLIVAGGAGYAVSRKLGVPRIQGPRIPTFARIGSFVGRRHRLIIAFWILLFAASFPLSQQLSQ
Ga0134063_1075974123300010335Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYLVFKRLGIPKSPGPRVPTFARIGSSVGRR
Ga0137392_1144420213300011269Vadose Zone SoilMMSGTSHSQARWIGIFVLVLIISLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGFVVFRRLGVQKTPGPRVP
Ga0137391_1043728713300011270Vadose Zone SoilMMSGTSHSQARWIGIFVLVLIMSLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSSTSQ
Ga0137391_1048422923300011270Vadose Zone SoilMKSGTSSAQSRWIGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLTTIVAASIGLLIVAGAAGYAVSRKLGVPRPPGPRTPTFAKIGAFV
Ga0137391_1056427923300011270Vadose Zone SoilMMSKTGSSQARLIGIFVLVLILSLAELAALILDYQNNRYLRQYVSGNLGTIIVAAISLLIVAGGAGYVVSRRLGVPKTPGPRLPTFARIGSF
Ga0137391_1127839313300011270Vadose Zone SoilMSGTDTSRARWIGILVLVLTISLAELAALILDYQNNRYLRLYVSDNLGTIVVAAIGLLIVAGGAGYVASKKLGVPKSPGPHVPTFARIASFVRR
Ga0137393_1042057413300011271Vadose Zone SoilMKSGTSSAQSRWIGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLTTIVAASIGLLIVAGAAGYAVSKLGVPRPPGPRTPTFAKIGAFVGRRYKLIIVFWILLFAASFS
Ga0137393_1053455423300011271Vadose Zone SoilMSKIDPSRARWTGILVLVLIISLAELAALILDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRV
Ga0137389_1033389423300012096Vadose Zone SoilMKSGTSSAHSRWIGIFVLVLIISLAELTALILDYQNNRYLRQYVSDNLTTIVAASIGLLIVAGAAGYAVSKLGVPRPPGPRTPTFAKIGAFVGRR
Ga0137389_1057459913300012096Vadose Zone SoilMSGTYPSRARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYIVSRRLGAQKTPRLHVPAFARIGSFVGRRYKLIIVF
Ga0137389_1091449613300012096Vadose Zone SoilMSKVDPSRARWTGILVLVLIISLAELAALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYIVSRRLGVQRTPGPRVPAF
Ga0137389_1091971323300012096Vadose Zone SoilMMSGTSHSQARWIGIFVLVLIISLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQ
Ga0137389_1180931413300012096Vadose Zone SoilMSKVDSSRARWIGILVLVLTISLAELAALILDYRSNRYLRLYVSDNLSTILLTAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGRQSSTSQSALAQNLMSQEFPHPQS
Ga0137388_1015899133300012189Vadose Zone SoilMNKTGSSQARWIGIFVLVLILSLAELAALILDYQNNQYLRQYVSDNLDMIVAGSIGLLIIAGVAGYAVSRKLGARKTPGPRVPTFARIGAFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSNTSGGQSSTSQSAQAQNLIGQEFPHPKSNA
Ga0137388_1050822123300012189Vadose Zone SoilMMSGTSHSQARWIGIFVLVLIISLAGLTALIFDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLS
Ga0137364_1049312523300012198Vadose Zone SoilMTSRTGSSQTRLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLVTIIVAAIGLLIVSGGAGYAVSRKLGVPRIQGHRIPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQ
Ga0137365_1049177723300012201Vadose Zone SoilMTSRTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLATIIVAGIGLLIIAGGAGYIVFRRLGVSKTTGPRAPTFARISSFVGRRYKLIIIFWILLFAASFPLSQQLSQ
Ga0137365_1134904313300012201Vadose Zone SoilLVGIFVLVLIISLAEFAALILDYQNNRYLRKYVSDNLVTIILAAIGLLIVAGGAGHAVSRKVGVPRIQGPRIPTFARIGSF
Ga0137365_1136285313300012201Vadose Zone SoilMTSRTGSSQTRLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLATIIVAAIGLLIVAGGAGYAVSRKLGVPRIQGPRIPTFARIGSFVGRRYKLIIVFWIL
Ga0137363_1087135813300012202Vadose Zone SoilMSGTNVSQARWIGIFVLVLMISLAGLTALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYVVSRRLGIQKTPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSNTSGGQSST
Ga0137362_1016914733300012205Vadose Zone SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYIVSKRLGIPKSPGPRVPTFARIGSF
Ga0137378_1076991613300012210Vadose Zone SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSGTSQSALAQNLMAQEFPHPQSNVSAII
Ga0137377_1090550013300012211Vadose Zone SoilLVGIFVLVLIISLAEFAALILDYQNNRYLRKYVSDNLVTIILAAIGLLIVAGGAGHAVSRKVGVPRIQGPR
Ga0137386_1131483713300012351Vadose Zone SoilMTSRTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGIIIVVAIGLLIVAGGAEYVVSKRLGVPVSSGPRVPTFARIGSFVGRRYKLIIVFWI
Ga0137367_1073676613300012353Vadose Zone SoilLVGIFVLVLIISLAEFAALILDYQNNRYLRKYVSDNLVTIILAAIGLLIVAGGAGHAVSRKVGVPRIKGPRIPTFARIGSFVGRRYTLIIVFWILLFAASFPLSQQLSQVT
Ga0137371_1025767023300012356Vadose Zone SoilMTSRTGSSQTRLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLVTIIVAAIGLLIVAGGAGYAVSRKLGVPRIQGPRIPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTS
Ga0137371_1126530013300012356Vadose Zone SoilMTSRTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLATIIVAGIGLLIIAGGAGYIVFRRLGVSKTTGPRAPTFARISSFV
Ga0137384_1030899613300012357Vadose Zone SoilMTSRTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSNTSGGQSS
Ga0137360_10011147113300012361Vadose Zone SoilMTSRTGSSQAGLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILVAIGLLFVAGGAGYVVSKRLGIPRSSGPRVP
Ga0137390_1043176223300012363Vadose Zone SoilMSKVDPSRARWTGILVLVLIISLAELAALILDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLS
Ga0137390_1051540723300012363Vadose Zone SoilMSGTSISQARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYVVSRRLGIQKTPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLS
Ga0137390_1182773213300012363Vadose Zone SoilMNKTGSSQARWIGIFVLVLILSLAELAALILDYQNNQYLRQYVSDNLDMIVAGSIGLLIIAGVAGYAVSRKIGVRKTPGPSVPTFARIGSFVGRRYKLIIVFWIL
Ga0137390_1185827813300012363Vadose Zone SoilMSGTDTSRARWIGILVLVLTISLAELAALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYIVSRRLGAQKTPRLHVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTS
Ga0137396_1032904813300012918Vadose Zone SoilMMSKTGSSQARLIGIFVLVLILSLAELAALILDYQNNRYLRQYVSDNLGMIVAGSIGLLIIAGVAGYAVSRKLGVSKTPGSRVPTFARIGAFVGRRYRLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQ
Ga0137396_1094458613300012918Vadose Zone SoilMMSGTGHSQARWIGIFILVLVISLAELTALILDYQNNRYLRLYVSDNLSTVIVASIGLLIVAGGAGYVVSRRLGAQKTPGSRIPAFARIGSFVGRRYKLIVVF
Ga0137416_1086714013300012927Vadose Zone SoilMSGSGSSQARLIGIFVLVLIISLAELAALILDYQNNRYLRQYVSDNLGTIVAAAIGFVVVASVAGYVVSRKLGAQKTPRPKVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSSTSQSAL
Ga0137416_1103797513300012927Vadose Zone SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIIVAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSGTSQSALAQNLMAQEFPHPQ
Ga0137407_1224551213300012930Vadose Zone SoilMTSGTNVSQARWIGIFVLMLITSLAGLTALILDYQNNRYLRQYVSDNLGTTIVAAISLLTVSAAAGYVVSRRLGVPKTPGLRVPTFARIGSVVGRRYK
Ga0137410_1183182113300012944Vadose Zone SoilMMSGTSHSQARWIGIFILVLIISLAELTALILDYQNNRYLRLYVSDNLSTIIVAAIGLFIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSRTSGGQSSSSQSALAQNL
Ga0134077_1006788613300012972Grasslands SoilMTSGTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSNTSGGQSSSSQSALAQN
Ga0134075_1023767423300014154Grasslands SoilMSETGPSQARWIAIFVLVLILALAELAALILDYQNNRYLRQYVSDNLGMIVAGSIGLLMIAGIAGYAVSRKLGIPKTPGPRVPT
Ga0134112_1007697723300017656Grasslands SoilMISGTNNSQARWIGIFVLMLIISLAGFTALILDYQNNRYLRQYVSDNLVTTIVAAIGLLIVAVGAGYVVSRRLGVPKTPGPRVPTFARIGS
Ga0187803_1035527113300017934Freshwater SedimentMSKTDPPRGRWVGIFVLVLILALAELAALILDYQNNIYLRQYVSDNLGAIVGAAIALLIVAVGAGYAVSRKLGVAKTPGPRIPTFAKIGIFVRRRYKLIILFWILIFVVSFPLSQELSQVVTSSTSGGQSSTSQSALAQNLMAQEFPHPQSNTSAIILVQSNDV
Ga0066662_1017668313300018468Grasslands SoilMRKSDSSQARSIGILVLVLIISLAELSALILDYQNNRYLRQYVSDNLGMIVVASIGLFIVAGTAAYGVSRKLAVPKTQGSRVPTFARIGAFVGRRYKLIIVFWILLFAASFPLSQQ
Ga0137417_105739213300024330Vadose Zone SoilMSRTNISQARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLGTIVVAAIGLFIVAGGAGYVVSKRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGF
Ga0209236_116101723300026298Grasslands SoilMTIRTGSSQAQLVGIFVLVLIVSLAEFSALILDYQNNRYLRQYVSDNLGTIIVFAIGLLIVAGGAGYVVSRRLGVPKTTAPSVPTFARIGSFVGRRYKI
Ga0209055_108483433300026309SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGAIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFTASFPL
Ga0209239_129000713300026310Grasslands SoilMISGTNNSQARWIGIFVLMLIISLAGFTALILDYQNNRYLRQYVSDNLVTTIVAAIGLLIVAVGAGYVVSRRLGVPKTPGPRVPTFARIGSFVGRRYKLIIVFWILLFA
Ga0209761_115146023300026313Grasslands SoilMTSRNGSSQARLVGIFVLVLIISLAEFSALLLDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYVVSKRLGIPRSSGPRVPTFARIGSFVGRRYKLIIVFWILL
Ga0209686_104964343300026315SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAATGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLF
Ga0209154_124620613300026317SoilLIGIFVLVLIISLAEFAALLLDYQNNRYLRQYISDNLGTIILAAIGLLIVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIAFWILL
Ga0209471_115989013300026318SoilMTSKTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLGLIVVATIGLVIVAGGAGYVVSRRLGAPKTPGPRVPTFARIGSFVGKRYKLIILFW
Ga0209152_1024160923300026325SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPT
Ga0209801_102885013300026326SoilMTSRTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASY
Ga0209802_122499913300026328SoilMKKTDSSHARWIGIFVLILVISLAELAALILDYQNNRYLRQYVSDNLGMIVAGPIGLLIIAGIAGYAVSRKLGVSKTPGSRVPTFARIGSFVG
Ga0209803_136975113300026332SoilMTSRTGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIV
Ga0209804_105859313300026335SoilMTSRTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLGLIVVATIGLVIVAGGAGYVVSRRLGAPKTPGPRVPTFARIGSFVGKRYKLIICGKFSVISATLASHYFKH
Ga0257177_108058813300026480SoilMSKVDPSRARWTGILVLVLTISLAELAALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWVLLFAASFPLSQQLSQVTTSSTSGGQSSASQSALAQNLMSQ
Ga0209806_110484923300026529SoilMTSKTGSSQARLVGIFVLVLIISLAGFAALLLDYQNNRYLRQYVSDNLGLIVVATIGLVIVAGGAGYVVSRRLGAPKTPGPRVPTFARIGSFVGKRYKLIILFWILLFAA
Ga0209806_110564213300026529SoilMRGTSISQARWIGIFVLVLIISLAELTALILDYQNNRYLREYVSDNLGMIVAATIGLVIVAGGAGYVVSRRLGVPKIPGPRVPTFARIGSFVGKRYKLIILFWILLFAA
Ga0209160_104512833300026532SoilMTIRTGSSQAQLVGIFVLVLIVSLAEFSALILDYQNNRYLRQYVSDNLGTIIVFAIGLLIVAGGAGYVVSRRLGVPKTTAPSVPTFARIGSFVGRRYKIIIIFWILLFAASFPLSQQLSHVTS
Ga0209160_131679813300026532SoilMRKSDSSQARSIGILVLVLIISLAELSALILDYQNNRYLRQYVSDNLGMIVVASIGLFIVAGTAAYGVSRKLAVPKTQGSRVPTFARIGAFVGRRYKPIIV
Ga0209058_114761123300026536SoilMSETGPSQARWIAIFVLVLILALAELAALILDYQNNRYLRQYVSDNLGMIVAGSIGLLMIAGIAGYAVSRKLGIPKTPGPRVPTFARIGAFVGRRYKLIIAFWILLFASSFPLSQQLAQVTTSSTSGGQSSTSQSAL
Ga0209056_10019712123300026538SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAATGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFTASFPLSQQLSQVTTSNTSGGQSGTSQSALAQNLMAQEFPHPQSNASAII
Ga0209056_1056181013300026538SoilLVGIFVLVLVISLAEFAALLLDYQNNRYLRQYVSDNLGAIILATIGLLFFAGGAGYVVSKRLGIPKSPGPRVPKFARIGSFVGRRYKLIIVFWILLFA
Ga0209161_1009699913300026548SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSMRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILL
Ga0209474_1029999913300026550SoilMTSRTSSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILATIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRY
Ga0209577_1042625613300026552SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALLLDYQNNRYLRQYVSDNLGTIILAAIGLLIVAGGAGYVVSRRLGIPKSPGPRVPMFARIGSFVGRRYKLIIVFWILLFAASFPLSRQLSQVT
Ga0209076_107985723300027643Vadose Zone SoilMSKVDPSRASWTGILVLVLTISLAELAALILDYQNNRYLRLYVSDNLSTIILAAIGLLIVAGGAGYVVSKRLGVQKTPGPRVPAFARIGSFV
Ga0209588_105921513300027671Vadose Zone SoilMSGTYPSRARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLTSIIVASIGLLIVAGGAGFVVSRRLGVQKTPGPRVPAFARIGSFV
Ga0209178_127498913300027725Agricultural SoilMSKTGTSQARWVGIFVLVLTLFLAGLGALILDYQNNSYLRQYVSDNLGIVVAGSIGILIVAGIAGYLVSRRLGVTKTPGPITPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQLSQVTTSSTSGGQSS
Ga0209689_142521913300027748SoilLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIILAAIGLLFVAGGAGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPL
Ga0209689_143467713300027748SoilMTSRTSSSQARLVGIFVLVLILSLAEFAALLLDYQNNPYLRQYVSDNLGTIILAAIGLLIVAGGGGYVVSKRLGIPKSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPL
Ga0209180_1005601313300027846Vadose Zone SoilLIGIFVLVLIVSLAELAALILDYQNNRYLRQYVSDNLGTIVAAAIGLAVVASVAGYVLSRKLGAQKTPRPKVPTFARIGSFVGRRYKLIIVFWILLFAASFPLSQQ
Ga0209180_1049286323300027846Vadose Zone SoilMSGTYPSRARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLSTIVVAAIGLLIVAGGAGYVVSRRLGVQKTPGPRVPAFARIGSFVGKRYKLIIV
Ga0209180_1069937813300027846Vadose Zone SoilMSGTNVSQARWIGIFVLVLIISLAGLTALILDYQNNRYLRLYVSDNLSTIIVAAIGLLIVAGGAGYVVSRRLGIQKTPGPRVPTFARIGSFVGRR
Ga0209701_1013746913300027862Vadose Zone SoilMSGTNISQARWIGIFVLVLIISLAGLAALILDYQNNRYLRLYVSDNLSTIIVAAVGLLIVAGGAGFVVSRRLGVQKTPGPRVPAFARIGSFVGRRYKLIIVFWIL
Ga0137415_1105443213300028536Vadose Zone SoilMTSRNGSSQARLVGIFVLVLIISLAEFAALILDYQNNRYLRQYVSDNLGTIIVAAIGLLIVAGGAGYVVSKRLGVPMSPGPRVPTFARIGSFVGRRYKLIIVFWILLFAASFPLS
Ga0307479_1048038923300031962Hardwood Forest SoilMSKTGTSQTRWVGIFVLVLTLSLAGLGALILDYQNNSYLRQYVSDNLGIVVAGSIGILIVAGIAGYLVSRRPGVTKTPGPSTPTFARIGSFVGRRYKLIIVFWILLFAA
Ga0307479_1169304613300031962Hardwood Forest SoilMSEADPSRSRWIGIFVLVLIISLAELAALILDYQNNPYLRQYVSDNLAAIVGAGIALLIVALGAGYAVSRKVGVAKTPGPRIPTFAKIGNFVRRRYKLIIIFWI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.