NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103525

Metagenome Family F103525

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103525
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 100 residues
Representative Sequence VSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLGLEAGAPSQRVFSPDEGLPGRTISTGKSYLV
Number of Associated Samples 93
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 75.00 %
% of genes near scaffold ends (potentially truncated) 3.96 %
% of genes from short scaffolds (< 2000 bps) 3.96 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.040 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(19.802 % of family members)
Environment Ontology (ENVO) Unclassified
(28.713 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(38.614 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.67%    β-sheet: 13.33%    Coil/Unstructured: 55.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00578AhpC-TSA 44.55
PF16868NMT1_3 9.90
PF13231PMT_2 1.98
PF13492GAF_3 1.98
PF13185GAF_2 1.98
PF03328HpcH_HpaI 0.99
PF05378Hydant_A_N 0.99
PF01370Epimerase 0.99
PF01590GAF 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0145N-methylhydantoinase A/oxoprolinase/acetone carboxylase, beta subunitAmino acid transport and metabolism [E] 1.98
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 0.99
COG2301Citrate lyase beta subunitCarbohydrate transport and metabolism [G] 0.99
COG38362-keto-3-deoxy-L-rhamnonate aldolase RhmACarbohydrate transport and metabolism [G] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.04 %
All OrganismsrootAll Organisms3.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000033|ICChiseqgaiiDRAFT_c0749885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium707Open in IMG/M
3300009137|Ga0066709_101047721All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1197Open in IMG/M
3300012203|Ga0137399_10244224All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1471Open in IMG/M
3300013297|Ga0157378_10577515All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1133Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil15.84%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand12.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.93%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Unplanted SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Unplanted Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.99%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300004281Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBioEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009795Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012499Unplanted soil (control) microbial communities from North Carolina - M.Soil.2.yng.030610EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027384Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027950Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031995Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-2Host-AssociatedOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_074988513300000033SoilVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAMPHHYLAVCLQDSEKSGYLVHSXXSLDEGAPAQRVFSLD
INPhiseqgaiiFebDRAFT_10572974833300000364SoilVTQIERLTRLSEILAELFRSPIPSHFFQTLADQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGVMSRRVFSVDEGLPGRAISTAKAYVVEDLTAEDGAPDLEGVLAAAGLRAALVVPIGSGLE
JGI1027J12803_10245323913300000955SoilMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRAM
JGI1027J12803_10506889013300000955SoilVTQIERLTRLSEILAELFRSPIPSHFFQTLADQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMGRRVFSVDKGLPGRAIST
JGI25382J37095_1023143023300002562Grasslands SoilMSQVERLTRLSEILVELFRSPVPTHFFQTLGDQAFAAVPNDYLAVCLLDPEKGGYVTHSLAALEGCQVSQRVFSGNESLPGWTISTGKARLVADLAVEDGVHDLEGVLIAAGMRAVMVAPIRRGSDVLGALLFAGRPPIT
JGI25612J43240_105350123300002886Grasslands SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPGRAMSTGKPYLVDELIAEDGVDDLEGVLAA
Ga0066397_1017457713300004281Tropical Forest SoilVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLRDSEKSGYLVHSLASLDEAAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAGAGLRAVLVVPIRRGLEVLGALLFAG
Ga0066672_1027759913300005167SoilMSQTERLTRLSEILIELFRSPIPTHFFQTLGDHASRVVPHDYLAVCLPDPEKGSYLVHTLAGLDAGAVSLRGFSLYEGL
Ga0066690_1011481533300005177SoilMSQVERLTRLSEILVELFRSPVPTHFFQTLGDQAFAAVPSDYLAVCLLDPEKGGYVTHSLAALEGCQVSQRVFSGNEGLPGWTISTGKARLVADLAVEDGVHDLEGVLIAAGMRAVMV
Ga0066684_1073904813300005179SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMSRRVFSLD
Ga0066678_1082292223300005181SoilMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRAMTTGQPQRIDDLTLVRD
Ga0065705_1047965523300005294Switchgrass RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYRVHSLASLDEAAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLASAGLQAVLVVPIRRGLEILGAL
Ga0065707_1047063213300005295Switchgrass RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAAAGLRAVLVVPIRRGLEVL
Ga0070669_10030876633300005353Switchgrass RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFALDEGLPGRSMSTGKPYLIDELMAEDGVDD
Ga0070705_10122899713300005440Corn, Switchgrass And Miscanthus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFALDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLASAGLRAVLVVPIRRGLEILGALLFAGRPPIRYG
Ga0070708_10213733913300005445Corn, Switchgrass And Miscanthus RhizosphereVTQIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQIDLEEGAMSRRVFSPDQGLPGRAISTGKAYIVEDLATEDGAPDLEGVLAA
Ga0070686_10146104423300005544Switchgrass RhizosphereMSQAERLKRLSEILIELFRSPVPTHFFQTLSDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRAMTTGQPQRIDDLALVRDGVH
Ga0070696_10101689823300005546Corn, Switchgrass And Miscanthus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRSGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAVPTQRVFALDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAS
Ga0066701_1029256513300005552SoilVSQVERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPERGGYLVHSQVDLEEGAMSRRVFSPDEGLPGRAISTGKAYLVEDLAAEDGAPDLEGVLAAAGL
Ga0066704_1005163533300005557SoilVTQIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPERGGYLVHSQVDLEEGAMSRRVFSPDEGLPGRAISTGKAYLVA
Ga0066698_1086040223300005558SoilMSQVERLTRLSEILVELFRSPVPTHFFQTLGDQAFAAVPNDYLAVCLQDPEKGGYVTHSLAALEGCQVSQRVFSGNEGLPGWTISTGKARLVADLAVEDGVHDLEGVLIAAGMRAVMVAPIRRG
Ga0070702_10164710023300005615Corn, Switchgrass And Miscanthus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFALDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLASAGLRAVLVVPIRRGLEIL
Ga0075422_1029138623300006196Populus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPVQRVFSLDEGLPGRSMSTGKPYLIDELMAE
Ga0066659_1083799723300006797SoilVSQVERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLKDPEKGGYRVHSQVDLEEGAMSRRVFSPDEGLPGRAISTGKAYLVEDLAAEAGAPDLEGVL
Ga0075421_10046336913300006845Populus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAVPTQRVFALDEGLPGRSMSTGKPYLI
Ga0075425_10050866713300006854Populus RhizosphereMSQAERLKRLSELLIELFRSPIPTHFFQTLGDHAGTAMPHDYLAVCLADPEKGSYVVHTLAGLDAGAVSMRGFSLYEG
Ga0079215_1031859423300006894Agricultural SoilMSQAERLVRLNEILVELFRSPIPTHFFQTLADQASAAVPHDYLAVCLEDAEKGGYLVHSLAAVEGGVVAPRVFSPHEGLPGRVMREG
Ga0099791_1013501533300007255Vadose Zone SoilVNQAERLTRLSEILIELFRSPVPTHFFQTLGDRAGTAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTI
Ga0066709_10104772123300009137Grasslands SoilMSQVERLTRLSEILVELFRSPVPTHFFQTLGDQAFAAVPSDYLAVCLLDPEKGGYVTHSLAALEGCQVSQRVFSGNE
Ga0105243_1043127813300009148Miscanthus RhizosphereMSQAERLKRLSETLIELFRSPIPTHFFQTLGDHAGSAVPHEYLAVCLADPEKGSYLVHTLAGLDVGAVSLRGFSLYEG
Ga0111538_1178506023300009156Populus RhizosphereVSQAERLTRASEILVELFRSPVPTHFFQTLGDRAVSAVPSHYLAVCLQDSEKSGYLVHSLGTLDDGAPAQRVFSLDEGLPGRAMSTGKPYLIDELMAEDGVDDLEGVLAS
Ga0105059_102832723300009795Groundwater SandMSVGIGASQAERLARASEILVELFRSPVPTHFFQTLGDQASIAVPSDYLAVCLQDRENGGYLVHSLTSLDDGAVGQRVFLPHEGLP
Ga0105065_108103723300009803Groundwater SandVSQAERLTRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLRLEACAPSQRVFSPDEGLPGRTISTGKAYLVDDLAAQDGVHDLEGVLAAAG
Ga0105061_100474413300009807Groundwater SandVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTG
Ga0105057_101054313300009813Groundwater SandMSVGTGVAQAERLARVSEILVELFRSPVPTHFFQTLGDQASMAVPSDYLAVCLQDRENGGYLVHSLASLDDGEVGQRVFLPHEGLPGQAMSTGRACLVTDLGAAEQ
Ga0105072_102527923300009818Groundwater SandVSQAERLRRLSELLVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAAEDGVHDLEGVLVAAGLRAVLVVPIRR
Ga0105058_114885423300009837Groundwater SandVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAAEDGV
Ga0126380_1111350823300010043Tropical Forest SoilVSQAERLTRVSEILVELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDSEKSGYLVHSLGSLDEGAPAQRVFSLDEGLPGRAMSTGKPYLIDELMAE
Ga0126382_1044473313300010047Tropical Forest SoilVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEGAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAGAGLRAVLVVPIRRG
Ga0126370_1075300523300010358Tropical Forest SoilVSQAERLTRASEILVELFRSSVPTHFFQTLGDRAGEAVPSHYLAVCLQDSEKSGYLVHSLGSLDDGAPAQRVFSLDEGLPGRAMSTGKPYLIDELMADDGVDDLEGGLAAAG
Ga0126376_1226440723300010359Tropical Forest SoilMSQAERLKRLSELIIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYVVHTLAGLDAGAVSMRGFSLYEGLPGRAMTTGQAQRIDDL
Ga0126372_1257948623300010360Tropical Forest SoilVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEGAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAGAGLRAVLVVPIRRGLEVLGALLFA
Ga0126377_1088332513300010362Tropical Forest SoilVSQAERLTRASEILVELFRSPVPTHFFQTLGDRAGAAVPSHYLAVCLQDSEKSGYLVHSLGSLDNGAPAQRVFSLDEGLPGRAMSTGKPYLIDELMADDGVDDLEGV
Ga0137388_1001766213300012189Vadose Zone SoilMSHAERLTRLSEILVELFRSPVPTHFFQTLADQAFAAVPNDYLAVCLQDPEKGGYLVHSLSGLEDRAVSQRVFSPDEGLPGRVIGLGKAQ
Ga0137363_1042671323300012202Vadose Zone SoilVTQIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMRRRVFSLDEGLPGRAISTAKAYAIEDLTAEDGAPDLEGVLAAA
Ga0137399_1006824333300012203Vadose Zone SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPGRAMSTGKPSSSENTR*
Ga0137399_1010642013300012203Vadose Zone SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLDEGRISRRVFAPDEGLPGRAISTGKAYLVED
Ga0137399_1024422433300012203Vadose Zone SoilVSQAARLIRLSEILVELFRSPVPTHFFQTLADRTGAVMPSDYLAVCLQDLEKGGYLVHSLASLEAGSPSQRVFSPD
Ga0137362_1072235023300012205Vadose Zone SoilVTQIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQIDLEEGAMSRRVFSPDQGLPGRAISTGKAYIVEDLAT
Ga0137376_1090618613300012208Vadose Zone SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMSRRVFSLDEGLPGR
Ga0137377_1019858713300012211Vadose Zone SoilMSHAERLTRLSEILVELFRSPVPTHFFQTLADQAFAVVPNDYLAVCLQDPEKGGYLVHSLAGLEEPAVSQRVFSPDEGLPGLAISTGKAQLRADLAADDGVDDLDGVLAGAGMRAVVFQDIGSVRVVDVPEPALED
Ga0137377_1055017913300012211Vadose Zone SoilMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRAVTTGQPQRI
Ga0137367_1046725523300012353Vadose Zone SoilVSQAGRLRRLSEILVELFRSPVPTHFFQTLGDRAGAVVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYL
Ga0137369_1033887413300012355Vadose Zone SoilVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGTVVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAA
Ga0137375_1024273713300012360Vadose Zone SoilVSQAGRLRRLSEILVELFRSPVPTHFFQTLGDRAGAVVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPG
Ga0157350_106012823300012499Unplanted SoilVNHAERLTRLNELLVELVRSPLPTHFFQTLADHAAGAVPHDYLAVCLVDPEKGGYLVHSLVGLDADAVSSRPFSPYEGLPGRVITT
Ga0137397_1073487523300012685Vadose Zone SoilLSKIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLDEGRISRRVFAPDEGLPGRAISTGKAYLVEDLTAEDGLPDVEGVLA
Ga0137396_1046479223300012918Vadose Zone SoilVSKIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLDEGRISRRVFAPDEGLPGRAISTGKAYLVEDLTAEDGVPDME
Ga0137359_1011518233300012923Vadose Zone SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMRRRVFSLDEGLPGRAISTAKAYAIE
Ga0137419_1018397233300012925Vadose Zone SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAPRVFSLDEGLPGRAM
Ga0126375_1111422913300012948Tropical Forest SoilVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAGAGLR
Ga0157378_1057751513300013297Miscanthus RhizosphereMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLGVCLADPEKDSYLVHTLAGLDAGAVSLRGFSLYE
Ga0134081_1022671013300014150Grasslands SoilMSQTERLTRLSEILIELFRSPIPTHFFQTLGDHAGRVVPHDYLAVCLPDPEKGSYLVHTLAGLDAGAVSRRGFSLYEGVPGRAITTGQA
Ga0157376_1160170723300014969Miscanthus RhizosphereMSQAERLKRLSELLIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYVVHTLAGLDAGAVSLRGFSLYEGLPGRAMTTGQ
Ga0137405_108807213300015053Vadose Zone SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPGRAMSTGKPYLVDELIAEDAHR
Ga0137403_1009353743300015264Vadose Zone SoilLSKIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLDEGRISRRVFAPD
Ga0134089_1020652323300015358Grasslands SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPGRAMSTGKPYLVDELIAEDGVDDLEGVLVAAGLRAVLVVPIRRGLEM
Ga0134112_1003015533300017656Grasslands SoilVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLASLEGGALIQRVFSPDEGLPGRTISTG
Ga0134083_1006952013300017659Grasslands SoilVSQAERLTRLSEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVLSLDEGLPGRAMSTGKPYLVDELIAEDGVD
Ga0184632_1019590313300018075Groundwater SedimentVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLGLEAGAPSQRVFSPDEGLPGRTISTGKSYLV
Ga0066662_1260689713300018468Grasslands SoilMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRA
Ga0187894_1030783323300019360Microbial Mat On RocksVSQAERLNRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASVDEAAPAQRVYALDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAGAGLRAVLVVPIRRGL
Ga0222623_1000912543300022694Groundwater SedimentVSQAERLTRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYRVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAAEDGVHDLEGVLAAAGLRAVLV
Ga0207662_1063539523300025918Switchgrass RhizosphereMSQAERLKRLSEILIELFRSPIPTHFFQTLSDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAVSLRGFSLYEGLPGRAMTTGQPQRIDHLALVR
Ga0209438_109325923300026285Grasslands SoilMSQAERLKRLSEILIELFRSPIPTHFFQTLGDHAGSAVPHDYLAVCLADPEKGSYLVHTLAGLDAGAAVSLRGFSLYEGLPGRAMTTGQPQRIDDLTLV
Ga0209237_126726813300026297Grasslands SoilVSQAERLTRLSEILIELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLHDPEQGGYLVHSLASLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAAEDGVHDLEGVLAAAGL
Ga0209761_112425213300026313Grasslands SoilVNQAERLTRLSEILIELFRSPVPTHFFQTLGDRAGTAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGRSYLVDDLAAEDGVHDLEGVLAAAGLRAVLVVPIRRGLE
Ga0209158_105888713300026333SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMSRRVFSLDEGLPGRTISTSKAYVVED
Ga0209157_122759113300026537SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPGRAMSTGKPYLVDELMAEDGVDDLEGVLAAAGLRAVLVVPIR
Ga0209161_1057606023300026548SoilVTRIERLTRLSEILAELFRSPIPSHFFQTLGDQAWAAVPCDYLAVCLQDPEKGGYLVHSQVDLEDGAMSRRVFSLDEGLPGRTISTSKAYVVEDLAAEDGAPDLEG
Ga0209474_1057181913300026550SoilMSQTERLTRLSEILIELFRSPIPTHFFQTLGDHAGRVVPHDYLAVCLPDPEKGSYLVHTLAGLDAGAVSLRGFSLYEG
Ga0209886_102925123300027273Groundwater SandMSVGTGVAQAERLARVSEILVELFRSPVPTHFFQTLGDQASMAVPSDYLAVCLQDRENGGYLVHSLAGLDDGAVGQRVFRPHEGLPGRAMSTGRAYLVADLGSAEEAVRDLEGVLAASGLRAAL
Ga0209854_101503733300027384Groundwater SandMSVGTGVAQAERLARVSEILVELFRSPVPTHFFQTLGDQASMAVPSDYLAVCLQDRENGGYLVHSLAGLDDGAVGQRVFR
Ga0209854_102838223300027384Groundwater SandVNVETVVSHAERLARVSEILVELFRSPVPTHFFQTLGDQASMAVPFDYLAVCLRDQENDGYLVHSLASLDDGAVGQRVFSPREGLPGVVMSTGRAHLVADLGAAQGGVGDL
Ga0209854_105111923300027384Groundwater SandVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTIS
Ga0209874_102447413300027577Groundwater SandVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGKSYLVDDLAAEDGVQDL
Ga0209388_110588613300027655Vadose Zone SoilVSQAERLTRASEILIELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDPEKSGYLVHSLASVDEASPAQRVFSLDEGLPG
Ga0209814_1005648913300027873Populus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFALDEGLPG
Ga0207428_1107661613300027907Populus RhizosphereVSQAERLSRASEILVELFRSPVPTHFFQTLGDRAGEAVPHHYLAVCLQDSEKSGYLVHSLASLDEAAPAQRVFSLDEGLPGRSMSTGKPYLIDELMAEDGVDDLEGVLAG
Ga0209868_102109813300027947Groundwater SandVNVETVVSHAERLARVSEILVELFRSPVPTHFFQTLGDQASMAVPFDYLAVCLRDQENDGYLVHSLASLDDGAVGQRVFSPREGLPGVVMSTG
Ga0209885_101343513300027950Groundwater SandVSQAERLRRLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLLSLEAGAPSQRVFSPDEGLPGRTISTGRSYLVDDLAAEDGVHD
Ga0318555_1038953123300031640SoilMSQPDRLKRLSEILIEMFRSPIPTHFFQTLGDYAGSVVPHDYLAVCLADAEKGSYLVHSLAGLAPGAVSPRGFSFYEGLPGRAITTGQAQRIDD
Ga0318560_1025056223300031682SoilMSQPDRLKRLSEILIEMFRSPIPTHFFQTLGDYAGSVVPHDYLAVCLADAEKGSYLVHSLAGLAPGAVSPRGFSFYEGLPGRAI
Ga0307469_1025389723300031720Hardwood Forest SoilMSQVERLTRLSEILVELFRSPVPTHFFQTLGDQAFAAVPNDYLAVCLLDPEKGGYVTHSLAALEGCQVSQRVFSGNEGLP
Ga0307469_1114655023300031720Hardwood Forest SoilVSQAERLTRASEILVELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDSEKSGYLVHSLASLDEAAPTQRVFALDEGLPGRAMSTGKPYLIDELIAEDGVDDLEGVLAAAGLRAVLVVPIRRGL
Ga0214473_1160869713300031949SoilMSHAERLTRLNEILVELFRSPIPTHFFQTLADQAGAAVPHDYLAICLEDADKGGYLVHSLAAVEGGAPAPRVFSPHEGLPGRVMREGRACALE
Ga0307409_10197620513300031995RhizosphereMSQAERLVRLNEILVELFRSPIPTHFFQTLADQASAAVPHDYLAVCLEDAEKGGYLVHSLAAVEGGVVAPRVFSPH
Ga0310889_1013745113300032179SoilVNHAERLTRLNELLVELVRSPLPTHFFQTLADHAAGAVPHDYLAVCLVDPEKGGYLVHSLVGLDADAVSSRPFSPYEGLPGRVITTGHAHRIEDL
Ga0307471_10094309723300032180Hardwood Forest SoilVSQAERLARLSEILVELFRSPVPTHFFQTLGDRAGAAVPSDYLAVCLQDPEKGGYLVHSLASLEGGAPIQRVFSLDEGLPGRTISTGKSYLVDDLA
Ga0307471_10205166823300032180Hardwood Forest SoilVSQAERLTRASEILVELFRSPVPTHFFQTLGDRAGEAVPSHYLAVCLQDSEKSGYLVHSLGSLDEGAPAQRVFSLDEGLPGRAMSTGKPYLVDELIAEDGVDDLEGVLAAAGLRAVLVVPIRRGLEVL
Ga0307472_10140778813300032205Hardwood Forest SoilMSQAERLKRLSETLTELFRSPIPTHFFQTLGDHAGSAVPHEYLAVCLADPEKGSYLVHTLAGLDVGAVSLRGFSLYEGLPGRAMTTGQPQRIDDLTLV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.