NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103782

Metagenome Family F103782

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103782
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 115 residues
Representative Sequence PAIPPPEQFQTTANSRLEQLRELCEAHGAKLIILVPPTPSSEDAVRQMTIASQKAGVDTLVPIDPTALSTKYYQPDEVHLNSEGAVLFTTALAEHLPQTIARESMASPD
Number of Associated Samples 81
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 11.11 %
% of genes near scaffold ends (potentially truncated) 5.94 %
% of genes from short scaffolds (< 2000 bps) 5.94 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (91.089 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(43.564 % of family members)
Environment Ontology (ENVO) Unclassified
(38.614 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.515 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 37.96%    β-sheet: 4.38%    Coil/Unstructured: 57.66%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF07977FabA 7.92
PF13432TPR_16 2.97
PF03062MBOAT 2.97
PF00589Phage_integrase 2.97
PF07238PilZ 1.98
PF13424TPR_12 1.98
PF13489Methyltransf_23 1.98
PF01966HD 0.99
PF07730HisKA_3 0.99
PF13620CarboxypepD_reg 0.99
PF00171Aldedh 0.99
PF01757Acyl_transf_3 0.99
PF00593TonB_dep_Rec 0.99
PF13683rve_3 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG07643-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydrataseLipid transport and metabolism [I] 7.92
COG4706Predicted 3-hydroxylacyl-ACP dehydratase, HotDog domainLipid transport and metabolism [I] 7.92
COG0014Gamma-glutamyl phosphate reductaseAmino acid transport and metabolism [E] 0.99
COG1012Acyl-CoA reductase or other NAD-dependent aldehyde dehydrogenaseLipid transport and metabolism [I] 0.99
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.99
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.99
COG4230Delta 1-pyrroline-5-carboxylate dehydrogenaseAmino acid transport and metabolism [E] 0.99
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.99
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A91.09 %
All OrganismsrootAll Organisms8.91 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005174|Ga0066680_10031391All Organisms → cellular organisms → Bacteria3009Open in IMG/M
3300005586|Ga0066691_10147163All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1353Open in IMG/M
3300007258|Ga0099793_10066025All Organisms → cellular organisms → Bacteria → Acidobacteria1626Open in IMG/M
3300010304|Ga0134088_10002290All Organisms → cellular organisms → Bacteria7656Open in IMG/M
3300012202|Ga0137363_11791728All Organisms → cellular organisms → Bacteria → Acidobacteria507Open in IMG/M
3300012206|Ga0137380_10034126All Organisms → cellular organisms → Bacteria4695Open in IMG/M
3300026296|Ga0209235_1075961All Organisms → cellular organisms → Bacteria → Acidobacteria1515Open in IMG/M
3300026315|Ga0209686_1206085All Organisms → cellular organisms → Bacteria → Acidobacteria534Open in IMG/M
3300026999|Ga0207949_1028552All Organisms → cellular organisms → Bacteria → Acidobacteria515Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil43.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.97%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.97%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.98%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.98%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024286Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK28EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026999Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF044 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027567Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066672_1080381913300005167SoilYRELVLLLKPQPSIPRPEQFETIADSRLLRLRELCESQGAKLILLVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVRYYQSDELHLNHEGAKLFTSALATFLPRTVDHEQVASPN
Ga0066680_1003139123300005174SoilLKPQPAVPPLHQFQTTANSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0066688_1035980513300005178SoilSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0066685_1024354933300005180SoilLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAARRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0070705_10121977113300005440Corn, Switchgrass And Miscanthus RhizosphereILCHVVPRCKELFLLLKRQPPIPTPEEFQTIASSRLQRLRVLCEAHGAKLIILVPPTPSSEDAVRQMTIASHKAGVDALVPIDPDTLSARYYQPDELHLNSEGAQLFTSALAAFLPRTVDREQRASPN*
Ga0070681_1034322013300005458Corn RhizosphereVSLLKPQPTIPPPQQFQATASFRVAQLRDLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAELFTSALATFLPKTVDHGPVASPN*
Ga0066697_1063117313300005540SoilLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0066692_1038016323300005555SoilELVLLLKPQPAVPPLHQFQTTSNSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0066704_1016453023300005557SoilLLLKPQPAVPPLHQFQTTANSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0066694_1003596253300005574SoilPHYRELVLLLKPQPSIPRPEQFETIADSRLLRLRELCESQGAKLILLVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVRYYQSDELHLNHEGAKLFTSALATFLPRTVDHEQVASPN*
Ga0066691_1014716323300005586SoilLCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0070762_1079132013300005602SoilAIPPAPQFQAIANPRLEQLRALCEAHGAKLILLVPPTPSSEDAVRQMTIASRNAGVDTLVPIDPTALSAKYYEPDELHLNPEGAAIFTAALAEFLTQTVVRKSMASPD*
Ga0066652_10001155343300006046SoilHYRELVLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0070716_10036785623300006173Corn, Switchgrass And Miscanthus RhizosphereLVLLLKPQPVIPPAPQFQILAHSRLVRLEELCKANGAKLIILVPPTPSSEDAVRQMTSTSQEAGVDTLVPIDPASLSARYYQPDELHLNSEGAALFTVALAEFLPQTIFHEPLPSHN*
Ga0079222_1074545533300006755Agricultural SoilFETIANSRLERLRELCESHGAKLIILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAVLSARYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQVASPN*
Ga0066659_1129472613300006797SoilIFRHAIPHYRELVLLLKPQPSIPPPEQFQTIANSRLERLREVCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDALVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0079220_1142583423300006806Agricultural SoilVPHYTELVLLLKPQPSIPSREQFQTIANSRLERLRELCESHGAKLIILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPATLSARYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQVASPN*
Ga0099793_1006602513300007258Vadose Zone SoilPQQFQTIANSRLEQLRELCHAHGAKLIFLVPPTPSSKDGVRELIIASQKAGVDALVPIDPAALSAKYYEADELHLNSEGAALFTSALATFLPQTIAREPLVSHDD*
Ga0099793_1020597913300007258Vadose Zone SoilPVPPAREFQTMATTRLERLRELCEAHGAKLIILVPPTPSSEDAIHQMTIAAQRTRVDTLVPIDPAALSARYYQPDELHLNSEGAQLFTSALAAFLPRTVDREQTASPN*
Ga0099794_1002191313300007265Vadose Zone SoilPAIPPPEQFQTTANSRLEQLRELCEAHGAKLIILVPPTPSSEDAVRQMTIASQKAGVDTLVPIDPTALSTKYYQPDEVHLNSEGAVLFTTALAEHLPQTIARESMASPD*
Ga0099794_1013320433300007265Vadose Zone SoilANSRLEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSHD*
Ga0099795_1003042833300007788Vadose Zone SoilMLLKPLPAIPPTGEFQLLANSRLTQLDELCKANGSKLIILVPPTPSSEDAVRQMTTASQKAGVDTLVPIDPATLSVKYYKSDELHLNSEGAELFTLALAEFLPQTIFHEPLASRD*
Ga0099829_1162259613300009038Vadose Zone SoilLCALLIAALEISSDYLLKRQRPIPPATEFQNIANSRLERQLDLCEAHGAKVIVLVPPTPSSENTVRQMTIAAQRARVDTLVPIDQAALSARYYQPDEVHLNSAGAQLFTSALATFLPQTVDQEPAASPN*
Ga0099828_1098351823300009089Vadose Zone SoilEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSQD*
Ga0099792_1113582513300009143Vadose Zone SoilLAIPPAQQFQTMANERLERLRVLCDRYGAKLILLVPPTPSSADAVRQMTSVSAKAGVDALVPIDPISLPTRYYQADELHLNAEGEKLFTTALAAFLPETVGHKLFPATQ*
Ga0134082_1017156613300010303Grasslands SoilHAIPHYRELVLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0134088_1000229043300010304Grasslands SoilLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0150983_1591515923300011120Forest SoilLERLRELCEAHGAKLIILLPPTPSSEDAVLQMTLASRKAGVDTLVPIDPTALSAKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN*
Ga0137391_1033175233300011270Vadose Zone SoilLLLLLKPQPAIPSPHQFQATANYRLEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSHDD*
Ga0137393_1029265213300011271Vadose Zone SoilQATANYRLEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSQD*
Ga0137393_1122384413300011271Vadose Zone SoilRSVIRTQILCRVVPHCKELFSLLKRQPPVPPAAEFQTMATTRLKRLRELCEAHGAKLIILVPPTPSSEDAVHQMTIAAQRIRVDTLVPIDPAALSARYYQPDELHLNSEGAQLFTSALAAFLPRTVDREQTASPN*
Ga0137388_1049781013300012189Vadose Zone SoilLLKPQPAIPSPHQFQATANYRLEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSQD*
Ga0137388_1083690313300012189Vadose Zone SoilPHYTELVSLLKPRPAIPPPHQFQTTANSRLEQLRELCQAHGTKLIILVPPTPSSEDGVRELIIASQKAGVDTLVPIDPTALSAKYYEPDELHLNPEGAALFTSALATFLPQTIARESMASPDD*
Ga0137363_1008261013300012202Vadose Zone SoilSHAVPHYKELVLLLKPQSAIPPPPQFQTTANYRLEQLRELCHAHGAKLIILVPPTPSSDDVVQQMTIASQKAGVDALVPIDPTALPAKYYQPDEVHLNSEGAALFTTALAEHLPQTIAREPMASPD*
Ga0137363_1075137023300012202Vadose Zone SoilWDTRSVIRTQILCHLLPHCKELFSLLKRQPPVPPAAEFQSMAITRLERLRELCEAHGAKLIILVPPTPSSEDAVHQMTIAAQRTRVDTLVPIDPAALSAKYYQSDELHLNSEGAQLFTSALAAFLPRTVDREQTASPN*
Ga0137363_1179172813300012202Vadose Zone SoilHSSVFWDTRAVLRTQTLRHAVPHYENLVLLLKPQPAFPPPQQFQTIANSRLEQLRELCHAHGAKLIFLVPPTPSSEDGVRELIIASQKAGVDALVPIDPTALSAKHYEADELHLNPEGAALFTSALATFLPQTIARESAVSREN*
Ga0137399_1002366713300012203Vadose Zone SoilHAHGAKLIFLVPPTPSSEDGVRELIIASQKAGVDALVPIDPAALSAKYYEADELHLNSEGAALFTSALATFLPQTIAREPLVSHDD*
Ga0137399_1055885913300012203Vadose Zone SoilQFQTIANSRLEQLRELCQAHGAKVIILVPPTPFSEDGVREFIMASQKAGVDTLVPIDPTALSAKYYEPDELHLNPEGAALFTSALATFLPQTIARESTVSHNN*
Ga0137399_1081442313300012203Vadose Zone SoilPPIPPPEQFQTIANSRLVRLRELCESHGAKLIILVPPTPSSEYAVRQMTIAAQRVRVDTLVPIDPAALSVRYYQSDELHLNPEGAKLFTIALATFLPRTVDHEQVASPN*
Ga0137362_1067898513300012205Vadose Zone SoilQILRHAVPHYRELVLLLKPQPSNPSPQQFETIADNRLERLRELCESHGAKLIILVPPTPSSVEAVRQMTVAAQRARVDTLVPIDPAILSARYYQSDELHLNPDGAKLFTAALATFLPRTVDHEAVASPN*
Ga0137380_1003412663300012206Vadose Zone SoilLEQLRELCQALGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD*
Ga0137360_1040378333300012361Vadose Zone SoilRTQILRHAVPHYRELVLLLKPQPSNPSPQQFETIADNRLERLRELCKSHGAKLIILVPPTPSSVEAVRQMTVAAERARVDTLVPIDPAILSARYYQSDELHLNPDGAKLFTAALATFLPRTVDHEAVASPN*
Ga0137360_1077333423300012361Vadose Zone SoilIPPPEQFQTTANSRLKQLRELCEAHNAKLIILVPPTPSSEDAVRQMTIASQRARVDALVPIDPTALSAKYYQPDELHLNSAGAALFTTALAEYLPQTIVRESMASPD*
Ga0137360_1132458713300012361Vadose Zone SoilKQILRHAIPHYKELVLLLKPQPAIPPPEQFQTIANQRLLRLRELCESHGAKLILLVPPTPSSEDAVHQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGARLFTSALANFLPRTMDHERAASPN*
Ga0137390_1093546713300012363Vadose Zone SoilILRHAVPHYRELLLLLKPQPAIPSPHQFQATANYRLEQLRELCQAHGAKLIILLPPTPSSEDGVRELIIASQKAGVDTLVPIDPRALSAKYYQPDELHLNSEGAALFTTALAEYLPQTIVRESMASPD*
Ga0137398_1008058943300012683Vadose Zone SoilRLRELCESHGAKLIILVPPTPSSVEAVRQMTVAAERARVDTLVPIDPAILSARYYQSDELHLNPDGAKLFTAALATFLPRTVDHEAVASPN*
Ga0137397_1088532613300012685Vadose Zone SoilGCKELFRLLKRQPSIPPATEFQTIASFRLERLRELCEAHGAKLIILVPPTPSSEDAVREMTIAAQRARVDTLVPIDPAALSTRYYQPDELHLNSEGAQLFTSALATFLPQTVDHEPVASPN*
Ga0137395_1078768113300012917Vadose Zone SoilPHYADLVLLLKPQPAIPPPRQFQATANSRLEQLRELCHAHGAKLIFLVPPTPSSEDGVRELIIASQKAGVDALVPIDPTALSAKYYEADELHLNPEGAALFTSALATFLPQTIARESAVSREN*
Ga0137396_1061498413300012918Vadose Zone SoilLRAQILRHAVPHYQELVLLLKPQPPIPPPEQFQTIANSRLVRLRELCESHGAKLIILVPPTPSSEYAVRHMTIAAQRVRVDTLVPIDPAALSVRYYQSDELHLNPEGAKLFTIALATFLPRTVDHEQVASPN*
Ga0137396_1113471713300012918Vadose Zone SoilAMTRLERLRELCEAHGVKLIILVPPTPSSEDAVHQMTIAAQRTRVDTLVPIDPAALSARYYQRDELHLNSEGAQLFTSALAAFLPRTVDREQRASPN*
Ga0137394_1005726533300012922Vadose Zone SoilIRTQILCRAVPHCKELFLLLKRQPPVPPAAEFQTTAMTRLVRLRELCEAHGAKLIILVPPTPSSEDAVHQMTIAAERTRVDTLVPIDPAALSARYYQPDELHLNSEGAELFTSALAAFLPRTVDREQRASPN*
Ga0137419_1141912823300012925Vadose Zone SoilMAITRLERLRALCEAHGAKLIILVPPTPSSEDAVRQMTMAAQRTQVDTLVPIDPAALSARYYQRDELHLNSEGAQLFTSALAAFLPRTVDREPTASPN*
Ga0137416_1069820713300012927Vadose Zone SoilTVPHYRELVLLLKPQSSILPPNQFQTTANSRLEQLRELCQAHGAKLIILLPPTPSSQDGVRELIIASQKAGVDTLVPIDPTALSAKYYKPDELHLNSEGAQLFTLALATFLPETVVRKSVASPD*
Ga0137407_1129747913300012930Vadose Zone SoilELFSLLKRQPPVPPAAEFQTMAITRLERLRELCEAHGAKLIILVPPTPSSEDAVHQMTIAAQRTRVDTLVPIDPAALSARYYQSDDLHLNSEGAQLFTSALAAFLPRTVDREHTASPN*
Ga0137407_1139944623300012930Vadose Zone SoilLEQLRELCQTHGAKLIILVPPTPSSGDAVRQMTIASQKAGVDTLVPIDPTALSAKYYKPDELHLNSEGTQLFTLAFATFLPETVVRKSVASPD*
Ga0137407_1241111313300012930Vadose Zone SoilLAHSSVFWNTRSVIRSQILYHVIPNSKELFLLLKRQPPIPPAPEFQSIADSRLRRLRELCEAHGARLIILVPPTPSSDDEVHQMTVAAQNAGVDALVPIPPAALSTRYYQPDELHLNSEGAELFTSALATFLPYTLNRDSVATPN*
Ga0126369_1038322413300012971Tropical Forest SoilYSRLQNVRTLCEEYGARLILLIPPTFSSEDAVREMTSACAKAGVESLVPIDPTALSVKYYQPDELHLNSEGAELFTSALATSLPASVSHYADSPRN*
Ga0134075_1007637813300014154Grasslands SoilLRTQIFRHAIPHYRELVLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAARRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN*
Ga0137420_111867013300015054Vadose Zone SoilEFSVVGRLRELCEAHGAKLIILVPPTPSSEDAVRQMTLASRTAGVDTLVPIDPTALSVKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN*
Ga0137420_141644913300015054Vadose Zone SoilCLSSAPEPQPAFPPPQQFQTIANSRLEQLRELCHAHGAKLIFLVPPTPSSEDGVRELIIASQKAGVDALVPIDPAALSAKYYEADELHLNSEGAALFTSALATFLPQTIAREPLVSHDD*
Ga0134083_1054089013300017659Grasslands SoilPLHQFQTTANSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD
Ga0187804_1027606513300018006Freshwater SedimentDLCEAYGAKLILLIPPTLSSEDAVRQMTIASKKAGVNTLVPIDPAGLSAKYYQPDELHLNSEGAALFTTALAEFLPQTIVRESMASRD
Ga0179592_1007883233300020199Vadose Zone SoilPQPAIPPPQEFQTTANSRLKQLRELCRAHGAKLIILVPPTPSSDDAVQQMTIASQKAGVDTLVPIDSTALSAKYYQPDELHLNPEGAALFTTALAEYLPQTIVRKSMASPD
Ga0210403_1052727933300020580SoilANSRLENLRALCERHGAKLIILLPPTLSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAQLFTSALATFLPKTLDSEPVASPN
Ga0210403_1140728213300020580SoilNSRLELLRELCEAHGAKLIILFPPTPSSEDALLQMTIASRKAGVDTLVPIDPAALSAKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN
Ga0179596_1045229323300021086Vadose Zone SoilILYHVIPHSKDLFLLLKRQPPIPPAPEFQAIADSRLRRLRELCEAHGARLIILVPPTPSSDDEVHQMTLAAQNAGVDALVPIPPAALSNRYYQPDELHLNAEGAELFTSALATFLPHTLDRDPVAAPN
Ga0210406_1055358113300021168SoilVPHYKELLLLLKPQPAIPPSPQFQSMAIYRLERLRELCEAHGAKLIILVPPTPSSEDAIRQVTIASQKAGVDTLVPIDPARFPGKYYQPDELHLNSEGAALFTSALAEFLPETIVHKSMASPD
Ga0210405_1004138643300021171SoilSLLRTQVLRHTVPHYQELVSLLRPQSAIPPGPQFEAIANSRLEQLRALCEAHGAKLILLVPPTPSSEEAVRQITIASQKADVDALVPIDPAALPDKYYLSDELHLNSEGAALFTAALAEFLPQVTIRESMTSPD
Ga0210405_1046977723300021171SoilRFEATANSRLEQLRALCEAHGAKLILLVPPTPSSEDAVRQMTIASQKADVDALVPIDPAALPDKYYLSDELHLNSEGAAIFTTALAEFLPQVTIREPMTSPD
Ga0210394_1122642313300021420SoilSVLRTQILSHAVPHYRELVSLLKPQPAIPPALQFQTIANSRLENLRALCERHGAKLIILLPPTLSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAQLFTSALATFLPKTLDSEPVASPN
Ga0210394_1156328123300021420SoilKELFLLLKRQPPIPPATEFQTIANSRLRRLRELCEAHGAKLIILVPPTPSSEDAIREMTAAAQRARVDTLVPIDPATLPARYYQPDELHLNSQGAQLFTSALATFLPQTVDHEPVDSPN
Ga0210409_1127386113300021559SoilPHYRELVLLLKPQPAIPPFPQFQSMAMSRLERLRELCEAHGAKLIILVPPTPSSEDAIRQVTIASQKAGVDTLVPIDPTRFPGKYYQPDELHLNSEGAALFTSALAEFLPETIVHKSMASPN
Ga0247687_102902513300024286SoilVLRTQILSHTVPHYRELVSLLKPQPTIPPPQQFQATASFRVAQLRDLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAELFTSALATFLPKTVDHGPVASPN
Ga0137417_102995733300024330Vadose Zone SoilRQFQTTANSRLGRLRELCEAHGAKLIILVPPTPSSEDAVRQMTLASRTAGVDTLVPIDPTALSVKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN
Ga0207699_1037253713300025906Corn, Switchgrass And Miscanthus RhizosphereTQILSHTVPHYRELVSLLKPQPTIPPPQQFQATASFRVAQLRDLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAELFTSALATFLPKTVDHGPVASPN
Ga0207707_1029485523300025912Corn RhizosphereVSLLKPQPTIPPPQQFQATASFRVAQLRDLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAELFTSALATFLPKTVDHGPVASPN
Ga0209235_107596123300026296Grasslands SoilILRHAVPHYTELVLLLKPQPAVPPLHQFQTTANSRLEQLRELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD
Ga0209236_104545733300026298Grasslands SoilLLLKPQPAVPPLHQFQTTANSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD
Ga0209686_120608513300026315SoilLLAHSSVFWDTRTVLRTQIFRHAIPHYRELVLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN
Ga0209375_127713813300026329SoilYRELVLLLKPQPSIPPPEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN
Ga0209377_126275513300026334SoilELVLLLKPQPAVPPLHQFQTTSNSRLEQLSELCQAHGAKLIILVPPTPSSEDAVRQMTIAAQNTGVDTLVPIDPTALSAKYYQPDELHLNPEGAALFTSALATFLPQTIARKSMALPDD
Ga0257157_108410313300026496SoilPIHPAAEFQGIADSRLRRLRELCEAHGARLIILVPPTPSSDDEVLQMTLAAQNAGVDALVPIPPAALSTRYYQPDELHLNSEGAKLFTSALATFLPHTLNRDSAATPN
Ga0209806_104348613300026529SoilEQFQTIANSRLERLRELCESHGAKLVILVPPTPSSVDAVRQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGAKLFTTALATFLPRSVDHEQAASPN
Ga0209648_1036731013300026551Grasslands SoilGELCQAHGAKLIILLPPTPSSEDGVRELIIASQKAGVDTLVPIDPRALSAKYYQPDELHLNSEGAALFTTALAEYLPQTIVRESMASPD
Ga0207949_102855213300026999Forest SoilSVFWDTRTVLRTQILRHAVPHYTELVSLLKPQPAIPPPHQFQTTANSRLERLRELCEAHGAKLIILLPPTPSSEDAVLQMTLASRKAGVDTLVPIDPTALSAKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN
Ga0209523_108159513300027548Forest SoilATPPAADLERLANERVQRLREICEAHRARLILLVPPTPSSAEADRLMTLAARRLGVETLVPLDPSSLSERYYQTDELHLNSEGAALFTSAVATALPKTVAQESTLSP
Ga0209115_112442223300027567Forest SoilPAPQFQAIANPRLEQLRALCEAHGAKLIILVPPTPSSEDAVHQLTIASQNAGVDTLVPIDPTALPDKYYESDQLHLNPEGAALFTSALAEFLPQTILRESMTSPD
Ga0209588_124445413300027671Vadose Zone SoilNSRLEQLRELCQAHGAKLIILVPPTPSSEDGVRELTIASQKAGVDTLVPIDPTALSAKYYEADELHLNSEGAALFTSALATFLPQTIARKSMVSHD
Ga0209693_1058207113300027855SoilFQAIANRRLGQLHSLCEAHGAKLIILVPPTLSSEGAVHQMTIASQNAGVDTLVPIDPTVLPAKYYLSDEVHLNPEGAALFTTALAEFLPQTVVRESMASPD
Ga0209701_1043931113300027862Vadose Zone SoilPAAEFQTMAITRLERLRELCEAHGVKLIILVPPTPSSEDAVHQMTIAAQRTRVDTLVPIDPAALSARYYQRDELHLNSEGAQLFTSALAAFLPRTVDREQRASPN
Ga0209488_1095749613300027903Vadose Zone SoilAVPHYENLVLLLKPQPAFPPPQQFQTIANSRLEQLRELCHAHGAKLIFLVPPTPSSEDGVRELIIASQKAGVDALVPIDPTALSAKYYEADELHLNPEGAALFTSALATFLPQTIARESAVSREN
Ga0310686_10365352223300031708SoilPPAWFQDVASSRLEDLRSLCDEHGAKLVLLVPPTPSSADSVHQMTLASQRAGVDSLIPIDPTTLSTKYYLSDELHLNAQGAALFTTALAEFLPQTIFRDTAASPD
Ga0310686_11380180323300031708SoilDLVTLLKPQHAIPLAPQFQAIANPRLQQLRTLCEAHGAKLIILIPPTPSSEGAVHQMTVASQKAGVDTLVPIDPTVLPAKYYLSDEVHLNPEGAALFTTALAEFLPQTIVHEPTASPD
Ga0307476_1001016953300031715Hardwood Forest SoilLLMKPQPAIPPPQQFQTIASSRLKKLRELCEAHGAKLILLVPPTLSSQEAVRQMAIASQKAGVDALVPIDPTALSARYYQPDELHLNSQGAALFTAALAEFLPQTVVREQD
Ga0307469_1173031713300031720Hardwood Forest SoilRSRLEMLRGLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVDTLVPIDPTALSAKYYQPDELHLNSEGAQLFTSALATFLPKTVGHELVASPN
Ga0307475_1066694223300031754Hardwood Forest SoilRTVLRTQILRHAVPHYTELVSLLKPQPAIPPPHQFQTTANSRLERLRELCEAHGAKLIILLPPTPSSEDAVLQMTLASRKAGVDTLVPIDPTALSAKYYQSDELHLNSEGAQLFTSALATFLPKTLDHEPLASPN
Ga0307475_1076134213300031754Hardwood Forest SoilSRLEQLRDLCQAHGAKLIILVPPTPSSEDGVREFVMASQKAGVDTLVPIDPTALSAKYYEPDELHLNPEGAALFTSALATFLPQTIARESMVSHDN
Ga0307475_1090657023300031754Hardwood Forest SoilIPPDPQFQATAQDRLKHLRELCEEHGAKLIILVPPTPSSEDALRQMTLASQKAGVDTLVPIDPKALSVRYYQSDELHLNSEGARLFTSALATFLPQTLDHEPVASTN
Ga0307478_1094886113300031823Hardwood Forest SoilVFWDTRGVIRTQILRHTVPHYQDLVSLLKPQSSIPPAPQFQSIASSRLEQLRKLCEAHGAKLIILIPPTLSSEDAVQQMTTASQRVGVDALVPIDPGALSPKYYLSDELHLNSEGAALFTTALAEFLPQTVVRESTASPD
Ga0307479_1005565813300031962Hardwood Forest SoilFWDTRTVLRTQILRHVIPHYKELVLLLKPQPAIPPPEQFQTIANQRLLCLRELCESHGAKIILLVPPTPSSEDAIHQMTMAAQRARVDTLVPIDPAALSVKYYQSDELHLNPEGARLFTSALANFLPRTMDHERAASPN
Ga0310810_1025677123300033412SoilVLRTQILSHTVPHYRELVSLLKPQPTIPPPQQFQATASFRVAQLRDLCEKHGAKLIILVPPTPSSEAAVKQMTIASQNAGVGTLVPIDPTALSAKYYQPDELHLNSEGAELFTSALATFLPKTVDHGPVASPN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.