NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086544

Metagenome Family F086544

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086544
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 107 residues
Representative Sequence MIQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRQYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG
Number of Associated Samples 64
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.00 %
% of genes near scaffold ends (potentially truncated) 6.36 %
% of genes from short scaffolds (< 2000 bps) 21.82 %
Associated GOLD sequencing projects 63
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (88.182 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(40.000 % of family members)
Environment Ontology (ENVO) Unclassified
(55.455 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(42.727 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.16%    β-sheet: 18.84%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.58.4.11: Dimeric alpha+beta barreld1y0ha_1y0h0.8
d.58.4.0: Dimeric alpha+beta barreld2bbea_2bbe0.8
d.58.4.0: Dimeric alpha+beta barreld3qmqa13qmq0.79
d.58.4.0: Dimeric alpha+beta barreld4dpoa14dpo0.79
d.58.4.11: Dimeric alpha+beta barreld2pd1a12pd10.75


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF00849PseudoU_synth_2 10.91
PF01479S4 3.64
PF00892EamA 2.73
PF01909NTP_transf_2 2.73
PF00296Bac_luciferase 2.73
PF13427DUF4111 2.73
PF00440TetR_N 1.82
PF12705PDDEXK_1 1.82
PF01243Putative_PNPOx 1.82
PF04191PEMT 1.82
PF13464DUF4115 1.82
PF13683rve_3 1.82
PF01252Peptidase_A8 1.82
PF12840HTH_20 1.82
PF07690MFS_1 1.82
PF00497SBP_bac_3 0.91
PF08264Anticodon_1 0.91
PF00583Acetyltransf_1 0.91
PF12680SnoaL_2 0.91
PF03167UDG 0.91
PF05015HigB-like_toxin 0.91
PF04226Transgly_assoc 0.91
PF14486DUF4432 0.91
PF14579HHH_6 0.91
PF08327AHSA1 0.91
PF00872Transposase_mut 0.91
PF08031BBE 0.91
PF01619Pro_dh 0.91
PF02811PHP 0.91
PF13413HTH_25 0.91
PF14499DUF4437 0.91
PF13964Kelch_6 0.91
PF02678Pirin 0.91
PF11716MDMPI_N 0.91
PF08241Methyltransf_11 0.91
PF01872RibD_C 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG0564Pseudouridine synthase RluA, 23S rRNA- or tRNA-specificTranslation, ribosomal structure and biogenesis [J] 10.91
COG1187Pseudouridylate synthase RsuA, specific for 16S rRNA U516 and 23S rRNA U2605Translation, ribosomal structure and biogenesis [J] 10.91
COG0597Lipoprotein signal peptidaseCell wall/membrane/envelope biogenesis [M] 3.64
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.73
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 0.91
COG0277FAD/FMN-containing lactate dehydrogenase/glycolate oxidaseEnergy production and conversion [C] 0.91
COG0506Proline dehydrogenaseAmino acid transport and metabolism [E] 0.91
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 0.91
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 0.91
COG1741Redox-sensitive bicupin YhaK, pirin superfamilyGeneral function prediction only [R] 0.91
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 0.91
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 0.91
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.91
COG3549Plasmid maintenance system killer proteinDefense mechanisms [V] 0.91
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A88.18 %
All OrganismsrootAll Organisms11.82 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002120|C687J26616_10014984All Organisms → cellular organisms → Bacteria2990Open in IMG/M
3300002120|C687J26616_10253403Not Available539Open in IMG/M
3300002123|C687J26634_10061270All Organisms → cellular organisms → Bacteria1396Open in IMG/M
3300002123|C687J26634_10101934Not Available1025Open in IMG/M
3300002124|C687J26631_10056049All Organisms → cellular organisms → Bacteria1370Open in IMG/M
3300002503|C687J35164_10019656All Organisms → cellular organisms → Bacteria2256Open in IMG/M
3300002503|C687J35164_10152330Not Available668Open in IMG/M
3300010391|Ga0136847_11744835All Organisms → cellular organisms → Bacteria2618Open in IMG/M
3300012211|Ga0137377_11109504All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi721Open in IMG/M
3300018031|Ga0184634_10280985Not Available764Open in IMG/M
3300018031|Ga0184634_10379139All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Rhodothermaeota → Rhodothermia → Rhodothermales → unclassified Rhodothermales → Rhodothermales bacterium648Open in IMG/M
3300018074|Ga0184640_10062863All Organisms → cellular organisms → Bacteria1557Open in IMG/M
3300018082|Ga0184639_10491558Not Available621Open in IMG/M
(restricted) 3300024521|Ga0255056_10205199Not Available866Open in IMG/M
3300025002|Ga0209001_1079034Not Available520Open in IMG/M
3300025313|Ga0209431_10036706All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3798Open in IMG/M
3300025313|Ga0209431_10185684All Organisms → cellular organisms → Bacteria1645Open in IMG/M
3300025313|Ga0209431_10523568Not Available899Open in IMG/M
3300025314|Ga0209323_10517339Not Available692Open in IMG/M
3300025322|Ga0209641_10254718All Organisms → cellular organisms → Bacteria1304Open in IMG/M
3300025326|Ga0209342_10300050Not Available1396Open in IMG/M
3300025327|Ga0209751_11261670Not Available535Open in IMG/M
3300031949|Ga0214473_10047061All Organisms → cellular organisms → Bacteria5053Open in IMG/M
3300031949|Ga0214473_10132648All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2903Open in IMG/M
3300031949|Ga0214473_10997389Not Available884Open in IMG/M
3300031949|Ga0214473_11754348Not Available615Open in IMG/M
3300031949|Ga0214473_12095503Not Available548Open in IMG/M
3300034089|Ga0373913_0053284Not Available1037Open in IMG/M
3300034419|Ga0373914_0141350Not Available612Open in IMG/M
3300034692|Ga0373917_0013206Not Available1029Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil40.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil11.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.55%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil4.55%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.64%
Sediment SlurryEngineered → Bioremediation → Metal → Unclassified → Unclassified → Sediment Slurry2.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.82%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.91%
SeawaterEnvironmental → Aquatic → Marine → Gulf → Unclassified → Seawater0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.91%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.91%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002120Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2EnvironmentalOpen in IMG/M
3300002123Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_3EnvironmentalOpen in IMG/M
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300002503Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300024521 (restricted)Seawater microbial communities from Amundsen Gulf, Northwest Territories, Canada - Cases_109_1EnvironmentalOpen in IMG/M
3300025002Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 2 (SPAdes)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025314Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 2EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300034089Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - B5A4.2EngineeredOpen in IMG/M
3300034419Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - B5A4.3EngineeredOpen in IMG/M
3300034692Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - A1A0.3EngineeredOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1033642623300000550SoilMTQTIAIILRFREDQADDFERMFEAEIMPLWHEFERDGKFLAASLTPVEGGSEAKTGIRDYILHVEVPGMGEHEEFDSHPRFLDFLPRARALQPEEPLVWFGTTQFKVP*
JGI10216J12902_10823674023300000956SoilMSQTLAIILKFREDRAGEFEEMFRAEILPLWEEFLAQDKFIEASLTPIEGGIPAPEGQRHYILHVEVGGMADHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
JGI10216J12902_11037005033300000956SoilMSQTLAIILRFREDRAGEFEEMFRAEILPLWEEFLGQGKFVEASLTPIEGGVPTPEGQRHYILHVEVLGMAEHEEFDQHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGS*
C687J26616_1001498443300002120SoilMSQTNAIILRFREDETQRFEQLFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSAQFXAFLDKAKPMQPEEPKVWLGTTRFQV*
C687J26616_1025340313300002120SoilIILRFREDEAQRFEELFETEILPMWQEFKAQGKFLAASLTPVEDGSEMKEGVRDYILHVEVPSMAEHKEFDSSARFLAFLAKAKPMQPEEPKVWLGTTRFQV*
C687J26634_1006127013300002123SoilMSQTNAIILRFREDETQRFEQLFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSAQFLAFLDKAKPMQPEEPKVWLGTTRFQV*
C687J26634_1010193413300002123SoilMSQTNAIILTFREDKAQRFEKLFEAEILPMWQQFKAQGKFLAASLTPVEGGSEVKEGVRDYILHVEVPSMAQHEEFDSSPGFLDFLAKAKPMQP
C687J26631_1005604923300002124SoilMSQTNAIILRFREDETRRFEQLFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSAQFLAFLDKAKPMQPEEPKVWLGTTRFQV*
C687J35164_1001965613300002503SoilMSQTNAIILRFREDETXRFEQLFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSAQFXAFLDKAKPMQPEEPKVWLGTTRFQV*
C687J35164_1015233013300002503SoilMSQTNAIILRFREDEAQRFEELFETEILPMWQEFKAQGKFLAASLTPVEDGSEMKEGVRDYILHVEVPSMAEHKEFDSSARFLAFLAKAKPMQPEEPKVWLGTTRFQV*
JGI25385J37094_1019087923300002558Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLPQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKARALQPEKPLVYF
Ga0066674_1011878613300005166SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWKQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHKEFDDHPEFVEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0066678_1092829713300005181SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKGRALQPEKPLVYFGTTLFKVGG*
Ga0066686_1113657723300005446SoilPLWEEFAGQGKFLEASLTPVEGGGEEREGIRQYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0066689_1007245523300005447SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0070697_10140505213300005536Corn, Switchgrass And Miscanthus RhizosphereVSQTIAIILKFGNADAEEFERMFEAEVLPLWNEFLAQGKFIGASLTPIEGGDPRPQDQRHYILHVEVPGMAEHEEFDEHPTFLDFLPRARALQPDEPNVWFGPTLFKVGR*
Ga0066697_1010089423300005540SoilMIQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRQYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0066695_1048237723300005553SoilMTQTIAIILRFGEADAEAFERIFEKEVMPLWREFARNGKFLSASLTPVEGGGQTQEGIRDYILHVEVPGMAEHEEFDSHPTFLDFLPRAKALQPEEPLVWFGTTRYRVP*
Ga0066698_1002527333300005558SoilMTQTIAIILKFRESEAGRFEALFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRQYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0066698_1028012313300005558SoilVSQTIAIILKFRDDMAGRFEEMFEAEILPLWNAFLAEGKFIEASLTPVQGGDRRPEGERHYILHVEVPGMAEHEEFDEHPRFLDFLPRAKALQPAEPNVWFGTTLFKVGG*
Ga0066705_1069602013300005569SoilVTQTIAIILKFRESEAPRFEEMFESEVLQLWNQFLADGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPKFLDFLPRARELQPSEPNVWFG
Ga0081455_10001274143300005937Tabebuia Heterophylla RhizosphereMSQTIAIILKFREDQAGEFEEMFRAEVLPLWEEFLAAGRFIGASLTPVEGGIKGPEGQGHYILHVEVPGMREHEQFDQHPRFLEFLPRARALQPEEPLVWFGTTLFKVGG*
Ga0066710_10000199973300009012Grasslands SoilVSQTIAIILKFRDDMAGRFEEMFEAEILPLWNAFLAEGKFIEASLTPVQGGDRRPEGERHYILHVEVPGMAEHEEFDEHPRFLDFLPRAKALQPAEPNVWFGTTLFKVGG
Ga0066710_10008004233300009012Grasslands SoilVTQTIAIILKFRESEAPRFEEMFESEVLQLWNQFLADGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPKFLDFLPRARELQPSEPNVWFGTTLFKIGG
Ga0066710_10018508823300009012Grasslands SoilMTQTIAIILRFGEADAEAFERMFEDEVMPLWHEFARDGKFLSASLTPVEGGGQTQEGIRHYILHVEVPGMAEHEAFDSHPRFLEFLPRAKALQPEEPLVWFGTTRYRVP
Ga0066710_10049622413300009012Grasslands SoilGGPVSQTIAIILKFRDAEAERFEEMFQAEILPLWNEFLAQGKFIEASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPRFLDFLPRAKALQPAEPNVWFGTTLFKVGS
Ga0066710_10101509413300009012Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKARALQPEKPLVYFGTTLFKVGG
Ga0066710_10335549323300009012Grasslands SoilMTQTIAIILKFRESEGGRFEAMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG
Ga0099827_1007606053300009090Vadose Zone SoilMTQTIAIILKFRESEAPRFEEMFRAEVMPLWEEFATKGKFIEASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPKFLEFLPKAHTLQPEKPLVFFGTTLFRVGA*
Ga0066709_10020434743300009137Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVGFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0066709_10050080033300009137Grasslands SoilMTQTIAIILRFGEADAEAFERMFEDEVMPLWHEFARDGKFLSASLTPVEGGGQTQEGIRHYILHVEVPGMAEHEAFDSHPRFLEFLPRAKALQPEEPLVWFGTTRYRVP*
Ga0066709_10145939733300009137Grasslands SoilMFESEVLQLWNQFLADGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPKFLDFLPRARELQPSGPNVWFRTTLFKIGG*
Ga0114129_1023091743300009147Populus RhizosphereMSQTIAIILKFRESEAGRFEEIFEAEILPLWHRFLADGKFIGASLTPIEGGDPRPEGERHYILHVEVPGMAEHEEFDEHPTFLEFLPRAKALQPKPPNVWFGNTRFKVGG*
Ga0134080_1021814323300010333Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHKEFDDHPEFVEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0136847_1174483523300010391Freshwater SedimentMSQTNAIILRFRKDEAQRFEELFEAEILPMWKQFKAQGKFLAASLTPVEGGSEVKEGVRDYILHVEVPSMAEHEEFDSSARFSAFLAKAKPMQPEEPKVWLGTTRFQV*
Ga0137364_1144219713300012198Vadose Zone SoilGGAMTQTIAIILRFGKADAEAFERMFEDEVMPLWQEFARNGKFLSASLTPVEGGGQTQEGIRDYILHVEVPGMAEHEEFDSHPTFLDFLPRAKALQPEEPRVWFGTTRYRVP*
Ga0137365_1000445873300012201Vadose Zone SoilMSQTLAIILKFREDRTGEFEEMFRAEILPLWEEFLAQDKFIEASLTPIEGGIPAPEGQRHYILHVEVGGMADHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
Ga0137365_1016053823300012201Vadose Zone SoilMTQTLAIILKFREDHAGQFEEMFRAEILPLWEEFLAHGKFIEASLTPIEGGIPAPDGQRHYILHVEVPGMTEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLLKVGG*
Ga0137365_1053882123300012201Vadose Zone SoilMSQTIAIVLRFREDQARQFEEMFQTEVLPLWEKFLAAGKFIEASLTPVEGGIQRPEGQRHYILHVEVPGMREHEEFDQHPRFLEFLPRARALQPEQPLVWFGTTRFKVGG*
Ga0137374_1003693043300012204Vadose Zone SoilMTQTIAIILKFRDEQAGQFEEMFRAEVMPLWEEFSAQGKFIDASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPRFLEFLPKARALQPAKPLVYFGTTLFKVSG*
Ga0137374_1003850023300012204Vadose Zone SoilMSQTLAIILKFREDRTGEFEEMLRAEILPLWEEFLAQDKFIEASLTPIEGGIPAPEGQRHYILHVEVGGMADHEEFDERPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
Ga0137374_1004080253300012204Vadose Zone SoilMSQTIAIILRFREDQAGEFEEMFRAEVLPLWEEFLAEGKFLKASLTPVEGGIKGPEGQRHYILHVEVPGMREHEQFDQHPRFLEFLPRARALQPQEPLVWFGTTLFNVGG*
Ga0137374_1011535413300012204Vadose Zone SoilMSQTIAIILRFREDRAEEFEELFRAEVLPLWEEFLAAGRFIRASLTPVEGGIKGPEGQRHYILHVEVPGMREHEQFDQHPRFLEFLPRARALQPQEPLVWFGTTLFKVGG*
Ga0137374_1013266643300012204Vadose Zone SoilMFRAEILPLWEEFLGQGKFVGASLTPIEGGVPTPERQRHYILHVEVPGMAEHEEFDEHPGFLEFLPRAKALQPEEPLVWFGTTLFKVGG*
Ga0137374_1061797523300012204Vadose Zone SoilVTQTIAIILKFRESEADEFERMFEAEIMPLWNQFLAQGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPKFLDFLPRAKALQPVGPNVWFGTTLFKVGV*
Ga0137380_1000867833300012206Vadose Zone SoilMTQTLAIILKFREDQAGQFEEMFRAEILPLWEEFLAQGKFIEASLTPIEGGIPAPDGQRHYILHVEVPGMTEHEEFDEHPRFLEFLPRAKALQPETPLVWFGTTLLKVGGRRLAADGADGHGPGT*
Ga0137380_1007167313300012206Vadose Zone SoilMTQTLAIILKFREDHAGQFEEMFRAEVLPLWEEFLAQGKFIEASLTPIEGGIPAPDGQRRYILHVEVPGMAEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLLKVGG*
Ga0137380_1123733123300012206Vadose Zone SoilVSQTIAIILRFREDEAESFEAMFEAEVMPLWHQFARDGKFLAASLTPVEGGGETKEGTRAYILHVEVPGMAEHEEFDSHPRFLDFLPKAKALQPEEPVVWFGTTRFQVP*
Ga0137376_1032835323300012208Vadose Zone SoilMTQTIAIILKFRESEAGRFEEMFEAEVLLLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKGRALQPEKPLVYFGTTLFKVGG*
Ga0137379_1019801413300012209Vadose Zone SoilMTQTLAIILKFREDHAGQFEEMFRAEILPLWEEFLAQGKFIEASLTPVEGGIPAPDGQRHYILHVEVPGMTEHGEFDEHPRFLEFLHRAKALQPEEPLVWFGTTLLKAGG*
Ga0137378_1123393623300012210Vadose Zone SoilMSQTLAIILKFREDRTGEFEEMFRAEILPLWEEFLAQDKLIEASLTPIEGGIPAPEGQRHYILHVEVGGMADHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
Ga0137377_1110950413300012211Vadose Zone SoilMDDGRPTTDSAKRETRETTDDSEEHMSQTIAIILRFRDEQARQFEDLFKAEVYPLWQQFKAQGKFITASLTPVQDGSEIKEGVRDYILHVEVPGMAEHNEFDSLPRFLKFLEKARPMQPE
Ga0137372_1028856023300012350Vadose Zone SoilMSQTLAIVLRFREEQVADFEQMFRAEILPLWEEFLAQGKFIGASLTPIEGGIPTPEGQRHYILHVEVPGMREHEEFDEHPRFLDFLPRAKALQPEEPLVWFGTTMFKVGG*
Ga0137372_1043560923300012350Vadose Zone SoilMSQTIAIILKFREDKAGEFEEMFRAEILPLWDEFLAEGKFIQASLTPVEGGVPGPEGQRHYILHVEVPGMSEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTMFKVGG*
Ga0137372_1070956223300012350Vadose Zone SoilMSQTLAILLRFREDGAGEFEEMFRAEILPLWEEFLAQGKFIEASLTPIEGGIPTPEGQRHYILHVEVPGMREHEEFDEHPQFLDFLPRAKALQPEEPLVWFGTTLFKVG*
Ga0137372_1099977613300012350Vadose Zone SoilVSQTIAIILRFREDEAATFESMFEAEIMPLWDQFMREGKFLAASLTPVEGGGEAKEGIRDYILHVEVPGMAEHHEFDSHPKFLDFLPRAQSLQPEVPLVWFGTTRFQVP*
Ga0137386_1014365223300012351Vadose Zone SoilKFREDHAGQFEEMFRAEILPLWEEFLAHGKFIEASLTPIEGGIPAPDGQRHYILHVEVPGMTEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLLKVGG*
Ga0137386_1070024313300012351Vadose Zone SoilMTQTLAIILKFREDNAGQFEEMFRAEILPLWEEFLAQGKFIEASLTPVEGGIPAPDGQRHYILHVEVPGMTEHGEFDEHPRFLEFLHRAKALQPEEPLVWFGTTLLKVGG*
Ga0137386_1078942613300012351Vadose Zone SoilLKFREDQAGQFEEMFRAEILPLWEEFLAQGKFIEASLTPIEGGIPAPDGQRHYILHVEVPGMTEHEEFDEHPRFLEFLPRAKALQPETPLVWFGTTLLKVGGRRLAADGADGHGPGT*
Ga0137367_1001779663300012353Vadose Zone SoilMSQTLAIILKFREDRTGEFEEMLRAEILPLWEEFLAQDKFIEASLTPIEGGIPAPEGQRHYILHVEVGGMADHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
Ga0137367_1003183333300012353Vadose Zone SoilMSQTLAIVLRFREEQAADFEQMFRAEILPLWEEFLAQGKFIGASLTPIEGGIPTPEGQRHYILHVEVPGMREHEEFDEHPRFLDFLPRAKALQPEEPLVWFGTTMFKVGG*
Ga0137367_1056775613300012353Vadose Zone SoilVTQTIAIILKFRESEADEFERMFEAEIMPLWNQFLAQGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPRFLDFLPRAKDLQPAGPNVWFGTTLFKVGG*
Ga0137367_1062665113300012353Vadose Zone SoilMFRAEILPLWEEFLGQGKFVGASLTPIEGGVPTPERQRHYILHVEVPGMAEHEEFDEQPGFLVFLPRAKALQPEKPPLSFATTLFKVGGSSLTRR
Ga0137367_1102758913300012353Vadose Zone SoilMTQTIAIILKFRDEQAGQFEEMFRAEVMPLWEEFSAQGKFIDASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPRFLEFLPKARALQPAKPLVYFGTTLFKVGG*
Ga0137366_1007434223300012354Vadose Zone SoilMTQTIAIILRFGEADAEAFERMFEGEVMPLWREFARTGKFLSASLTPVEGGGQRQEGIRDYILHVEVPGMAEHEEFDSNPEFLAFLPRAKALQPRPPLVWFGTTLHQVP*
Ga0137369_10000956263300012355Vadose Zone SoilMTQTIAIILKFRDEQAGQFEEMFRAEVMPLWEEFSAQGKFIDASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPRFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0137369_1003899823300012355Vadose Zone SoilMSQTLAIILKFREDRTGEFEEMFRAEILPLWEEFLAQDKFIEASLTPIEGGIPAPEGQRHYILHVEVGGMTDHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLFKVGD*
Ga0137371_1030073913300012356Vadose Zone SoilMTQTIAIILRFGEADAEAFERMFEDEVMPLWQEFARNGKFLSASLTPVEGGGQTQEGIRHYILHVEVPGMAEHEAFDSHPKFLDFLPRAKALQPEQPLVWFGTTRYRVP*
Ga0137368_1004202423300012358Vadose Zone SoilMTQTIAIILKFRDEQAGQFEEMFRAEVMPLWEEFSAQGKFIAASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPRFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0137368_1024564523300012358Vadose Zone SoilMSQTLAIILKFHEDRAGEFEEMFRAEVLPLWEEFLAQGKFFEASLTPIEGGVPIPKGQRHYILHIEVPGMREHEEFDEHPRFLEFLPRAKALQPEEPFVWFGTTLFKVGSEV*
Ga0137368_1048668623300012358Vadose Zone SoilMTQTIAIILKFRESEADEFERMFEAEIMPLWNQFMAQGRFIGASLTPIEGGDLRPEGERHYILHVEVPGMAEHEEFDEHPKFLDFLPRAKALQ
Ga0137385_1002588173300012359Vadose Zone SoilMAGGGRMTQTLAIILKFREDHTGQFEEMFRAEVLPLWEEFLAQGKFIEASLTPIEGGIPAPDGQRRYILHVEVPGMAEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTLLKVGG*
Ga0137385_1041088313300012359Vadose Zone SoilMTQTLAIILKFREDNAGQFEEMFRAEILPLWEEFLAQGKFIESSRTPVEGGIPAPHGQRHYILHVEVPGMTEHGEFDEHPRFLEFLHRAKALQPEEPLVWFGTTLLKAGG*
Ga0137375_1011169543300012360Vadose Zone SoilMSQTLAILLRFREDGAGEFEEMFRAEILPLWEEFLAQGKFIEASLTPIEGGIPTPEGQRHYILHVEVPGMREHEEFDEHPQFLDFLPRAKALQPEEPLVWFGTTLF
Ga0137373_1012766133300012532Vadose Zone SoilMTQTIAIILKFRDEQAGQFEEMFRAEVMPLWEEFSAQGKFIDASLSPVEGGGEEREGIRQYILHVEVPGMAEHEEFDEHPRFLEFLPKARALQPEKPLVYFGTTLFKVSG*
Ga0137373_1028543733300012532Vadose Zone SoilVTQTIAIILKFRESEADEFERMFEAEIMPLWNQFLAQGKFIGASLTPIEGGDRRPEGERHYILHVEVPGMAEHEEFDEHPRFLDFLPRAKDLQPAGPNVWFGTTLFKVGV*
Ga0137373_1065443823300012532Vadose Zone SoilMSQTLAIVLRFREEQAGVFEEMFRAEIIPLWEEFLAQGKFIEASLTPIEGGVPAPEGQRHYILHVEVPGMREHEEFDEHPKFLDFLPRAKALQPEEPLVWFGTTMFKVGG*
Ga0137373_1076518523300012532Vadose Zone SoilMTQTIAIILKFRESEADEFERMFEAEIMPLWNQFMAQGRFIGASLTPIEGGDLRPEGERHYILHVEVPGMAEHEEFDEHPKFLGFLPRAKALQPVGPNVWFGT
Ga0134087_1075047923300012977Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPRFLEFLPRATALQPEKPL
Ga0134078_1061748113300014157Grasslands SoilEMFQAEITTLWEEFAGQGKFLEASLTPVEGGGEEREGIRQYILHVEVPGMAEHKEFDDHPEFVEFLPKARALQPEKPLVYFGTTRFKVGG*
Ga0134089_1014841023300015358Grasslands SoilVSQTIAIILRFREDEAGSFESMFESEIMPLWHQFARDGKFLAASLTPVEGGGEMKEGTRGYILHVEVPGMAEHEEFDSHPRFLDFLPKAKALQPEEPLVWFGTTRFQVP*
Ga0134089_1055539623300015358Grasslands SoilKFRESEAGRFAALFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRQYILHVEVPGMAEHEEFDDHPEFLEFLPKARALQPEKPLVYFGTTLFKVGG*
Ga0184634_1028098513300018031Groundwater SedimentMSQTNAIILRFREDEAQRFEELFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSARFLAFLEKAKPMQPEEPKVWLGTTRFQV
Ga0184634_1037913913300018031Groundwater SedimentMSQTNAIILRFRERDAERFEKLFEAEILPLRKQFQAQGKFIAASLTPVVDGSEIKKDVRDYILHVEVPSMAEHEEFDSSAAFLSFLAKAKLMQPE
Ga0184640_1006286333300018074Groundwater SedimentMSQTNAIILRFREVDAERFEKLFEAEILPLWGQFKAQGKFIAASLTPVDDGSEIKKGVKDYILHVEVPSMVEHEEFDSSAPFLTFLAKAKLMQPEEPKIWLGTTRFQV
Ga0184639_1049155823300018082Groundwater SedimentMSQTNAIILRFREGDAERFEKLFEAEILPLWGQFKAQGKFISASLTPVDDGSEIKKGVKDYILHVEVPSMAEHEEFDSSAPFLAFLKKAKMMQPEEPKIWLGTTRFQV
Ga0066667_1011542223300018433Grasslands SoilMFEAEVLPLWNQFLAQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKGRALQPEKPLVYFGTTLFKVGG
(restricted) Ga0255056_1020519923300024521SeawaterMSQTNAIIVQFREDDTDHFEKLFEEEILPMWKELKAQGKFLSASLTPVEDGSEVKKGVRDYILHVEVPSMAEHEEFDSSAPFLAFLGKVKPLQPEEPKVWLGTTRFQV
Ga0209001_107903413300025002SoilMSQTNAIILRFREDETQRFEQLFEAEILPMWVQFKAQGKFLGASLTPVEGGSEVKEGVRDYILHVEVPSMAEHQEFDSSAQFLAFLDKAKPMQPEEPKVWLGTTRFQV
Ga0209431_1003670623300025313SoilMSQTNAIILTFREDKAQRFEKLFEAEILPMWQQFKAQGKFLAASLTPVEGGSEVKEGVRDYILHVEVPSMAQHEEFDSSPGFLDFLAKAKPMQPDDPKVWLGTTRFQV
Ga0209431_1018568413300025313SoilMSQTNAIILRFREDEAQRFEELFETEILPMWQEFKAQGKFLAASLTPVEDGSEMKEGVRDYILHVEVPSMAEHKEFDSSARFLAFLAKAKPMQPEEPKVWLGTTRFQV
Ga0209431_1052356813300025313SoilMSQTNAIILRFREDEAQRFEELFEAEVLPMWEQFKAQGKFLSASLTPVEGGSEMKEGVRDYILHVEVPSMAEHEEFDSTAPFLAFLAKAKPMQPEDPKVWLGTTRFQV
Ga0209323_1051733913300025314SoilMSQTNAIILRFRQEEAQGFEEMFELEILPMWQEFKAQGKFLAASLTPVEGGSEKKEGMRDYILHVEVPSMAEHEEFDSSARFLAFLAKAKPMQPEDPKVWLGTTRFQV
Ga0209641_1025471823300025322SoilMSQTNAIILRVRQEEAQGFEEMFELEILPMWQEFKAQGKFLAASLTPVEGGSETKEGMRDYILHVEVPSMAEHEEFDSSARFLAFLAKAKPMQPEDPKVWLGTTRFQV
Ga0209342_1030005033300025326SoilMSQTNAIILRFREDEAQRFEELFEAEVLPMWERFKKEGKFLAASLTPVEGGSERKEGARDYILHVEVPGMEQHEEFDSSAAFKAFLAKATPMQPEEPKVWFGTTRFQV
Ga0209751_1126167013300025327SoilILRFRQEEAQGFEEMFEAEVLPMWQEFKAQGKFLAASLTPVEGGSETKEGMRDYILHVEVPSMADHEEFDSSARFLAFLAKAKPMQPEEPKVWLGTTQFQV
Ga0209235_122086413300026296Grasslands SoilMTQTIAIILKFRESEAGRFEEMFEAEVLPLWNQFLPQGKFIEASLSPVEGGDEEREGIRRYILHVEVPGMAEHEEFDDHPEFVEFLPKARALQPEKPLVYFGTTLFKVGG
Ga0307305_1039692023300028807SoilVSQTIAIILRFREDEADSFESMFEEEIMPLWHQFARDGKFLAASLTPVEGGGEMKEGTRAYILHVEVPGMAEHEEFDSHPRFLDFLPKAKALQPQEPLVWFGTTR
Ga0307312_1011396213300028828SoilVSQTIAIILRFREDEADSFESMFEEEIMPLWHQFARDGKFLAASLTPVEGGGEMKEGTRAYILHVEVPGMAEHEEFDSHPRFLDFLPKAKALQPEEPLVWFGT
Ga0307278_1010896713300028878SoilAGAFEDMFRAEILPLWEAFLAEGKFIGASLTPIEGGMPTAVGQRHYILHVEVPGMAEHEEFDEHPRFLEFLPRAKALQPEEPLVWFGTTSFKVGG
Ga0307278_1015354923300028878SoilMTQTIAIILRFSEGQAEDFERMFEAEVMPLWQEFARDGKFLSASLTPVEGGSETKDGLRDYILHVEVPRMAEHHEFDSHPKFLDFLPRAKALQPEEPLVWFGTTRYSVP
Ga0307308_1027095123300028884SoilVSQTIAIILRFREDEADSFESMFEEEIMPLWHQFARDGKFLAASLTPVEGGGEMKEGTRAYILHVEVPGMAEHEEFDSHPRFLDFLPKAKALQPQEPLVWFGTTRFHVP
Ga0214473_1004706143300031949SoilMSQTNAIILRFRQEEAQGFEEMFELEILPMWQEFKAQGKFLAASLTPVEGGSETEEGMRDYILHVEVPSMAEHEEFDSSARFLAFLAKAKPMQPEDPKVWLGTTRFQV
Ga0214473_1013264823300031949SoilMSQTNAIILRFRQEEAQGFEEMFELEILPMWQEFKAQGKFLAASLTPVEGGSETKEGMRDYILHVEVPSMAEHEEFDSSARFLAFLAKAKPMQPEDPKVWLGTTRFQV
Ga0214473_1099738923300031949SoilMSQTNAIILTFREDEAQRFEKLFEAEILPMWQQFKAQGKFLAASLTPVEGGSEVKEGVRDYILHVEVPSMAQHEEFDSSPGFLDFLAKAKPMQPDDPKVWLGTTRFQV
Ga0214473_1175434813300031949SoilMSQTNAIILRFREDEAQRFEELFEAEILPMWVQFKAQGKFLGASLTPVEGGSEEKEGVRDYILHVEVPSMAEHQEFDSSARFLAFLEKAKPLQPEEPKVWLGTTRFQV
Ga0214473_1209550313300031949SoilAQMSQTNAIILTFRKDEAQRFEKLFEAEILPMWQQFKAQGKFLAASLTPVEGGSEVKEGVRDYILHVEVPSMAQHEEFDSSPGFLDFLAKAKPMQPDDPKVWLGTTRFQV
Ga0373913_0053284_507_8333300034089Sediment SlurryMSQTVAIILRFRKDEVQRFEQLFEAEILPMWEQYKAQGKFLSASLTPVEDGSEVKEGVRDYILHVEVPSMAEHQEFDSSAPFLAFLQKAKPMQPEEPKVWLGNTLFQV
Ga0373914_0141350_58_3843300034419Sediment SlurryMSQTVAIILRFRKDEVQRFEQLFEAEILPMWEQYKAQGKFLSASLTPVEDGSELKEGVRDYILHVEVPSMAEHQEFDSSAPFLAFLQKAKPMQPEEPKVWLGNTLFQV
Ga0373917_0013206_14_3403300034692Sediment SlurryMSQTVAIILRFRKDEVRRFEQLFEAEILPMWEQYKAQGKFLSASLTPVEDGSELKEGIRDYILHVEVPSMAEHQEFDSSAPFLAFLQKAKPMQPEEPKVWLGNTLFQV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.