NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F058317

Metagenome / Metatranscriptome Family F058317

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F058317
Family Type Metagenome / Metatranscriptome
Number of Sequences 135
Average Sequence Length 208 residues
Representative Sequence AQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVK
Number of Associated Samples 112
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.50 %
% of genes near scaffold ends (potentially truncated) 93.33 %
% of genes from short scaffolds (< 2000 bps) 93.33 %
Associated GOLD sequencing projects 102
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.519 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(28.889 % of family members)
Environment Ontology (ENVO) Unclassified
(28.148 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.074 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 4.56%    β-sheet: 48.13%    Coil/Unstructured: 47.30%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF08308PEGA 1.48
PF08541ACP_syn_III_C 1.48
PF01571GCV_T 1.48
PF08545ACP_syn_III 1.48
PF02604PhdYeFM_antitox 0.74
PF12844HTH_19 0.74
PF07238PilZ 0.74
PF02954HTH_8 0.74
PF00977His_biosynth 0.74
PF02517Rce1-like 0.74
PF00990GGDEF 0.74
PF13590DUF4136 0.74
PF01850PIN 0.74
PF02518HATPase_c 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.74
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.74
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.74
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.52 %
UnclassifiedrootN/A1.48 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_101209440All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300002908|JGI25382J43887_10339545All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300004101|Ga0058896_1136270All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300004152|Ga0062386_100841792All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300004633|Ga0066395_10348075All Organisms → cellular organisms → Bacteria823Open in IMG/M
3300005166|Ga0066674_10352254All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300005167|Ga0066672_10207211All Organisms → cellular organisms → Bacteria → Acidobacteria1251Open in IMG/M
3300005177|Ga0066690_10451064All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300005179|Ga0066684_10430778All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300005186|Ga0066676_11039758All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300005445|Ga0070708_101530337All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Nocardiaceae → Nocardia → Nocardia jiangxiensis621Open in IMG/M
3300005541|Ga0070733_11213700All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300005545|Ga0070695_101063082All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300005556|Ga0066707_10940482All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300005557|Ga0066704_10608882All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300005557|Ga0066704_10748800All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300005568|Ga0066703_10184503All Organisms → cellular organisms → Bacteria → Acidobacteria1262Open in IMG/M
3300005568|Ga0066703_10799322All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300005764|Ga0066903_102367186All Organisms → cellular organisms → Bacteria1027Open in IMG/M
3300005993|Ga0080027_10322475All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300005995|Ga0066790_10331632All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300006176|Ga0070765_100270303All Organisms → cellular organisms → Bacteria1564Open in IMG/M
3300006176|Ga0070765_101403284All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300006796|Ga0066665_10337802All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300006797|Ga0066659_10202681All Organisms → cellular organisms → Bacteria1453Open in IMG/M
3300006806|Ga0079220_10115064All Organisms → cellular organisms → Bacteria1423Open in IMG/M
3300006854|Ga0075425_102219025All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300006903|Ga0075426_10105104All Organisms → cellular organisms → Bacteria2026Open in IMG/M
3300006954|Ga0079219_10796462All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300007258|Ga0099793_10069273All Organisms → cellular organisms → Bacteria1591Open in IMG/M
3300007265|Ga0099794_10123920All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → unclassified Terriglobales → Acidobacteriales bacterium 59-551302Open in IMG/M
3300009012|Ga0066710_101627078All Organisms → cellular organisms → Bacteria → Acidobacteria988Open in IMG/M
3300009012|Ga0066710_103727580All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300009038|Ga0099829_10058945All Organisms → cellular organisms → Bacteria → Acidobacteria2872Open in IMG/M
3300009038|Ga0099829_10155237All Organisms → cellular organisms → Bacteria1829Open in IMG/M
3300009038|Ga0099829_10704811All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300009088|Ga0099830_10046272All Organisms → cellular organisms → Bacteria3045Open in IMG/M
3300009088|Ga0099830_10385958All Organisms → cellular organisms → Bacteria → Acidobacteria1131Open in IMG/M
3300009090|Ga0099827_11591233All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300009143|Ga0099792_10435482All Organisms → cellular organisms → Bacteria809Open in IMG/M
3300009792|Ga0126374_11048918All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300010341|Ga0074045_10007108All Organisms → cellular organisms → Bacteria9576Open in IMG/M
3300010360|Ga0126372_11333270All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300010376|Ga0126381_101258810All Organisms → cellular organisms → Bacteria1069Open in IMG/M
3300010396|Ga0134126_12444323All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300010398|Ga0126383_11950284All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300011120|Ga0150983_13093738All Organisms → cellular organisms → Bacteria1360Open in IMG/M
3300011120|Ga0150983_16322720All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300011269|Ga0137392_11610488All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300011270|Ga0137391_11312021All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300011271|Ga0137393_10147040All Organisms → cellular organisms → Bacteria1964Open in IMG/M
3300011271|Ga0137393_10336521All Organisms → cellular organisms → Bacteria → Acidobacteria1288Open in IMG/M
3300012096|Ga0137389_11208656All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300012189|Ga0137388_11062094All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300012189|Ga0137388_11990318All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300012201|Ga0137365_10378002All Organisms → cellular organisms → Bacteria1045Open in IMG/M
3300012201|Ga0137365_11320901All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300012202|Ga0137363_10755370All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300012203|Ga0137399_10203338All Organisms → cellular organisms → Bacteria1608Open in IMG/M
3300012205|Ga0137362_10011714All Organisms → cellular organisms → Bacteria → Proteobacteria6447Open in IMG/M
3300012207|Ga0137381_10273080All Organisms → cellular organisms → Bacteria1470Open in IMG/M
3300012211|Ga0137377_11613847All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300012211|Ga0137377_11805184All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300012351|Ga0137386_10429293All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300012351|Ga0137386_11012307All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300012357|Ga0137384_10989314All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300012361|Ga0137360_10854728All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300012923|Ga0137359_10937739All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300012927|Ga0137416_10219766All Organisms → cellular organisms → Bacteria1536Open in IMG/M
3300012927|Ga0137416_10965306All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300012948|Ga0126375_10993447All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300012972|Ga0134077_10227811All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300014165|Ga0181523_10794506All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300014199|Ga0181535_10322628All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300015241|Ga0137418_10016530All Organisms → cellular organisms → Bacteria6820Open in IMG/M
3300017823|Ga0187818_10214924All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300017934|Ga0187803_10158120All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300017972|Ga0187781_10530369All Organisms → cellular organisms → Bacteria846Open in IMG/M
3300017995|Ga0187816_10325529All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300017995|Ga0187816_10353630All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300018006|Ga0187804_10514453All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300018012|Ga0187810_10160161All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300018034|Ga0187863_10773566All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300018040|Ga0187862_10162818All Organisms → cellular organisms → Bacteria1494Open in IMG/M
3300018482|Ga0066669_10114480All Organisms → cellular organisms → Bacteria → Acidobacteria1906Open in IMG/M
3300019788|Ga0182028_1035721All Organisms → cellular organisms → Bacteria1205Open in IMG/M
3300019788|Ga0182028_1074814All Organisms → cellular organisms → Bacteria1206Open in IMG/M
3300019788|Ga0182028_1121679All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300019788|Ga0182028_1263067All Organisms → cellular organisms → Bacteria1212Open in IMG/M
3300020580|Ga0210403_11331330All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300020581|Ga0210399_11226713All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300020581|Ga0210399_11345353All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300020583|Ga0210401_10606551All Organisms → cellular organisms → Bacteria956Open in IMG/M
3300021088|Ga0210404_10177624All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300021170|Ga0210400_10799359All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300021170|Ga0210400_11015237All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300021420|Ga0210394_10713998All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300021432|Ga0210384_11482064All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300021478|Ga0210402_10796249All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300021479|Ga0210410_10556437All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300021560|Ga0126371_11582495All Organisms → cellular organisms → Bacteria782Open in IMG/M
3300026301|Ga0209238_1067530All Organisms → cellular organisms → Bacteria → Acidobacteria1261Open in IMG/M
3300026320|Ga0209131_1163339All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300026329|Ga0209375_1295873All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300026335|Ga0209804_1250116All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300026342|Ga0209057_1135926All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300026474|Ga0247846_1088067All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300026537|Ga0209157_1093272All Organisms → cellular organisms → Bacteria → Acidobacteria1447Open in IMG/M
3300026548|Ga0209161_10315912All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300026551|Ga0209648_10052232All Organisms → cellular organisms → Bacteria → Acidobacteria3496Open in IMG/M
3300026551|Ga0209648_10280608All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300027502|Ga0209622_1100587All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300027775|Ga0209177_10246059All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300027825|Ga0209039_10064266All Organisms → cellular organisms → Bacteria1638Open in IMG/M
3300027846|Ga0209180_10279426All Organisms → cellular organisms → Bacteria959Open in IMG/M
3300027862|Ga0209701_10133906All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1525Open in IMG/M
3300027862|Ga0209701_10579007All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300027867|Ga0209167_10709583All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300027875|Ga0209283_10628609All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300027882|Ga0209590_10381665All Organisms → cellular organisms → Bacteria910Open in IMG/M
3300028047|Ga0209526_10199226All Organisms → cellular organisms → Bacteria1387Open in IMG/M
3300028536|Ga0137415_10445069All Organisms → cellular organisms → Bacteria → Acidobacteria1101Open in IMG/M
3300028906|Ga0308309_10823537All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300029636|Ga0222749_10339909All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300030596|Ga0210278_1104415All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300031231|Ga0170824_126451351All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300031474|Ga0170818_106403176All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300031962|Ga0307479_10269786All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1681Open in IMG/M
3300031962|Ga0307479_11415347All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300032205|Ga0307472_101468577All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300032828|Ga0335080_11191564All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300033806|Ga0314865_029460All Organisms → cellular organisms → Bacteria1394Open in IMG/M
3300033808|Ga0314867_038630All Organisms → cellular organisms → Bacteria1122Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil28.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.63%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.93%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.19%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment4.44%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.44%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.22%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.22%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.22%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen2.96%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland1.48%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.48%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.48%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.48%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.48%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland1.48%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.48%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.48%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.48%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.74%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.74%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.74%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.74%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.74%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.74%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004101Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF228 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005993Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-046EnvironmentalOpen in IMG/M
3300005995Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014165Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_30_metaGEnvironmentalOpen in IMG/M
3300014199Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_30_metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300017972Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018034Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_11_10EnvironmentalOpen in IMG/M
3300018040Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_150EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019788Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (PacBio error correction)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026474Peat soil microbial communities from Stordalen Mire, Sweden - P.F.S.T0EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027034Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027502Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027825Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030596Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO135-VCO085SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300033806Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_50_20EnvironmentalOpen in IMG/M
3300033808Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_100_20EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10120944013300002245Forest SoilYRKYIPTSASVLQQRIEEPPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQ
JGI25382J43887_1033954513300002908Grasslands SoilAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVK
Ga0058896_113627013300004101Forest SoilEPSALAKYEIRVIDGVQRFRSYPDGKEELQNVPSPRLNDSIRPGGEWSELPEMVGTELRLRIHQAADVVINDRRMKVFQYWANIEDSVCKFETISDFVFYEVSKVDIVACYGEVWTDEDTNILRISEHYELPGKWKHYQGVVTYGWLQRKDETPRLIPLTIYTQVEHRKAYWCRGQFTDYQVFDSRVRIIPKQDRAQPDVAARN*
Ga0062386_10084179213300004152Bog Forest SoilSAYEVQVLDGYQRFRKYPDGKKHYEDLPLPPLNTVIGTGGEWAELPKMVGTELGLRIQQAPDVVVNGRRMKVFQYRADVEDGVCTFRSIFDFELFVINKDVTVPCYGEVWTDEEANIVRMSLHLENRGWWKHYQSVVTYGWLQRRGEIPRLIPLTISTQAERGKRVYWCRGQFVNYQVFRSQVVKMVTK*
Ga0066395_1034807513300004633Tropical Forest SoilPASVSIQYRPDEPAEIKALRKKAQRQADGMRNLVAVQTFAWGSGTNDVPLAVAEYEVKVLDGFQRFREYPDGKKELQDVPFPTVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRSIRVFQYLAETEDDICMFKSVLDLLFGDRSKTVSAPCYGEVWTDDGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLFTNYHIFTSRAQLIAYEYVKGSHP*
Ga0066674_1035225413300005166SoilYQRFREYPDGKKEFQDVPLPPLKHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFMEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANGVQSLPPR*
Ga0066672_1020721113300005167SoilFAPSPDHNSKFVTTHADGLQHRPEEPYAVRVLRKKAQLLADSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFLEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANGANR*
Ga0066690_1045106413300005177SoilYPKFAPSPDHNSKFVTTHADGLQHRPEEPYAVRVLRKKAQLLADSMRNFIAVQSFSWGSGDKEAPAAEGAYEVRVLGGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANGANR*
Ga0066684_1043077813300005179SoilLADSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKIFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANDVQRLSPR*
Ga0066676_1103975813300005186SoilVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSILDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNK
Ga0070708_10153033713300005445Corn, Switchgrass And Miscanthus RhizosphereDVVQHRPEEPPEVKELRNKSQLLADSMRNFIAVQAFAWGSGNKAPSAVSAYEVRVVDGFQRFREYPDGKKELQNVPFPPLNNVIVPGGEWSELPQMVGTELRLKIHQAADVVVNERRVKVFQYQADPEDGVCRWKSNFDFGFFEVNKIVNVSCYGEVWTDEDSNILRMSEHYELPGKWKDYQGVVTYGWLQRTDETPRLIPLTIST
Ga0070733_1121370013300005541Surface SoilSFAWGSGNKAPAAEAAYEVRVLDGYQRFREYPDGKKELEDAPFPSLNNAVSPGSEWSELPQMVGTDLRLKIRQAPDVTVNEQRLKVFQYRADVEDGVCQFKSISDFVFFATSKIFTIACFGEVWTDEETNILRMSRHFKLYGRWKDDGTVVTYGWLRQADEAPKLIP
Ga0070695_10106308213300005545Corn, Switchgrass And Miscanthus RhizosphereQRFRKYPDGKKELQDVPAPPLSTSLVPGGEWSELPAMVGTELGLKIQQAVSVVINKRRIKVFQYRADSEDGLCRFKSVLDYMFFAANKIVTVGCYGEVWTDEDTNILRMSEHYELPGRWKNYQSVVTYGWLHRKDEAPKLVPLTISTQAEFKKKFYWCRGQFTDYRVFDSKVRILSGEPRYGDQAKAEALKKKVK*
Ga0066707_1094048213300005556SoilEEPAAEGAYEVRVLDGYQRFRQYPDGKKEFQDVPLPPLNHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFMEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWC
Ga0066704_1060888213300005557SoilEEPYEVSVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQ
Ga0066704_1074880013300005557SoilADSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFRQYPDGKKEFQDVPLPPLNHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFLEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWFQKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQ
Ga0066703_1018450333300005568SoilFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKIFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRVIANGMQSLPPR*
Ga0066703_1079932213300005568SoilMRNFIAVQTFAWGSGDKEPVAEATYEVRIVDGYQRFREYPDGRKEFRDVPFPSLSTMIVPGGEWSELPEMVGTRLRLKIHQAPDVVVNERWMKVFQYRADVEDDVCTFKSVLDYVFFEVSKIVAVSCYGEVWTDEDTNILRMSEHYELPGKWKDYQAVVTYGWLQRADQMSQLIPLTIS
Ga0066903_10236718613300005764Tropical Forest SoilVAAYEVKVLEGFQRFREYPDGKKELQDVPFPPVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRSIRVFQYLAETEDDICMFKSVLDLLFGDRSKTVSAPCYGEVWTDEGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLFTNYHIFTSRAQLIASEYVQGSHP*
Ga0080027_1032247513300005993Prmafrost SoilPVAEAAYEVRVLDGNQRFRSYPEGKKELVDVPYPDLNRVMVPGGEWSELPEMVGTELHLKIRQASDVVVNERRMKVFQYWAAVEDGVCRWTDVTDLLFLTINKDFGVDCYGEVWTDEDTNILRISEHYELPGKWKDFQGVVTYGWLQRAGEPWLIPLTVSTQAEFHRKVYWCRGQFMDYRMFGARARIVENRSAMSPPRLSQPVE
Ga0066790_1033163213300005995SoilEEPPEVKMLRSKAQVLADSMRNFIAVQTFAWGRGENEPSAEAAYEVQVLDGYQRFRAYPNGKKELRDIPFPPLNPVMRPGGEWSELPGMVGTELRLTIRQAPDVVVEDRRMKVFQYWAGVEDAVCSWTDITDLIFLTLDHTHTVVCYGEVWTDKDTNILRISEHYELPGKWKSFQGVVTYGRLQRAGEPRLIPLSFSTQAEFHKKIYWCRGQFMDY
Ga0070765_10027030313300006176SoilLAQSIFPCTQGISAVSADLYRKYIPASASVLQQRIEEPPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRILTDN*
Ga0070765_10140328413300006176SoilTKVEPHLAQSIFPCTQGISAVSADLYRKYVPTSVNALQQRMEESPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHEAPDVVVDQRRMKVFQYWAGAEDAVCSWTDINDFIAFSLSHDFTVACYGEVWTDENTNIMRISEHYELTGKWKNFQGV
Ga0066665_1033780233300006796SoilRRADGLQHRPEEPYEVSVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSILDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSVPP*
Ga0066659_1020268113300006797SoilSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANGANR*
Ga0079220_1011506423300006806Agricultural SoilRAQTLADSMRNFVAVQTYSWGSRNNSPVAMAEYEVQVLEGFQRFREYPDGKKELQSVPFPPVSTMVVPGGEWSELPQMVGSELHLRIHQAADTVVNGRHVKVFQYAADVEDGVCVFKSVRDYGFFETSKVVTIPCHGEVWADEYLDILRISQHLELPGKWQDYQSVVNYGWIRLVDNTPRLVPLTISTQAEFKNKAYWCRGLFTDYRMFSSRTQIMSAANYNVQSLPP*
Ga0075425_10221902513300006854Populus RhizospherePTTKEQVISPGLADFYPKFQWPTASGSLQHRPEETLEIKTLRSRAQTLADSMRNFVAVQTYSWGSRNNPPVAMAEYEVQVLEGFQRFREYPDGKKELQSVPFPPVNTMVVPGGEWSELPQMVGSELHLKIHQAADTVVNGRHVKVFQYAADVEDGVCVFKSVRDYGFFETSKVVTIPCHGEVWADEYLDILRISQHLELPGKW
Ga0075426_1010510413300006903Populus RhizosphereWPTASGSLQHRPEETLEIKTLRSRAQTLADSMRNFVAVQTYSWGSRNNPPVAMAEYEVQVLEGFQRFREYPDGKKELQSVPFPPVSTMVVPGGEWSELPQMVGSELHLKIHQAADTVVNGRHVKVFQYAADVEDGVCVFKSVRDYGFFETSKVVTIPCHGEVWADEKLDILRISQHLELPGKWQDYHSVVNYGWIRLADNTPRLVPLTISTQAEFKNKAYWCRGLFTDYRMFSSRTQIMSAANYNVQSLPP*
Ga0079219_1079646213300006954Agricultural SoilISPGLADFYPKFQWPTASGSLQHRPEETLEIKTLRSRAQTLADSMRNFVAVQTYSWGSRNNSPVAMAEYEVQVLEGFQRFREYPDGKKELQSVPFPPVSTMVVPGGEWSELPQMVGSELHLRIHQAADTVVNGRHVKVFQYAADVEDGVCVFKSVRDYGFFETSKVVTIPCHGEVWADEYLDILRISQHLELPGKWQDYQSVVNYGWIRLVDNTPRLVPLTISTQAEFKNKAYWCRGLFTDYRMFS
Ga0099793_1006927313300007258Vadose Zone SoilDVKLLRRKAQLLADSMRNFIAVQTFVWGSGNKEPAAASAYEVQILDGNQRFREYPDGKKELQNVVLPPLNTVMAPGGEWSELPEMVGTKLGLKIHQGADVVANERRMKVFQYWADPEDEVCKWRTVVGFGFFSINRDVTVACYGEVWTDEETNILRMSEHYELSGKWREFQGVVTYGWLQRPDETPRLIPLTISTQAEFNKKVYWCRGQFVNYKIFGSRVKILAN*
Ga0099794_1012392023300007265Vadose Zone SoilQHRPEEPPEVKVLRIKAQLLADSMRNFIAVQSYEWGSGDKEPAAEAEYEVRVIDDYQHYREYPEGKKELEGVPFPPLNDVIRSGGEWSELPEMVGTELRLKIHQAADVVVNEQRMKVFQYWADIEDGVCRFQSISDFGFFVVNKIDIVACYGEVWTDTDTNILRMSEHYELPGKWKHYQGVVTYGWLRRKDETPRLVPLTIYTQAEHNKKVYWCRGLFTDYQVFDSRVRIIANLPGPN*
Ga0066710_10162707823300009012Grasslands SoilGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKIFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRVIANGMQSLPPR
Ga0066710_10372758013300009012Grasslands SoilLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPLVNEVVNPGGEWSELSQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVSVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKK
Ga0099829_1005894533300009038Vadose Zone SoilMRNFIAVQSFAWGSGDKEPAAQAQYEVRVIDGVQRFREYPDGKRELQDVPFPPVNAVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDLGFFAVNKIVTVPCYGEVWTDEDISIFRISEHLDLSGKWRDYQSIVTYGWLRRTDEAPRLIPLTISTQAEHNKKVYWCRGAFTNYRIFSSRTKIVANDYVQSVPP*
Ga0099829_1015523743300009038Vadose Zone SoilDHNPKFIPVHADGLQHRPEEPVEVKLLREKAQLLADSIRNFIAVQSFEWGSGDKEPAALAAYEVRVIDGYQRFREYPDGKKDFQDLPLPSLHHIVGTGGEWSELPWMVGTALGLRIQQAADVFVNKRRIKVFQYQADAEDGVCRFAIISDFVFFEGSRTFTVGCYGEVWTDEDTNILRISEHYELPGRWKHYQGVVTYGWLQKDETPRLIPLTIYTQAERNKKVYWCRGQFTDYQVFDSRVKVIANDYVQSLPP*
Ga0099829_1070481123300009038Vadose Zone SoilMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPVGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSVPP*
Ga0099830_1004627263300009088Vadose Zone SoilEVRVIDGYQRFREYPEGKEEFQDLPLPSLNNVVGTGGEWSELPEMVGTKLALKIRQAADVVVNERRVKVFQYRADIEDGVCIFKSILDLGFFAVSKIHTVACYGEVWTDEDTNILRMSEHYELPGKWKNYQGVVTYGWLQRKDEPPRVIPLTIYTQAERNGRVYWCRGQFTDYQIFSSRVKIIAN*
Ga0099830_1038595823300009088Vadose Zone SoilAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPVGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSVPP*
Ga0099827_1159123313300009090Vadose Zone SoilPEEPYEVKVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDE
Ga0099792_1043548213300009143Vadose Zone SoilGLQHRPEEPYEVSVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNAVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFAFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLWRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRTKIVAND*
Ga0126374_1104891813300009792Tropical Forest SoilEVPLAVAEYEVRVLDGFQRFREYPDGKKELQDVPFPTVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRSIRVFQYLAESEDDICMFKSVLDLLFGDRSKTLSAPCYGEVWTDDGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLFTNYHIFTSRAQLIAYEYVQGSHP*
Ga0074045_10007108123300010341Bog Forest SoilFAWGSGDKAPAAETAYEVRVLDGYQRFREYPDGKKELRDVPFPALNTVMVPGGEWSELPEMVGTELRLKIHRADDVLVNGQRMKVFQYWADREDAVCRWKSNLDLGFFAISKIATVACYGEVWTDEDTNILRMSEHYELPGKWKDYRAVVTYGWLRKATETPRLIPLTISSQAQYNKKVYWCRGQFTDYQVFSSRVKMAAN*
Ga0126372_1133327013300010360Tropical Forest SoilMRNFIAVQTFAWGAEDKESAAVAAYEVRIVDGYQRFREYPDGNQEFRDVPFPPLNTMIVPGGEWSELPEIVGTRLGLRIHQAPDAVVNERRMKVFQYRADMEDDLCKFKSVQDYVLGERSKTVTVACYGEVWTDEDTNIVRMSEHYELPGKWKDYQAV
Ga0126381_10125881013300010376Tropical Forest SoilIKVLREKAQRQADGMRNLVAVQTLAWGSGTNNVPVALAAYEVKVLDGFQRFREYPDGKKELQDVPFPTVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRSIRVFQYLAESEDDICMFKSVLDLLFGDRSKTVSAPCYGEVWTDEGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLFTNYHIFTSRAQLIAYEYVQGSHP*
Ga0134126_1244432313300010396Terrestrial SoilRNFIAVQSFAWGSGAEAPAAESAYEVQVIDGYQRFREYPDGKKELQQVRFPALTSVMAPGDEWSELPKMVGTELRLRIHEADDVVLKKRRIRVFQYEADREDSVCQFRSISDAVFFSVSKLLTVACYGEVWTDEDMNILRMSEHYELPGKWKDYQGVVTYGWLKRNEGSPRLIPLTISTQAEHDKRIYW
Ga0126383_1195028413300010398Tropical Forest SoilLEGFQRFREYPDGKKELQSVPFPAVNTVVVPGGEWSELPQMVGAQLNLKIHQAADKDLNGRQVKVFQYSADVEDGACVFKSIRDYGFFEASKTVTIPCHGEVWTDENMDILRISQHLELPGKWQDYRSVVNYGRVRLTDNTSRLVPLTISTQAEFKNKTYWCRGLFTNYHLFNSQAKIVTAANYNIQSLPGTK*
Ga0150983_1309373823300011120Forest SoilQVLDGYQRFRKYPDGKQELQEVPLPSLKNAITPRGEWSELPEMVGTKLRLKIQQSAEAVVNGRRIKVFQYRADPEDALCTFESILDFEFFAVSQIANVGCYGEVWTDENTNILRMSEHFEISGRWRDYQVVVTYGWLRRTDEAAKLVPLTISTQANLNRRVYWCRGQFTDYQVVASRVKILAGKLNP*
Ga0150983_1632272013300011120Forest SoilKEPSALAKYEIRVIDGVQRFRSYPDGKEELQNVPSPRLNDSIRPGGEWSELPEMVGTELRLRIHQAADVVINDRRMKVFQYWANIEDSVCKFETISDFVFYEVSKVDIVACYGEVWTDEDTNILRISEHYELPGKWKHYQGVVTYGWLQRKDETPRLIPLTIYTQVEHRKAYWCRGQFTDYQVFDSRVRIIPKQDRAQPDVAARN*
Ga0137392_1161048813300011269Vadose Zone SoilEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENN
Ga0137391_1131202113300011270Vadose Zone SoilVQSFAWGSGDKEPAAQAEYQVQVIDGVQRFREYPDGKKELQDVPFPPLNTVMVPGGEWSELSEMVGTELRLKIHQAADVVVNERRMKVFQYRADLEDGLCRFKSVSDFVFFAVNKIVTVGCYGEVWTDEDTNILRMSEHYELPGKWKDYQGVVTYGWLQRKNETPRLIPLTIYTQAEINRKVYWCRGVFT
Ga0137393_1014704033300011271Vadose Zone SoilLYPKFVPGLADGLQYRPEEPYEVSVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFAFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEAPRLIPLTISTQAEHNKKVYWCRGQFTDYQIFSSRVKIVAN*
Ga0137393_1033652123300011271Vadose Zone SoilMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPVGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRQTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSAPP*
Ga0137389_1120865613300012096Vadose Zone SoilAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRQTDEVHRLITLTISTQAENNKKVHWCRGVFTNYRIFSSRTKIVEND*
Ga0137388_1106209413300012189Vadose Zone SoilMRNFVAVQTFAWGSRDNVPVAVAEYEVQVLDGFQRFREYPDGKKELQDVPFPPVNTVVNPGGEWSELPQMVGTTLRLKIHQAADAVVNGRRIKVFQYVADAEDGVCVFKSVRDFGFFEASKVVTVACYGEVWTDENFNILRMSQHLELPGKWRNYHSVATYGWHRRTDEAPRRAPLSISTQAEFNKK
Ga0137388_1199031813300012189Vadose Zone SoilFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPR
Ga0137365_1037800223300012201Vadose Zone SoilVQRFREYPDGKKELQDVPFPPLNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDRRMKVFQYQADPEDGVCIFKSVLDFGFFAVNKIAAVACYGEVWTDEDTNILRMSEHYELPGKWKHYQGVVTYGWLQRMDEIPRLIPLTISTQAEHNKKIYWCRGRFTDYQIFSSRVKIVAN*
Ga0137365_1132090113300012201Vadose Zone SoilPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPLVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLEISGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKV
Ga0137363_1075537013300012202Vadose Zone SoilEYEVRVLDGYQRFRAYPAGKKELKDIPFPPLNTAMVPGGEWSELPEMVGTELRLNIHQADDVVVNGQRIKVFQYWADPEDNVCRWKSISDFGFFAIGKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSQAKIAAK*
Ga0137399_1020333813300012203Vadose Zone SoilKPPAIESEYEVRVLDGYQRFREYPDGKKELKDIPFPPLNTAMVPGGEWSELPEMVGTELRLNIHQADDVVVKGQRIKVFQYWADPEDNVCRWKSVSDFGFFAINKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSRAKIAAK*
Ga0137362_1001171413300012205Vadose Zone SoilVRVIDGVQRFREYPDGKKELQDVPFPPLNNVIVPGGEWSELPQMVGTELQLKIRQAADVVVNDRRMKVFQYQADPEDGVCIFKSVLDFGFFAVNKIAAVACYGEVWTDEDTNILRMSEHYELPGKWKHYQGVVTYGWLQRMDEPPRLIPLTISTQAEHNKKIYWCRGRFTDYQIFSSRVKIVAN*
Ga0137381_1027308023300012207Vadose Zone SoilLADLYPKFVEPPVADSLQHRSDEPAEVKALREKAQLLADSMRNFVAVQTLAWGGRDNVPVAVAAYEVQVLDGYQRFREYPDGKKELPDGHLPFPPLSMSVVPGSEWSELPNMVGTALRLKIHQFPDTVINERRIKIFQYRAESEDGICSFISVLNFGFFAVNKTVTVSCYGEVWTDEDTNILRISEYLGLPSKWGNYQALVTYGWLRRTDEAPRLIPLTFISRQVEKNGKSYWCRGLFTNYRIFDSRVKIIANDHVQSLPR*
Ga0137377_1161384713300012211Vadose Zone SoilDSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSDLPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLEISGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKV
Ga0137377_1180518413300012211Vadose Zone SoilMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNAVVNPGGEWSELPQMVGSALRLKIHQAPDTVVNERVFQYRAESEDGACIFKSVLDFGLFAVNKIVTVPCYGEVWTDEDMNIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTIST
Ga0137386_1042929313300012351Vadose Zone SoilGIISPALADLYPKFVEPPVTDSLQHRSDEPAEVKALREKAQLLADSMRNFVAVQTLAWGGRDNVPVAVAAYEAQVLDGYQRFREYPDGKKELPDGHLPFPPLSMSVVPGSEWSELPNMVGTALRLKIHQFRDTVINERRIKILQYRAESEDGIRSFISVLNFGFFAVNKTVTVSCYGEVWTDEDTNILRISEYLGLPSKWGNYQALVTYGWLRRTDEAPRLIPLTFISRQVEKNGKSYWCRGLFTNYRIFDSRVKIIANDHVQSLPR*
Ga0137386_1101230713300012351Vadose Zone SoilRPEEPYEVSVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNAVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIP
Ga0137384_1098931413300012357Vadose Zone SoilVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSDLPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYV
Ga0137360_1085472813300012361Vadose Zone SoilTFAWGSGNKPPAIESEYEVRVLDGYQRFRAYPDGKKELKDIPFPPLNTAMVPGAEWSELPEMVGTELRLNIHQADDVVVNGQRIKVFQYWADPEDNVCRWKSVSDFGFFAINKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSQAKIAAK*
Ga0137359_1093773913300012923Vadose Zone SoilMRDFIAVQTFAWGSGNNPPAIESEYEVRVLDGYQRFREYPDGKKELKDIPFPPLNTAMVPGGEWSELPEMVGTELRLNIHQADDVVVKGQRIKVFQYWADPEDNVCRWKSVSDFGFFAINKIVTIACHGEVWTDEDINILRMSENYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSQAKIAAK*
Ga0137416_1021976613300012927Vadose Zone SoilPTIESEYEVRVLDGYQRFRAYPDGKKDLKDIPFPPLNTAMVPGGEWSELPEMVGTELRLNIHQADDVVVNGQRIKVFQYWADPEDNVCRWKSVSDFGFFAINKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSQAKIAAK*
Ga0137416_1096530613300012927Vadose Zone SoilEEPYEVRVLRKKAQLLADSMRNFIAVQSFAWGSGDREEPAAQEAYEVRVLDGYQRFREYPDGKKEFQDVPLPDLKHVVGTGGEWSELPSMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCIFKSILDLGFFAVSKIHTVACYGEVWTDEDTNILRMSEHYELPGKWKNYQGVVTYGWLQRKDEPPRVIPLTIYTQAERNGRVYWCRGQFTDYQIFSSRVKIIAN*
Ga0126375_1099344713300012948Tropical Forest SoilFAWGSGTNDVPLAVAEYEVKVLDGFQRFREYPDGKKELQDVPFPTVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRSIRVFQYLAESEDDICMFKSVLDLLFGDRSKTVSAPCYGEVWTDEGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLFTNYHIFTSRAQLIAYEYVQGSHP*
Ga0134077_1022781113300012972Grasslands SoilVPVAVAAYEVQVLDGYQRFRDGKKEFQDVPFPPVNTVVNPGGEWSELLQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQRVPP*
Ga0181523_1079450613300014165BogQTFARGSGDRAPVAESAYEVRVLDGYQRFRKYPDGKKELQNAPLPPLNRVMGTGGEWSELPQMVGTELRLEIHQAPDVIVNKRRMKVYQYWAGTEDGVCRFKSIFDFVFFAINKVATIPCYGEVWTDEDINILRMSEHYQLPGRWKHYQAVVTYGWLQRTGEGPRRIPLTI
Ga0181535_1032262813300014199BogLFPFTKGISPVSADLYPKFVPPPPSGVLEQRPEEPAEVTVLRSKAQLLADSMRNFIAVQTFAWGSGDKVPTMESAYEVRVLDGFQRFREYPDGKKELKDVPFPPLNTVMVPGGEWSELPELVGTDLRLKIHQAADAEVNGQRMKVFQYWAGTEDGVCRWKSNLDLGFFAISKIATVACYGEVWTDEDTNILRMSEHYELPGKWRDYRAVVTYGWLRKATEAPRLIPLTISSQAQYNKKVYWCRGQFTNYQVFSSAVKITADRI*
Ga0137418_1001653013300015241Vadose Zone SoilLAQSIFPFSKAISPVSADIYSKFVASQVEGLQQRPEESAEVKGLRNKSQLLADSMRDFIAVQTFAWGSGDKPPAIESEYEVRVLDGYQRFRQYPDGKKELKDIPFPPLNTAMVPGGEWSELPEMVGTELRLNIHQADDVVVKGQRIKVFQYWADPEDNVCRWKSVSDFGFFAINKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQSVVTYGWLHRMDDIPRLIPLTISTQAEYNKKVYWCRGQFGNYQMFSSQAKIAAK*
Ga0187818_1021492413300017823Freshwater SedimentYPKFVPSTTDVLQHRPDEPAEVKVLRSKAQLLADSMRNFIAVQTFAWGSGDKMPAAVSAYEVRVLDGFQRFRKYPDGKKELQNVPFPPLNNAIVTGGEWSELPAMVGTELRLKIHQADDMVVNERRMKVFQYRADVEDGVCTWKSIFDFGLFEVNKTVTVSCYGEVWTDEDTNILRMSEHYELPGKWKDFNAVVTYGWLHRTDETPPRLIPLTISAQAEYNKKVYWCRGVFMNYQIFGSRVKIVSN
Ga0187803_1015812013300017934Freshwater SedimentAVQTFAWGSGNKAPVVESAYEVQVLDGYQRFRKYPDGKKELQDVPLPPLNTVMTSGGEWSELPQMVGTKPRLKIHQAPDAVVNDRRMKVFQYRADAEDDVCRFRSIFDFVFFKINKDVTLACYGEVWTDEEANILRISEHFEHLGWWKNYQAVVTYGWLQRSDDTPRLIPLTISAQAERGKKVYWCRGRFINYRTFTIQVKMTPK
Ga0187781_1053036913300017972Tropical PeatlandRNFIAVQTFAWGSGDKAPAAEAAYEVRVLNGYQRFREYPDGKKELQDVRLPPLNTVMAPGGEWSELPQMVGTKPRLKIQQAPDVVVDGRRMKVFQYRADVEDGLCTFRSIFDFDFFVISKDATVACYGEVWTDEETNILRISLHLEKYGWWKHYQAVVSYGWLRLKDGTPWLVPLTISAQAERGKRVYWCRGRFVNYQVFSSRVKMAAN
Ga0187816_1032552913300017995Freshwater SedimentTKEISPVSADFYPKFVPPSPTGVIQHRPKEPAEVNLLREKAQLLADSIRNFIAVETFAWGSGNKEPAAVAEYEVRVLDGYQRFRAYPDGKKELQDVPLPPLNTAMSSGGEWSELPQMVGTELRLKIHQAADVVVNGRRIKVFQYRADPEDGVCRFKSIVDFGFFEVNKIATVACYGEVWTDEQTDIVRISRHYELPGKWKDYQAVVTYGWLERTDETPRLVPLTI
Ga0187816_1035363013300017995Freshwater SedimentFQRFRKYPDGKKELQNVPFPPLNNAIVTGGEWSELPAMVGTELRLKIHQADDMVVNERRMKVFQYRADVEDGVCTWKSIFDFGFFEVNKTVTVSCYGEVWTDEDTNILRMSEHYELPGKWKDFNAVVTYGWLHRTDETPPRLIPLTISAQAEYNKKVYWCRGLFMNYQIFGSRVKIIAN
Ga0187804_1051445313300018006Freshwater SedimentGFQRFREYPDGRKELQNLPLPPLNNVIATGGEWSELPEMVGTRLGLKIHQAPDGVVNDRRMKVFQYQADAEDGVCRFESTLDFMLFAVSKIYTVACYGEVWTDEDSNILRMSEHLELPGRWKNYQGVVTYGWLHRAGESPRLIPLTISTQAELNKKVYWCRGRFMNYQVFSSRVKMGAD
Ga0187810_1016016113300018012Freshwater SedimentPSTTDVLQHRPDEPAEVKVLRSKAQLLADSMRNFIAVQTFAWGSGDKMPAAVSAYEVRVLDGFQRFRKYPDGKKELQNVPFPPLNNAIVTGGEWSELPKMVGTELRLKIHQADDMVVNERRMKVFQYRADVEDGVCTWKSIFDFGFFEVNKTVTVSCYGEVWTDEDTNILRMSEHYELPGKWKDFNAVVTYGWLHRTDETPPRLIPLTISAQAEYNKKVYWCRGLFMNYQIFGSRVKIMW
Ga0187863_1077356613300018034PeatlandGSGNNPPSAESAYEVRVLDGFQRFREYPNGTKEFQDVPFPRLNTALVPGGEWSQLPEMVGTELRLKIRQATDVVLRDKRIKVFQYRAEAEDNICKWKSSRDFGFFAVNKIVSVACHGEVWTDESGNILRMSERYELPGKWKEYQTVVTYGWLRRSDDSPRLIPLTITTQAEYNKKLYWCH
Ga0187862_1016281833300018040PeatlandEVRVLDGYQRFRKYPDGKKELQNAPLPPLNRVMGTGGEWSELPQMVGTELRLEIHQAPDVIVNKRRMKVYQYWAGTEDGVCRFKSIFDFVFFAINKVATIPCYGEVWTDEDINILRMSEHYQLPGRWKHYQAVVTYGWLQRTGEGPRRIPLTISSQAEFDKRIYWCRGSFTNYQVFSSQVKMAGN
Ga0066669_1011448013300018482Grasslands SoilKKAQLLADSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFREYPDGKKEFQDVPLPPLSHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFMEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWRNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRIIANGANR
Ga0182028_103572113300019788FenLRSKAQLLADSMRNFIAVQTFAWRSGDKSPAAVSAYEVRVLDGYQRFRKYPNGKKELQDVPLPPLNTVMASGGEWSELPQMVGTELRLKIHQAADVVVNERRMKVFQYRADPEDGLCRWKSIFDFVFFAVNKIVTVACYGEVWTDEDTNILRMSEHYELPGKWKDFQGVVTYGWLQRATETPRLVPLTISTQAEYNKKVYWCRGRFMDYQVFSSRVKMAAN
Ga0182028_107481413300019788FenAKLNFWPIACATLSRCRLLPGDRETSRPLLCRRMKSYEVRVLDGYQRFRKYPNGKKELQDVPLPPLNTVMASGGEWSELPQMVGTELRLKIHQAADVVVNERRMKVFQYRADPEDGLCRWKSIFDFVFFAVNKIVTVACYGEVWTDEDTNILRMSEHYELPGKWKDFQGVVTYGWLQRATETPRLVPLTISTQAEYNKKVYWCRGRFMDYQVFSSRVKMAAN
Ga0182028_112167913300019788FenFAWGSGDKSPAAVSAYEVRVLDGYQRFRKYPNGKKELQDVPLPPLNTVMASGGEWSELPQMVGTELRLKIHQAADVVVNERRMKVFQYRADPEDGLCRWKSIFDFVFFAVNKIVTVACYGEVWTDEDTNILRMSEHYELPGKWKDFQGVVTYGWLQRATETPRLVPLTISTQAEYNKKVYWCRGRFMDYQVFSSRVKMAAN
Ga0182028_126306713300019788FenVLRSKAQLLADSMRNFIAVQTFAWGSGDKSPAAVSAYEVRVLDGYQRFRKYPNGKKELQDVPLPPLNTVMASGGEWSELPQMVGTELRLKIHQAADVVVNERRMKVFQYRADPEDGLCRWKSIFDFVFFAVNKIVTVACYGEVWTDEDTNILRMSEHYELPGKWKDFQGVVTYGWLQRATETPRLVPLTISTQAEYNKKVYWCRGRFMDYQVFSSRVKMAAN
Ga0210403_1133133013300020580SoilGSGDKEPSAEASYEVRVIDGVQRFRNYPDGKEELQNVPFPRLNDSIRPGGEWSELPKMVGTELRLRIHQAADVVVNDRRMKVFQYWANIEDGVCRFESIFDFAFFSVSKVDIVDCYGEVWTDEDTNILRMSEHYELPGKWKHYQGVVTYGWLQQKDETPQLIPLTIYTQTEHRKVYWCRG
Ga0210399_1122671313300020581SoilEPPEVKILRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQ
Ga0210399_1134535313300020581SoilEIQVLGGYQRFREYPDGNKELKNIPFPPLNTAMVPGGEWSELPGMVGTELHLNIHQADDTVVNGQRIRVFQYWADPEDDVCRWKSVLDFGFFPINKIVTVACYGEVWTDEDSNILRISEHYELPGRWKDFHGVVTYGWLQRKNEIPLLVPLTIATQAEYKKKVYWCRGHFTDYQKFTSQTKMVAK
Ga0210401_1060655113300020583SoilADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRMLTDN
Ga0210404_1017762423300021088SoilGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRMLTDN
Ga0210400_1079935923300021170SoilSFAWGSGDNEPSAEAQFEVQVIDGYQRFREYPEGKKPFQDVPLPSLNNVIGPGGEWSELPEMVGTKLGLKIRQAADVVFNERRMKVFQYRADIEDGVCIFKSILDLVFFEVNKTVTVACYGEVWTDEDTNILRMSEHYELPGKWKNYQGVVTYGWRQLGDETLRLIPLTIYTQAEHNKKVYWCRGQFTDYQIFGSRVKIIAND
Ga0210400_1101523713300021170SoilKFEPHMAQSLFSSSQGISEVSADIYPKYVPPLANSLEHRTEEPPDVRMLRSKAQALTDSIRDFIAVETFVWGVADKDPSADAAYEVRVADGYQQFRAYPDGKKELSDVPLPSLNTLIRPGGEWAELPEMIGTQLRLKIRQVPDFVDKGWRMKIFQYWAGSEDDVCKWRDIADFVLFQLNKDFSVACYGEVWTDEDTNILRMSEHYELVGKWKEYSGVVTYGWRQ
Ga0210394_1071399813300021420SoilPHLAQSIFPCTQGISAVSADLYRKYIPASASVLQQRIEEPPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRMLTDN
Ga0210384_1148206413300021432SoilLLADSIRNFIAVQTFEWGSGDKEPSAEASYEVRVIDGVQKFRSYPDGKEELQNVPFPRLNDSIRPGGEWSELPAMVGTELRLRIHQAADLVVNDRRMKVFQYWANIEDGVCKFESISDFVFYEVSKVDIVDCYGEVWTDEDTNILRMSEHYELPGKWKHFQGVVTYGWLQQKDETPRLIPLTIYTQVEHKKAYW
Ga0210402_1079624913300021478SoilQRFRGYPNGKKDLKNIPFPPLNTAMVPGGEWSELPEMVGNELHLNIHQADDSFVNGRRIKVFQYWADPEDAVCKWKSVLDFGFFAVNKIATVSCYGEVWTDEDTNILRMSEHYELLGKWKDFQGVVTYGWLQRKDEIPLLVPLTIATQAEYNKKVYWCRGHFTDYQKFTSQTKIRPGDSALILKGTL
Ga0210410_1055643713300021479SoilKAQVLGDSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRMLTDN
Ga0126371_1158249513300021560Tropical Forest SoilANGMRNLVAVQTFAWGSGTNEVPLAVAEYEVRVLDGFQRFREYPDGKKELQDVPFPTVDTVVVPGGEWSELPQMVGTALHLKIHQAADSVVNGRPIKVFQYLAESEDDICMFKSVLDLLFGDRSKTVSAPCYGEVWTDEGFNILRISLHLELPGKWRDYESVVTYGWLQRTDEAPQLVPLTISTQAKFKKKLYWCRGLSTNYHIFTSRAQLIASEYVQGSHP
Ga0179589_1051656913300024288Vadose Zone SoilPEESAEVKGLRNKSQLLADSMRDFIAVQTFAWGSGNKPPAIESEYEVRVLDGYQRFRAYPDGKKELKDIPFPPLNTAMVPGAEWSELPEMVGTELRLNIHQADDVVVNGQRIKVFQYWADPEDNVCRWKSISDFGFFAIGKIVTIACHGEVWTDEDINILRMSEHYDLPGKWRAYQAVVTYGWL
Ga0209238_106753013300026301Grasslands SoilRPVPADFYPKFAPSPDHNSKFVTTHADGLQHRPEEPYAVRVLRKKAQLLADSMRNFIAVQSFSWGSGDKEAPAAEGAYEVRVLGGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKIFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRVIANGMQSLPPR
Ga0209131_116333923300026320Grasslands SoilHRPAELADVKLLRRKAQLLADSMRNFIAVQTFVWGSGNKEPAAASAYEVQILDGNQRFREYPDGKKELQNVVLPPLNTVMAPGGEWSELPEMVGTKLGLKIHQGADVVANERRMKVFQYWADPEDEVCKWRTVVGFGFFSINRDVTVACYGEVWTDEETNILRMSEHYELSGKWREFQGVVTYGWLQRPDETPRLIPLTISTQAEFNKKVYWCRGQFVNYKIFGSRVKILAN
Ga0209375_129587313300026329SoilGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRDFVFLEINKIFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERKGRVYWCRGQFTNYQRFDSEVRV
Ga0209804_125011613300026335SoilSKFVTTHADGLQHRPEEPYAVRVLRKKAQLLADSMRNFIAVQSFSWGSGDKEAPAAEGAYEVRVLGGYQRFREYPDGKKEFQDVPLPPLNHVVGTGGEWSELPNMVGTELGLTVHQAADVVVNDRRMKVFQYRADIEDGVCSFKSTSDFVFLEINKVFTVACYGEVWTDEDTNILRISEHYELPGKWKNYQGVVTYGWLPKKDEPPRLIPLTIYTQAERK
Ga0209057_113592613300026342SoilVEVQTYSWGSRNNPPVAMAEYEVQVLEGFQRFREYPDGKKELQNVPFPPVNTMVVPGGEWSELPQMVGTDLHLKIHQARDSVVNGRQIKVFQYAVNAEDGVCIFKSVRDYGFFETSKVFTIPCYGEVWTDADVNILRISQHLELPGKWHNYLSVVNYGLLQFADNTPRLVPMTISTQAEFKDKIYWCRGLFTNYGMFSSRTKIMSATNYNIQSLPP
Ga0247846_108806713300026474SoilTFAWGSGDKSPAAVSAYEVRVLDGYQRFRKYPNGKKELQDVPLPPLNTVMASGGEWSELPQMVGTELRLKIHQAADVVVNERRMKVFQYRADPEDGLCRWKSIFDFVFFAVNKIVTVACYGEVWTDEDTNILRMSEHYELPGKWKDFQGVVTYGWLQRATETPRLVPLTISTQAEYNKKV
Ga0209157_109327223300026537SoilFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPDGKKELQDVPFPPVNEVVNPGGEWSELPQMVGTALRLKIHQAADTVVNERRIKVFQYRAESEDGVCIFKSILDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQRVPP
Ga0209161_1031591213300026548SoilHVDGLQHRPDEPIEVKVLRKKAQLLADSMRNFIAVQSFSWGSGDKEEPAAEGAYEVRVLDGYQRFREYPDGKKEFQDVPLPPLRHVVGTGGEWSELPNMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCNFKSIRHFVFLEINKVFTVACYGEVWTDEDTNILRMSEHYELPGKWKDYQAVVTYGWLQRADQMSQLIPLTISTQAGYNKKIYWCRGQFTDYRMFGTRVRIIGN
Ga0209648_1005223213300026551Grasslands SoilVKVLRKKAQLLADSMKNFIAVQTFEWGSGDKEPSAQAAYEVRVIDGDQRFREYPDGKSELEDVPFPRLNRAIRPGDEWSELPEMVGTELGLRIHQAADVVVNDQRMKVFQYWADIEDGVCRFQVISDLLFFEVNRIDNVACHGEVWTDKDTNILRMSEHLDLPGKWQAYQSVVTYGWLQLNETPRLIPLTIYTQAERNKKVYWCRGQFTDYRIFDSRVRIVAN
Ga0209648_1028060813300026551Grasslands SoilFEWGSGDKEPAALAAYEVRVIDGYQRFREYPDGKKDFQDLPLPSLHHIVGTGGEWSELPWMVGTALGLRIQQAADVFVNKRRIKVFQYQADAEDGVCRFAIISDFVFFEGSRTFTVGCYGEVWTDEDTNILRISEHYELPGRWKHYQGVVTYGWLQKDETPRLIPLTIYTQAERNKKVYWCRGQFTDYQVFDSRVKVIANDYVQSLPP
Ga0209730_103031013300027034Forest SoilSPDIYPKFVPPHASTLQQRPAEPAAVKALRSKSQLLADSLRNFMAVQTFAWGSRDSVPAAESAYEIQVLDGYQRFREYPGGTKQLKDVPFPPLNTVMVPGGEWSELPEMVGTELHLNIHQADEVVLDGRRIKVFQYWAKPEDSVCRWKSVQDFGFFAVGKIDTVACYGEVWTDEDTNILRMSEHYELPGKWKDYQAVVTYGW
Ga0209622_110058713300027502Forest SoilQTFAWGTGEKEPAALAAYEVQVRDGYQRFREYPDGHKELKDVPFPPLNDSVVPGGEWSELPQMVGTVLGLRIYQAPDVIVNERRIKVFQYQAEIEDGVCTFNSSYDFGYFAVNKIVTVACHGEIWTDEDTNILRISEHLELLGRWHEYQAIMTYGWLHRADGPPLLIPVTISTQAQ
Ga0209177_1024605913300027775Agricultural SoilISPGLADFYPKFQWPTASGSLQHRPEETLEIKTLRSRAQTLADSMRNFVAVQTYSWGSRNNSPVAMAEYEVQVLEGFQRFREYPDGKKELQSVPFPPVSTMVVPGGEWSELPQMVGSELHLRIHQAADTVVNGRHVKVFQYAADVEDGVCVFKSVRDYGFFETSKVVTIPCHGEVWADEYLDILRISQHLELPGKWQDYQSVVNYGWIRLVDNTPRLVP
Ga0209039_1006426623300027825Bog Forest SoilDGYQRFRKYPDGKKELQDVPLPPLNTVMAPGGEWSELPQMVGTKPRLKIQQAPEVVVDGRRMKVFQYRADVEDDLCWFRSIFDFVFFKINKDASLACYGEVWTDEETNILRISEHFEQLGWWKNYQAVVTYGWLQQRGETPRLIPLTIATQAERGKRVYWCRGQFKDYQVFSSRVKIVAHRD
Ga0209180_1027942613300027846Vadose Zone SoilRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSVPP
Ga0209701_1013390633300027862Vadose Zone SoilYEVRVIDGVQRFREYPDGKKELQDVPFPPLNTVMVPGGEWSELSEMVGTELRLKIHQAADVVVNERRMKVFQYRADLEDGLCRFKSVSDFVFFAVNKIVTVGCYGEVWTDEDTNILRMSEHYELPGKWKDYQGVVTYGWLQRKNETPRLIPLTIYTQAEINRKVYWCRGVFTNYRIFSSRVKIGGNDYVQSLPP
Ga0209701_1057900713300027862Vadose Zone SoilIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPVGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSS
Ga0209167_1070958313300027867Surface SoilAEAAYEVRVLDGYQRFREYPDGKKELEDAPFPSLNNAVSPGSEWSELPQMVGTDLRLKIRQAPDVTVNEQRLKVFQYRADVEDGVCQFKSISDFVFFATSKIFTIACFGEVWTDEETNILRMSRHFKLYGRWKDDGTVVTYGWLRQADEAPKLIPLTIWTQAEVKKQTYWCRGKFTDYHEFR
Ga0209283_1062860913300027875Vadose Zone SoilPKFVPGLGDGLQHRPEEPYEVKVLRKKAQLLADSMRNFIAVQSFAWGSGDKEPAAQAEYEVRVIDGLQRFREYPVGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWC
Ga0209590_1038166513300027882Vadose Zone SoilYEVRVIDGVQRFREYPDGKKELQDVPFPPVNNVIVPGGEWSELPQMVGTELRLKIRQAADVVVNDHRIKVFQYQADPEDGVCIFKSVLDFGFFAVNKIVTVPCYGEVWTDEDINIFRISEHLELSGKWRDYQSIVTYGWLRRTDEVPRLIPLTISTQAENNKKVHWCRGVFTNYRIFSSRVKIVANDYVQSVPP
Ga0209526_1019922623300028047Forest SoilLYRKYIPTSASVLQQRIEEPPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRILTDN
Ga0137415_1044506923300028536Vadose Zone SoilEEPYEVRVLRKKAQLLADSMRNFIAVQSFAWGSGDREEPAAQEAYEVRVLDGYQRFREYPDGKKEFQDVPLPDLKHVVGTGGEWSELPSMVGTELGLIVHQAADVVVNDRRMKVFQYRADIEDGVCIFKSILDLGFFAVSKIHTVACYGEVWTDEDTNILRMSEHYELPGKWKNYQGVVTYGWLQRKDEPPRVIPLTIYTQAERNGRVYWCRGQFTDYQIFSSRVKIIAN
Ga0308309_1082353713300028906SoilTKVEPHLAQSIFPCTQGISAVSADLYRKYVPTSVNALQQRMEESPEVKMLRSKAQVLADSMRNFIAVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHEAPDVVVDQRRMKVFQYWAGAEDAVCSWTDINDFIAFSLSHDFTVACYGEVWTDENTNIMRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHNKVYWCRGQFMNYQVFSSRVRILA
Ga0222749_1033990913300029636SoilVQTIVWGAGDKDPTAQAAYEVRVLDGYQRFRSYPDGKKELRDIPFPSLNPVMRPGGEWSELPGMVGTALRLKIHQAPDVVVDERRMKVFQYWAGAEDGVCSWTDINDFVAFSLSHDFTVACYGEVWTDENTNILRISEHYELTGKWKNFQGVVTYGWLQREGEPRLIPLTISTQAESHKKVYWCRGQFMNYQVFSSRVRMLTDN
Ga0210278_110441513300030596SoilLYPKFLPSHPANVLRQRPEEPAEVTLLRRKAQSLADSIRNFVAVQSFAWGKGDNDPSAVAAYEVQVLGGYQKFREYPDGKKEFADVPFPRLNTSLTPSGEWSELPAMVGTELDLKIQQAADVMVNERRIKVFQYRADAEDGVCKFRSMLDLVFFVSNKIFNVACYGEVWTDEDTNILRMSRHVETQGGWWKDYQTVVTYGWLRGEDQAPMRIP
Ga0170824_12645135113300031231Forest SoilPAAVSAYEVRVLYGQQRFRAYPDGKKELQDVPFPPLNTGMVPGGEWSELPGMVGTELRLKIHQAADVLVNERRMKVFQYWAGPEDGICRWKSNFDFGFFVVNKIVTVACYGEVWTDENANILRMSEHYELPGKWKDYQAVVTYGWLQRANESPRLIPLSFSTQAHFNKKL
Ga0170818_10640317613300031474Forest SoilPAAVSAYEVRVLYGQQRFRAYPDGKKELQDVPFPPLNTGMVPGGEWSELPGMVGTELRLKIHQAADVLVNERRMKVFQYWAGPEDVICRWKSNFDFGFFVVNKIVTVACYGEVWTDENANILRMSEHYELPGKWKDYQAVVTYGWLQRANESPRLIPLSFSTQAHFNKKL
Ga0307479_1026978633300031962Hardwood Forest SoilDGYQRFREYPGGTKQLKDVPFPPLNTVMVPGGEWSELPGMVGTELHLNIHQADDVVLDGRRIKVFQYWAKPEDSVCRWKSVQDFGFFAVGKIATVACYGEVWTDEDTNILRMSEHYELPGKWKDYQAVVTYGWLRRKGEIPLLIPVTIATQAEYNKKTHWCRGQFTDYREFSSQARMVAK
Ga0307479_1141534713300031962Hardwood Forest SoilWGSGDKEPAAVAAYEVQVLDGSQRFREYPDGKKEYQDVPFPPLGTLISTGGEWSELPWMVGTDLRLKVHQTADVVVNDRRMRVFQYQADAEDGVCTFKSVADFGFFTITKVGTVACYGEVWTDEDTNILRISEHLEYFKWWKDYRSVVTYGWLKRAGEPAWLVPLTIFTEASNKNRIYWCRGNFTDYRVFSVRARLLAN
Ga0307472_10146857713300032205Hardwood Forest SoilLLADSMRNFIAVRTFAWGSGDKRPAAVSAYEVRVLYGQQRFRAYPDGKKELQDVPFPPLNAGVVPGGEWSELPGMVGTELRLKIHQAADILVNERRMKVFQYWAGPEDGICRWKSNFDFGFFVVNKIVTVACYGEVWTDENANILRMSEHYELPGKWKDYQAVVTYGWLQRANESPRLIPLSFSTQAHFNKKLYGCRGRFTDYKVFSRHRPGQASLRAEWK
Ga0335080_1119156423300032828SoilQRFRAYPDGTKELQDVPFPPLNTVIVPGGEWSELPGMVGTELRLKIHQAADVMVNEKRVKVFQYRADPEDAVCRWKSNFDFGFFAINKVVTVGCYGEVWTDENTNILRMSEHYELPGRWKDYQAVVTYGWLKGSSEPARLIPLTISTQAQFDKKAYWCRGRFTDYQTFTSRVKMAAN
Ga0314865_029460_2_6103300033806PeatlandTVAWGSGNKAPAAEAAYEVRVLDGNQRFREYPNGVKEFQNLPFPRLDNAVNTGVEWAELPQMVGTKPRMRVHQAPDVVVDGRRMKVFQYRADVEDGVCNFRSIFDFDFFAIKKDVTVACYGEVWTDQETNILRISQHLENYGWWKHYQSVVTYGWLRMKDGTSRLVPLTIYTQAERGKRVYWCRGRFVNYQVFGSRVKMAAK
Ga0314867_038630_25_6933300033808PeatlandVLRSKAQLLADSIRNFIAVQTVAWGSGNKAPAAEAAYEVRVLDGNQRFREYPNGVKEFQNLPFPRLDNAVNTGVEWAELPQMVGTKPRMRVHQAPDVVVDGRRMKVFQYRADVEDGVCNFRSIFDFDFFAIKKDVTVACYGEVWTDQETNILRISQHLENYGWWKHYQSVVTYGWLRMKDGTSRLVPLTIYTQAERGKRVYWCRGRFVNYQVFGSRVKMAAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.