NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F045233

Metagenome / Metatranscriptome Family F045233

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045233
Family Type Metagenome / Metatranscriptome
Number of Sequences 153
Average Sequence Length 163 residues
Representative Sequence MDLRCPKCNSNSLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPAILVGRTTTRGTQQSALSKRLWPPTKWSYLKLVGWSGVVTLIALVLYVQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSTYERKYARWNESFICERCGTVSQQALR
Number of Associated Samples 111
Number of Associated Scaffolds 153

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 69.74 %
% of genes near scaffold ends (potentially truncated) 37.91 %
% of genes from short scaffolds (< 2000 bps) 82.35 %
Associated GOLD sequencing projects 105
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.346 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(42.484 % of family members)
Environment Ontology (ENVO) Unclassified
(43.791 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(43.137 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 51.89%    β-sheet: 5.41%    Coil/Unstructured: 42.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 153 Family Scaffolds
PF01381HTH_3 16.34
PF12844HTH_19 4.58
PF08401ArdcN 3.27
PF01904DUF72 1.96
PF05406WGR 1.31
PF13589HATPase_c_3 0.65
PF04255DUF433 0.65
PF13091PLDc_2 0.65
PF01680SOR_SNZ 0.65
PF12770CHAT 0.65
PF13560HTH_31 0.65
PF08281Sigma70_r4_2 0.65
PF04956TrbC 0.65
PF00078RVT_1 0.65
PF00136DNA_pol_B 0.65
PF00692dUTPase 0.65
PF03176MMPL 0.65
PF13361UvrD_C 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 153 Family Scaffolds
COG4227Antirestriction protein ArdCReplication, recombination and repair [L] 3.27
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 1.96
COG3831WGR domain, predicted DNA-binding domain in MolRTranscription [K] 1.31
COG0214Pyridoxal 5'-phosphate synthase subunit PdxSCoenzyme transport and metabolism [H] 0.65
COG0417DNA polymerase B elongation subunitReplication, recombination and repair [L] 0.65
COG0717dCTP deaminaseNucleotide transport and metabolism [F] 0.65
COG0756dUTP pyrophosphatase (dUTPase)Defense mechanisms [V] 0.65
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 0.65
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 0.65
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 0.65
COG3838Type IV secretory pathway, VirB2 component (pilin)Intracellular trafficking, secretion, and vesicular transport [U] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.35 %
UnclassifiedrootN/A0.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001154|JGI12636J13339_1014426All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300001593|JGI12635J15846_10315262All Organisms → cellular organisms → Bacteria972Open in IMG/M
3300004092|Ga0062389_101943725All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300004631|Ga0058899_12128320All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300005187|Ga0066675_10266991All Organisms → cellular organisms → Bacteria1228Open in IMG/M
3300005450|Ga0066682_10786467All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300005537|Ga0070730_10007049All Organisms → cellular organisms → Bacteria → Proteobacteria9540Open in IMG/M
3300005540|Ga0066697_10098190All Organisms → cellular organisms → Bacteria1705Open in IMG/M
3300005546|Ga0070696_100606157All Organisms → cellular organisms → Bacteria883Open in IMG/M
3300005556|Ga0066707_10290540All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300005559|Ga0066700_10700615All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300005598|Ga0066706_10926459All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300005602|Ga0070762_10166083All Organisms → cellular organisms → Bacteria1331Open in IMG/M
3300005602|Ga0070762_10248105All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300006050|Ga0075028_100853954All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300006176|Ga0070765_100094165All Organisms → cellular organisms → Bacteria2574Open in IMG/M
3300006176|Ga0070765_100375246All Organisms → cellular organisms → Bacteria1325Open in IMG/M
3300006176|Ga0070765_100938882All Organisms → cellular organisms → Bacteria818Open in IMG/M
3300006796|Ga0066665_10767874All Organisms → cellular organisms → Bacteria762Open in IMG/M
3300006804|Ga0079221_10473747All Organisms → cellular organisms → Bacteria803Open in IMG/M
3300006806|Ga0079220_10757269All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300007255|Ga0099791_10324767All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300007265|Ga0099794_10160611All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300009038|Ga0099829_10105845All Organisms → cellular organisms → Bacteria2194Open in IMG/M
3300009038|Ga0099829_10995683All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300009088|Ga0099830_10492309All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300009088|Ga0099830_10602508All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300009088|Ga0099830_10982547All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300009088|Ga0099830_11616142All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300009088|Ga0099830_11625769All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300009090|Ga0099827_11528851All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300009137|Ga0066709_101000636All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300010337|Ga0134062_10749338All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300010361|Ga0126378_10779474All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300010373|Ga0134128_10019432All Organisms → cellular organisms → Bacteria → Proteobacteria8059Open in IMG/M
3300010376|Ga0126381_102067923All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300010401|Ga0134121_10758549All Organisms → cellular organisms → Bacteria927Open in IMG/M
3300010858|Ga0126345_1106372All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300011120|Ga0150983_12202361All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300011269|Ga0137392_10380401All Organisms → cellular organisms → Bacteria1171Open in IMG/M
3300011270|Ga0137391_10421386All Organisms → cellular organisms → Bacteria1139Open in IMG/M
3300011271|Ga0137393_10458679All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300011271|Ga0137393_10967611All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300012096|Ga0137389_10626113All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300012198|Ga0137364_10006262All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria6468Open in IMG/M
3300012199|Ga0137383_10203133All Organisms → cellular organisms → Bacteria1452Open in IMG/M
3300012199|Ga0137383_10785029All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300012200|Ga0137382_10112822All Organisms → cellular organisms → Bacteria1807Open in IMG/M
3300012202|Ga0137363_10000740All Organisms → cellular organisms → Bacteria18172Open in IMG/M
3300012202|Ga0137363_10110579All Organisms → cellular organisms → Bacteria2104Open in IMG/M
3300012202|Ga0137363_10119128All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2034Open in IMG/M
3300012202|Ga0137363_10478974All Organisms → cellular organisms → Bacteria1044Open in IMG/M
3300012202|Ga0137363_10949468All Organisms → cellular organisms → Bacteria730Open in IMG/M
3300012202|Ga0137363_11456582All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300012203|Ga0137399_10233151All Organisms → cellular organisms → Bacteria1505Open in IMG/M
3300012203|Ga0137399_10367301All Organisms → cellular organisms → Bacteria1197Open in IMG/M
3300012205|Ga0137362_10312322All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300012205|Ga0137362_10595743All Organisms → cellular organisms → Bacteria953Open in IMG/M
3300012205|Ga0137362_10659411All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300012208|Ga0137376_10949970All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300012211|Ga0137377_11746023All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300012351|Ga0137386_10070972All Organisms → cellular organisms → Bacteria2430Open in IMG/M
3300012351|Ga0137386_10643558All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300012359|Ga0137385_10023711All Organisms → cellular organisms → Bacteria5495Open in IMG/M
3300012361|Ga0137360_10170658All Organisms → cellular organisms → Bacteria1741Open in IMG/M
3300012361|Ga0137360_10412594All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300012361|Ga0137360_11164706All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300012362|Ga0137361_10240034All Organisms → cellular organisms → Bacteria1649Open in IMG/M
3300012362|Ga0137361_10364304All Organisms → cellular organisms → Bacteria1327Open in IMG/M
3300012362|Ga0137361_10426348All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300012362|Ga0137361_11171965All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300012363|Ga0137390_10517148All Organisms → cellular organisms → Bacteria1166Open in IMG/M
3300012363|Ga0137390_11237934All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300012582|Ga0137358_10120442All Organisms → cellular organisms → Bacteria → Acidobacteria1783Open in IMG/M
3300012685|Ga0137397_10002099All Organisms → cellular organisms → Bacteria13969Open in IMG/M
3300012685|Ga0137397_10005843All Organisms → cellular organisms → Bacteria → Acidobacteria8516Open in IMG/M
3300012685|Ga0137397_11293508All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300012917|Ga0137395_10601805All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300012918|Ga0137396_10167299All Organisms → cellular organisms → Bacteria1607Open in IMG/M
3300012922|Ga0137394_10464720All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300012923|Ga0137359_10617356All Organisms → cellular organisms → Bacteria950Open in IMG/M
3300012923|Ga0137359_10789422All Organisms → cellular organisms → Bacteria823Open in IMG/M
3300012925|Ga0137419_10529998All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300012925|Ga0137419_10577063All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300012927|Ga0137416_10437245All Organisms → cellular organisms → Bacteria1116Open in IMG/M
3300012930|Ga0137407_11924012All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300012944|Ga0137410_10428444All Organisms → cellular organisms → Bacteria1071Open in IMG/M
3300014201|Ga0181537_10494154All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300015241|Ga0137418_11248672All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300017933|Ga0187801_10460373All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300018468|Ga0066662_10998773All Organisms → cellular organisms → Bacteria829Open in IMG/M
3300020022|Ga0193733_1075998All Organisms → cellular organisms → Bacteria943Open in IMG/M
3300020199|Ga0179592_10292557All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300020579|Ga0210407_10545228All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300020579|Ga0210407_10624596All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300020580|Ga0210403_10362521All Organisms → cellular organisms → Bacteria1186Open in IMG/M
3300020580|Ga0210403_10947730All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300020581|Ga0210399_10254040All Organisms → cellular organisms → Bacteria1466Open in IMG/M
3300020581|Ga0210399_11431752All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300020583|Ga0210401_10600851All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300020583|Ga0210401_10805059All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300021046|Ga0215015_10806164All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300021170|Ga0210400_11153337All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300021178|Ga0210408_10462889All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300021180|Ga0210396_10000122All Organisms → cellular organisms → Bacteria135106Open in IMG/M
3300021401|Ga0210393_10739197All Organisms → cellular organisms → Bacteria802Open in IMG/M
3300021404|Ga0210389_10030156All Organisms → cellular organisms → Bacteria4169Open in IMG/M
3300021405|Ga0210387_10033514All Organisms → cellular organisms → Bacteria → Acidobacteria4060Open in IMG/M
3300021407|Ga0210383_10534177All Organisms → cellular organisms → Bacteria1012Open in IMG/M
3300021420|Ga0210394_10086307All Organisms → cellular organisms → Bacteria2709Open in IMG/M
3300021432|Ga0210384_10214809All Organisms → cellular organisms → Bacteria1731Open in IMG/M
3300021476|Ga0187846_10274317All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300021477|Ga0210398_10905166All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300021479|Ga0210410_10000014All Organisms → cellular organisms → Bacteria320811Open in IMG/M
3300021479|Ga0210410_10637470All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300021559|Ga0210409_11036530All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300022523|Ga0242663_1106636All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfuromonadales → Geobacteraceae → Trichlorobacter → Trichlorobacter thiogenes564Open in IMG/M
3300022721|Ga0242666_1027861All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300022722|Ga0242657_1151605All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300022726|Ga0242654_10256920All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300026499|Ga0257181_1056950All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300026514|Ga0257168_1063040All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300026515|Ga0257158_1090851All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300027174|Ga0207948_1008586All Organisms → cellular organisms → Bacteria1171Open in IMG/M
3300027635|Ga0209625_1079048All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300027671|Ga0209588_1001275All Organisms → cellular organisms → Bacteria6530Open in IMG/M
3300027671|Ga0209588_1255435All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300027674|Ga0209118_1000980All Organisms → cellular organisms → Bacteria14067Open in IMG/M
3300027674|Ga0209118_1017186All Organisms → cellular organisms → Bacteria2352Open in IMG/M
3300027678|Ga0209011_1010599All Organisms → cellular organisms → Bacteria3082Open in IMG/M
3300027857|Ga0209166_10029501All Organisms → cellular organisms → Bacteria3343Open in IMG/M
3300027875|Ga0209283_10395607All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300027884|Ga0209275_10097438All Organisms → cellular organisms → Bacteria1500Open in IMG/M
3300027884|Ga0209275_10300263All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300028536|Ga0137415_11392701All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300030730|Ga0307482_1047949All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300030730|Ga0307482_1240784All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300030878|Ga0265770_1030614All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300031090|Ga0265760_10056216All Organisms → cellular organisms → Bacteria1190Open in IMG/M
3300031128|Ga0170823_12147162All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300031231|Ga0170824_127414821All Organisms → cellular organisms → Bacteria957Open in IMG/M
3300031715|Ga0307476_10013352All Organisms → cellular organisms → Bacteria5187Open in IMG/M
3300031718|Ga0307474_10002283All Organisms → cellular organisms → Bacteria13794Open in IMG/M
3300031753|Ga0307477_10003646All Organisms → cellular organisms → Bacteria11391Open in IMG/M
3300031753|Ga0307477_10326272All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300031753|Ga0307477_10375693All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300031753|Ga0307477_10414109All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300031754|Ga0307475_11498220All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300031823|Ga0307478_11344186All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300032892|Ga0335081_10504518All Organisms → cellular organisms → Bacteria1521Open in IMG/M
3300032898|Ga0335072_11559022All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300033402|Ga0326728_10003280All Organisms → cellular organisms → Bacteria50809Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil42.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.30%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.54%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.58%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.61%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.31%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.31%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.31%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.31%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.31%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.31%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.65%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.65%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.65%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.65%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.65%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.65%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001154Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010858Boreal forest soil eukaryotic communities from Alaska, USA - C3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014201Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_10_metaGEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022523Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022721Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030878Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032898Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.1EnvironmentalOpen in IMG/M
3300033402Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB31MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12636J13339_101442623300001154Forest SoilMDLRCPKCNSNNLKKVSLAYQEGTYLIDTRSRMGGLLFAGGGPDIVVGRTATRGTHQSALSKRLKPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPAVLLLLVGIAWRHNHSTYEQKYARWNESFICERCGTVSQQALR*
JGI12635J15846_1031526223300001593Forest SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFXPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0062389_10194372513300004092Bog Forest SoilMDLRCPKCNSNNLRMVSLAYQEGTYRVDTRSRIRGLLLAGGGPDILVGRATTRGSQQSALSKRLSPPSKWSYMKLILWSGVATLIAIVIYVQHVMSSPVPASSLPAKLYVLFAPFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSQQALH*
Ga0058899_1212832013300004631Forest SoilPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFASGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVVYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALR*
Ga0066675_1026699123300005187SoilMDLRCPKCNSNSLKKVSLAHQEGTYHIDTRSRMRGLLFAGGGPDILVGRTTTRGTEQSALSKRLCPPTKWSYVKLVLWSGVITLIALVIYVQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSTYEQKYARWNESFICERCGTVSQQALR*
Ga0066682_1078646713300005450SoilRRATKVESPGWMRIACGIGQKELKPDRSGVARSEEMDLRCPKCNSNNLKKVSLAYQEGTYHINTRGRMRGLLFAGGGPDILVGRTTTGGSQQSALSKRLSPPSKWSYVKLVLWSGVVTFIALVLYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFICGRCGTVSKQAV
Ga0070730_1000704933300005537Surface SoilMDLRCLKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTFIALLLYVQHVMASPVPASSLPVKLYVVFAPVVLSLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALR*
Ga0066697_1009819023300005540SoilMDLRCPKCNSSNLKKVSLAYQEGTYHIAARSKMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSTYERKYARWNESFICERCGTVSQQALR*
Ga0070696_10060615723300005546Corn, Switchgrass And Miscanthus RhizospherePHRSGAARGDKMDLRCPKCNSNNLKTVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPPKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYLVFAPLVLLLLVGIVWRHNHSAYEQKYARWNESYICELCGTVSQQALR*
Ga0066707_1029054023300005556SoilMDLRCPKCNSNNLKKVSLAYQEGTYLIDTRSRMRGLLFAGGGPGIVVGRTATRGSQQSALSKRLKPPSKWSYLKLVLWSGVVTLIALVIYVRHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSTYEQEYARWNESFICERCGTVSQQALR*
Ga0066700_1070061523300005559SoilGDKMDLRCPKCNSNNLKNVSLAYQEGTYHINTRGRMRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYAIFAPVVLLLFVGIVWRHNHSTYERKYARWNESFICERCGTVSQQALR*
Ga0066706_1092645923300005598SoilMDLRCPKCNSNDLKKVSLAYQEGLYRTNARTRLSAAVIGGNGPDLVVGRATTKATQQSALSKQLSPPVRWSYVKLVLWSGVVTLIALVLYVQHVMASPVPASSLPVRFYVVFAPVVLLLLVGIVWRHNHSTYQQKYALWNDSFLCERCGTIS*
Ga0070762_1016608323300005602SoilMDLRCPKCNSSNLKKVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTCGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSRVPASSLPVKLYVIFAPVLLLLLVGIAWRHNHSTYDQKYAQWNESFICERCGTVSQQGLS*
Ga0070762_1024810513300005602SoilRGDQMDLRCSKCHSNNLKKVSLAYQEGAYRMDSRSRIRGLLFAGGGPDILIARATTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYAVFAPLVLFLLVGIVWRHNHSAYELKYARWNESFICERCGTVSQQALR*
Ga0075028_10085395413300006050WatershedsAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWLGVVTLIALILYVQHVMASPVPAPSLPVKLYVVFAPLVLFLLVGIIWRHNHSAYEQKYARWNESYICERCGTVSQQAPR*
Ga0070765_10009416523300006176SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAVGGQGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0070765_10037524623300006176SoilMDLRCSKCHSNNLKKVSLAYQEGAYRMDSRSRIRGLLFAGGGPDILIARATTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYAVFAPLVLFLLVGIVWRHNHSAYELKYARWNESFICERCGTVSQQALR*
Ga0070765_10093888223300006176SoilKMDLRCPKCNSNNLKKVSLAYQEGTYHINTRGRMRGLLFAGGGPDILVGRTTTGGSQQSALAKHLSPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNSSTYKQKYAQWNESFICGRCGTVSKQAVC*
Ga0066665_1076787423300006796SoilPGWMRIACGIGQKELKPDRSGVARSEEMDLRCPKCNSNNLKKVSLAYEEGTYHINTRGRMRGLLFAGGGPDILVGRTTTRGSQQSGLSKRLCPPTKWSYVKLVLWSGAVTLIALIIYVQHVMGSPVPASSLPAKLYVLFAPVVFLLLGAIIWRHNHSTYQQKYAQWNESFICGRCGTVSKQAVC*
Ga0079221_1047374713300006804Agricultural SoilPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFASGGPGILVGGTSTRGSQQSALSKRLSPPTRWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVAFAPLVLSLLVGIVWRHNHSAYEQKYARWNESFICERCGTVSQQAFR*
Ga0079220_1075726913300006806Agricultural SoilMDLRCPKCNSNSLKKVSIVYQEGTYNIDTRSRMRGLLFAGGGPDILVGRTTTRGTQQSVLSKRLCPPTKWSYVKLILWPGVVTLIALIIYVQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSPSASPPSNSTAPA*
Ga0099791_1032476723300007255Vadose Zone SoilMDLRCPKCNSNSLKKVSLAHQEGTYHIDTRSRMRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPMPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSTYEQKYARWNESFICERCGTVSQQ
Ga0099794_1016061123300007265Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKNVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0099829_1010584523300009038Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQSASE*
Ga0099829_1099568313300009038Vadose Zone SoilMDLRCPKCDSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTHGSQQSALSKRLSPPTKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYKQKYARWNESYICERCGTVFEPF*
Ga0099830_1049230923300009088Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQESVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVMTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0099830_1060250823300009088Vadose Zone SoilMDLRCPKCNSTDLKKVSLAYQEGTYHIDTRSRIRGLLFAGGGPDVLVGRATTRGSQQSALSKRLSPPSKWSYMKLILWSGVVTLIALVIYIQHVMSSPVPVSSLPAKLYVLFAPVVFLLLVAIIWRHNHSTYQQKYAQWNESFICERCGTVSQQALR*
Ga0099830_1098254713300009088Vadose Zone SoilMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSRQALR*
Ga0099830_1161614213300009088Vadose Zone SoilARGDKMDLRCPNCNGTDLKKVSLAYQEGIYHIDTRSRIRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLITLVIYVQHVMSSPVPASSLPVKLYAVFAPIVLLLFIGTVWRHNHSIYQRKYAQWNESFICERCGTVSQQALD*
Ga0099830_1162576913300009088Vadose Zone SoilHRSGAARGDKMDLRCPKCNSNNLKNVSLAYQEGTYHTHTRSRIRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSAVVTLIALIIYVQHVMGSPVPASSLPAKLYVLFAPFIFVFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSLQSLG*
Ga0099827_1152885123300009090Vadose Zone SoilMQGVLVGGNGPNIMVGRATTNGILQTQLSRRLSPPKKRSYLKLVVWTGVVTLIALVIYVQHVMSSPVPASSLPVKLYVLFAPAVLLLLVGIVWRHNHSTYQQKYAHWNESFICERCGTVSQQALH*
Ga0066709_10100063613300009137Grasslands SoilMDLRCPKCNSNNLKSVFSAYQEGTCDINARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVIYVQHVMSSRVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHST
Ga0134062_1074933813300010337Grasslands SoilMDLRCPKCNSNNLKSVFSAYQEGTCDINARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVIYIQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHST
Ga0126378_1077947423300010361Tropical Forest SoilMDLRCPKCNSTDLKKVSLAYQEGTYQINTRTGIRGLLFAGGGPGVLVGGATTRGSQQSALSKRLSPPSKWSYAKLVLWSGVVTLIALFLYVQHVMSSTPPVSSLPVRLYAVFAPVVLLLLVGIVWRHNHSTYRQKYAHWDKSFICERCGTVSQQTTL*
Ga0134128_1001943253300010373Terrestrial SoilVSLAYQEGAYHIDTRSRIRGLLFAGGGPDVLVGRATTRGSHQSALSKSLSPPTKWSYVKLVLWSGVVTFVALVIYVQHVMSSPVPASSLPVKLYVIFASVVLLLLAGAVWRHNHSTYRQKQAQWNESFICRRCGTVGQQSLS*
Ga0126381_10206792313300010376Tropical Forest SoilMDLRCPKCNSTNLKKVSLAYQEGTYQTTSRTGIRGLLFAGGGPGVLVGGATTRGSQQSALSKRLSPPSKWSYAKLVLWSVVVTLIVLVLYAQHVMSSPPPASSLPVKLYAVFAPVVLLLLMSIVWRHNHSTYQQKYALWNESFICERCGTISQQIVH*
Ga0134121_1075854913300010401Terrestrial SoilLRSAGFNGGSFGLKKKGLIATQKWGCKGDKMDLRCPKCNSTDLKNVSLAYQEGAYHIDTRSRIRGLLFAGGGPDVLVGRATTRGSHQSALSKSLSPPTKWSYVKLVLWSGVVTFVALVIYVQHVMSSPVPASSLPVKLYVIFASVVLLLLAGAVWRHNHSTYRQKQAQWNESFICRRCGTVGQQSLS*
Ga0126345_110637213300010858Boreal Forest SoilLILTKLWRTVARRDAWPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWSESYICE
Ga0150983_1220236113300011120Forest SoilRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137392_1038040123300011269Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGIRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSAPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137391_1042138613300011270Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVL
Ga0137393_1045867923300011271Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGWGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137393_1096761113300011271Vadose Zone SoilMDLRCPECNSNNLKKVSLAYQEGTYHIAARSKMRGLLFAGGGPGILVGRLTTRGSQQSALSKRLCPPSKWSYLKLVLWSGVVTLIALVVYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQWNESFICERCGTVSQQSLG*
Ga0137389_1062611313300012096Vadose Zone SoilMDLRCAKCNSNNLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPGIVVGRTATRGSQQSALSKRLKPPSKWSYLKLVLWSGVVTLIALVVYVQHVMSSPMPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYEQKYTLWNESFICERCGTVSQQSLG*
Ga0137364_1000626223300012198Vadose Zone SoilMVEPQATKTGLPGWMRIAVRIGQKGLKPHGSGSARGEKMDLRCPKCNGNSLKKVSLAYQEGTHHMNSRTRIRGLLFASGGPDVLVGGATTRGSQQSALSKRLSPPSKWSYVKLVLWSGVVTFIALILYAQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFICGRCGTVFEPF*
Ga0137383_1020313323300012199Vadose Zone SoilMHRHRVVGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137383_1078502913300012199Vadose Zone SoilMDLRCPKCNSNNLRRVSLAYQVGTYHIDARSRMRGLLFAGGGPGILVGRTSTHGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSTYEQKYARWNESYICERCGTVSQQALD*
Ga0137382_1011282223300012200Vadose Zone SoilMVEPQATKAGLPGWMRIAVRIGQKGLKPHGSGSARGEKMDLRCPKCNGNSLKKVSLAYQEGTHHMNSRTRIRGLLFASGGPDVLVGGATTRGSQQSALSKRLSPPSKWSYVKLVLWSGVVTFIALILYAQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFICGRCGTVFEPF*
Ga0137363_10000740163300012202Vadose Zone SoilMDLRCAKCNSTDLKKISLAYQEGTYHIDTRSRLRGLLFADGGPDVLVGRTTTHGSHQSALSKRLCPPTKWSYVKLVLWSGVLTLIALVIYVQHVMSSPAPASSLPVKLYVILAPVVLLLLVGTVWRHNHSTYQQKYAQWNESFICQRCGTVGQQSLG*
Ga0137363_1011057923300012202Vadose Zone SoilMDLRCPKCNSNSVKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPGIVVGRTAARGSQQSALSKRLSPPSKWSYMKLILWSGVTTLIALVIYVQHVMSSPVPASSLPAKLYVLFAPFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSQQALG*
Ga0137363_1011912813300012202Vadose Zone SoilAVGLGSIRKDSSPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTHGSQQSALSKRLSPPTNWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALR
Ga0137363_1047897423300012202Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137363_1094946813300012202Vadose Zone SoilLVRKDSSHTEVGLQGVTKMDLCCPKCNSNNLKNVSLAYQEGTYHIDTRSKMRSLVFAGGGPGILVGRTTTRGTQQSALSKRLCPPSKWSFVKLVLWSGVVTLIALVVYVQHIMSSAAPASSLPVKLYVVFAPVVFLILFAIVWRHNHTTYQQKYAQWNESFICERCGTVSLQSLG*
Ga0137363_1145658213300012202Vadose Zone SoilMTIALDQIRKDFRPHRSRAAWGDKMDLRCPKCNSNNLKNVSLAYQEGTYHIDTRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSAVVTLIALISYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNH
Ga0137399_1023315123300012203Vadose Zone SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137399_1036730123300012203Vadose Zone SoilMVEPQATKTGLPGSMRIAVRIGQKGLKPHRSGAARGDNMDLRCPKCNSTDLKKVSLVHQEGTYHIDMRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSGFVTLIALVLYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQWNESFICERCGTVSQQSLG*
Ga0137362_1031232213300012205Vadose Zone SoilMDLRCLNCNSTDLKKVSLAHQEGTYHIDARSKIRGLLFAGGGPGILVGRLTTRGSQQSALSKRLSPPSKWSYVKLVMWSGAVTLIALVIYVQHVMSSPVPASSLPVKLYAVFAPIVLLLLIGTVWRHNHSIYQQKYAEWNESFVCKRCGTVSQQALH*
Ga0137362_1059574323300012205Vadose Zone SoilMDLRCPKCNSTDLKKVSLVHQEGTYHVDMRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSGFVTLIALVLYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQWNESFICERCGTVSQQSLG*
Ga0137362_1065941113300012205Vadose Zone SoilMDLRCPKCNSDSVKKVSLAYQEGTYHIDTWSRMRGLLFAGGGPGIVVGRTAARGSQQSALSKRLSPPSKWSYMKLILWSGVTTLIALVIYVQHVMSSPVPASSLPAKLYVLFALFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSQQALG*
Ga0137376_1094997013300012208Vadose Zone SoilMDLRCPKCGSTALKKVSLAYQEGLFQVNTRTRMLGFLFASGGPDVMVGRATTRGSQQSALSKRLSPPTRWSYVKPIGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYKQKYARWNESYICERCGTVSQQALP*
Ga0137377_1174602313300012211Vadose Zone SoilCNSTDLRKVSLAYQEGTYHIDTRSRIRGLLFAGGGPDILVGRTTTRGTQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQYVMASPVPASSLPVKLYVVFAPLVLFLLVGIIWRHNHSAYEQKYARWNESYICERCGTVFEPF*
Ga0137386_1007097213300012351Vadose Zone SoilMDLRCPKCNSSNLKKVSLVHQEGTYHIAARSKIRGLLFAGGGPGILVGRLTTRGSQQSALSKHLSPPSKWSYVKLVLWSGVVTLIALVVYVQHVMSSPVPASSLPVKLYVVFAPIVLLLLLGIVWRHNHSAYEQKYARWNESYICERCETVFEPF*
Ga0137386_1064355813300012351Vadose Zone SoilMAQAIRQDASPQSSGTARGDKMVLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPAKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137385_1002371133300012359Vadose Zone SoilMDLRCPKCNSSDLKKVSLAYQEGTYHIDTRSRIRGLLFAGGGPDVLVGRTTTHGSHQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGTVWRHNHSTYQQKYAQWNESFICQRCGTVGRQSLG*
Ga0137360_1017065823300012361Vadose Zone SoilMHRPQSRGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALAKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137360_1041259433300012361Vadose Zone SoilVSLAYQEGTYHIDTRSRMRGLLFAGGGPGIVVGRTAARGSQQSALSKRLSPPSKWSYMKLILWSGVTTLIALIIYVQHVMSSPVPASSLPAKLYVLFALFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSQQALG*
Ga0137360_1116470613300012361Vadose Zone SoilMDLRCPKCNSSNLKKVSLAHQEGTYHIDARSKIRGLLFAGGGPGILVGRLTTRGSQQSALSKRLSPPSKWSYVKLVMWSGAVTLIVLVIYVQHVMSSPVPASSLPVKLYAVFAPIVLLLLIGTVWRHNHSTYQQKYAQWNGSFICERCGTVSQQALR*
Ga0137361_1024003433300012362Vadose Zone SoilMFRHEMAQAIRKDASPQSSRTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137361_1036430423300012362Vadose Zone SoilMDLRCPKCNSNSLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPAILVGRTTTRGTQQSALSKRLWPPTKWSYLKLVGWSGVVTLIALVLYVQHVMSSPVPASSLPVKLYVIFAPVVLLLLVGIVWRHNHSTYERKYARWNESFICERCGTVSQQALR*
Ga0137361_1042634813300012362Vadose Zone SoilMDLRCPKCNSSNLKKVSLAHQEGTYHIAARSKMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTRWSYVKPIGWSGVVTLIALVLYIQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIIWRHNHSAYEQKYARWNESYICERCGTVSQQALR*
Ga0137361_1117196523300012362Vadose Zone SoilYQEGTYRIGTRSRIRGLLFASGGPDILVGRATTRGSQQSALSKRLSPPSKWSYMKLILWSGVTTLIALIIYVQHVMSSPVPASSLPAKLYVLFAPFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERCGTVSQQALG*
Ga0137390_1051714823300012363Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDSRTGIRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137390_1123793413300012363Vadose Zone SoilLAYQEGIYHIDTRSRIRGLLFAGGGPDVLVGRTTTRGSQQSALSKHLSPPSKRSYVKLVLWSGVVTLIALVVYVQHVMSSPMPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYEQKYALWNESFICERCGTVSQQSLG*
Ga0137390_1154792013300012363Vadose Zone SoilKQIASCFYTWPRKWLDQGMVEPQATKAGLPGWIRIAVRIGQKGFKPHGSEAARGDKMDLCCPKCNSTDLKKVSLAYQEGTYHIDTRSRIRGLLFAGGGPDVLVGRATTRGSQQSALSKRLSPPSKWSYMKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVAFAPFVFLLLVRIIWRHNHSIYRQNYAQWDESFI
Ga0137358_1012044223300012582Vadose Zone SoilMDLRCPKCNSNNLERVSLAYQEGTYHIHTRSRIRGLLFAGGGPDILVGRTTTRGSRQSALSNHLCPPSKWSYVKLVLWTGVLTLIAVVIYVQHVMSSPVPASSLPAKLYVVFAPVVLLLLVGIVWRHNHSAYLRRSTEWDRSFICERCGCIAQKELT*
Ga0137397_10002099133300012685Vadose Zone SoilMELRCPKCNSSDLKKASLAYKEGIYHIATRSRIRGLLFAGGGPNVLVGRTTTRGSEQSALSKRLSPPSKWSYVKLVLWSGVLTFIALVLYVQHVMSSSVPASSLPVKVYVISAPVVLFLLVGIVWRHNHLTYEQRYAQWNESFICQRCGTVSQQAFR*
Ga0137397_1000584363300012685Vadose Zone SoilMDLRCPKCSSNNLKKVSLAYQEGTYQIDTRSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFGSVVLLLLVGIVWRHNHSTYERKYARWNESFICERCGTVSQQALR*
Ga0137397_1129350813300012685Vadose Zone SoilKTVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVAWSGVVTLIALVIYVQHVMASPVPASSLPVKLYVVFAPLVLSLLVGIVWRHNHSAYEQKYARWNESFICERCGTVFEQF*
Ga0137395_1060180523300012917Vadose Zone SoilMDLRCPKCNSNNLKNVSLAYQEGTYHIDTRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSGFVTLIALVLYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQW
Ga0137396_1016729933300012918Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKGLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137394_1046472023300012922Vadose Zone SoilMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD*
Ga0137359_1061735623300012923Vadose Zone SoilMDLRCPKCNSTDLKKVSLVHQEGTYHVDMRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSGFVTLIALVLYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQWNESFICERCGTVSQQ
Ga0137359_1078942223300012923Vadose Zone SoilMDLSCPKCNSNNLKRVSLAYQEGTYDINARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIELVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIIWRHNHSAYE*
Ga0137419_1052999823300012925Vadose Zone SoilCPKCNSTDLKKVSLVHQEGTYHIDMRSRMRGLLFAGGGPDILVGRTTTRGSQQSALSKRLSPPSKWSYVKLVLWSGFVTLIALVLYVQHVMSSPVPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYQQKYTQWNESFICERCGTVSQQSLG*
Ga0137419_1057706323300012925Vadose Zone SoilMDFRCPKCNSTDLKKVSLAYQEGIYHIDTRSRIRGLLFAGGGPDVLVGRTTTRGSQQSALSKRLSPPTRWSYVKPVGWSGVVTLIALVLYVQHVMAGPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQEYARWNESYICERCGTVFEPF*
Ga0137416_1043724523300012927Vadose Zone SoilRSGAARGDKMDLRCPKCNSTDLKKVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYIQHVMASPVPASSLPVKLYVVFAPLVLSLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALD*
Ga0137407_1192401213300012930Vadose Zone SoilPGWMRIGAQNWLERTQATQKWAARSDKMDLRCPKCNSNSLKKVSLAHQEGTYHIDTRSRMRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPGPASSLPAKLYMVFAPVVFLLLVAIIWRHNHSTYQQKYAQWNESFICKRCGTVSQQALG*
Ga0137410_1042844423300012944Vadose Zone SoilMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYQRRYALWNTSIICERCGTVSQQVLD*
Ga0181537_1049415413300014201BogMDLRCPKCNSNNLKKVSLAYQEGTYRIDTRSRIRGLLFAGGGPNILVGRATTRGSQQSAFSKRLSPPSKWSYVKLVAWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVLAPLILSLLVGIVWRHNHSAYEQKYARWNESYV
Ga0137418_1124867213300015241Vadose Zone SoilRGDKMDLCCPKCNSTDLKKVSLAYEEGTYHINTRGRMRGLLFAGGGPDILVGRTTTGGSQQSALSKRLSPPSKWSYVKLVLCSGVVTFIALVLYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFICGRCGTVSNQAVC*
Ga0187801_1046037313300017933Freshwater SedimentWGDSRKGYRGVVRKDVKPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYRIDARSRMRGLLFAGGGPGILLGRTSTRGSQQSALAKRLSPPSKWSYVKLVGWSGVVTLIALVIYVQHVMSSPLPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSTYEQKYARWNESYICERCGT
Ga0066662_1099877323300018468Grasslands SoilMDLRCPKCNSTDLKKVSLVHQEGTYHIDMRSRMRGLLFAGGGPDILVGRTTTHGSQQSALSKRLSPPSKWSYVKLVMWSGAVTLIALVIYVQHVMSSPVPASSLPVKLYAVFAPVVLLILIGTVWRHNHSIYQQKYAKWNESFICERCGTVSQQALH
Ga0193733_107599823300020022SoilMDLRCPKCNSDNLKRVSLAYQEGTYHIDARSRMRGLLFASGGPGILVGRTSTRGSHQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWSESYICERCGTVSQQAVR
Ga0179592_1029255713300020199Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0210407_1054522823300020579SoilMDLRCPKCNSNNLKGVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYEQKYAWWNESFICERCGTVSQQALR
Ga0210407_1062459623300020579SoilMDLRCPKCNGNSLKKVSLAYQEGTYHIDTRSRMLGLLFAGGGPGILVGRTSTRGSQQSAFSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVRLYVVFAPLVLFLLVGIVLRHNLSAYEQKYARWNESYICERCGTVSQQALR
Ga0210403_1036252123300020580SoilMDLRCPKCNSNNLKKVSLAYQEGTYHISTRGRMRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPVLLLLLVGIVWRHNHSTYDRKYARWNESFICERCGTVSQQALS
Ga0210403_1094773013300020580SoilVFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0210399_1025404023300020581SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0210399_1143175213300020581SoilMIATPKWAARGEKMDLRCPKCNSADLKKVSLAYQEGIYHIDTRSRIRGLLFAGGGPDILVGRTTTGGSQQSALSKRLSPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLIGIVWRHNRSTYERKYAQWNASFICGRCGTVSKQAVC
Ga0210401_1060085123300020583SoilGWMCIASGIGQKGLNPHRSGVGRGEEMDLRCPKCNSNNLKKVSLAYQEGTYHINTRGRMRGLLFAGGGPDILVGRTTTGGSQQSALAKRIGPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLAGIVWRHNGSTYDEKYAQWNESFICGRCGTVSKQAVC
Ga0210401_1080505913300020583SoilMFRHEMAQAIRKDASPQSSGTARGDKVDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFVGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0215015_1080616413300021046SoilMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPLVLFLLVGIVWRHNHSAYEQKYARWNELSLIHISEPTRPLYISY
Ga0210400_1115333723300021170SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLR
Ga0210408_1046288923300021178SoilEGTQHVDMRSRVRGLLFVGGRPGVFVGGATTRGPQQSALSKRLCPPSKWSYVKLVLWSGVVTLIALIIYVQHVMSSPVPASSLPVKLYVAFAPVVFLLLVGTTWRHNHSTYRENYAQWNESFICERCGTVSRQILR
Ga0210396_10000122153300021180SoilMDLRCPKCKSNNLKRVSLAYQEGTHRIDTRSGIRGLLFAGGGPDILIGRATTHGSQQSVLSKRLSPPSKWSYMKLILWSGVATLIALVIYVQHVMSSTVPASSLPAKLYVLFAPFIFLFLVAIIWRHNHLTYQQKYAQWNESFICERRGTVSQQALH
Ga0210393_1073919723300021401SoilMDLRCPKCNSNNLKGVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0210389_1003015623300021404SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNIKKVSLAYQEGVYHIDTRTGIRGLLFAGGGQGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLVAVFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYEQRYARWNESFICERCGTVSQQALR
Ga0210387_1003351423300021405SoilMDLRCSKCHSNNLKKVSLAYQEGAYRMDSRSRIRGLLFAGGGPDILIARATTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYAVFAPLVLFLLVGIVWRHNHSAYELKYARWNESFICERCGTVSQQALR
Ga0210383_1053417713300021407SoilSEEMDLRCPKCNSNNLKKVSLAYQEGTYHINTRGRMRGLRFAGGGPDILVGRTTTGGSQQSALSKRLSPPSKWSYVKLVLWSGVATFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFVCGRCGTVSKQALC
Ga0210394_1008630743300021420SoilMDLRCPKCNSNNLKKVSLAYQEGTYHVNTRGRMRGLLFAGGGPDILVGRTTTSGSQQSALSRRLSPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYQQKYAQWNESFICGRCGTVSKQAVC
Ga0210384_1021480923300021432SoilMHRHRVVGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0187846_1027431713300021476BiofilmMGLRCPNCTSTDVKKVSLAYEEGICHLNTRARILGLLLTDGGPNVLVGTATTRGSQQSSPSKRLCPPAKWSYVKPVLWLGVVTLITLVIYAQHVLSSPVPASSLPAKLYVAFAPVVLLLLVGIIWRHNHSTYRQDVAQWNESFVCGQCGAVSKQVLR
Ga0210398_1090516613300021477SoilDQMDLRCSKCHSNNLKKVSLAYQEGAYRMDSRSRIRGLLFAGGGPDILIARATTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYAVFAPLVLFLLVGIVWRHNHSAYELKYARWNESFICERCGTVSQQALR
Ga0210410_100000141213300021479SoilMDLRCPKCNSNNLKRVSLAYQEGTYQIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYLKVVGWSGVVTLIALVLYVQHVMASPVPASSFPVKLYVVFAPLVLFVLLGIVWRHNHSAYEQKYARWNESFICERCGTVSQQALR
Ga0210410_1063747023300021479SoilRMHRHRVVGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLSPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0210409_1103653023300021559SoilRSGAARGDKMDLRCPKCNSNNLKKVSLAYQEGTYHIDARSRMRGLLFAGGWPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFATLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVFEPF
Ga0242663_110663613300022523SoilMDLRFPKCKSNNLKRVSLAYQEGTHRIDTRSGIRGLLFAGGGPDILIGRATTHGSQQSVLSKRLSPPSKWSYMKLILWSGVATLIALVIYVQHVMSSTVPASSLPAKLYVLFAPFIFLFLVAIIWRHNHLTYQQK
Ga0242666_102786123300022721SoilMAQNGPAQEAGERRATKVESPGWMRIPGGIGQKGLKPHRSGVARSEEMDLRCPKCNSNNLKKVSLAYEEGTYHINTRGRMRGLLFAGGGPDILDGRTTTGGSQQSALSKRLSPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYGQKYTQWNESFICGRCGTVSKQAVC
Ga0242657_115160513300022722SoilPAQEAGERRATKVESPGWMRIPCRIGQKGLKPHRSGVARSEEMDLRCPKCNSNNLKKVSLAYQEGTYHVNTRGRMRGLLFAGGGPDILVGRTTTSGSQQSALSRRLSPPSKWSYVKLVLWSGVVTFIALILYVQHVMSSSVPASSLPVRLYLIFAPVVLLLLVGIVWRHNGSTYEQKYAQWNESFVCGRCGTVSKQALC
Ga0242654_1025692013300022726SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGIRGLLFAGGGPGLLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCG
Ga0257181_105695013300026499SoilMDLRCPKCNSNSLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPAILVGRTTTRGTQQSALSKRLWPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPVDLLLLVGIVWRHNHSTYERKYARWNESFICERCGTV
Ga0257168_106304023300026514SoilMDLRCPKCNSNNLKGVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPTSSLPVKLYVVFAPLVLFLLVGIVWRHNHSAHEKKYARWNESYICERCGTVSQQALR
Ga0257158_109085113300026515SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKGLCPPSKWSYAKLVLWSGVMTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQV
Ga0207948_100858623300027174Forest SoilMPIALEQIRKDFRPHRSGAARGDKMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQAL
Ga0209625_107904813300027635Forest SoilMDLRCPKCNSCNLKSVSFAYQEGTYDINARSRMRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGFVTLIALVTYVQHVMSSSVPASSLPVTLYVIFAPVVLLLLVGTVWRHNHSTYQQKYDQWNESF
Ga0209588_1001275103300027671Vadose Zone SoilMDLRCPKCNSNNLKKASLAYQEGTYHIDARSRIRGLLFAGGGPDILVARATTRGSQQSALSKRLSPPSKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVMLYVVFAPLVLSLLVGIVWRHNHSAYEQKYARWNESYVCERCGTVSQQALR
Ga0209588_125543513300027671Vadose Zone SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKNVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTS
Ga0209118_1000980143300027674Forest SoilMDLRCPKCNSNNLKKVSLAYQEGTYLIDTRSRMGGLLFAGGGPDIVVGRTATRGTHQSALSKRLKPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPAVLLLLVGIAWRHNHSTYEQKYARWNESFICERCGTVSQQALR
Ga0209118_101718623300027674Forest SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0209011_101059933300027678Forest SoilMDLRCPKCDSNNLKRASLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLSLLVGIVWRHNHSAYEQKYARWNESYICERCGTVFEPF
Ga0209166_1002950133300027857Surface SoilMDLRCLKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTFIALLLYVQHVMASPVPASSLPVKLYVVFAPVVLSLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALR
Ga0209283_1039560723300027875Vadose Zone SoilMDLRCPKCNSNNLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPGIVVGRTATRGSQQSALSKRLKPPSKWSYLKLVLWSGVVTLIGLVVYVQHVMSSPMPASSLPVKLYVVFAPVVLLLLLGIVWRHNHSTYEQKYALWNESFICE
Ga0209275_1009743823300027884SoilMDLRCPKCNSSNLKKVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTCGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSRVPASSLPVKLYVIFAPVLLLLLVGIAWRHNHSTYDQKYAQWNESFICERCGTVSQQGLS
Ga0209275_1030026313300027884SoilRGDQMDLRCSKCHSNNLKKVSLAYQEGAYRMDSRSRIRGLLFAGGGPDILIARATTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYAVFAPLVLFLLVGIVWRHNHSAYELKYARWNESFICERCGTVSQQALR
Ga0137415_1139270113300028536Vadose Zone SoilDLRCPKCNSNNLKSVSFASQEGTYDINARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVLFLLVGIVWRHNHSAYEKKYARWNESYICERCGTVSQQALR
Ga0307482_104794913300030730Hardwood Forest SoilMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0307482_124078413300030730Hardwood Forest SoilMDLRCPKCNSNNLKKVSLAHQQGSCHIDTRSKMRGLVFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYLKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVMFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQAL
Ga0265770_103061413300030878SoilMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKGLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSVPVKLYAVFAPVVFLLLIGIVWRHNHSTYRLRYALWNTSIICERCGTVSQQVLD
Ga0265760_1005621623300031090SoilMDLRCPKCNSNNLKKVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKHLTPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKIYVVFAPLVLFLLVGIVWRHNHSAYEQKYARWNESYICERCGSVSKQALRSKP
Ga0170823_1214716213300031128Forest SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFVGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQSAAE
Ga0170824_12741482123300031231Forest SoilMFRHEMAQAIRKDASPQSSGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKGLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLD
Ga0307476_1001335223300031715Hardwood Forest SoilMFRHEMAQAQAIRKDASPQSRGTARGDKMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLG
Ga0307474_1000228363300031718Hardwood Forest SoilMFRHEMAQAQAIRKDASPQSRGTARGDEMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGVRGLLFAGGGPGVLVGRATTRGSQQSALSKRLCPPSKWSYAKLVLWSGVVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFGPVVFLLLIGIVWRHNHSTYRRRYALWNTSIICERCGTVSQQVLG
Ga0307477_1000364653300031753Hardwood Forest SoilMDLRCPKCNSNNLKKVSLAHQQGSYHIDTRSKMRGLVFAGGGPGILVGGTSTRGSQQSALSERLSPPAKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPTSSLPVKLYAVFAPVVLLLVLGIVWRHNHSTYHEKYAQWNESFICERCGTVSQQALH
Ga0307477_1032627223300031753Hardwood Forest SoilVEPPATKVGSPGWVRIAVGVGQKGLKPHRCGAARGDKMDLRCPKCNSSNLKKVSLAYQEGTYHIDTRSRVRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSRVPASSLPVKLYVIFAPVLLLLLVGIVWRHNHSTYDQKYAQWNESFICERCGTVSQQGLS
Ga0307477_1037569313300031753Hardwood Forest SoilMDLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAGGGPGILVGRTSTRGSQQSALSKRLSPPTKWSYVKLVGWSGVVTLIALVLYVQHVMASPVPASSLPVKLYVVFAPLVMFLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQALH
Ga0307477_1041410923300031753Hardwood Forest SoilMDLRCPKCNSNSLKKVSLAYQEGTYHIDTRSRMRGLLFAGGGPAILVGRTNTRGTQQSGLSKLLWPPTKWSYVKLVQWSGVVTLIALVIYVQHVISSPVPASSLPVKLYVIFAPAVLLLLVGIVWRHNHSTYERKYARWNESFICERCGTV
Ga0307475_1149822013300031754Hardwood Forest SoilMDLRCPKCNSNNLKKVSLAYQEGTYRIDTRSRIRGLLFAGGGPDILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVLWSGVVTLIALVIYVQHVMSSPVPASSLPVKLYVIFAPVLLLLLVGIVWRHNHSTYDRKY
Ga0307478_1134418623300031823Hardwood Forest SoilMDLRCPNCESTNLKKVSLAYQEGVYHIDTRTGIRGLLFAGGGPGLLVGRATTRGSQQSALSKSLCPPSKWSYAKLVLWSGAVTLLALFLYAQHVMSSPPPVSSLPVKLYAVFAPVVFLLLIGIVWRHNHSTYEQKYVQWNESFICERCGTVSQQALR
Ga0335081_1050451823300032892SoilMDLRCPKCNSTELKKVSLAYQEGTYHIDTRSRIRGLLFAGGGADILVGRTTTRGTQQSALSKRLCPPTKWSYVKLVVWSGVVTLIALVIYVQHVMSSPAPASSLPVKLYVIFAPVVLLLLVGTVWRHNHSTYVQKYALWNESFICQRCGTVGQQALG
Ga0335072_1155902213300032898SoilLRCPKCNSNNLKRVSLAYQEGTYHIDARSRMRGLLFAAGGPGILVGRTNTRGSQQSALSKRLSPPTKWSYLKLVGSSGVVTLIALVLYVQHVMASPVPASSLPVKLYGVFAPLVLCLLVGIVWRHNHSAYEQKYARWNESYICERCGTVSQQSLG
Ga0326728_1000328043300033402Peat SoilMDLRCPKCNSNNLKKVSLACQEGTYHIDARSRMRGLLFAGGGLGIFVGRTTTRGTQQSAFSKRLNPPSKWSYLNLVLWSGAVTLIALVLYVQHVMSSPAPASSLPVKLYAVFTPVLFLLLLRVVWRHNHATYHEKYAQWNKSFVCERCGSVSQQSGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.