NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F085496

Metagenome Family F085496

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F085496
Family Type Metagenome
Number of Sequences 111
Average Sequence Length 216 residues
Representative Sequence MRRNRHLRLSLFTVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTARSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDR
Number of Associated Samples 81
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 33.94 %
% of genes near scaffold ends (potentially truncated) 94.59 %
% of genes from short scaffolds (< 2000 bps) 88.29 %
Associated GOLD sequencing projects 70
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.198 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(33.333 % of family members)
Environment Ontology (ENVO) Unclassified
(61.261 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(67.568 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 32.96%    β-sheet: 12.36%    Coil/Unstructured: 54.68%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 111 Family Scaffolds
PF00589Phage_integrase 3.60
PF02371Transposase_20 3.60
PF01725Ham1p_like 2.70
PF00528BPD_transp_1 2.70
PF06187DUF993 1.80
PF16576HlyD_D23 1.80
PF01266DAO 0.90
PF14384BrnA_antitoxin 0.90
PF13183Fer4_8 0.90
PF00378ECH_1 0.90
PF12706Lactamase_B_2 0.90

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 111 Family Scaffolds
COG3547TransposaseMobilome: prophages, transposons [X] 3.60
COG0127Inosine/xanthosine triphosphate pyrophosphatase, all-alpha NTP-PPase familyNucleotide transport and metabolism [F] 2.70


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.20 %
UnclassifiedrootN/A1.80 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005171|Ga0066677_10420656All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia ramosa764Open in IMG/M
3300005175|Ga0066673_10771705All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium551Open in IMG/M
3300005177|Ga0066690_10651904All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → Streptomyces tsukubensis701Open in IMG/M
3300005178|Ga0066688_10526102All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia ramosa760Open in IMG/M
3300005180|Ga0066685_10353847All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300005180|Ga0066685_10401811All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300005181|Ga0066678_10090426All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1834Open in IMG/M
3300005181|Ga0066678_10397767All Organisms → cellular organisms → Bacteria913Open in IMG/M
3300005187|Ga0066675_10401427All Organisms → cellular organisms → Bacteria1012Open in IMG/M
3300005218|Ga0068996_10069437All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → unclassified Sphingomonadaceae → Sphingomonas-like bacterium B12735Open in IMG/M
3300005332|Ga0066388_101970141All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300005332|Ga0066388_103297343All Organisms → cellular organisms → Bacteria825Open in IMG/M
3300005446|Ga0066686_10107149All Organisms → cellular organisms → Bacteria1807Open in IMG/M
3300005446|Ga0066686_11047168All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium527Open in IMG/M
3300005447|Ga0066689_10090718All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1734Open in IMG/M
3300005450|Ga0066682_10055799All Organisms → cellular organisms → Bacteria2407Open in IMG/M
3300005450|Ga0066682_10212573All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1241Open in IMG/M
3300005450|Ga0066682_10463296All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium805Open in IMG/M
3300005471|Ga0070698_100147913All Organisms → cellular organisms → Bacteria2297Open in IMG/M
3300005535|Ga0070684_100275622All Organisms → cellular organisms → Bacteria1540Open in IMG/M
3300005546|Ga0070696_101476146All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae → Lysobacter → Lysobacter antibioticus581Open in IMG/M
3300005553|Ga0066695_10097814All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1796Open in IMG/M
3300005556|Ga0066707_10081798All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1949Open in IMG/M
3300005557|Ga0066704_10220588All Organisms → cellular organisms → Bacteria1285Open in IMG/M
3300005557|Ga0066704_10473238All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300005558|Ga0066698_10310195All Organisms → cellular organisms → Bacteria1092Open in IMG/M
3300005558|Ga0066698_10569366All Organisms → cellular organisms → Bacteria767Open in IMG/M
3300005558|Ga0066698_10748174All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Bacillaceae → Bacillus → unclassified Bacillus (in: Bacteria) → Bacillus sp. BT1B_CT2639Open in IMG/M
3300005559|Ga0066700_10110054All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1818Open in IMG/M
3300005574|Ga0066694_10161676All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1063Open in IMG/M
3300005586|Ga0066691_10808538All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Pandoraea → Pandoraea pnomenusa552Open in IMG/M
3300005598|Ga0066706_10339955All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1188Open in IMG/M
3300006031|Ga0066651_10199411All Organisms → cellular organisms → Bacteria1059Open in IMG/M
3300006046|Ga0066652_100152975All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1937Open in IMG/M
3300006791|Ga0066653_10154046All Organisms → cellular organisms → Bacteria1122Open in IMG/M
3300006791|Ga0066653_10262215All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300006791|Ga0066653_10545874All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia ramosa586Open in IMG/M
3300007255|Ga0099791_10244959All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium849Open in IMG/M
3300009012|Ga0066710_100917043All Organisms → cellular organisms → Bacteria1349Open in IMG/M
3300009012|Ga0066710_103041461All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300009012|Ga0066710_103201218All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300009012|Ga0066710_104216448All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300009137|Ga0066709_100314758All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2133Open in IMG/M
3300009137|Ga0066709_100357238All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium2010Open in IMG/M
3300009137|Ga0066709_103175491All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300009795|Ga0105059_1033446All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300009812|Ga0105067_1067249All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300010046|Ga0126384_10877141All Organisms → cellular organisms → Bacteria809Open in IMG/M
3300010047|Ga0126382_10106445All Organisms → cellular organisms → Bacteria1825Open in IMG/M
3300010301|Ga0134070_10226959All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300010303|Ga0134082_10102434All Organisms → cellular organisms → Bacteria1134Open in IMG/M
3300010320|Ga0134109_10164191All Organisms → cellular organisms → Bacteria805Open in IMG/M
3300010320|Ga0134109_10195144All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300010329|Ga0134111_10460235All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300010333|Ga0134080_10247254All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300010336|Ga0134071_10121230All Organisms → cellular organisms → Bacteria1256Open in IMG/M
3300010336|Ga0134071_10139894All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1172Open in IMG/M
3300010337|Ga0134062_10388226All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300010400|Ga0134122_10595987All Organisms → cellular organisms → Bacteria1020Open in IMG/M
3300012199|Ga0137383_10285237All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300012199|Ga0137383_11030041All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium599Open in IMG/M
3300012202|Ga0137363_11292982All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300012206|Ga0137380_10497584All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300012356|Ga0137371_10325645All Organisms → cellular organisms → Bacteria1195Open in IMG/M
3300012356|Ga0137371_11082179All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300012357|Ga0137384_10492025All Organisms → cellular organisms → Bacteria1007Open in IMG/M
3300012582|Ga0137358_11012081All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300012685|Ga0137397_11183335All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300012922|Ga0137394_10019676All Organisms → cellular organisms → Bacteria5396Open in IMG/M
3300012922|Ga0137394_11348842All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300012929|Ga0137404_10016094All Organisms → cellular organisms → Bacteria5301Open in IMG/M
3300012929|Ga0137404_10110324All Organisms → cellular organisms → Bacteria2233Open in IMG/M
3300012929|Ga0137404_10448938All Organisms → cellular organisms → Bacteria1144Open in IMG/M
3300012930|Ga0137407_10378457All Organisms → cellular organisms → Bacteria1308Open in IMG/M
3300012975|Ga0134110_10302111All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300012976|Ga0134076_10097714All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1161Open in IMG/M
3300014154|Ga0134075_10037304All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1988Open in IMG/M
3300014154|Ga0134075_10223465All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300014157|Ga0134078_10064310All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300015053|Ga0137405_1369002All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1710Open in IMG/M
3300015264|Ga0137403_10006124All Organisms → cellular organisms → Bacteria13814Open in IMG/M
3300015358|Ga0134089_10020536All Organisms → cellular organisms → Bacteria2253Open in IMG/M
3300015358|Ga0134089_10070405All Organisms → cellular organisms → Bacteria1305Open in IMG/M
3300015358|Ga0134089_10233483All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium748Open in IMG/M
3300015374|Ga0132255_104437978All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300017657|Ga0134074_1160092All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300017657|Ga0134074_1316912All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300017659|Ga0134083_10211164All Organisms → cellular organisms → Bacteria803Open in IMG/M
3300018071|Ga0184618_10488710All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300018079|Ga0184627_10271593All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium891Open in IMG/M
3300018084|Ga0184629_10065235All Organisms → cellular organisms → Bacteria1704Open in IMG/M
3300018431|Ga0066655_10446558All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300018433|Ga0066667_10052127All Organisms → cellular organisms → Bacteria2477Open in IMG/M
3300018433|Ga0066667_10427994All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300021080|Ga0210382_10222661All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium822Open in IMG/M
3300025985|Ga0210117_1056811All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300026306|Ga0209468_1185595All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300026329|Ga0209375_1027395All Organisms → cellular organisms → Bacteria3136Open in IMG/M
3300026329|Ga0209375_1227304All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300026332|Ga0209803_1216717All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300026334|Ga0209377_1094082All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1255Open in IMG/M
3300026528|Ga0209378_1053203All Organisms → cellular organisms → Bacteria → Acidobacteria1952Open in IMG/M
3300026536|Ga0209058_1169371All Organisms → cellular organisms → Bacteria984Open in IMG/M
3300027068|Ga0209898_1055461All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300027655|Ga0209388_1088911All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300027961|Ga0209853_1120505All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300031740|Ga0307468_100715621All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300032180|Ga0307471_103153139All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300032256|Ga0315271_10875671All Organisms → cellular organisms → Bacteria774Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil33.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil18.92%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.12%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.01%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.60%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.70%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.80%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.80%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.80%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.90%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.90%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.90%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.90%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.90%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005218Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009795Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300025985Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032256Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_topEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066677_1042065613300005171SoilQLRLSLFTVPDLSIGRSPLTPQNSQIDPRRVRYLEGLIAYLSSVFEDLKKRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRTKSPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIKWSEKDLKACFSDSFGERLFFEPYGPHLTQLSPAQLPP
Ga0066673_1077170513300005175SoilPELSIGRAPLTPQNSQFDPRRLKYVEGMVTYLNDLFEELQKGYSQSEALSRLEHVLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNEDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIVSKLSHGLSEKALSRILAQCDVNLSPVVEALRNLELIEEVDPSAQIVSQDLLEGNEDRLTWL
Ga0066690_1065190413300005177SoilKHTHQLRLSLFTVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINAYGGPLVSEIPSARDRIVFNQERLKITLLDGLHRRTGSLAIPAGRYAPEVSRIVSKLSRGISEEALARMLRDCNVGLSSVIKGLHDLHLIEDIDPSVPIVPQGLVEGSKDRLTWLGHACVLFQTPRSSVCVDPFLRPHIKWSERDVQSCFSD
Ga0066688_1052610213300005178SoilVKDSRQLRLSLFTVPDLSIGRSPLTPQNSQIDPRRVRYLEGLIAYLSSVFEDLQKRYARSDALSRLEKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERLKITFLDGLHRRTESPAIPAGRYAPRVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVEPSVPIVPHGLLEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIKWSE
Ga0066685_1035384723300005180SoilMKRNRHLRLSLFTVPELSIGRAPLTPQNSQFDPRRLKYLEGMVTYLNDLFEELQKGYSQSEALSRLEHLLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSTQIVSQNLLEGNEDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTERDLQSSFSDTFGDRFFFEPYGPQLTQLSPAQLPPLDAVFVTHQD
Ga0066685_1040181123300005180SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFENLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPVGRHSPEVSHIISKLSRGISEKALARILRECDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLVAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTEEEVKSCFSDSFGDRRFFEPYGPRLT
Ga0066678_1009042613300005181SoilMVDYLNGLLENLRKSHSQSEALFHLEKFLAQLPCSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSQGVSEKALSRLLEQCDVDLSPVIDGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTE
Ga0066678_1039776713300005181SoilMKRNRHLRLSLFTVPELSIGRAPLTPQNSQFDPRRLKYLEGMVAYLNGLFEDLRKDCSQSEALSRLEKLLTPLPHSELVKIDASGEPRVTEIPSARDRIAFNKDRLRINLLDGLHRRTESPGIPAGRYSPQVSHIVSKLSQGISEQALSRILGQCDVNLSPAIEGLRNLELIEEIDPSEQIVPQSLLEGKNDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTTEDLTSS
Ga0066675_1040142723300005187SoilMRRNRHLRLSLFTVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVCQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTERDLKSSFSDTFGDRFFFEPYGPQLTQLSPAQLPPLD
Ga0068996_1006943713300005218Natural And Restored WetlandsSNRHLRLSLFTVSEVSIGRAPLTPQNSQFDPRRVQYVEGMVSYLTGLFESLRKSHSRPEAISRLEKLLKQLPYSELMKIDPSGEPCVAEIPSARDRIGFNKDRLRINFLDGFHRRSESPGIPAGRHTPVVSHIISKLSHGLSEKELSRILKKCAANLSPAIESLRNLELIEEIDPSMPIVPQRLLEGEKDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTDMDLASSFSDAFGDRFFFEPYG
Ga0066388_10197014113300005332Tropical Forest SoilMKRKRHLRLSLFTVPELSVGRAPLTPQNSQSDPKRVKYLEGMVVYLNDLFDDLRKNYSEAEALSRLGKILAQLPYSELVQIDPSGNPKVTEIPGARDRIGFNKDRLRINFLDGLHRRAESPGIPAGQYTPEFSHIISRLSQGLSEDALSRILAKCDVDLSPVIEGLRKLQLIEEVDHADQIVLQSLLEGKKDRLTWLGHAGILFQTSRTSICVDP
Ga0066388_10329734313300005332Tropical Forest SoilMKTSRSLRLSLFTVPELSIGRAPLTPENSAFDPRRLQYLEGMVAYLNGEFESLRKVCSHSEALSSLEKLLAPVPHSELVKIHTSGHPHATEIPSARDRIAFAHDRLRINFLDGLHRRTESPGIPAGRHSPQISHIISKLAQGVSEQALSRILGQCDVNLSPVIEGLRSLELIEEVDRSAQVVPPSLLDGKQDRLTWLGHACMLFQTARSAICVDPFLRP
Ga0066686_1010714913300005446SoilMERNRHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRLEYLEGMVAYLNSLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVSQIVSKLSRGISEESLAHILGECEVDLSPAFKSLRDLQLIEEIDPSAQIVPQSLLEGKKDRLTWLGHACVLFQTSRSS
Ga0066686_1104716813300005446SoilRHSREEVRSRLEKVLPQLPYSELVKIATNGKLAVTEIPSARDRIVFNRDRLRIAFLDGLHRRPESPTIHAGRFSPAVSHVVSTLSRGVPEESLARILGKCGVGLAPAIKSLRDLQLIDESDPSAQIVPQSLLDGKKDRLTWLGHACMLFQTSRSSVCVDPFLRPHIKWTEKDLKS
Ga0066689_1009071813300005447SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFL
Ga0066682_1005579953300005450SoilVKHTHQLRLSLFTVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINADGGPLVAEIPSARDRIVFNQERLKITLLDGLHRRTGSLAIPAGRYAPEVSRIVSKLSRGISEEALARMLRDCNVGLSSVIKGLHDLQLIEDIDPSVPIVPQGLVEGSKDRLTWLGH
Ga0066682_1021257313300005450SoilMVAYLNGLFENLRKSYSRSETLSRLEKFLAQLPYSELVKIDTSGEPSVSEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIPAGRHTPVVSQIISKLSHGLSEKELSRILGKCKANLSPAIEGLRIRQLIEGIDPSVQIVSQGLLEGKKDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTEKDLGSSCSDTFGDRFSLNHTALSWPSCHQHNSRPWMLSL*
Ga0066682_1046329623300005450SoilMRRNRHLRLSLFTVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSQSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHAC
Ga0070698_10014791333300005471Corn, Switchgrass And Miscanthus RhizosphereMKRNRHLRLSLFTAPELSIGRAPLTPQNSQFDPRRLKYLEGMVAYLNGLFEDLRKDCSQSEALSRLEKLLTPLPHSELVKIDASGEPRVTEIPSARDRIAFNKDRLRINLLDGLHRRTESPGIPAGRYSPQVSHIVSKLSQGISEQALSRILGQCDVNLSPAIEGLRNLELIEEIDPSEQIVPQSLLEGKNDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTTED*
Ga0070684_10027562213300005535Corn RhizosphereMKSKRHLRLSLFTVPELSIGRAPLTPQNSQSDPKRLKYLEGMVVYLNGLFEDLRKNYSEAEALSRLGKILAQLPYSELVQIDPSGGAKVTEIPSARDRIGFNKDRLRINFLDGLHRRAESPGIPAGQYTPECSHIISKLSQGLSEDGLSRILAKCDVDLSPVIEGLRKLQLIEEVDQADQIVPQSLLEGKKDRLTWLGHAGILFQTSRTSICVDPYLRPHIKW
Ga0070696_10147614613300005546Corn, Switchgrass And Miscanthus RhizosphereQNSQFDSRRLEYLEGMVAYLNGVFENLRQRHPEPEALSRLEKVSLRLPYSRLVKINTGGEPPLVTEAPGARDRIVFNHEHLKITFLDGLHRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRSSICVDPFLR
Ga0066695_1009781413300005553SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRP
Ga0066692_1092755413300005555SoilPELSIGRAPLTPQNSQFDPRRLKYLEGMVAYLNGLFEDLRKDCSQSEALSRLEKLLTPLPHSELVKIDASGEPRVTEIPSARDRIAFNKDRLRINLLDGLHRRTESPGIPAGRYSPQVSHIVSKLSQGISEQALSRILGQCDVNLSPAIEGLRNLELIEEIDPSEQIVPQSLLEGKN
Ga0066707_1008179813300005556SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHVKWNEKDLKSSFSDSFGDSLFFEPYGPELIQLSPAQLPPLDAVF
Ga0066704_1022058823300005557SoilMKGNRHLRLSLFTVPELSIGRAPLTPQNSQFDPRRLKYLEGMVTYLNDRFEELQKGYSQSEALSRLEHLLARLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTE
Ga0066704_1047323813300005557SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSEFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVAETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICV
Ga0066698_1031019523300005558SoilMERNRHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRLEYLEGMVAYLNSLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVSHIVSKLSRGISEESLAHILGECEVDLSPAFKSLRDLQLIEEIDPSAQIVPQSLLEGKKDRLTWLGHACVLFQTSRSSVCVDPYLRPHIKWTEKELKSSFSNSFGDRLFFEPYGPQLTQLSPAQLPPLDAVF
Ga0066698_1056936613300005558SoilMRRNRHLRLSLFTVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTARSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDR
Ga0066698_1074817413300005558SoilGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFENLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPPGPYSPEVSHIISKLSHGISETALARILRACDVDLSPVIESLRDLHLIEEIDPSVSIVPRGLVEGSQDRLTWLGHAGLLFQTSRSSVCVDPFLRPHIKWREGELKSC
Ga0066700_1011005413300005559SoilMKINRQLRLSLFTVPEVSIGRAPLTPQNSQFDPRRLQYLEGMVAYHNGLFENLRKNYSRPEALSRFEKLLAQLPYSELVKTDTSGEPSVAEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIPAGRHTPVVSQIISKLSQGLSEKELSRILGKCEANLSPAIEGLRSRQLIEEIDPSVQIVSQGLFEGGKRIASLGWDTPAFSFKPHDHRFALIPFFDRI*
Ga0066694_1016167623300005574SoilMKRDRHLRLSLFTVPELSIGRAPLTPQNSQFDPRRLKYVEGMVTYLNDLFEELQKGYSQSEALSRLEHVLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNEDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIVSKLSHGLSEKALSRILAQCDVNLSPVVEALRNLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACIL
Ga0066691_1080853813300005586SoilDSRRLKYLEGMVAYLNGLFENLRKSYSQSDALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGISEKALSRILGQCDVNLSPVVEALRNLELIEEVDQSAQIVSQSLLEGNEDRLTWLGHACILFQTSRSSVCVD
Ga0066706_1033995533300005598SoilMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQ
Ga0066651_1019941113300006031SoilMNGDKQLRLSLFAVPDLSIGRAPLTPQNCQFDARRVQYLNGMVAYLESVFTDLRRHHSRSDALSTLQKVLARLPYSELVQIDHDGNPVVSAIPGARDRIVFDHDRLRVAILDGLNRRSESPMIPVGRDSPELAHVISKLSRGISAKDLARALRTGTVDISPAITALRDLHLVEEVDPSVSTVPQALSAGHGDRLTWLGHAALLFQTSRASICVDPFLRPHIKWTEEEKKTCFSDSFADSQLFEPYGPGLTQMSPAQL
Ga0066652_10015297513300006046SoilVKDSRQLRLSLFTVPDLSIGRSPLTPQNSQIDPRRVRYLEGLIAYLSSVFEDLQKRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRRESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIKWSEKDLKACFSDSFGERLFFEPYGPHLTQLSPAQLPPLDAVFV
Ga0066653_1015404613300006791SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHVKWTEKDLKSSFSDS
Ga0066653_1026221513300006791SoilVKRTHQRRLSLFTVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINADGGPLVSEIPSARDRIVFDQERLKITLLDGLHRRTESLAIPAGRYAPEVSRIVSKLSRGISEDALARLLRDCNVGLSSVIKGLHDLHIIEDIDPSVPIVPQGLVEGSKDRLTWLGHACVLFQTPRSSVCVDPFLRPHIKWSEQDVQSCFSDSFGESVFFEPYGAQLTQLSPAQLPPLDAVFVTHQ
Ga0066653_1054587413300006791SoilYLEGLIADLSSVFEDLQNRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRRESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIKWSEKDLKAC
Ga0099791_1024495913300007255Vadose Zone SoilLKYLEGMVAYLNDLFEELQKGYSQSEALSRLENLLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDQSAQIVSQSLLEGNEDRLTWLGHACILFQTSRSS
Ga0066710_10091704323300009012Grasslands SoilMCRRSEKANKQLRLSLFTVPDLSIGLAPLTPQNSQFDRRRLDYLEGMVASLNSLFENFRKRYSESEALFRLEKFLMQLPYSELVKIDASEKSVVTEIPSARDRIGFNQKRLKITLLDGLHRRAETPAIPAGRYSPAVSHIISRLSCGISEGALSQTLRQSDVGLAPVIKSLRDLEFIEEIDPSTQIVPQSLLEGKTDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTEN
Ga0066710_10304146113300009012Grasslands SoilLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVSQIVSKLSRGISEESLAHILGECEVDLSPAFKSLRDLQLIEEIDPSAQIVPQSLLEGKKDRLTWLGHACVLFQTSRSSICVDPYLRPHIKWTEKELKSSFSNSFGDRLFFEPYGPQLTQLSPAQLPPLDAVF
Ga0066710_10320121813300009012Grasslands SoilMERNKHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRLEYLEGMVAYLNSLSKDLRKRHSRSEALSRLAKVLPQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVAHIVSKLSRGISEELLARILGECEVDLSPACKSLRDLQFIEEIDPSAQIVPQSLLEGIKDRHTWLGHAC
Ga0066710_10421644813300009012Grasslands SoilLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRHESPTIPAGRYSPEVSRIVSKLSRGMSEESLARILGECEVDLSPAFKSLRDLQLIEEIDPSAPIVPQSLLEGKKDRLTWLGHACVLFQTSRSSVCVDPYLRPHIKWTEK
Ga0066709_10031475833300009137Grasslands SoilMVAYLNGLFENLRKSYSRSETLSRLEKFLAQLPYSELVKIDTSGEPSVSEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIQAGQHTPVVSQIISKLSHGLSEKELSRILGKCKANLSPAIEGLRSRQLIEGIDPSVQIVSQGLLEGKKDRLTWLGHACVLFQTSRSSVCVNPFLRPHIKWTEKDLGSSCSDTFGDRFSLNHTALSWPSCHQH
Ga0066709_10035723823300009137Grasslands SoilVPELSIGRAPLTPQNSQVDPKRVAYLEGMVAYLNGLFEDLRKRHSRTEARSRLDRVLSQLPYSELVKIEASGGPAVTEIPGARDRIAFNQERLQISLLDGLHRRTKIPAIPAGRHSPDVSHVISKLSQGVSEQTLARILGGCRVGLSQVIQGLRDLQFIEESDPSAPMVPQQLLEGKTDRLTWLGHAGILLQTARSSVCVDPFLRPHIKWTEKDLKSSFSHSFGERFFFEPYGPHLSQLSPAQLPPLD
Ga0066709_10317549113300009137Grasslands SoilGRAPLTPQNSQFDSRRVGYLEGMVAYVESLFANLRKRHSRSEALSRIQKLLVQVPYSELVTINAEGTPLVTEIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHSPEVSHIISKLSRGISEKALARILRERAVDLSSAIEGLRDLQLIDEIDPSVPIVPRSLSAGHQDRLTWLGHAGILFQSSRSSVCVDPFLR
Ga0105059_103344613300009795Groundwater SandLRLSLFAVPDLSIGRAPLTPQNSQFDARRAEYLDGMVAYLESVFEELRKSHSRSEALSRLETLLARLPYSELVKIDAEGTPYITTIPGARDRLVFDHDRLRIILLDGLNRRSESPVIPVGRHSAEMAHIISKLSRGIFEKRLAAILKEGTVDLAPAIASLRDLHLIEEVDPTVSMIPPGLLAGHHDRLTWLGHAGLLFQTAHASICVD
Ga0105067_106724913300009812Groundwater SandSQFDARRAEYLDGMVAYLESVFEELRKSHSRSEALSRLETLLARLPYSELVKIDAEGTPYITTIPGARDRLVFDHDRLRIILLDGLNRRSESPVIPVGRHSAEMAHIISKLSRGIFEKRLAAILKEGTVDLAPAIASLRDLHLIEEVDPTVSMIPPGLLAGHHDRLTWLGHAGLLFQTAHASICVDPFLRPHIKWSQEE
Ga0126384_1087714113300010046Tropical Forest SoilMVTYLNDVFEDAEARPRLENLLAQLPYSELVKIDTAGESCVTEIPNARDRIAFNKDRLRINFLDGLHRRAESPGIRAGRHSPQVSHVISKLSQGVSEQALSRILRHCNVDLSPVIEALRDLELIEEVDPSAPIVPQSLLAGKKDRLTWLGHACILFQTSRASVCVDP
Ga0126382_1010644533300010047Tropical Forest SoilMVVYLNDLFDDLRKNYSEAEALSRLGKILAQLPYSELVQIDPSGNPKVTEIPGARDRIGFNKDRLRINFLDGLHRRAESPGIPAGQYTPEFSHIISRLSQGLSEDALSRILAKCDVDLSSVIKGLRNLQLIEEVDQSDQIVPQSLLEGKKDRLTWLGHAGILFQT
Ga0134070_1022695913300010301Grasslands SoilAPLTPQNSQLDPRRLKYLEGMVTYLNDRFEELQKGYSQSEALSRLEHVLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTERDLQSSFSDTFGDRFFFEPYGPQLTQ
Ga0134082_1010243413300010303Grasslands SoilVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTARSSVCVDPFLRPHIKWTERDLQSSFSDTFGDRFFFEPYGPQLTQLSPAQLPPLDA
Ga0134109_1016419113300010320Grasslands SoilMVDYLNGLLENLRKSHSQSEALFHLEKFLAQLPCSELVKIDTSEEPRVTEIPSARDRIVFTKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILVQCDVNLSPVVEALRNLELIEEVDPSAQIVSQDLLEGNEDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTERDLQSSFSDTFGDRFFFEPYGPQLTQLSPAQ
Ga0134109_1019514413300010320Grasslands SoilSKTGSTTMNQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVAYLDGMVAYLESLFDNLRGRYSRSEALSRFERSLAQLPYAELVKINLNGGPIVSEIPTARDRIVFNQDRLKITFLDGLHRRMESPAIPAGRYSPEVSRVVSALSHGVSETALERVLSACDVGLSPVIKSLRDLQFIDDVDPSAPIVPHGLSEGNRDRLTWLGHACVLFQTARASVCVDPFLRPHITWTNEELTSCFSDSFGERLFFE
Ga0134111_1046023513300010329Grasslands SoilLQSLFENLRKCHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPVGRHSPEVSHIISKLSRGISEKALARILRDRDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTEEEV
Ga0134080_1023551713300010333Grasslands SoilLSLFTVPDLSIGRSPLTPQNSQIDPRRVRYLEGLIAYLSSVFEDLQKRYTRSDALSRLEKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRTESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLQVIEEVDPSVPIVPHGLVEGTKDR
Ga0134080_1024725413300010333Grasslands SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFENLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPVGRHSPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLVAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTEEEVKSCFS
Ga0134071_1012123013300010336Grasslands SoilMERNKHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRIEYLEGMVAYLNSLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRHESPTIPAGRYSPEVSRIISRLSQGVSEAALSRILVKGEVDLSPVIKSLRDLQLIDEVDSSAQIVPQSLLEGKWDRVTWLGHACILFQTARSSVCVDPFLRPHIKWTEKDLKTSFSDSYGD
Ga0134071_1013989423300010336Grasslands SoilLKYLEGMVTYLNDLFEELQKGYSQSEALSRLEHLLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSRGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSAPLVSESLLEGNEDRLTWLGHAGILFQT
Ga0134062_1038822613300010337Grasslands SoilVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINADGGPLVAEIPSARDRIVFDQERLKITLLDGLHRRTESLAIPAGRYAPEVSRIVSKLSRGISEEALARMLRDCNVGLSSVIKGLHDLQLIEDIDPSVPIVPQGLVEGRHDRLTWLGHACVLFQTPRSSVCVDPFLRPHIKWSEKDVQSCFSDSFGDR
Ga0134122_1059598723300010400Terrestrial SoilVITRDRQLRLSLFTVPDLSIGRAPLTPENSQSDPRRVAYLEGMVAYLNGLLESLRQRHAPSEALSRLETLRARLPHSELVTIDGGAQPRVTEDPGARDRIGFNPDRLKVLFLEGLHRGAESPALPAGRCSPQVSHVVSKLSRGVSEEKLSRIIAQGGVDLWAAIEGLRDLDLVEEVDSSVPIVPASLSEGGKDRLTWLGHAGVLFQTARSSVCVDPFLRPHVKWTEQELRSCFSD
Ga0137383_1028523713300012199Vadose Zone SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHDHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEGLSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHIKWSEQDVQSCFSDSFGESVFFEPYGAQLTQL
Ga0137383_1103004113300012199Vadose Zone SoilSLFTGPELSIGRAPLTPQNSQFDPRRLKYLEGMVTYLNDRFEELQKGYSQSEALSRLEHLLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACILFQTSR
Ga0137363_1129298213300012202Vadose Zone SoilGMVAYLNGLFENLRKSYSQSDALSRLEKFLAQLPHSELVKIDTRAEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSQGVSEKALSRLLEQCDVDLSPVIDGLRNLELIEEIDPSAQIVCQSLLGGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTERDLKSSFSDTFGDRFFFEPYG
Ga0137380_1049758423300012206Vadose Zone SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFESLRKRHSRSEALSRLERLLVQLPYSELVKINPEGPLVTAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHSPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTEEEV
Ga0137371_1032564513300012356Vadose Zone SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHVKWNEKDLKSSFSDSFGDSLFFEPYGPEL
Ga0137371_1108217913300012356Vadose Zone SoilSPLTPQNSQIDPRRVRYLEGLIAYLSSVFEDLQKRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRRESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIK
Ga0137384_1049202513300012357Vadose Zone SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHIKWTEKDLKSSFSYWFGDSLFFEPYGPELTQRSPAQLQP
Ga0137358_1101208113300012582Vadose Zone SoilLLENLRKSYSQSEALSHLEKFLAQLPCSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSKGVSEKALSRLLEQCDVDLSPVIEGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTEK
Ga0137397_1118333513300012685Vadose Zone SoilFDPRRLKYLEGMVTYVNDLFEELQKGYSQSEALSRLENLLAQLPYAELVKIDSAGEPRVTEIANARDRIVFNKDHLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVDLSPVVEALRNLELIEEVDPSAQLVSETLLEGNEDRLTWLGHACILFQTSRSSVCV
Ga0137394_1001967613300012922Vadose Zone SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFEHLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVAEIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHSPEVSHIISKLSRGISEKALARILREGTVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQTSRASVCVDPFLRPHVKWTQEEMKSCFSDSFGDRR
Ga0137394_1134884213300012922Vadose Zone SoilEHLRKRHSRSEALSRLERLLVQLPYSELVKINPEATPLVTAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHAPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRNLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAVILFQTSRASVCVDPFLRPHVKWTQEEMKSCFSDSFGDRRF
Ga0137404_1001609413300012929Vadose Zone SoilMVDYLNGLLENLRKSHSQSEALFHLEKFLAQLPCSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSKGVSEKALSRILEQCDVNLSSVIEGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDRFFFEPYGPQLTQLSPAQLPPLDA
Ga0137404_1011032413300012929Vadose Zone SoilMVAYLNSLFENLRKSYSQSEALSHLEKILAQLPHSELVKIDTSGEPRVTETPSARDRIVFTMDRVRINFLDGLHRRIESPGIPAGRYSPEVSHIISKLSQGVSEKALSRILEQCDVNFSPVIEGLRNLELIEEVDPSVQIVSQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDRFFFEPYGPQLTQLSPAQLPPLDA
Ga0137404_1044893813300012929Vadose Zone SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFEHLRKRHSRSEALSRLERLLVQLPYSELVKINPEATPLVTAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHAPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRNLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTQEEVKSCFSDSFGDRRFFEPYGPRLTQLSPAQLPPLDA
Ga0137407_1037845733300012930Vadose Zone SoilMVAYLNGLFENLRKSYSQLEALSRLEKFLAQLPLSELVKIDTSGEPRVTEIPSARDRIVFIKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSKGVSEKALSRILEQCDVNLSSVIEGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHASILFQTSRSSVCVD
Ga0134110_1030211113300012975Grasslands SoilDPRRVRYLEGLIAYLSSVFEDLQKRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRRESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLFQTSRASVCVDPFLRPHIKWSEKDLKACFSDSFGERLFFEPYGPHLTQLSPAQLPPL
Ga0134076_1009771413300012976Grasslands SoilVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSQSDALSRLEKFLAQLPHSELVKIDTRAEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVAALRNLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDRFFFEPYGPQLTQ
Ga0134075_1003730433300014154Grasslands SoilMVDYLNGLLENLRKSHSQSEALFHLEKFLAQLPCSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHVISKLSQGVSEKALSRLLEQCDVDLSPVIEGLRNLELIEEIDPSAQIVCQSLLEGKKDRLTWLGHACILFQTSRSSVCVDPFLRPHIKW
Ga0134075_1022346513300014154Grasslands SoilMERNKHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRLEYLEGMVAYLNSLSQDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVSQIVSKLSRGISEESLAHILGECEVDLSPAFKSLRDLQLIEEIDPSAQIVPQSLLEGKKDRLTWLGHACVLFQTSRSSVCVDPYLRPHIKWTEKELKSSFSNSFGDRLFFEPYGPQLTQLSPAQLPPLDAVFVT
Ga0134078_1006431013300014157Grasslands SoilMNGDKQLRLSLFAVPDLSIGRAPLTTQNCQFDARRVQYLDGMVAYLESVFTDLRRHHSRSDALSTLQRVLARLPYSELVQIDHDGNPVVSAIPGARDRIVFDHDRLRVAILDGLNRRSESPMIPVGGDSPELAHVISKLSRGISAKDLARALRTGTVDISPAITALRDLHLVEEVDPSVSTVPQPLSAGRGDRLTWLGHAALLFQTSRASICVDPFLRPHIKWTEEEKKTCFSDSFADSRLFE
Ga0137405_136900233300015053Vadose Zone SoilLEALSRLEKFLAQLPLSELVKIDTSGEPRVTEIPSARDRIVFIKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSKGVSEKALSRILEQCDVNLSSVIEGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWLDTPVFSFKPLDHRFVLIHFFDRISSGQKRI*
Ga0137403_1000612413300015264Vadose Zone SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFEHLRKRHSRSEALSRLERLLVQLPYSELVKINPEATPLVTAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHAPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRNLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQTSRSSVCVDPFLRPHVKWTQEEVKSCFSDSFGDRRFFEPYGPRLTQLSPAQ
Ga0134089_1002053613300015358Grasslands SoilMVDYLNGLLENLRKSHSQSEALFHLEKFLAQLPCSELVKIDTSGEPRVTEIPSARDRIVFTKDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSQGVSEKALSRLLEQCDVDLSPVIEGLRNLELIEEIDPSAQIVSQSLLEGKKDRLTWL
Ga0134089_1007040523300015358Grasslands SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRA
Ga0134089_1023348323300015358Grasslands SoilMVTYLNDLFEELQKGYSQSEALSRLEHLLARLPYSELVTIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSRGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDPSTQIVSQNLLEGNEDRLTWLGHAC
Ga0132255_10443797813300015374Arabidopsis RhizosphereKYLDGMVTHLNDLFEGLQKGYSQSEALSRLENVLAQLPYSELVRIDTAREPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGITAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVDLSPAIEALRNLELIEEVDRSAQLVSQSLLEGKKDRLTWLGHACTLFQTSRSSVCVDPFLRPHLKWTEKDLQSSFS
Ga0134074_116009213300017657Grasslands SoilMNQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVAYLDGMVAYLESLFDNLRGRYSRSEALSRFERCLAQLPYAELVKINLNGGPIVREIPTARDRIVFNQDRLKITFLDGLHRRMESPAIPAGRYSPEVSRVVSALSHGVSETALERVLSACDVGLSPVIKSLRDLQFIDDVDPSAPIVPHGLSEGNRDRLTWLGHACVLFQT
Ga0134074_131691213300017657Grasslands SoilKYLEGMVTYLNDRFEELQKGYSQSEALSRLEHLLAQLPYPELVEIDTAGEPRVTEIPNARDRIVFDKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRHLELIEEVDPSAQIVSQNLLEGNEDRLTWLGHACILFQTSRSSVCVDPFLRPHIKWTE
Ga0134083_1021116413300017659Grasslands SoilMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFENLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPAGRHAPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLSAGNQDRLTWLGHAGILFQTSRSSVCV
Ga0184618_1048871013300018071Groundwater SedimentVAYLDGMVQYLNALATDLRERHSREEARSRLEKVLPQLPYSELVTIATNGKLAVTEIPSARDRIGFNQDRLRIALLDGLHRRPESPTIHAGRYSPEVSHVVSTLSRGVSEESLARILGKCGVGLAPAIKSLRDLQLIDESDPSAQIVPQSLLDGKKDRLTWLGHACILFQT
Ga0184627_1027159313300018079Groundwater SedimentMKSDRHLRLSLFTVPEVSIGRAPLTPQNSQFDPRRVKYVEGMVAYLNGLFENLRKSYSRPEALSRFEKLLAQLPYSELVKIDSSGEPCVTEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIPTGRYTPEISHIISKLAQGVSEESLSRMLKKCDVDLSPVMDGLRNLELIEAIDPAVQIVPQGLLEGEKDRLTWLGHAGVLF
Ga0184629_1006523513300018084Groundwater SedimentMINNRHLRLSLFTVPEVSIGRAPLTPQNSQFDPRRVKYVEGMVAYLNGLFENLRKNYSRPDALSRLEKLLVQLPYSELVKIDSSGEPCVTEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIPAGRYMSEISHIISKLAQGVSEEALSRMLKKCNVDLSPVIEGLRNLELIEAIDPAVQIVPQGLLEGKKDRLTWLGHACVLFQTSRSSVCVDPFLRPHIQWT
Ga0066655_1044655813300018431Grasslands SoilKRAHTSTGRTHRMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHVKWTEKDLKSSFSDSFGDSLFFEPYGPELIQLSPAQLPPLDAVFVTHQDID
Ga0066667_1005212743300018433Grasslands SoilMRRNRHLRLSLFTVPELSIGRAPLTPQNSQFDSRRLKYLEGMVAYLNGLFENLRKSYSKSEALSRLEKFLAQLPHSELVKIDTSGEPRVTEIPSARDRIVFSEDRLRINFLDGLHRRTESPGIPAGRYSPEVSHIISKLSQGVSEKALSRILEQCDVNLSPVIEGLRSLELIEEIDPSAQIVSQSLLEGKKDRLTWLGHACILFQTARSSVCVDPFLRPHIKWTEKDLKSSFSDTFGDRFFFEPYGPQLTQLSPAQLPP
Ga0066667_1042799433300018433Grasslands SoilMKHTHQLRLSLFTVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINADGGPLVSEIPSARDRIVFNQERLKITLLDGLHRRTGSLAIPAGRYAPEVSRIVSKLSRGISEDALARLLRDCNVGLSSVIKGLHDLHIIEDIDPSVPIVPQGLVEGSKDRLTWLGHACVLFQTPRSSVCVDPFLRPHIKWSEKDVQSCFSDSFGESVFFEPYGAQLTQLSPAQLPPLDAVFVT
Ga0210382_1022266113300021080Groundwater SedimentMKTNKQLRLSLFTVPEVSIGRAPLTPQNSQFDPRRLQYLEGMVAYLNGLFENLRKSYSRSETLSRLEKFLAQLPYSELVKIDTSGEPSVAEIPSARDRIAFNKDRLRINFLDGLHRRSESPGIPAGRHTPVVSHIISKLSHGLSEKELSRILGKCEANLSPAIEGLRSCELIEEIDPSVQIVSQGLLEGEKDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTEKDLG
Ga0210117_105681113300025985Natural And Restored WetlandsRLSLFTVSEVSIGRAPLTPQNSQFDPRRVQYVEGMVSYLTGLFESLRKSHSRPEAISRLEKLLKQLPYSELMKIDPSGEPCVAEIPSARDRIGFNKDRLRINFLDGFHRRSESPGIPAGRHTPVVSHIISKLSHGLSEKELSRILKKCAANLSPAIESLRNLELIEEIDPSMPIVPQRLLEGEKDRLTWLGHACVLFQTSRSSVCVDPFLRPHIKWTDM
Ga0209468_118559513300026306SoilIDPRRVRYLEGLIAYLSSVFEDLQKRYTRSDALSRLDKLLAQVPYSELVKINANGGPLVSEIPSARDRIVFNQERVKITFLDGLHRRTESPAIPAGRYAPQVSRIVSMLSHGVSEEALARILRAGDVGLGSVIKSLRDLRVIEEVDPSVPIVPHGLVEGTKDRLTWLGHACLLF
Ga0209375_102739553300026329SoilVKHTHQLRLSLFTVPDLSIGRSPLAPQNSQVDPRRVQYLEGLIAYLGSLFENLEKRYPRAEALSRLEKVLAQVPYGELVTINADGGPLVAEIPSARDRIVFNQERLKITLLDGLHRRTGSLAIPAGRYAPEVSRIVSKLSRGISEEALARMLRDCNVGLSSVIKGLHDLQLIEDIDPSVPIVPQGLVEGSKDRLTWLGHACVLFQTPRSSVCVDPFLRPHIKW
Ga0209375_122730413300026329SoilDAARSRRRTPSKGMKGDKQLRLSLFTVPDLSIGRAPLTPQNSQFDSRRVGYLEGMVAYLQSLFENLRKRHSRSEALSRLERLLVQLPYSELVKINPEGTPLVSAIPSARDRIVFDQERLRITFLDGLHRRSESPAIPVGRHSPEVSHIISKLSRGISEKALARILRERDVDLSSAIEGLRDLQLIEEIDPSVPIVPRSLLAGNQDRLTWLGHAGILFQ
Ga0209803_121671713300026332SoilHASTGRTHRMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFL
Ga0209377_109408213300026334SoilVSSRAAVCNLSMKRNRHLRLSLFTVPELSIGRAPLTPQNSQFDPRRLKYLEGMVAYLNGLFEDLRKDCSQSEALSRLEKLLTPLPHSELVKIDASGEPRVTEIPSARDRIAFNKDRLRINLLDGLHRRTESPGIPAGRYSPQVSHIVSKLSQGISEQALSRILGQCDVNLSPAIEGLRNLELIEEIDPSEQIVPQSLLEGKNDRLTW
Ga0209378_105320343300026528SoilMKTNKQLRLSLFTVPDLSIGRAPLTPQNSQFDPKRLEYLEGMVAYLNGLFENLRQRHPEPEALARLEKVSSRLPYSRLVKINTGGEPPLVTETPGARDRIAFNHEHLKITFLDGLYRRTESPAIPAGRYSPEVSHVISELSHGVSEEALSGILRECEVDLSPVIKSLRDIQFIEETDPSAQIVPQSLLEGKNDRLTWLGHAGILFQTSRASICVDPFLRPHVKWNEKDL
Ga0209058_116937123300026536SoilMERNRHLRLSLFTVPDLSIGRAPLTPQNSQVDPKRLEYLEGMVAYLNSLSKDLRKRHSRSEALSRLDKVLLQLPYSELVKINTSGKPLVTEIPGARDRIVFNQERLRITFLDGLHRRPESPTIPAGRYSPEVSQIVSKLSRGISEESLAHILGECEVDLSPAFKSLRDLQLIEEIDPSAQIVPQSLLEGKKDRLTWLGHACVLFQTSRSSVCVDPYLRPHIKWTEKELKSSFSNSFGDRLFFEPYGPQLTQLSPAQLP
Ga0209898_105546113300027068Groundwater SandEELRKSHSRSEALSRLETLLARLPYSELVKIDAEGTPYITTIPGARDRLVFDHDRLRIILLDGLNRRSESPVIPVGRHSAEMAHIISKLSRGIFEKRLAAILKEGTVDLAPAIASLRDLHLIEEVDPTVSMIPPGLLAGHHDRLTWLGHAGLLFQTAHASICVDPFLRPHIK
Ga0209388_108891113300027655Vadose Zone SoilMVTYVNDLFQELQKGYSQSEALSRLENLLAQLPYSELVKIDTAGEPRVTEIPNARDRIVFNKDRLRINFLDGLHRGAKSPGIPAGRYSPQVSHIISKLSQGVSEKALSRILGQCDVNLSPVVEALRNLELIEEVDQSAQIVSQSLLEGNEDRLTWLGHACILFQTSRSSVCVDPFL
Ga0209853_112050513300027961Groundwater SandSLFAVPDLSIGRAPLTPQNSQFDARRAEYLDGMVAYLESVFEELRKSHSRSEALSRLETLLARLPYSELVKIDAEGTPYITTIPGARDRLVFDHDRLRIILLDGLNRRSESPVIPVGRHSAEMAHIISKLSRGIFEKRLAAILKEGTVDLAPAIASLRDLHLIEEVDPTVSMIPPGLLAGHHDRLTWLGHAGLLFQTAHASICVDPFLRPHIKWSEE
Ga0307468_10071562113300031740Hardwood Forest SoilMKKNRHLRLSLFTVSEVSIGRAPLTPQNSQFDPRRLKYLEGMVSYLNELIENLQKDYTRPEARYRLGKLLAQLPYSDLVKVDTSGEPSVAETPSARDRIAFNKDRLRINFLDGLHRRAESPGIPAGRHTTEIAHIISRLSQGISEESLSRILKKCDVNLSPVIEGLRNLELIEAIDPSEQIVSQSLLERGKDRLTWLGHACVLLQTSRSSICVDPFLRPHIKWTENDLRSSFSDTFGDRFFFDPYGPQLTQLSPTQLPPL
Ga0307471_10315313913300032180Hardwood Forest SoilKRVAYLEGMVAYLHGLFESLCKQHSRSEALSRLTEALSQLPYSELVKIDASGTPLVTEIPGARDRIAFNQERLQIAFLDGLHRRPESPVIPAGRYSPEVSHIVSTLARGVSEQALSRALGACGVGLSGVIQGLRDLQLVEEIDPSAPIVPQRLSEGKKDRLTWLGHAGMLFQTSRTSICVDPFLRPQMRWTEKE
Ga0315271_1087567123300032256SedimentMINNRHLRLSLFTVPDLSIGRAPLTPQNSQFDPRRVKYVEGMVAYLNGLFGNLQKSYSRPEALSRLEKLLAQLPYSELVKLDTSGEPCVTEIPSARDRIGFNKDRLRINFLDGLHRRPGRPSIPAGRYTPEVSHIISKLSQGVSEEALSRILRRCEVNLSPVMEGLRNLELIEEIDSSAQIVSERLLEGKKDRLTWLGHACILFQTARSSICVDPFL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.