NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068644

Metagenome / Metatranscriptome Family F068644

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068644
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 164 residues
Representative Sequence VKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASISLAAKAMGDLRTDHEFESKMLVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRS
Number of Associated Samples 94
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 30.65 %
% of genes near scaffold ends (potentially truncated) 91.13 %
% of genes from short scaffolds (< 2000 bps) 88.71 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (72.581 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(36.290 % of family members)
Environment Ontology (ENVO) Unclassified
(63.710 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.129 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 58.15%    β-sheet: 0.00%    Coil/Unstructured: 41.85%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.74.1.2: Transcription factor IIB (TFIIB), core domaind1c9ba11c9b0.75059
a.74.1.2: Transcription factor IIB (TFIIB), core domaind1aisb11ais0.74879
a.74.1.1: Cyclind7b5oi17b5o0.74391
a.74.1.1: Cyclind4eojb14eoj0.74254
a.74.1.0: automated matchesd3mi9b13mi90.74143


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF04075F420H2_quin_red 13.71
PF13561adh_short_C2 8.87
PF00106adh_short 7.26
PF07366SnoaL 4.03
PF01408GFO_IDH_MocA 2.42
PF04343DUF488 1.61
PF13248zf-ribbon_3 0.81
PF00903Glyoxalase 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG3189Uncharacterized conserved protein YeaO, DUF488 familyFunction unknown [S] 1.61


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A72.58 %
All OrganismsrootAll Organisms27.42 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10070145Not Available1120Open in IMG/M
3300002560|JGI25383J37093_10030212All Organisms → cellular organisms → Bacteria1815Open in IMG/M
3300002561|JGI25384J37096_10064940All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1352Open in IMG/M
3300002561|JGI25384J37096_10154811Not Available727Open in IMG/M
3300002561|JGI25384J37096_10165842Not Available683Open in IMG/M
3300002561|JGI25384J37096_10173284Not Available657Open in IMG/M
3300002562|JGI25382J37095_10046846All Organisms → cellular organisms → Bacteria1671Open in IMG/M
3300002908|JGI25382J43887_10147223Not Available1200Open in IMG/M
3300002912|JGI25386J43895_10128321Not Available628Open in IMG/M
3300002912|JGI25386J43895_10196020Not Available507Open in IMG/M
3300005167|Ga0066672_10635276Not Available690Open in IMG/M
3300005172|Ga0066683_10462445Not Available777Open in IMG/M
3300005174|Ga0066680_10278405Not Available1066Open in IMG/M
3300005174|Ga0066680_10509465Not Available757Open in IMG/M
3300005177|Ga0066690_10681048Not Available682Open in IMG/M
3300005178|Ga0066688_10228799All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_211186Open in IMG/M
3300005180|Ga0066685_10504468All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_21837Open in IMG/M
3300005181|Ga0066678_10032439All Organisms → cellular organisms → Bacteria2857Open in IMG/M
3300005181|Ga0066678_10377028All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_21938Open in IMG/M
3300005186|Ga0066676_10697669Not Available691Open in IMG/M
3300005447|Ga0066689_10244879All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1101Open in IMG/M
3300005540|Ga0066697_10047648All Organisms → cellular organisms → Bacteria2422Open in IMG/M
3300005552|Ga0066701_10520210Not Available734Open in IMG/M
3300005552|Ga0066701_10620300Not Available656Open in IMG/M
3300005555|Ga0066692_10975466Not Available519Open in IMG/M
3300005556|Ga0066707_10168968Not Available1396Open in IMG/M
3300005557|Ga0066704_10061878All Organisms → cellular organisms → Bacteria2389Open in IMG/M
3300005558|Ga0066698_10065786All Organisms → cellular organisms → Bacteria2326Open in IMG/M
3300005568|Ga0066703_10825239Not Available530Open in IMG/M
3300005575|Ga0066702_10196048Not Available1220Open in IMG/M
3300005586|Ga0066691_10300158Not Available947Open in IMG/M
3300005587|Ga0066654_10809848Not Available532Open in IMG/M
3300005598|Ga0066706_10414423Not Available1072Open in IMG/M
3300006034|Ga0066656_10080214All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1946Open in IMG/M
3300006034|Ga0066656_10302952Not Available1031Open in IMG/M
3300006046|Ga0066652_100901657Not Available843Open in IMG/M
3300006796|Ga0066665_10304097All Organisms → cellular organisms → Archaea1283Open in IMG/M
3300006797|Ga0066659_10233793Not Available1366Open in IMG/M
3300006797|Ga0066659_10537323Not Available942Open in IMG/M
3300006797|Ga0066659_11471932Not Available570Open in IMG/M
3300006800|Ga0066660_11085877Not Available635Open in IMG/M
3300007258|Ga0099793_10144908Not Available1123Open in IMG/M
3300007258|Ga0099793_10441198Not Available643Open in IMG/M
3300009012|Ga0066710_101439493Not Available1066Open in IMG/M
3300009088|Ga0099830_11569657Not Available548Open in IMG/M
3300009089|Ga0099828_11706514Not Available554Open in IMG/M
3300009137|Ga0066709_100855620Not Available1322Open in IMG/M
3300010128|Ga0127486_1068417Not Available504Open in IMG/M
3300010301|Ga0134070_10042437All Organisms → cellular organisms → Bacteria1524Open in IMG/M
3300010303|Ga0134082_10218857Not Available783Open in IMG/M
3300010303|Ga0134082_10297919Not Available675Open in IMG/M
3300010323|Ga0134086_10304078Not Available620Open in IMG/M
3300010325|Ga0134064_10194669Not Available724Open in IMG/M
3300010336|Ga0134071_10267622Not Available853Open in IMG/M
3300010361|Ga0126378_13062745Not Available532Open in IMG/M
3300011270|Ga0137391_11552902Not Available506Open in IMG/M
3300012096|Ga0137389_10140765Not Available1965Open in IMG/M
3300012096|Ga0137389_11151272Not Available664Open in IMG/M
3300012189|Ga0137388_10299275Not Available1475Open in IMG/M
3300012189|Ga0137388_10787866Not Available881Open in IMG/M
3300012198|Ga0137364_10361779All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_211083Open in IMG/M
3300012201|Ga0137365_10228940All Organisms → cellular organisms → Archaea → TACK group1385Open in IMG/M
3300012201|Ga0137365_11123839Not Available565Open in IMG/M
3300012206|Ga0137380_10502078All Organisms → cellular organisms → Bacteria → Terrabacteria group1068Open in IMG/M
3300012206|Ga0137380_10591608All Organisms → cellular organisms → Bacteria → Terrabacteria group971Open in IMG/M
3300012207|Ga0137381_10074277Not Available2847Open in IMG/M
3300012207|Ga0137381_10466438All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Nocardioidaceae → Marmoricola → Marmoricola endophyticus1102Open in IMG/M
3300012209|Ga0137379_10925045Not Available777Open in IMG/M
3300012210|Ga0137378_10147968All Organisms → cellular organisms → Bacteria2176Open in IMG/M
3300012210|Ga0137378_11593636Not Available562Open in IMG/M
3300012211|Ga0137377_11923309Not Available508Open in IMG/M
3300012349|Ga0137387_10263750All Organisms → cellular organisms → Bacteria → Terrabacteria group1243Open in IMG/M
3300012351|Ga0137386_11284340Not Available508Open in IMG/M
3300012357|Ga0137384_10368031All Organisms → cellular organisms → Archaea1190Open in IMG/M
3300012361|Ga0137360_11695525Not Available537Open in IMG/M
3300012362|Ga0137361_11103368Not Available714Open in IMG/M
3300012363|Ga0137390_10267141All Organisms → cellular organisms → Bacteria1696Open in IMG/M
3300012401|Ga0134055_1333075Not Available548Open in IMG/M
3300012918|Ga0137396_10034820Not Available3360Open in IMG/M
3300012927|Ga0137416_10002790All Organisms → cellular organisms → Archaea9421Open in IMG/M
3300012927|Ga0137416_10131759All Organisms → cellular organisms → Bacteria1930Open in IMG/M
3300012927|Ga0137416_11529696Not Available606Open in IMG/M
3300012927|Ga0137416_11751621Not Available567Open in IMG/M
3300012972|Ga0134077_10470526Not Available553Open in IMG/M
3300012972|Ga0134077_10583231Not Available504Open in IMG/M
3300012977|Ga0134087_10268956Not Available788Open in IMG/M
3300014150|Ga0134081_10256502Not Available613Open in IMG/M
3300017656|Ga0134112_10153148Not Available888Open in IMG/M
3300017659|Ga0134083_10081421All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1258Open in IMG/M
3300017934|Ga0187803_10122087All Organisms → cellular organisms → Archaea1023Open in IMG/M
3300018433|Ga0066667_10764266Not Available817Open in IMG/M
3300018468|Ga0066662_10552014All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1065Open in IMG/M
3300018468|Ga0066662_10716822Not Available957Open in IMG/M
3300024330|Ga0137417_1239550All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1886Open in IMG/M
3300025922|Ga0207646_10021056Not Available6031Open in IMG/M
3300026297|Ga0209237_1228603Not Available576Open in IMG/M
3300026298|Ga0209236_1087147Not Available1433Open in IMG/M
3300026298|Ga0209236_1271924Not Available550Open in IMG/M
3300026307|Ga0209469_1131501Not Available577Open in IMG/M
3300026313|Ga0209761_1016636All Organisms → cellular organisms → Bacteria4678Open in IMG/M
3300026313|Ga0209761_1321535Not Available529Open in IMG/M
3300026318|Ga0209471_1228737Not Available672Open in IMG/M
3300026326|Ga0209801_1171513Not Available899Open in IMG/M
3300026328|Ga0209802_1161264Not Available931Open in IMG/M
3300026334|Ga0209377_1181162Not Available734Open in IMG/M
3300026499|Ga0257181_1003819Not Available1665Open in IMG/M
3300026499|Ga0257181_1090114Not Available536Open in IMG/M
3300026527|Ga0209059_1147022Not Available862Open in IMG/M
3300026528|Ga0209378_1021272Not Available3609Open in IMG/M
3300026528|Ga0209378_1280485Not Available531Open in IMG/M
3300026529|Ga0209806_1332339Not Available510Open in IMG/M
3300026532|Ga0209160_1019144All Organisms → cellular organisms → Archaea4639Open in IMG/M
3300026532|Ga0209160_1104623All Organisms → cellular organisms → Archaea → TACK group1426Open in IMG/M
3300026540|Ga0209376_1232422Not Available805Open in IMG/M
3300026552|Ga0209577_10098525All Organisms → cellular organisms → Bacteria2374Open in IMG/M
3300027671|Ga0209588_1068958Not Available1142Open in IMG/M
3300027748|Ga0209689_1284661Not Available648Open in IMG/M
3300027875|Ga0209283_10498733Not Available783Open in IMG/M
3300028536|Ga0137415_10013982All Organisms → cellular organisms → Archaea7966Open in IMG/M
3300028536|Ga0137415_11214386Not Available569Open in IMG/M
3300028536|Ga0137415_11413142Not Available518Open in IMG/M
3300031820|Ga0307473_10712464Not Available706Open in IMG/M
3300032180|Ga0307471_103583198Not Available549Open in IMG/M
3300032205|Ga0307472_101783375Not Available611Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil36.29%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.84%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil16.13%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.29%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.61%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010128Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1007014513300002558Grasslands SoilVKAQERFYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAERGLISGEEREELIRRSCDYLDAAVERG
JGI25383J37093_1003021243300002560Grasslands SoilVKAQERFYSLGKGVESLSKEEHRLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRPTRRQIESMAEGFAERELVSSDEKE
JGI25384J37096_1006494023300002561Grasslands SoilLEEQTIGCPFVKTQERIYNLSKGVESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFLLASVSIAAKALGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTKRQMESMAESFARQELVSDNEKEELVR
JGI25384J37096_1015481113300002561Grasslands SoilVKTQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDSILPFPGLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSYDYLDAAVERGLGPKMSYRG
JGI25384J37096_1016584213300002561Grasslands SoilVKTQERIYSLNKGVDSLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVNREKLADAEERIIKSFGVQDSILPFPDLVSQLTTRQIESMAEGFAERGLVSSNEREELVRRTYDYLD
JGI25384J37096_1017328413300002561Grasslands SoilTGAGLFVGQLSVAGSHVRRICPKCVIRIGRSALAPINAYIFGNTGCCRAFVKAQERMYSLNRGVESLSKEEHGLFFDTVQRLELSNEVKLAAVALYLDFKSRPIGEYNADHRNLEIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIMPIPDLVSQLTKRHIESAAEGFATRELVSSSEKEELVQCSYEYLDAALEKGL
JGI25382J37095_1004684633300002562Grasslands SoilVKAQERIYSLNKGVESLSKDEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNAEHKNLEIFLIASVSLAAKAMGDLRTDREFESKMFVSKEKLLDAEERIIRSFGVQDNIMPFPDLVLQLARRQIESMVEGFAERGLVSND
JGI25382J43887_1014722323300002908Grasslands SoilVKAQEXFYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAERG
JGI25386J43895_1012832113300002912Grasslands SoilVKAQEXFYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAERGLVSSDE
JGI25386J43895_1019602013300002912Grasslands SoilCARVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNAEHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSKEKLLDAEERIIKSFGLQDSIMHFSDLVLQLTRRQIESMAESFAQRELVSSDEEAELVRRSYNYLDEAVEK
Ga0066672_1063527613300005167SoilVKAQQRVYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVGREKLADAEERIIKSFGVQDSIILFPDLVSQLTKRQIESMAEGFAERGLVS
Ga0066683_1046244523300005172SoilVERLSIWEGRSYVDAVKAQERFYNLNKGVESLSKEEHGLFFDAIQRLGLPNEVKLAAVALYQDFKGRPIGEYNSDHKNLQIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLADAEERIIKSFGVRDSIIPFPDLVSQLTKRQIESMADGFAERGLVSSNEKE
Ga0066680_1027840513300005174SoilVKAQERFYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAERG
Ga0066680_1050946513300005174SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVRQAAVALYLDCKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDSILPFPGLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSYDYLDAAVERGLG
Ga0066690_1068104813300005177SoilMERLSIWEGRSHVDPVKAQQKVYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVGREKLADAEERIIKSFGVQDSIILFPDLVSQLTKRQIESMAEGFAERGLVS
Ga0066688_1022879913300005178SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASISLAAKAMGDLRTDHEFESKMLVSREKLADAEERIIKSFGVQGSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRSYDYLDAAVEKGLSPKMSYRGR
Ga0066685_1050446813300005180SoilMPQQEICSLTLFHMNAYIFGNTGCRCARVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPLPDL
Ga0066678_1003243913300005181SoilVKAQERIYNLNRGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRS
Ga0066678_1037702813300005181SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASISLAAKAMGDLRTDHEFESKMLVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRS
Ga0066676_1069766913300005186SoilLEEQIVGCPFVKTKERIYNLSRGVESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQHNIIPIPELVLRLTQRQMESMAESFAKQELVTDNEKRGTDPAFL*
Ga0066689_1024487913300005447SoilMNAYIFGNTNCCCPRVKDQQRIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRS
Ga0066697_1004764843300005540SoilMNAYIFGNTDCRCARVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEERIIKSFGVQDSILPFPDLVSQLTKRQI
Ga0066701_1052021013300005552SoilVKAQERIYNLNKGVESLSKEEHGLFFDAMQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLLDAEERIIRSFGVQESILPFPDLVSQLTKRQIESMAEGFAERDLVSSSDKEELVRLS
Ga0066701_1062030013300005552SoilVKTQERIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEEL
Ga0066692_1097546613300005555SoilLEEQTIGCPFVKTQERIYNLSRGLESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMCVSREKLVDAEERMIRSFGVQDSIIPIPELVLRLTKRQMESMGESFAKQELVTDNEKEELVRL
Ga0066707_1016896813300005556SoilDCPCAFVKAQERIYSLNRGVESLSREEHGLFFDAVQRLRLSNEVKQAAVAFYLDFKGRPIGEYNPDHKDLDIFLIASVSLAAKAIGNLRTDHEFESRMFVNREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSNEREELVRRSYDYLDAAVEKGLSPKMSYRVEPRA*
Ga0066704_1006187833300005557SoilMNAYIFGNTNCCCPRVKDQQRIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGDMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEGFAERDLVSSSDKEELVRLSYEYLDAAVERGLGPKMSYRGR
Ga0066698_1006578613300005558SoilVKAQQRVYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAER
Ga0066703_1082523913300005568SoilCPFVKTQERIYNLSRGLESLTKEEHGLFFDATHRLGLPNEVKQAAVALYLDFKTRPIGTYNSSRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTKRQMESMAESFAKQELVTDNEKEELVQLSFDYLDAAVERGLDPMMS
Ga0066702_1019604823300005575SoilLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVGREKLADAEERIIKSFGVQDSIILFPDLVSQLTKRQIESMAEGFAERGLVS
Ga0066691_1030015823300005586SoilVKAQERIYNLNKGVESLSKEEHGLFFDAMQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLLDAEERIIKSFGVQDSILPFPDLVSQLTKRQMES
Ga0066654_1080984813300005587SoilVKGQERIYNLRRGVESLSKEEHGIFFDAVQRLGLPNEVKQAAVALYLDFKSRPIGEYNAGHKNLEIFLIASVSLAAKAMGELRTDHEFESKMFVSREKLLDAEERIMKSFGVQDSILPFPDLVSRLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYL
Ga0066706_1041442313300005598SoilVKAQERIYNLNKGVESLSKEEHGLFFDAMQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEE
Ga0066656_1008021413300006034SoilLEEQIVGCPFVKTQERIYNLSRGLESLTKEEHGLFFDAMHRLELPNEVKQAAVALYLDFKTRPIGAYNSSRKNLKIFLIASISIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTKRQMESMAESFAKQELVTDNEKEELIQLSYDY
Ga0066656_1030295223300006034SoilVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLGLPNEVKLAAVALYLDFKSRPIGEYNTDHKNLDIFLIASISLAAKAMGDLRTDHEFESRMFVNREKLVDAEERIIRSFGVQESIIPVQDLVLQLTKRHIESMAEGFAERELVSSSEKEELVQRSYEYLDAALEKGL
Ga0066652_10090165723300006046SoilVKAQERFYSLSKGVESLSKEEHGLFFDAVQRLGLPNEVKQAAVALYLDFKSRPIGEYNANHKNLEIFLIATVSLAAKAMGDLRTDHEFESKMFVSKEKLADAEERILRSFGVQDSIMPFPDLVSQLTKRQIESMAQSFAERELVSNNEREELVRRSNEY
Ga0066665_1030409713300006796SoilMNAYIFGNTDCRCARVKAQERIYSLNTGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASISLAAKAMGDLRTDHEFESKMLVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRSY
Ga0066659_1023379313300006797SoilMEHTTLVATSSTLHVRCGRAIKISSLILALMNAYIFGNTDCPCALVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKQAAVALYLDFKSRPIGEYNADHKNLAIFLIASVSLAAKTLGDLRTDHEFESRMFVSREKLADAEERITKSFGVQDSIRPFPDLVSQLTKKQIESMADGFAERGLVGSNEREELVRRSYD*
Ga0066659_1053732313300006797SoilVKAQERFYSLGKGVESLSKEEHGLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRPTRRQIESMAEGFAERELVSSDEKEKLVRRSY
Ga0066659_1147193213300006797SoilVKAQERIYNLNRGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRSY
Ga0066660_1108587713300006800SoilVKDQQRIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLKIFLVASVSIAAKALGDLRTDHEFESKMFVSREKLVDAEERVFKSFGVQDHIVPFPDLVSQLTKRQIESMAEGFAERGLVDSEE
Ga0099793_1014490823300007258Vadose Zone SoilVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLDLRNEVKLAAVALYLDFKSRPIGEYNADQKNLKIFLVASVSLAAKAMGDQRTDREFESKMFVSGEKLVDAEERIIRSFGVQDSLIPFREIVSRLARRQIESMAESFADRHLVNSNEKKELIRRSCDYLDAAVEKGLSPKTSYR
Ga0099793_1044119813300007258Vadose Zone SoilMKAQTRSYNLGRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVTLYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAIGDVRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIIPIPDLVSQLTKRHIESMAEGFAERELVSRVEKEELVQRSYDYLDAAVEKGLN
Ga0066710_10143949313300009012Grasslands SoilTTLVATSSTLHVRCGRAIKISSLILALMNAYIFGNTDCPCALVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKQAAVALYLDFKSRPIGEYNADHKNLAIFLIASVSLAAKTLGDLRTDHEFESRMFVSREKLADAEERITKSFGVQDSIRPFPDLVSQLTKKQIESMADGFAERGLVGSNEREELVRRSYD
Ga0099830_1156965713300009088Vadose Zone SoilVKATERIYNLSKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNAGHMNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLVDAEERMIRSFSVRDSIIPFPELVLQLTKRQIESMAEGFAERELVSS
Ga0099828_1170651413300009089Vadose Zone SoilNAYIFGNTDGCCASVKSQERIYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEIESKMFVRRERLVDAEERIIKSLGVQESIIPIPDLVSQLTKRHIESMAEGFAEREMVSSGEKMELVQRSYDYLDAAVVKGLNP
Ga0066709_10085562013300009137Grasslands SoilMNAYIFGNTDCPCAFVKAQERIYSLNRGVESLSREEHGLFFDAVQRLRLSNEVKQAAVAFYLDFKGRPIGEYNPDHKDLDIFLIASVSLAAKAIGNLRTDHEFESRMFVNREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSNEREELVRRSYDYLDAAVEKGLSPKMSYRVEPRA*
Ga0127486_106841713300010128Grasslands SoilKGVESLSKEEHGLFFDAMQTLGLRNEVKQAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRLTRRQIESMAEGFAERELVSSDEKEELVRRCHSHLDATVEKGLAPKTSYRGRAA
Ga0134070_1004243733300010301Grasslands SoilVKAQERFYSLGKGVESLSKEEHGLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSLIPVRELVLRLTRRQIESMAEGFAERELVSSDEKEELVRRCHSHLDATVEKGLAPKTSYRG
Ga0134082_1021885713300010303Grasslands SoilVKAQEGIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQSAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGDMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAER
Ga0134082_1029791913300010303Grasslands SoilVKAQERFYSLGKGVESLSKEEHGLFFDAMQTLGLRNEVKQAAVVLYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRRENLLDAEERILRSFAVQDSIIPVPELVLRLTRRQIESMAEGFAERELVSSDEKEELVRRSYSHLDAAVEKGLAPKTI*
Ga0134086_1030407813300010323Grasslands SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSIVPFPDLVSQLTKRQIESMADGFAERGLVSSNEREELVRRTYDYLDAAVERGLGP
Ga0134064_1019466913300010325Grasslands SoilVKAQERFYSLGKGVESLSKEEHGLFFDAMQRLGLRKEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRLTRRQIESMAEGFAERELVSSDEKEELVRRCHSHLD
Ga0134071_1026762213300010336Grasslands SoilLEEQTVGCPFVKTQERIYNLSRGLESLTKEEHGLFFDAMHRLELPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQHNIIPIPELVLRLTQRQMESMAESFAKQELVTDNEKEELIQLSYDY
Ga0126378_1306274513300010361Tropical Forest SoilVKAQERILGPSKGVESLSREEHGVFFDAMQRLGLSSQVKEAAVALYLDFKSRPVGEYNADRKNLDIFLVAAISLAAKAVGELRTDREFESKMFVGKEKLIDAEERLLRSFGVQNDVMPLADFVVQLTKRQIESMAEGFAERELISS
Ga0137391_1155290213300011270Vadose Zone SoilQERIYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEIESKMFVRRERLVDAEERIIRSLGVQESIIPIPDLVSQLTKRHIESMAEGFAERELVSSSEKTELVQRSYDYLDAAVVKGLNPK
Ga0137389_1014076543300012096Vadose Zone SoilMDCCCAFVKAQEKIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAMGDLRTDHEFESKMFVRKERLVDAEERILRSFGVQESIMPVPDLVLQLTKRHIESMAEGFAERELVSNNE
Ga0137389_1115127213300012096Vadose Zone SoilVKAQERIYRLNRGVESLSKEEHGLFFDAVQRLGLPNEVKLAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLVDAEERIIKSFRVQDSIIPFPDLVSQLTKRQMESMAESFAERGLVSSDEKAELVRLFLNI
Ga0137388_1029927513300012189Vadose Zone SoilVKAQERIYSLNRGVESLSKEEHGLFFDTVQRLGLSNEVKLAAVALYLDFKSRPIGEYNTDHKNLEIFLIASLSLTAKAMRDLRNEHEFESRMFVNREKLVDAEERIIRSFGVQESIMRVPDLVLQLTKRHIESMAEGFAERELVSNNEKMEL
Ga0137388_1078786613300012189Vadose Zone SoilVKTQERIYNPSRGVESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGTYNAGRKNLKIFQLASVSIAAKTLGDLRTEPEFESKMFVSREKLVDAEERMIKSFGVQDSIIPIPELVLRLTQRQMESMAESFAKQELVTDNEKEELIRLSYDYLDLAVERGLDPMMSYRGRA
Ga0137364_1036177923300012198Vadose Zone SoilVKTQERIYSLNKGVESLSREEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVNREKLADAEERIVKSFGVQDSILPFPDLVSQLTKRQIESIAEGFAERGLVSSNEREELVRRSYDYLDAAVEKGLGPKMSYRGRAAGVML
Ga0137365_1022894023300012201Vadose Zone SoilVKGQERIYNLSRGVESLSKEEHGIFFDAVQRLGLPNEVKQAAVALYLDFKSRPIGEYNAGHKNLEIFLIASVSLAAKAMGELRTDHEFESKMFVSREKFVDAEERIIRSFGVQDSILPFAELVLRLAKRQIESMAESFAKRELVSSDEKEELVQRSCNYLDAAVEKGLGPKMSYRG
Ga0137365_1112383913300012201Vadose Zone SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEERIMKSFGVQDSILPFSDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYLDAAVEKGLGPKMSYRGRAA
Ga0137380_1050207813300012206Vadose Zone SoilVRARERILNLREGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPVGEYNAGRKNLEIFLIASVSIAAKILGDLRTDHEFESKMLVGREKLVDAEERIIKSFGVQDSIMPLPELVSQLTRRQLESMAEGFAE
Ga0137380_1059160813300012206Vadose Zone SoilVRARERILNLREGVESLSKEEHGLFFDAIHRLGLPNEVKQAAIALYLDFKSRPVGEYNAGRKNLEIFLVASISLAAKILGHLRTDHEFESKMFVGREKLVDAEERIIKSFGVQDSIMPLPELVSQLTRRQLESMAEGFAE
Ga0137381_1007427713300012207Vadose Zone SoilVKAQERIYSLNTGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALHLDFKSRPIGEYNADHKNLEIFLIASISLAAKAMGDLRTDHEFESKMLVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELVRRSMTTLTQQLKRV*
Ga0137381_1046643813300012207Vadose Zone SoilMNGYIFGNTDCPCAFVKAQERIYSLNRGVESLSREEHGLFFDAVQRLRLSNEVKQAAVAFYLDFKGRPIGEYNPDHKDLDIFLIASVSLAAKAIGNLRTDHEFESRMFVNREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMADG
Ga0137379_1092504513300012209Vadose Zone SoilVKAQERFYNLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILPFPDLVSQLTKRQIESMADGFAERGLVSSNEREEL
Ga0137378_1014796833300012210Vadose Zone SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAIGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRTYDYLDAAVERGLGPKMSY
Ga0137378_1159363613300012210Vadose Zone SoilAYIFGNTDCRCARVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSLAAKATGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDSIVPFPDLVSQLTKRQIESMADCFAERGLVSSNEREELVRRTYDYLDAAVERGLGPKMSY
Ga0137377_1192330913300012211Vadose Zone SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEERIMKSFGVQDSILPFSDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYL
Ga0137387_1026375013300012349Vadose Zone SoilVRARERILNLREGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPVGEYNAGRKNLEIFLIASVSIAAKILGDLRTDHEFESKMLVGREKLVDAEERIIKSFGVQDSIMPLPELVSQLTRRQLESMAEGFAERGLVSSSEKEELVQRSYECLDAAVERGLSPKMSY
Ga0137386_1128434013300012351Vadose Zone SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEERIMKSFGVQDSILPFSDLVSQLTKRQIESMAEGFAERGLVSSNEREEL
Ga0137384_1036803113300012357Vadose Zone SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEGFAE
Ga0137360_1169552513300012361Vadose Zone SoilVKAQERFYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDNILTFPDLVSQLTKRQIESMADGFAERGLISGEEREELIRRSCDYLDAAVERGLGPKMSYRGRAA
Ga0137361_1110336813300012362Vadose Zone SoilLYVTPVKAQERIYNLNKGVESLSKEEHGLFFDAVLRLGLPNEVKQAAVALYLDFRGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLVDAEERIIRSFGVQESIIPVPDLVLQLTKRHIESMAEGFAERELV
Ga0137390_1026714133300012363Vadose Zone SoilVKAQERIYSLNRGVESLTKEEHGLFFDAVQRLGLPNEVKQAAVALYLDFKSRPIGEYNADNKNLEIFLIASISLAAKAVGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSIIPVPDLVLQLTKRHIESIAEGFAERELVSNNEKMELV
Ga0134055_133307513300012401Grasslands SoilFYSLGKGVESLSKEEHGLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRLTRRQIESMAEGFAERELVSSDEKEELVRRCHSHLDATVEKGLAPKTSYRGRAAGVMLNAVRD
Ga0137396_1003482053300012918Vadose Zone SoilMKAQTRSYNLGRGVESLSKEEHGLFFDPVQRLGLSNEVKIAAVALYLDFKSRPIGEYNSDHKNLDIFLIASVSLAAKAMGDVRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIIPIPDLISQLTRRHIESMAEGFAERELVSSGEKWSWSSVPMTILMQRLKMV*
Ga0137416_1000279083300012927Vadose Zone SoilMKAQTRSYNLGRGVESLSKEEHGLFFDPVQRLGLSNEVKIAAVALYLDFKSRPIGEYNSDHKNLDIFLIVSGSLAAKAMGDVRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIIPIPDLISQLTRRHIESMAEGFAERELVSSGEKWSWSSVPMTILMQRLKMV*
Ga0137416_1013175933300012927Vadose Zone SoilVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLDLRNEVKLAAVALYLDFKSRPIGEYNADQKNLRIFLIASVSLAAKAMGDQRTDREFESKMFVSGEKLVDAEDRIIRSFGVQDSLIPFPEIVSRLARRQIESMAESFADRDLVNSNEKK
Ga0137416_1152969613300012927Vadose Zone SoilMKAQERMHNLNRGVESLSKEEHGLFFDAVQRLGLPNEVKQAAVAFYLDFKSRPIGEYNADHKNLEIFLMASISLAAKVLGDLRTDNEFESKMFVSREKLADAEERIIKSFGVQDSIIPLPEFVMQLTRRQMESMAENFAEHELVSINEKEELVKRSYDYLDAAVEKGLGPKMGY
Ga0137416_1175162113300012927Vadose Zone SoilMKAQTRSYNLGRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAMGDLRTDHEFEARMFVSREKLIDAEERIIRSFGVQDSIIPIPDLVSQLTKRHIETMAEGFAERELVSRVEKEELVQRSYDY
Ga0134077_1047052613300012972Grasslands SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEKRIIKSFGVQDSILPFPDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYLDAAVEKGL
Ga0134077_1058323113300012972Grasslands SoilGLFFDAVQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIATVSLAAKAMGDLRTDHEFESKMFVSKEKLADAEERILRSFGVQDSIMPFPDLVSQLTRRQIESMAQSFAERELVSNNEREELVRRSNEYLDAAVEKGLTPKMSYRGRAAGVTLKAIRDIG
Ga0134087_1026895613300012977Grasslands SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAIGDLRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYLDAAVEKGLGP
Ga0134081_1025650213300014150Grasslands SoilVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEKRIIKSFGVQHSILPFPDLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSHDYLDA
Ga0134112_1015314813300017656Grasslands SoilVKAQERFYSLGKGVESLSKEEHGLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRRENLLDAEERILRSFAVQDSIIPVPELVLRLTRRQIESMAEGFAERELVSSDEKEELVRPFSTAASRWL
Ga0134083_1008142113300017659Grasslands SoilLEEQIVGCPFVKTKERIYNLSRGLESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFQLASVSIAAKTLGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTQRQMESMAESFAKQELVTDNQKEELIQLSYDYLDAAVERGLDPMMSYRGRA
Ga0187803_1012208723300017934Freshwater SedimentVKAQQRIYNLSKGVESLSTEEHGRFFDAVQRLGLSNEVKQAAVALYLDFKSRPVGEYNADHKNLDIFLIASVSLAAKAMGELRTDQEFESKLFATSEKLVDAEERILRSFGVQDSAMPFLEFVSQLTRRQIESMAESFAERGLISRSERDELVE
Ga0066667_1076426613300018433Grasslands SoilMEHTTLVATSSTLHVRCGRAIKISSLILALMNAYIFGNTDCPCALVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKQAAVALYLDFKSRPIGEYNADHKNLAIFLIASVSLAAKTLGDLRTDHEFESRMFVSREKLADAEERITKSFGVQDSIRPFPDLVSQLTKKQIESMADGFAERGLVGSNEREELVRRSYD
Ga0066662_1055201413300018468Grasslands SoilVKDQQRIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGDMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAER
Ga0066662_1071682223300018468Grasslands SoilMERLSIWEGRSHVNPVKAQQRVYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVGREKLADAEERIIKSFGVQDSIILFPDLVSQLTKRQIE
Ga0137417_123955013300024330Vadose Zone SoilLEEKTVGCPFVKTQERIYNLSRGLESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFQLASVSIAAKALGDLRTEPEFESKMFVSREKLVDAEERMIRSLGVQDSIIPIPELVLRLTKRQMESMGESFARQELVTDNEKEELVRLSYDYLDAAVERGLDKRAGSY
Ga0207646_1002105663300025922Corn, Switchgrass And Miscanthus RhizosphereMKAQTRSYNLGRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNSDHKNLEIFLIASISLAAKAVGDLRTDHEFESKMLVGKERLADAEERIIKSFGVQDSIIRFPELLLQLTRRQIESMAETFAERELVSGSEKEELVQRSYKYLDAAIEKGLAPKM
Ga0209237_122860313300026297Grasslands SoilVKAQERFYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLLDAEERIIKSFGVQDSILPFPDLVSQLTKRQMESMAEGFAERDLLSSTDKEELVRLSYEYLDAAVERGLGPKMSYRGRAA
Ga0209236_108714713300026298Grasslands SoilLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKIRPIGTYNASRKNLKIFLLASVSIAAKTLGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRFTKGQMESMAESFAGQELVSDN
Ga0209236_127192413300026298Grasslands SoilLFVGQLSVAGSHVRRICPKCVIRIGRSALAPINAYIFGNTGCCRAFVKAQERMYSLNRGVESLSKEEHGLFFDTVQRLELSNEVKLAAVALYLDFKSRPIGEYNADHRNLEIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIMPIPDLVSQLTKRHIESAAEG
Ga0209469_113150113300026307SoilVKAQERFYSLSKGVDALSKEEHGLFFDAVQRLGLPNEVKQAAVALYLDFKCRPIGEYNAEHKNLEIFLIATVSLAAKAMGDLRTDHEFESKMFVSKEKLADAEERILRSFGVQDSIMRFPDLVSQLTKRQIESMAQSFAERELVSNNE
Ga0209761_101663613300026313Grasslands SoilVKAQERFYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESMMFVSREKLLDAEERIIKSFGVQDSILPFPDLVSQLTKRQMESMSEGFAERDLLSSTDKEELVRLSYEYLDAAVERGLGPKMSYR
Ga0209761_132153513300026313Grasslands SoilLEEQIVGCPFVKTQERIYNLSRGVESLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTRRQMESMGESFARQELVTDNEKEELVRLSYDYL
Ga0209471_122873713300026318SoilVKAQERIYNLNRGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVTLYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDNEFESRMFVSREKLVDAEERIIRSLGVQDHIVAFPELVSQLTKRQIEIMAEGFSQRDLISGKEKEELV
Ga0209801_117151323300026326SoilVKAQERFYSLGKGVESLSKEEHRLFFDAMQRLGLRNEVKLAAVALYLDFKSRPIGEYNADRKNLQIFLIASVSLAAKAMGDLRTDREFESKMFVRREKLLDAEERILRSFAVQDSIIPLPELVLRPTRRQIESMAEGFAERE
Ga0209802_116126423300026328SoilVKAQERFYNLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKGRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDHEFESRMFVSREKLLDAEERIIKSFGVQDSILPFPDLVSQLTKRQMESMAEGFAERDLLSS
Ga0209377_118116213300026334SoilVKAQERIYNLNKGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLQIFLIASVSLAAKAMGDLRTDREFETRMFVSREKLADAEERIIKSFGVKDSILSFPDLVSQLTKRQMETMAEGFAERDLVSSEEKEELVRLSYDYLDAAVERGLGPKMSYR
Ga0257181_100381933300026499SoilVRRICPKCAIRIGGSALAAINAYIFGNTDCCCAFVKAQEKIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAMGDLRTDHEFESKMFVRKERLVDAEERIIRSFGVQESIIPVPDLVLQLTKRHIESMAEGFA
Ga0257181_109011413300026499SoilNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNTDHKNLEIFLIASISLAAKAMGDLRTDYEFESRMFVNREKLVDAEERIIRSFGVQESIIPVPDLVLQLTKRHIESMVEGFAERELVSNNEKMELVQRSYGYLDAALEKGLNPKMSYRGRAAGVMLKAVLD
Ga0209059_114702223300026527SoilVKAQERIYNLNRGVESLSKEEHGLFFDAMQRLGLPNEVKLAAVALYLDFKGRPIGEYNSDHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVGREKLADAEERIIRSFGVQDSIILFPDLVSQLTKRQIESMAEGFAERGLVS
Ga0209378_102127213300026528SoilLHVRCGRAIKISSLILALMNAYIFGNTDCPCAFVKAQERIYSLNRGVESLSREEHGLFFDAVQRLRLSNEVKQAAVAFYLDFKGRPIGEYNPDHKDLDIFLIASVSLAAKAIGNLRTDHEFESRMFVNREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSNEREELVRRSYDYLDAAVEKGLSPKMSYRVEPRA
Ga0209378_128048513300026528SoilVKAQERIYNLNKGVESLSKEEHGLFFDAMQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGEMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMAEVFAERGLVSSDETEELV
Ga0209806_133233913300026529SoilEEQTIGCPFVKTQERIYNLSRGLESLTKEEHGLFFDATHRLGLPNEVKQAAVALYLDFKTRPIGTYNSSRKNLKIFLLASVSIAAKVTGDLRTDPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLRLTKRQMESMAESFAKQELVSDNEKEELVRLSFDYL
Ga0209160_101914413300026532SoilVKTQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDSILPFPGLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRS
Ga0209160_110462313300026532SoilLTKEEHGLFFDAMHRLGLPNEVKQAAVALYLDFKTRPIGVYNAGRKNLKIFQLASVSIAAKALGDLRTGPEFESKMFVSREKLVDAEERMIRSFGVQDSIIPFPELVLQLTQRQMESMGESFARQELVTDNEKEELVRLSYDYLDAAV
Ga0209376_123242213300026540SoilMVRVLRDQSSSRSGTAMPQQEICSLTLFHMNAYIFGNTGCRCARVKAQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVSREKLLDAEERIMKSFGVQDSILPFPDLVSRLTKRQIESMAEGFAERGLVSSNEREELVRRS
Ga0209577_1009852533300026552SoilMNAYIFGNTNCCCPRVKAQEGIYSLNKGVESLSKEEHGLFFDAVQSLGLPNEVKQSAVALYLDFKSRPIGEYNADHKNLDIFLIASISLSAKAMGDMRTDHEFESKMFVSREKLADAEERIIKSFGVQDSILPFPDLVSQLTKRQIESMADRFAERGLVSSDETEELVRRSYDYLDAAVEKGLGPKM
Ga0209588_106895813300027671Vadose Zone SoilMKAQARSYNLNNGVDSLSKEEHGLFFDAMQRLGLSNDVKLAAVALYLDFKSRPIGEYNAGHKNLEIFLMASVSLAAKAMGDIRTDHEFESRMFVRREKLVDAEERIIKSFGVQEGIIPIPDLVLQLTKRHIESIAEGFAERELVSSSEKMELVRRSYAYLDVALEKGLNPKMSY
Ga0209689_128466113300027748SoilVKTQERIYSLNKGVESLSKEEHGLFFDAMQRLGLPNEVKQAAVALYLDFKSRPIGEYNADHKNLEIFLMASVSIAAKAMGDLRTDHEFESRMFVSREKLADAEERIIKSFGVQDSILPFPGLVSQLTKRQIESMAEGFAERGLVSSNEREELVRRSYDYLDAAVERGLG
Ga0209283_1049873313300027875Vadose Zone SoilVKAQERVYSLNRGVESLTREEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAIGDLRTDHEFESKMFVSREKLIDAEERIIKSFGVQDSIIPIPDLVSQITKRHIESMAEGFAERELVSSSEKMELVRRSYDYLDAALEKGLNPK
Ga0137415_10013982103300028536Vadose Zone SoilLSKEEHGLFFDPVQRLGLSNEVKIAAVALYLDFKSRPIGEYNSDHKNLDIFLIASVSLAAKAMGDVRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIIPIPDLISQLTRRHIESMAEGFAERELVSSGEKWSWSSVPMTILMQRLKMV
Ga0137415_1121438613300028536Vadose Zone SoilMKAQTRSYNLGRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHKNLDIFLIASVSLAAKAIGDVRTDHEFESRMFVSREKLIDAEERIIRSFGVQDSIIPIPDLVSQLTKRHIESMAEGFAERELVSRVEKEELVQRSYDYL
Ga0137415_1141314213300028536Vadose Zone SoilAQERIYNLNRGVESLSKEEHGLFFDAVQRLGLPNEVKQAAVAFYLDFKSRPIGEYNADHKNLEIFLMASISLAAKVLGDLRTDNEFESKMFVSREKLADAEERIIKSFGVQDSIIPLPEFVMQLTRRQMESMAENFAEHELVSINEKEELVKRSYDYLDAAVEKGLGPKMGY
Ga0307473_1071246413300031820Hardwood Forest SoilVKASERFHSPSRGIESLSKEEHGLFFDAMHRLGLSNEVKQAAVALYLDFKRRPIGEYNADHKNLDIFLIASISLAAKVIGDLRTDQEFEARMFVIREKLVDAEERIIKSFGVGDSIMPLPQIVFQLTRRQIEIMAESFSEHQLVASGEKEE
Ga0307471_10358319813300032180Hardwood Forest SoilFFGQLSVAGSHVRRKCPKCAIRIGGSSLASINAYIFGNTSSCCVFVKAQERIYSLNRGVESLSKEEHGLFFDAVQRLGLSNEVKLAAVALYLDFKSRPIGEYNADHRNLEIFLIASVSLAAKAMGDLRTDHEFESKMFVRRERLVDAEERIIRSLGVQDSIIPIADLVSQLTKRHIESMAEGF
Ga0307472_10178337513300032205Hardwood Forest SoilMKASERFHSPSRGIESLSKEEHGLFFDAMHRLGLSNEVKQAAVALYLDFKRRPIGEYNADHKNLDIFLIASISLAAKVIGDLRTDQEFETRMFVSREKLVDAEERIIKSFGVGDSIIPLPQIVFQLTRRQIEIMAESFSERQLVISGVKEE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.