NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069161

Metagenome / Metatranscriptome Family F069161

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069161
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 138 residues
Representative Sequence WLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYGEGGLLDMSAFFERQRRIIFSATLALSVMGGLATYADRNNFPGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWAGVGGMFVQNIWFFVFYTLGA
Number of Associated Samples 111
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.61 %
% of genes near scaffold ends (potentially truncated) 95.97 %
% of genes from short scaffolds (< 2000 bps) 91.13 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(24.194 % of family members)
Environment Ontology (ENVO) Unclassified
(37.903 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.065 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 65.24%    β-sheet: 1.22%    Coil/Unstructured: 33.54%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF14559TPR_19 4.03
PF00211Guanylate_cyc 4.03
PF13414TPR_11 2.42
PF13432TPR_16 1.61
PF08450SGL 0.81
PF13431TPR_17 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 4.03
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 0.81
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459023|GZGNO2B02FVYCSAll Organisms → cellular organisms → Bacteria522Open in IMG/M
3300000955|JGI1027J12803_103131883All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300002915|JGI25387J43893_1001563All Organisms → cellular organisms → Bacteria2701Open in IMG/M
3300004633|Ga0066395_10399424All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia774Open in IMG/M
3300004633|Ga0066395_10786777All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300005179|Ga0066684_10200747All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300005186|Ga0066676_11001509All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300005329|Ga0070683_102284432All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300005331|Ga0070670_101003639All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300005339|Ga0070660_101218188All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300005354|Ga0070675_101663984All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300005355|Ga0070671_101186111All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300005436|Ga0070713_100862885All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium869Open in IMG/M
3300005440|Ga0070705_101430638All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300005446|Ga0066686_10669631All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia703Open in IMG/M
3300005451|Ga0066681_10702737All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300005471|Ga0070698_100031783All Organisms → cellular organisms → Bacteria5470Open in IMG/M
3300005540|Ga0066697_10692208All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300005556|Ga0066707_10934595All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300005558|Ga0066698_10061619All Organisms → cellular organisms → Bacteria2396Open in IMG/M
3300005558|Ga0066698_10865818All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300005561|Ga0066699_10261313All Organisms → cellular organisms → Bacteria1225Open in IMG/M
3300005566|Ga0066693_10029891All Organisms → cellular organisms → Bacteria1715Open in IMG/M
3300005575|Ga0066702_10516393All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300005576|Ga0066708_10439000All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300005587|Ga0066654_10347443All Organisms → cellular organisms → Bacteria805Open in IMG/M
3300005598|Ga0066706_11318219All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300005764|Ga0066903_100459972All Organisms → cellular organisms → Bacteria2132Open in IMG/M
3300005764|Ga0066903_108545418All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300006028|Ga0070717_10821553All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium846Open in IMG/M
3300006032|Ga0066696_10490145All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300006032|Ga0066696_10668080All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300006034|Ga0066656_11127084All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300006175|Ga0070712_100410414All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1120Open in IMG/M
3300006794|Ga0066658_10350662All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia790Open in IMG/M
3300006796|Ga0066665_10511858All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia982Open in IMG/M
3300006800|Ga0066660_11212896All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300006806|Ga0079220_11989669All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300006871|Ga0075434_100663815All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1061Open in IMG/M
3300009012|Ga0066710_102177633All Organisms → cellular organisms → Bacteria811Open in IMG/M
3300010048|Ga0126373_12224218All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300010159|Ga0099796_10254638All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales730Open in IMG/M
3300010321|Ga0134067_10400569All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300010326|Ga0134065_10200760All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia722Open in IMG/M
3300010326|Ga0134065_10366524All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300010333|Ga0134080_10244928All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia789Open in IMG/M
3300010335|Ga0134063_10403603All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300010335|Ga0134063_10498147All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300010358|Ga0126370_10915635All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium793Open in IMG/M
3300010362|Ga0126377_12802246All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300010373|Ga0134128_12279332All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300010376|Ga0126381_100433561All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1839Open in IMG/M
3300010396|Ga0134126_10610121All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1248Open in IMG/M
3300010397|Ga0134124_12694443All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300012198|Ga0137364_10282010All Organisms → cellular organisms → Bacteria1231Open in IMG/M
3300012198|Ga0137364_10443482All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia973Open in IMG/M
3300012200|Ga0137382_11125945All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300012208|Ga0137376_11127274All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia671Open in IMG/M
3300012285|Ga0137370_10058911All Organisms → cellular organisms → Bacteria2080Open in IMG/M
3300012354|Ga0137366_10939026All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300012357|Ga0137384_10904922All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300012360|Ga0137375_10088706All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3195Open in IMG/M
3300012360|Ga0137375_10678162All Organisms → cellular organisms → Bacteria845Open in IMG/M
3300012513|Ga0157326_1076553All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300012918|Ga0137396_10906352All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300012923|Ga0137359_11036242All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300012929|Ga0137404_10757139All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia881Open in IMG/M
3300012929|Ga0137404_11309436All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300012958|Ga0164299_11215864All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300012971|Ga0126369_12835854All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300012975|Ga0134110_10043959All Organisms → cellular organisms → Bacteria1756Open in IMG/M
3300013296|Ga0157374_10345668All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1477Open in IMG/M
3300013308|Ga0157375_11187850All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium895Open in IMG/M
3300013308|Ga0157375_13398788All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300014157|Ga0134078_10446530All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300014157|Ga0134078_10630231All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300015357|Ga0134072_10326660All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300015371|Ga0132258_12803221All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1214Open in IMG/M
3300015372|Ga0132256_100067024All Organisms → cellular organisms → Bacteria3390Open in IMG/M
3300015373|Ga0132257_101212540All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia955Open in IMG/M
3300015374|Ga0132255_103898693All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300016341|Ga0182035_11746697All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300016422|Ga0182039_11967976All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300017657|Ga0134074_1395942All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300017993|Ga0187823_10373317All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300018071|Ga0184618_10471087All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300018433|Ga0066667_10694871All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia853Open in IMG/M
3300018433|Ga0066667_10987031All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300018482|Ga0066669_10677994All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia908Open in IMG/M
3300019877|Ga0193722_1009145All Organisms → cellular organisms → Bacteria2514Open in IMG/M
3300019887|Ga0193729_1250565All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300021418|Ga0193695_1013493All Organisms → cellular organisms → Bacteria1660Open in IMG/M
3300021445|Ga0182009_10610914All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300025315|Ga0207697_10088558All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300025906|Ga0207699_10648309All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300025922|Ga0207646_11148534All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300025927|Ga0207687_11628485All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300025929|Ga0207664_10420168All Organisms → cellular organisms → Bacteria1191Open in IMG/M
3300025930|Ga0207701_11345021All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300025961|Ga0207712_11860045All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300026023|Ga0207677_11027677All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300026088|Ga0207641_11496203All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300026304|Ga0209240_1208759All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300026305|Ga0209688_1017660All Organisms → cellular organisms → Bacteria1372Open in IMG/M
3300026308|Ga0209265_1235755All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300026312|Ga0209153_1059305All Organisms → cellular organisms → Bacteria1371Open in IMG/M
3300026316|Ga0209155_1263001All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300026318|Ga0209471_1278654All Organisms → cellular organisms → Bacteria559Open in IMG/M
3300026498|Ga0257156_1097060All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300026523|Ga0209808_1213888All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300026524|Ga0209690_1036203All Organisms → cellular organisms → Bacteria2323Open in IMG/M
3300026536|Ga0209058_1052473All Organisms → cellular organisms → Bacteria2313Open in IMG/M
3300026540|Ga0209376_1018893All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia4658Open in IMG/M
3300026548|Ga0209161_10433142All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300027504|Ga0209114_1115331All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300027874|Ga0209465_10534638All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300028715|Ga0307313_10230927All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300028793|Ga0307299_10356902All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300029636|Ga0222749_10273911All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300031057|Ga0170834_111889770All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300031122|Ga0170822_13234691All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300031912|Ga0306921_12547676All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300031946|Ga0310910_11054451All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300031946|Ga0310910_11178174All Organisms → cellular organisms → Bacteria595Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil24.19%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.29%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.87%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.84%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.03%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.23%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.42%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.42%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.42%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.61%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.81%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.81%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.81%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere0.81%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.81%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.81%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.81%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459023Grass soil microbial communities from Rothamsted Park, UK - FA3 (control condition)EnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012513Arabidopsis rhizosphere microbial communities from North Carolina - M.Oy.2.old.250510Host-AssociatedOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300021445Bulk soil microbial communities from the field in Mead, Nebraska, USA - 072115-187_1 MetaGEnvironmentalOpen in IMG/M
3300025315Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)Host-AssociatedOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027504Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FA3_106016602170459023Grass SoilALLAVSMINAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYGESGLLDMSAFFERQRRVIFSATLALAVTGGLATYLDRNNFPGWKPNDWIGAELLGLVLAVCAVLAGWAKPRWLQWVGVGRHVRSEHFVFVFYTLGS
JGI1027J12803_10313188313300000955SoilGVIINWLGLWELQNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYTEGGLIDMSAFFERQRRVIFSATLALSMMGGLAIYLDRNNFQGWKPNDWIGAELLGLALAVCAVLAGWAKPRWLQWVGVGGMFVQNMWFFVFYTLGS*
JGI25387J43893_100156313300002915Grasslands SoilLGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066395_1039942423300004633Tropical Forest SoilSMLNATLGVIVNWLGLWELQNLKNWSLPEILLQLGWVIPNYFSCSLVAMPYRETGPLDMRAYFERQRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELFSLPLGICAVLAGWAKPRWLQWTGVAGMLAQNLGFFIFYSLGS*
Ga0066395_1078677723300004633Tropical Forest SoilALGVIINWLGLWELQNLRHWSLGEVLLQLGWVVPNYFSCSLVAMPVSESGPLDMSAFFERQRRIIFSATLALCVMGGLANYVDRNNLEGWKPNDWIGAELLLLGLLLAVCAVLAGWAKPRWLQWVGVAGMFAQNIWFFLFYSLGS*
Ga0066684_1020074723300005179SoilLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALSVMGGLAIYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWTKPRWLQWVGVGGMFVQNICFFVFYTLGA*
Ga0066676_1100150923300005186SoilVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWIGAELLGLVLAVCAVLAGWAKPRWLQWVGVGGMLVQNIWFFVFYTLGA*
Ga0070683_10228443213300005329Corn RhizosphereSMVNAALGVIINWLGLWELQNIKHWSLIEVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0070670_10100363913300005331Switchgrass RhizosphereALGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0070660_10121818813300005339Corn RhizosphereLWELEHIKHWSLGEVVLLLGWVIPNYFSCSLVAMPYSEGGLLDMPAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0070675_10166398413300005354Miscanthus RhizosphereWLGLWELQNIKHWSLIEVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0070671_10118611123300005355Switchgrass RhizosphereLNAALGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMQAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0070713_10086288523300005436Corn, Switchgrass And Miscanthus RhizosphereQDIKHWSLAEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFDRQRRVIFSATIALALMSGLATYADRNNVPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWVGVGGMLVQNIWYFVFYTLGS*
Ga0070705_10143063823300005440Corn, Switchgrass And Miscanthus RhizosphereLGLWELQNLKHWSLTEVLLQLGWVFPNYFSCSLVAMSVSESGPLDMQAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0066686_1066963123300005446SoilMLLQLGWVVPTYVSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066681_1070273723300005451SoilIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLAIYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTIGF*
Ga0070698_10003178313300005471Corn, Switchgrass And Miscanthus RhizosphereAMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVIPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATVALSVMGWLATYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWLGVGGMFVQNISFFVFYTLGS*
Ga0066697_1069220823300005540SoilNAALGVIINWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066707_1093459513300005556SoilIKHWSLTEVLLQLGWVVPNYFSCSLVAMPVSESGPLDMPAFFERQRRIIFSATIALSVMSGLATWFDRNNFAEWKPNDWIGAEAVGLLLAVFAMLAGWARPRWLQWVGVGGMLVQNIWYFVFYTVGA*
Ga0066698_1006161913300005558SoilALLTVSMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066698_1086581813300005558SoilMINAALGVIINWLGLWQLENIKRWSLGEVVLQLGWVIPNYFSCSLVAMRCSESGLLDMSAFFERQRRVIFSATLALSVMGGLETYLDRNNFEGWKPNDWIAAELVGLSLAVFAMLAGWAKPRWLQWLGVGGMFVQNIWFFVFYTLGS*
Ga0066699_1026131323300005561SoilVSMVNAALGVIINWLGLWELQNIEHWTLGEVMLQLGWVVPNYFSCSLVAMPCSETGLLDMSDFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWFGAELLGLLLAVCAVLAGWAKPRWLQWAGVGGMFVQNVSFFVFYTLGS*
Ga0066693_1002989123300005566SoilVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMSAFFERQRRVIFSATLALAVMSGLATYADRNNILGWKPNEWIEAELVGLALGVLAVLAGWARPRWLQWVGVGGMFVQNIWYFVFYTLGS*
Ga0066702_1051639313300005575SoilLGVIINWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066708_1043900023300005576SoilAEVLLQLGWVVPNYFSCSLVAMPYSEDGVLDMSAFFERQRRVIFSATLALAVMSGLATYADRNNILGWKPNEWIEAELVGLALGVLAVLAGWARPRWLQWVGVGGMFVQNIWYFVFYTLGS*
Ga0066654_1034744323300005587SoilWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066706_1131821923300005598SoilINWLGLWELEHIKHWSLGEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFERQRRVIFSATIAFAVMSGLATYADRNNIPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWIGVGGMLVQNIWYFVFYTLGS*
Ga0066903_10045997223300005764Tropical Forest SoilLQLGWVIPNYFSCSLVAMPYSETGPLDMLDYFERQRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELYSLPLGICAVLAGWGKPRWLQWVGVAGMLAQNLGFFIFYSLGS*
Ga0066903_10854541813300005764Tropical Forest SoilSMLNAALGVIINWLGLWDLQHLKQWSLTEVVLQLGWVIPNYFSCSLVAMPYSESGVLDMAAFFERQRRVIFSATLALWAMSSLATYFDRSNFEGWKPNDWIGAELYGLPLGICAVLAGWTKPRWLQWVGVGGLFVQNIAYFVLFTVGS*
Ga0070717_1082155323300006028Corn, Switchgrass And Miscanthus RhizosphereAEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFDRQRRVIFSATIALALMSGLATYADRNNVPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWVGVGGMLVQNIWYFVFYTLGS*
Ga0066696_1049014513300006032SoilWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYGEGGLLDMSAFFERQRRIIFSATLALSVMGGLATYADRNNFPGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWAGVGGMFVQNIWFFVFYTLGA*
Ga0066696_1066808013300006032SoilLTVSMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNICFFVFYTLGA*
Ga0066656_1112708423300006034SoilNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPCSETGVLDMSAFFERQRRVIFSATLALSVMGGLATYADRNNVPGWKPNEWIEAELLGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNICFFVFYTLGA*
Ga0070712_10041041423300006175Corn, Switchgrass And Miscanthus RhizosphereVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFDRQRRVIFSATIALALMSGLATYADRNNVPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWVGVGGMLVQNIWYFVFYTLGS*
Ga0066658_1035066223300006794SoilMVNAALGVIINWLGLWELQNIKHWSVAEVLLQLGWVVPNYFSCSLVAMPYSEDGVLDMSAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0066665_1051185823300006796SoilSLAEVLLQLGWVIPNYFSCSLVAMPYSESGLLDMSAFFERQRRVIFSATLALSVMGGLALYLDRNNLEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0066660_1121289623300006800SoilGLWQLENIKHWSLGEVVLQLGWVVPNYFSCSLVAMPVNESGLLDMSAFFERQRRLIFSATLALSVMGGLETYLDRNSFEGWKPNDWIAAELVGLSLAVFAVLAGWAKPRWLQWLGVGGMFVQNLWYFVFYTLGS*
Ga0079220_1198966913300006806Agricultural SoilVVLQLGWVIPNYFSCSLVAMPYSESGVLDMATFFERQRRVIFSATLTLWAMSSLSTYLGRNNFEGWKPNDWIGFELFGLPLGICAVLAGWSKPLWLQWVGVGGLFVQNIAYFVFNSPGS*
Ga0075434_10066381523300006871Populus RhizosphereGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVTGSLATYADRNNFEGWKPNDWIGAELAALPLGICAVLAGWAKPRWLQWLGVGGMFVQNVWYFLFYTLSV*
Ga0066710_10217763323300009012Grasslands SoilLTVSMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPCSETGVLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFDGWKPNDWIGAELLGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0126373_1222421823300010048Tropical Forest SoilNAALGVIINWLGLWDLQQLKQWSLTEVVLQLGWVIPNYFSCSLVAMPYSESGVLDMAAFFERQRRVIFSATLALWAMSSLSTYLGRNNFEGWKPNDWIGFELFGLPLGICAVLAGWSKPRWLQWIGVGGLFVQNIAYFVFNTPGS*
Ga0099796_1025463823300010159Vadose Zone SoilHWSLGEVVLQLGWVVPNYFSCSLVAMPVSESGLLDMSAFFERQRRVIFSATIALAVMSGLATYADRNNIPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWIGVGGMLVQNIWYFVFYTLGS*
Ga0134067_1040056913300010321Grasslands SoilVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIEAELLGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNICFFVFYTLGA*
Ga0134065_1020076013300010326Grasslands SoilGVIINWLGLWELQHIKHWSLAEVLLQLGWVIPNYFSCSLVAMPCSETGVLDMSAFFERQRRVIFSATLALSVMGGLATYADRNNVPGWKPNEWIEAELLGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0134065_1036652413300010326Grasslands SoilVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSAFFDRQRRVIFSATLALSVMGGLAVYLDRNNFEGWKPNDWIGAELLGLLLAICAVLAGWAKPRWLQWAGVGGMFVQNIWFFVFYTLGA*
Ga0134080_1024492813300010333Grasslands SoilSMVNAALGVIINWLGLWELQNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLAVYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS*
Ga0134063_1040360323300010335Grasslands SoilLAEVLLQLGWVVPNYFSCSLVAMPYGESGLLDMSAFFERQRRIIFSATLALSVMGGLATYADRNNFPGWKPNDWIGAELVGLSLAVFAVLAGWAKPRWLQWVGVGGMLVQNIWFFVFYTIGS*
Ga0134063_1049814713300010335Grasslands SoilSLGEVLLQLGWVVPNYFSCSLVAMPCSETGVLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFDGWKPNDWIGAELLGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0126370_1091563523300010358Tropical Forest SoilTVGVLINWIGLWDLRNLKHWSPAEVLLQLGWVIPNYFSCSLVAMPCSETGSLDMSAFFERQRRVIFSATLALSVMAALATYVDRNNFEGWQPNEWISAEMVGLALAVFAVLAGWGKPRWLQWVGVGGMLVQNAWWFVFYTLGS*
Ga0126377_1280224623300010362Tropical Forest SoilVSMVNAPLGVIINWLGLWPLQNLKHWSAVEVLLQLGWVIPNYFSCSLVAMPYSENGLLDMAAFFERQRRVIFSATLALWAMSSLATYLDRTNFEGWKPNDWIGFELYGLPLGICAVLAGWSKPLWLQWVGVGGLFGQNIAYFVLVTVGS*
Ga0134128_1227933223300010373Terrestrial SoilIGLWDLQHLKHWSPAEVLLQLGWVIPNYFSCSLVAMPCSETGSLDMPAFFERQRRVIFSATLALSVMGGLATYLDRHNFEGWQPNEWISAELVGLALGVFAVLAGWAKPRWLQWVGVGGMLVQNAWWFVFYTLGS*
Ga0126381_10043356123300010376Tropical Forest SoilTVVVLQLGWIIPNYFSCSLAAMSYGESGVLDMATFFERQRRVIFSATLALWAMSSLSIYLGRDNFEGWKPNEWIGFELYGLPLGICAVLAGWSKPLWLQWVGVGGLFVQNIAYFVLFSPGS*
Ga0134126_1061012123300010396Terrestrial SoilVSMVNAALGVIINWLGLWDLQNLKHWSLGEVLLQLGWVIPNYFSCSLVAMPVSESGLLDMPGFFGRQRQVIFSATLALSLMGGLATYLDRNNFEGWKPNDWIGAELAALPLGICAVLAGWAKPRWLQWVGVGGMFVQNVWYFLFYTLSV*
Ga0134124_1269444313300010397Terrestrial SoilGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVTGSLATYADRNNFEGWKPNDWIGAELAALPLGICAVLAGWAKPRWLQWFGVGGMFVQNVWYFLFYTLSV*
Ga0137364_1028201033300012198Vadose Zone SoilALGVIINWLGLWELEHIKHWSLGEVVLQLGWVVPNYFSCSLVAMPYSEGGLLDMPAFFERQRRVIFSATLALSVMGGLATYADRNNFPGWKPTDWIGAELVGLSLAVFAVLAGWAKPRWLQWIGVGWDVRAEHLVFCFLHARFLTRS*
Ga0137364_1044348213300012198Vadose Zone SoilIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSGFFERQRRVIFSATLTLSVMGGLAIYLDRNNFQGWKSNDWIGAELLGLLLGVFAVLAGWSKPRWLQWVGVGGMFVQNIWFFVFYTLGS*
Ga0137382_1112594513300012200Vadose Zone SoilIINWLGLWELEHIKHWSLGEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFERQRRVIFSATIALAVMSGLATYADRNNIPGWKPNEWIEAELVGLALGVLAVLAGWGRPRWLQWIGVGGMLVQNIWYFVFYTLGSQ*
Ga0137376_1112727423300012208Vadose Zone SoilGLWELQNLKHWSLGEVLLQLGWVIPNYFSCSLVAMPCSETGVLDMPAFFERQRRIIFSATLALSVMGGLATYADRNNFPGWKPNDWIGAELVGLSLAVFAVLAGWARPRWLQWVGVGGMFVQNIWFFVFYTLGSQ*
Ga0137370_1005891113300012285Vadose Zone SoilMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLAVYLDRNNFEGWKPNDWIGAELLGLLLAICAVLAGWGKAALAAMGRDWRHVRSEHFVFCFLHTWLLAKSHGQSQLLCRAEAA*
Ga0137366_1093902623300012354Vadose Zone SoilTVSMVNAALGVIINWLGLWELQNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSAFFDRQRRVIFSATLALSVMGGLAVYLDRNNFEGWKPNDWIGAELLGLLLAICAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0137384_1090492213300012357Vadose Zone SoilMVNAALGVIINWLGLWALQNIKHWSPPEVLLQLGWVIPNYFSCSLVAMPYSESGLLDMSAFFERQRRVIFSATLALSVMSGLATYADRNNVPGWKPNEWIEAELVGLALGVFAVLAGWAKPRWLQWIGVGGMLVQNIWYFVFYTVGA*
Ga0137375_1008870613300012360Vadose Zone SoilNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSESGILDMPAFFERQRRVIFSATLALSVMGGLATYVDRNNFQGWKPNDWIGAELLGLSLGVFAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0137375_1067816213300012360Vadose Zone SoilWSLAEVLLQLGWVIPNYFSCSLVAMPYGESELLDMPAFFERQRRVIFSATLALSVMGGLAVYLDRNNFQGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNMFFFVFYTLGS*
Ga0157326_107655323300012513Arabidopsis RhizosphereLTEVLLQLGWIIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVTGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0137396_1090635223300012918Vadose Zone SoilLNAALGVIINWLGLWQLESIKHWTLGEVLLQIGWVIPNYFSCSLVAMPCSETGALDMAAYFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIEAELVGLVLGVFAVLAGWAKPRWLQWIGVGGMLIQNIWYFVFSTLGS*
Ga0137359_1103624213300012923Vadose Zone SoilIINWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYGEGGLLDMSAFFERQRRVIFSATIALAVMSGLATYADRNNIPGWKPNEWIEAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWYFVFYTLGS*
Ga0137404_1075713923300012929Vadose Zone SoilVNAALGVIINWLGLWGLQNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLAVYLDRNNFQGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0137404_1130943613300012929Vadose Zone SoilSMVNAALGVIINWLGLWGLQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS*
Ga0164299_1121586413300012958SoilHWSLAEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFDRQRRVIFSATIALALMSGLATYADRNNVPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWVGVGGMLVQNILYFVFYTLGS*
Ga0126369_1283585413300012971Tropical Forest SoilIVNWLGLWELQNLKNWSLPEILLQLGWVIPNYFSCSLVAMPYRETGPLDMRAYFERQRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELYTLPLGICAVLAGWAKPRWLQWTGVAGMLAQNLGFFIFYTLGS*
Ga0134110_1004395913300012975Grasslands SoilMVNAALGVIINWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYGEGGLLDMPAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA*
Ga0157374_1034566813300013296Miscanthus RhizosphereMVNAALGVIINWLGLWELQNIKHWSLIEVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSTFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0157375_1118785013300013308Miscanthus RhizosphereWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVTGSLATYADRNNFEGWKPNDWIGAELAALPLGICAVLAGWAKPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0157375_1339878813300013308Miscanthus RhizosphereQNIKHWSLIEVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0134078_1044653023300014157Grasslands SoilVIINWLGLWQLENIKHWSLGEVVLQLGWVIPNYFSCSLVAMRCSESGLLDMSAFFERQRRVIFSATLALSVMGGLETYLDRNNFEGWKPNDWIAAELVGLSLAVFAMLAGWAKPRWLQWLGVGGMFVQNIWFFVFYTLGS*
Ga0134078_1063023123300014157Grasslands SoilWELQNIKHWSPPEVLLQLGWVIPNYFSCSLVAMPYSESGLLDMSAFFERQRRIIFSATLALSVMGGLALYLDRNSFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGA*
Ga0134072_1032666023300015357Grasslands SoilSMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALSVMGGLATYLDRNHFEGWKPNDWIGAELLGLLLAVCAVLAGWARPRWLQWVGVGGMFVQNISFFVFCTLGS*
Ga0132258_1280322123300015371Arabidopsis RhizosphereWLLSVSMLNAALGVIINWLGLWELQNLKHWSLTEVLLQLGWIIPNYFSCSLVAMSVSEGGPLDMQAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV*
Ga0132256_10006702443300015372Arabidopsis RhizosphereGLWELQNIKQWSLAEVLLQLGWVIPNYFSCSLVAMPYSETGLLDMPAFFERQRRVIFSATLALSIMGGLATYFDRNNIMGWKPNEWIGAELVGLSLAVFAVMAGWGKPRWLQWLGVGGMFVQNIWFFVFYTLGS*
Ga0132257_10121254013300015373Arabidopsis RhizosphereVIINWLSLWQLEKLKRWTLGEVLLQLGWVIPNYFSCSLVAMPYSESGVLDMAAFFERQRRVIFSATVALWVMSSLATYLDRSNFEGWKPNDWIGFELYGLPLGICAMLAGWAKRRWLQWVGVGGLFVQNIAYFVLFSVGS*
Ga0132255_10389869313300015374Arabidopsis RhizosphereHWSLFEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNDFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS*
Ga0182035_1174669723300016341SoilEVVLQLGWVIPNYFSCSLVAMPYSESGLLDMAAFFERQRRAIFSATLALWAMSSVTTYLDRNNFEGWKPNDWMCFELFGLPLGICAVLASWSKPLWLQWVGVGGLFVQNIAYFVFNTPGC
Ga0182039_1196797623300016422SoilAVSMLNATLGVIVNWLGLWELQNLKNWSLPEILLQLGWVIPNYFSCSLVAMPYRETGPLDMRAYFERQRRVIFSATLALWAMFTIVNYIDRHNFEDWKPNDWIGAELYALPLGVCAVLAGWAKPRWLQWIGVTGMLAQNLGFFVFYRLGS
Ga0134074_139594213300017657Grasslands SoilSLGLWELQNIKHWSVAEVLLQLGWVVPNYFSCSLVAMPYSEDGVLDMSAFFERQRRVIFSATLALAVMSGLATYADRNNILGWKPNEWIEAELVGLALGVLAVLAGWARPRWLQWVGVGGMFVQNIWYFVFYTLGS
Ga0187823_1037331723300017993Freshwater SedimentLWELQNLKHWSLPEILLQLGWVIPNYFSCSLVAMPYRETGTLDMSDYFQRHRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELYTLPLGICAVLAGWGKTRWLQWAGVAGMLAQNLGFFVFYSLGT
Ga0184618_1047108723300018071Groundwater SedimentWTLGEVLLQLGWVIPNYFSCSLVAMPCSETGALDMAAYFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIEAELVGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS
Ga0066667_1069487123300018433Grasslands SoilWSLGEVVLQLGWVIPNYFSCSLVAMRCSESGLLDMSAFFERQRRVIFSATLALSVMGGLETYLDRNNFEGWKPNDWIAAELVGLSLAVFAMLAGWAKPRWLQWLGVGGMFVQNIWFFVFYTLGS
Ga0066667_1098703123300018433Grasslands SoilVLTLTLINATIGVSIKWFGLWVLQNIQHWSLSEILLQHGWVIPNYISCSLVAMPYSEGGLLDMSAFFDRQRRVIFSATLALSVMGGLAVYLDRNNFEGWKPNDWIGAELLGLLLAICAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0066669_1067799413300018482Grasslands SoilSALLTVSMVNAALGVIINWLGLWELQNIKHWSLAEVLLQLGWVIPNYFSCSLVAMPYSEGGLLDMSGFFERQRRVIFSTTLTLSVMGGLAIYLDRNNFQGWKSNDWIGAELLGLLLGVFAVLAGWSKPRWLQWVGVGGMFVQNIWFFVFYTLGS
Ga0193722_100914533300019877SoilMQHWSPAEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMPAFFEQKRRVIFSATLALSVMGGLANYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS
Ga0193729_125056513300019887SoilMINAALGVIINWLGLWELQNLKHWSLGEVLLQLGWVVPNYFSCSLVAMPVSENGPLDMSAFFGRQRRVIFSATLALSVMGGLATYVDRNNFPGWKPNDWIGAELLGLMLAVCAVLAGWAKPRWLQWVGVGGMFVQKICFFVFYTLGS
Ga0193695_101349313300021418SoilIINWLGLWQLENIKHWTLGEVLLQLGWVIPNYFSCSLVAMPCSETGALDMAAYFERQRRVIFSATLALAVMGGLATYADRNTFPGWKPNEWIEAELVGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS
Ga0182009_1061091423300021445SoilVSMLNAALGVIINWLGLWELQNLKHWSLTEVMLQLGWVIPNYFSCSLVAMPRGESGPLDMAAFFQRQRRVIFSATLALWLMGSIATYADRNNFAGWKPDEWIGAEWAALPLGICAVLAGWGKSRWVQWLGVAGLFVQNVCYFLFYTLGS
Ga0207697_1008855823300025315Corn, Switchgrass And Miscanthus RhizosphereINWLGLWELQNIKHWSLIEVLLQLGWVIPNYFSCSLVAMPVSENGPLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVMAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0207699_1064830923300025906Corn, Switchgrass And Miscanthus RhizosphereVIINWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPVSENGPLDMSAFFGRQRRVIFSATLALAVMGGLATYLDRNSFQGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNICFFVFYTLGS
Ga0207646_1114853413300025922Corn, Switchgrass And Miscanthus RhizosphereALLTVSMVNAALGVIINWLGLWELQNLKHWTLGEVLLQLGWVIPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALSVMGGLATYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWLGVGGMFVQNISFFVFYTLGS
Ga0207687_1162848513300025927Miscanthus RhizosphereWSLPGILLQLGWVIPNYFSCSLVAMPYSETGPLDMPAYFERQRRVIFSATLALWAMFTIVNYADRHNFEGWKPNDWIGAELFTLPLGICAVLAGWGKPRWLQWVGVAGMLAQNLGFFVFYTLGS
Ga0207664_1042016813300025929Agricultural SoilWELQNLKHWSLPEILLQLGWVIPNYFSCSLVAMPYRETGTLDMSDYFQRQRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELYTLPLGICAVLAGWGKPRWLQWIGVGGMLVQNAWYFVFYTLGS
Ga0207701_1134502113300025930Corn, Switchgrass And Miscanthus RhizosphereLNAALGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVTGSLATYADRNNFEGWKPNDWIGAELAALPLGICAVLAGWAKPRWLQWVGVGGMFVQNVWYFLFYTLSV
Ga0207712_1186004513300025961Switchgrass RhizosphereSMVNATLGVIINWLGLWELQNIKHWSLVEVLLQLGWVIPNYFSCSLVAMPYSENELLDMSAFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVLAGWTKPRWLQWVGVGGMLVQNISFFVFYTLGS
Ga0207677_1102767733300026023Miscanthus RhizosphereMLNAALGVIINWLGLWELQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPDDWIGAEVAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV
Ga0207641_1149620323300026088Switchgrass RhizosphereQNLKHWSLTEVLLQLGWVIPNYFSCSLVAMSVSESGPLDMPAFFDRQRRIIFSATLALWVMGSLATYADRNNFEGWKPNDWIGAELAALPLGICAVLAGWARPRWLQWVGVGGMFVQNVWYFLFYTLRV
Ga0209240_120875913300026304Grasslands SoilQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMPVSENGPLNMSAFFGRQRRVIFSATLALAVMGGLATYLDRNNFQGWKPNDWVGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNICFFVFYTLGS
Ga0209688_101766013300026305SoilALGVIINWLGLWQLENIKHWSLGEVVLQLGWVIPNYFSCSLVAMPYSEDGVLDMSAFFERQRRVIFSATLALAVMSGLATYADRNNILGWKPNEWIEAELVGLALGVLAVLAGWARPRWLQWVGVGGMFVQNIWYFVFYTLGS
Ga0209265_123575523300026308SoilWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA
Ga0209153_105930513300026312SoilHWTLGEVMLQLGWVVPNYFSCSLVAMPCSETGLLDMSDFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWAGVGGMFVQNVSFFVFYTLGS
Ga0209155_126300113300026316SoilQTIKHWSLAEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMSAFFERQRRVIFSATLALSVMGGLAIYLDRNNFEGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNIWFFCFLHAWLLTASNEQSQLLRRAEARNS
Ga0209471_127865423300026318SoilEVLLQLGWVIPNYFSCSLVAMPCSETGLLDMSDFFERQRRVIFSATLALSVMGGLATYLDRNNFQGWKPNDWIGAELLGLLLAVCAVLAGWAKPRWLQWAGVGGMFVQNVSFFVFYTLGS
Ga0257156_109706013300026498SoilGVIINWLGLWELQNMQHWSPAEVLLQLGWVVPNYFSCSLVAMPYSEGGLLDMPAFFEQKRRVIFSATLALSVMGGLANYLDRNNFEGWKPNDWIGAELLGMLLAICAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0209808_121388813300026523SoilWELQNIKHWSVAEVLLQLGWVVPNYFSCSLVAMPYSEDGVLDMSAFFERQRRVIFSATLALAVMSGLATYADRNNILGWKPNEWIEAELVGLALGVLAVLAGWARPRWLQWVGVGGMFVQNIWYFVFYTLGS
Ga0209690_103620313300026524SoilWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSGGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA
Ga0209058_105247323300026536SoilNWLGLWELEHIKHWSLGEVVLQLGWVVPNYFSCSLVAMPYSESGLLDMPAFFERQRRVIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA
Ga0209376_101889313300026540SoilGVIINWLGLWELQNIKHWSLAEMLLQLGWVVPNYFSCSLVAMPYSEGGPLDMPAFFERQRRIIFSATLALAVMGGLATYADRNNVPGWKPNEWIGAELVGLALGVFAVLAGWGRPRWLQWIGVGGMLVQNIWFFVFYTLGA
Ga0209161_1043314223300026548SoilWELEHIKHWSLGEVVLLLGWVIPNYFSCSLVAMPVSESGLLDMSAFFDRQRRVIFSATIALAVMSGLATYADRNNIPGWKPNEWIEAELVGLALGVLAVLAGWAKPRWLQWIGVGGMLVQNIWYFVFYTLGS
Ga0209114_111533123300027504Forest SoilNWLGLWELQNLKHWSLPEILLQLGWVIPNYFSCSLVAMPYRETGPLDMPDYFERQRRVIFSATLALWAMFTIVNYVDRHNFEGWKPNDWIGAELYSLPLGICALLAGWGKPRWVQWIGVAGMLAQNLGFFVFYSLGN
Ga0209465_1053463813300027874Tropical Forest SoilALGVIINWLGLWELQNLRHWSLGEVLLQLGWVVPNYFSCSLVAMPVSESGPLDMSAFFERQRRIIFSATLALCVMGGLANYVDRNNLEGWKPNDWIGAELLLLGLLLAVCAVLAGWAKPRWLQWVGVAGMFAQNIWFFLFYSLGS
Ga0307313_1023092713300028715SoilLGLWQLENIKHWTLGEVLLQLGWVIPNYFSCSLVAMPCSETGALDMAAYFERQRRVIFSATLALAVMGGLATYADRNTFPGWKPNEWIEAELVGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGS
Ga0307299_1035690223300028793SoilINWLGLWQLENIKHWTLGEVLLQLGWVIPNYFSCSLVAMPCSETGALDMAAYFERQRRVIFSATLALAVMGGLATYADRNTFPGWKPNEWIEAELVGLALGVFAVLAGWAKPRWLQWVGVGGMFVQNIWFFVFYTLGF
Ga0222749_1027391123300029636SoilALLTVSMVNAALGVIINWLGLWELQNIKHWSLGEVLLQLGWVVPNYFSCSLVAMPCSESGPLDMPAFFERQRRVIFSATLALSVMGGLATYLDRNNFEGWKPNEWIGAEVVGLSLAVFAVLAGWGKPRWLQWVGVGGMFLQNIWFFVFYTLGS
Ga0170834_11188977023300031057Forest SoilMINAALGVIINWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSCSLVAMSYSESGLLDMPAFFERQRRVIFSATLALSVMGGLAVYLDRNNFQGWKPNDWVGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0170822_1323469113300031122Forest SoilWLGLWELQNIKHWSLAEVLLQLGWVVPNYFSSLVAMSYSESGLLDMPAFFERQRRVIFSATLALSVMGGLAVYLDRNNFQGWKPNDWVGAELLGLLLAVCAVLAGWAKPRWLQWVGVGGMFVQNISFFVFYTLGS
Ga0306921_1254767613300031912SoilWLLSVSMLNAALGVIINWLGLWDLQHLKHWSLTEVVLQLGWVIPNYFSCSLVAMPYSESGVLDMATFFERQRRVIFSATVALWAMSSLSTYLGRNNFEGWKPNDWMCFELFGLPLGICAVLAGWLKPLWLQWVGVGGLFVQNIAYFVLFRPGS
Ga0310910_1105445123300031946SoilALGVIVNWLGLWELQNLKNWSLPEILLQLGWVIPNYFSCSLVAMPYQETGPLDMRTYFERQRRLIFSATLALWAMFTIVNYVDRHNFEGWKPNEWIGAELYTLPLGICAVLAGWAKPRWLQWVGVAGMLAQNLGFFIFYRL
Ga0310910_1117817423300031946SoilWQLRNFKDWSFTEVVLQLGWVIPNYFSCSLVAMPYSESGLLDMAAFFERQRRVIFSATLALWAMSSVTTYLDRNNFEGWKPNDWIGAEVYGLPLGVCAVLAGWSKPRWLQWVGVGGQFVQNIAYFFLFTLGS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.