NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F101777

Metagenome Family F101777

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101777
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 240 residues
Representative Sequence MSSTALISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL
Number of Associated Samples 84
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 55.88 %
% of genes near scaffold ends (potentially truncated) 33.33 %
% of genes from short scaffolds (< 2000 bps) 51.96 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.471 % of family members)
Environment Ontology (ENVO) Unclassified
(31.373 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.725 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 18.61%    β-sheet: 11.31%    Coil/Unstructured: 70.07%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF01784NIF3 23.53
PF00487FA_desaturase 21.57
PF00486Trans_reg_C 3.92
PF06649DUF1161 2.94
PF02518HATPase_c 2.94
PF03952Enolase_N 2.94
PF00113Enolase_C 1.96
PF00512HisKA 1.96
PF09685DUF4870 1.96
PF01925TauE 0.98
PF00072Response_reg 0.98
PF00294PfkB 0.98
PF05222AlaDh_PNT_N 0.98
PF13336AcetylCoA_hyd_C 0.98
PF028262-Hacid_dh_C 0.98
PF01694Rhomboid 0.98
PF03447NAD_binding_3 0.98
PF00324AA_permease 0.98
PF13673Acetyltransf_10 0.98
PF02887PK_C 0.98
PF13508Acetyltransf_7 0.98
PF09424YqeY 0.98
PF00141peroxidase 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0327Putative GTP cyclohydrolase 1 type 2, NIF3 familyCoenzyme transport and metabolism [H] 23.53
COG3323PII-like insert in the uncharacterized protein YqfO, YbgI/NIF3 familyFunction unknown [S] 23.53
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 21.57
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 21.57
COG0148EnolaseCarbohydrate transport and metabolism [G] 4.90
COG0376Catalase (peroxidase I)Inorganic ion transport and metabolism [P] 0.98
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 0.98
COG0531Serine transporter YbeC, amino acid:H+ symporter familyAmino acid transport and metabolism [E] 0.98
COG0705Membrane-associated serine protease, rhomboid familyPosttranslational modification, protein turnover, chaperones [O] 0.98
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 0.98
COG0833Amino acid permeaseAmino acid transport and metabolism [E] 0.98
COG1113L-asparagine transporter or related permeaseAmino acid transport and metabolism [E] 0.98
COG1115Na+/alanine symporterAmino acid transport and metabolism [E] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004268|Ga0066398_10075866All Organisms → cellular organisms → Bacteria → Acidobacteria736Open in IMG/M
3300005163|Ga0066823_10000864All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3351Open in IMG/M
3300005165|Ga0066869_10007405All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1419Open in IMG/M
3300005181|Ga0066678_10023655All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3259Open in IMG/M
3300005186|Ga0066676_10799244All Organisms → cellular organisms → Bacteria → Acidobacteria639Open in IMG/M
3300005187|Ga0066675_10215381All Organisms → cellular organisms → Bacteria → Acidobacteria1356Open in IMG/M
3300005332|Ga0066388_100029216All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5123Open in IMG/M
3300005332|Ga0066388_100043500All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4485Open in IMG/M
3300005332|Ga0066388_100268454All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2348Open in IMG/M
3300005434|Ga0070709_10356834All Organisms → cellular organisms → Bacteria → Acidobacteria1082Open in IMG/M
3300005436|Ga0070713_100286463All Organisms → cellular organisms → Bacteria → Acidobacteria1513Open in IMG/M
3300005447|Ga0066689_10078276All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1849Open in IMG/M
3300005533|Ga0070734_10000112All Organisms → cellular organisms → Bacteria226411Open in IMG/M
3300005533|Ga0070734_10001009All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae46000Open in IMG/M
3300005537|Ga0070730_10000387All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae52664Open in IMG/M
3300005537|Ga0070730_10023652All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter4726Open in IMG/M
3300005537|Ga0070730_10223135All Organisms → cellular organisms → Bacteria → Acidobacteria1251Open in IMG/M
3300005552|Ga0066701_10153055All Organisms → cellular organisms → Bacteria → Acidobacteria1389Open in IMG/M
3300005557|Ga0066704_10242606All Organisms → cellular organisms → Bacteria → Acidobacteria1223Open in IMG/M
3300005559|Ga0066700_10481618All Organisms → cellular organisms → Bacteria → Acidobacteria868Open in IMG/M
3300005561|Ga0066699_10625433All Organisms → cellular organisms → Bacteria → Acidobacteria773Open in IMG/M
3300005568|Ga0066703_10109772All Organisms → cellular organisms → Bacteria → Acidobacteria1630Open in IMG/M
3300005598|Ga0066706_10034638All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3264Open in IMG/M
3300005764|Ga0066903_101918897All Organisms → cellular organisms → Bacteria → Acidobacteria1135Open in IMG/M
3300006046|Ga0066652_100486096All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1144Open in IMG/M
3300006173|Ga0070716_100026940All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3082Open in IMG/M
3300006797|Ga0066659_10501376All Organisms → cellular organisms → Bacteria → Acidobacteria973Open in IMG/M
3300006903|Ga0075426_10009176All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7041Open in IMG/M
3300006914|Ga0075436_100470239All Organisms → cellular organisms → Bacteria → Acidobacteria917Open in IMG/M
3300007788|Ga0099795_10380393All Organisms → cellular organisms → Bacteria → Acidobacteria637Open in IMG/M
3300009012|Ga0066710_100592237All Organisms → cellular organisms → Bacteria → Acidobacteria1681Open in IMG/M
3300009137|Ga0066709_100847032All Organisms → cellular organisms → Bacteria → Acidobacteria1328Open in IMG/M
3300010043|Ga0126380_10005885All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5226Open in IMG/M
3300010043|Ga0126380_10143947All Organisms → cellular organisms → Bacteria → Acidobacteria1515Open in IMG/M
3300010046|Ga0126384_10014985All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4907Open in IMG/M
3300010046|Ga0126384_10037550All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3263Open in IMG/M
3300010358|Ga0126370_10081698All Organisms → cellular organisms → Bacteria → Acidobacteria2175Open in IMG/M
3300010358|Ga0126370_10411786All Organisms → cellular organisms → Bacteria → Acidobacteria1114Open in IMG/M
3300010359|Ga0126376_10005932All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium KBS 837336Open in IMG/M
3300010359|Ga0126376_10006720All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium KBS 836952Open in IMG/M
3300010360|Ga0126372_10267197All Organisms → cellular organisms → Bacteria → Acidobacteria1480Open in IMG/M
3300010360|Ga0126372_10410625All Organisms → cellular organisms → Bacteria → Acidobacteria1239Open in IMG/M
3300010362|Ga0126377_10068374All Organisms → cellular organisms → Bacteria3153Open in IMG/M
3300010362|Ga0126377_10221871All Organisms → cellular organisms → Bacteria1824Open in IMG/M
3300010366|Ga0126379_10079770All Organisms → cellular organisms → Bacteria2829Open in IMG/M
3300010398|Ga0126383_10041020All Organisms → cellular organisms → Bacteria3777Open in IMG/M
3300010398|Ga0126383_10363526All Organisms → cellular organisms → Bacteria → Acidobacteria1472Open in IMG/M
3300011271|Ga0137393_10061336All Organisms → cellular organisms → Bacteria2970Open in IMG/M
3300012202|Ga0137363_10088600All Organisms → cellular organisms → Bacteria2324Open in IMG/M
3300012205|Ga0137362_10029376All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4339Open in IMG/M
3300012349|Ga0137387_10042839All Organisms → cellular organisms → Bacteria2986Open in IMG/M
3300012351|Ga0137386_10332264All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1094Open in IMG/M
3300012359|Ga0137385_10830166All Organisms → cellular organisms → Bacteria → Acidobacteria767Open in IMG/M
3300012361|Ga0137360_10001376All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae14333Open in IMG/M
3300012683|Ga0137398_10134090All Organisms → cellular organisms → Bacteria → Acidobacteria1594Open in IMG/M
3300012685|Ga0137397_10009248All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6860Open in IMG/M
3300012917|Ga0137395_10255903All Organisms → cellular organisms → Bacteria → Acidobacteria1231Open in IMG/M
3300012922|Ga0137394_10072015All Organisms → cellular organisms → Bacteria2881Open in IMG/M
3300012923|Ga0137359_10112678All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2409Open in IMG/M
3300012924|Ga0137413_10555844All Organisms → cellular organisms → Bacteria → Acidobacteria853Open in IMG/M
3300012925|Ga0137419_11031252All Organisms → cellular organisms → Bacteria → Acidobacteria683Open in IMG/M
3300012929|Ga0137404_11349633All Organisms → cellular organisms → Bacteria → Acidobacteria658Open in IMG/M
3300012944|Ga0137410_10000712All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae23056Open in IMG/M
3300012944|Ga0137410_10010545All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis6259Open in IMG/M
3300012944|Ga0137410_10010783All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6188Open in IMG/M
3300012971|Ga0126369_10187185All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1984Open in IMG/M
3300012971|Ga0126369_10980555All Organisms → cellular organisms → Bacteria → Acidobacteria932Open in IMG/M
3300014166|Ga0134079_10064721All Organisms → cellular organisms → Bacteria → Acidobacteria1320Open in IMG/M
3300015053|Ga0137405_1145777All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5238Open in IMG/M
3300015053|Ga0137405_1233846All Organisms → cellular organisms → Bacteria → Acidobacteria648Open in IMG/M
3300015241|Ga0137418_10122473All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2317Open in IMG/M
3300015242|Ga0137412_10020030All Organisms → cellular organisms → Bacteria5442Open in IMG/M
3300015245|Ga0137409_10018003All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6989Open in IMG/M
3300015264|Ga0137403_10002217All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae23897Open in IMG/M
3300016294|Ga0182041_11314583All Organisms → cellular organisms → Bacteria → Acidobacteria662Open in IMG/M
3300018482|Ga0066669_10200155All Organisms → cellular organisms → Bacteria → Acidobacteria1528Open in IMG/M
3300021168|Ga0210406_10000006All Organisms → cellular organisms → Bacteria → Acidobacteria350090Open in IMG/M
3300021170|Ga0210400_10000009All Organisms → cellular organisms → Bacteria447643Open in IMG/M
3300021478|Ga0210402_10003952All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae13701Open in IMG/M
3300024178|Ga0247694_1000072All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae41927Open in IMG/M
3300024181|Ga0247693_1000033All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia55039Open in IMG/M
3300024182|Ga0247669_1000088All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae48791Open in IMG/M
3300024182|Ga0247669_1027244All Organisms → cellular organisms → Bacteria → Acidobacteria969Open in IMG/M
3300024224|Ga0247673_1010349All Organisms → cellular organisms → Bacteria → Acidobacteria1195Open in IMG/M
3300024290|Ga0247667_1017978All Organisms → cellular organisms → Bacteria → Acidobacteria1380Open in IMG/M
3300024330|Ga0137417_1491066All Organisms → cellular organisms → Bacteria → Acidobacteria1619Open in IMG/M
3300026301|Ga0209238_1138396All Organisms → cellular organisms → Bacteria → Acidobacteria774Open in IMG/M
3300026310|Ga0209239_1084281All Organisms → cellular organisms → Bacteria → Acidobacteria1369Open in IMG/M
3300026320|Ga0209131_1138946All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1243Open in IMG/M
3300026548|Ga0209161_10049107All Organisms → cellular organisms → Bacteria2771Open in IMG/M
3300027288|Ga0208525_1004728All Organisms → cellular organisms → Bacteria → Acidobacteria1416Open in IMG/M
3300027748|Ga0209689_1203934All Organisms → cellular organisms → Bacteria → Acidobacteria863Open in IMG/M
3300027826|Ga0209060_10000007All Organisms → cellular organisms → Bacteria → Acidobacteria1505266Open in IMG/M
3300027826|Ga0209060_10001075All Organisms → cellular organisms → Bacteria → Acidobacteria33275Open in IMG/M
3300027857|Ga0209166_10001476All Organisms → cellular organisms → Bacteria19491Open in IMG/M
3300028146|Ga0247682_1000027All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae72295Open in IMG/M
3300028536|Ga0137415_10678171All Organisms → cellular organisms → Bacteria → Acidobacteria843Open in IMG/M
3300031231|Ga0170824_101953948All Organisms → cellular organisms → Bacteria → Acidobacteria631Open in IMG/M
3300031446|Ga0170820_12008484All Organisms → cellular organisms → Bacteria → Acidobacteria621Open in IMG/M
3300032180|Ga0307471_100575019All Organisms → cellular organisms → Bacteria → Acidobacteria1283Open in IMG/M
3300032770|Ga0335085_10436931All Organisms → cellular organisms → Bacteria → Acidobacteria1509Open in IMG/M
3300032782|Ga0335082_10113443All Organisms → cellular organisms → Bacteria2669Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil16.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.80%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil7.84%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.94%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300005163Soil and rhizosphere microbial communities from Laval, Canada - mgHMBEnvironmentalOpen in IMG/M
3300005165Soil and rhizosphere microbial communities from Laval, Canada - mgHMCEnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024181Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK34EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027288Soil and rhizosphere microbial communities from Laval, Canada - mgHMC (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028146Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK23EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066398_1007586613300004268Tropical Forest SoilMFLARIALASILALSFPSNTPWDKPPGQWTAADANKILEESPWAPTKVTIEAKYSQKYTDNLSRIVTDSPANATQNSAIVQNVQISRSAAPVYYVKWMSAKTMRLALEKMHRMRTNVVGTQPPFKVEESSDYVIAIEGDEPMRIIKDAKEDLHDTVFVELDNGFPLDLTRVQYVDGADADPLRTEFHFPRLIEGKPAIDPDSEKVIFHLRA
Ga0066823_1000086433300005163SoilMFLTRIVLASLLALSFPSNTPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDNLTHLASDSPINSQTSPVVPSMQISRGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQPPLKVEQSPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELGNGFTLDLASIQYIDGADADPLRTEFHFPRQIEGKPAIDPDSEKIVFHLRATAKREMPNRQNAIAIRVDFHPKDMRAQNIPDL*
Ga0066869_1000740523300005165SoilMFLTRIVLASLLALSFPSNTPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDSLTHLASDSPINSQTSPVVQSMQISRGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQPPLKVEQSPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELGNGFTLDLASIQYIDGADADPLRTEFHFPRQIEGKPAIDPDSEKIVFHLRATAKREMPNRQNAIAIRVDFHPKDMRAQNIPDL*
Ga0066678_1002365513300005181SoilMSSVALISRFLIGAFLAVSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0066676_1079924413300005186SoilGLGIAAGNVMPLTVPMSRFALAALLALSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMRRMRTNMVGTPPPLRAEESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDP
Ga0066675_1021538123300005187SoilQNGKYNLPAGLGIAAGNVMPLTVPMSRFALAALLALSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMRRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPGSEKVIFHCRATAKKELPDRDNAISIRVDFHPREMRAQNVPDL*
Ga0066388_10002921623300005332Tropical Forest SoilMPQSSFIARFALAAVFTAAFPSNTPWDKPPDQWTAADANKILEDSPWAPSKVTIEAKYTQKHLEPLTGITSDSDINAQNTNRVRGVQISKGGTPAYYVKWMSAKTMRLALEKMHRMRANVTGTLPPLKVEGSPDYVVAIEGDEPMRILRTAKEDLHDTVFLELDNGFTLDLVSVQFLDGAEADPIRTEFHFPRQVDGKPAIELDSERIVFHCRATAKKELPGRENALAIRVDFHPKEMRAQNLPDL*
Ga0066388_10004350073300005332Tropical Forest SoilMLQSSFFAGFALAVIFVGAFPSNTPWDKPPDQWTAADANKILEESPWAPTKVTIETKYSQKYTDSLTRIVTDSAANPIQNSPIVQNVQISRSATPSYYVKWISAKTMRLALEKMHRMRTNVAGVLPPLKAEELPDYVVAIEGNEPMRILRDAKEDLHDTVFIELDNGFTLDLESVQYLDGTDADPIRTEFHFPRMMEGKPAIDPDSEKVVFHLRAAAKKEIPNRNSAIAIRVDFHPKDMRAQNTPDL*
Ga0066388_10026845413300005332Tropical Forest SoilMFLARFILVSILALSIPSSTPWDKPPDQWTAADANKILEDSPWAPTKVTIEAKYLQKYTDSLTRIVTDSSTNSVQNSPTVQSVQISRGAAPAYYVKWMSAKTMRLALEKMHRMRANVSGTLPPLKVEELPDYVIAIEGDEPMRIIKDAKEDLHDTVFVELENGFTLDLAGVQYIEGADADPLRTEFHFPRLVEGKPAFDPDSEKVIFHLRATAKKEMQNRNNAIAIRVDFHPKDMRAQNLPDL
Ga0070709_1035683423300005434Corn, Switchgrass And Miscanthus RhizosphereSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGIQISKGGTPSFYVKWMSAKTMRLALEKMHRMRTNVAGTPPPLKADESPDYVVAIEGDEPMRIIRIAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPIRTEFHFPRTIEGKPAIDPETEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0070713_10028646323300005436Corn, Switchgrass And Miscanthus RhizosphereSTVPISRFLVGALLAFSFPANAPWDKPADQWTAADANRILEDSPWAPSKVTIEAKFTQKHTEPLTGLISNSDVNLNNSNNIRGIQISKGGTPSFYVKWMSAKTMRLALEKMHRMRTNTAGTAPPLKADESPDYVVAIEGDEPMRIIRIAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPIRTEFHFPRTIEGKPAIDPETEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0066689_1007827623300005447SoilMSSVALISRFLIGAFLAVSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTIEGKPAIDPDSGKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0070734_10000112553300005533Surface SoilMVLARFALASLLALSFPSNTPWEKPPDQWSAADTNKILEDSPWAPGKVTVETKYSQKYKDNVTQIVSDSPINSQNSPIVQNLQISKGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQSPIKVEESPDYVIAIEGDEPMRIIRDAKEDSHDTVFVELDNGFTLDLASVHYIDGPDADPLRTEFHFPRLIEGKPAIDANSEKVVFHLRATAKREMQNRNNAIAIRVDFHPKEMRAQNVPDL*
Ga0070734_1000100993300005533Surface SoilMPQTASISRFAFAAYLTFALPANGPWDKSPDQWTAADTNKILEDSPWAPTKVAIETKYSQKYTDNLTHVVSDSPINSTQNSPIVQNVQISRGATPSFYVKWMSAKTMRLALEKMHRMRLNVAGNQPLIKVEESPDYVVAIEGDEPMRILRDAKEDLHDTVFVELDNGFTLDLASVQFIDGADADPLRTEFHFPRAIEGKAAIDPDSEKVVFHLRATAKKELPNRENSIAIRVDFHPKDMRAQTLPDL*
Ga0070730_10000387353300005537Surface SoilMQTTALFLRFAFAALVAISFPANTPWDKPPDQWTAADANKIFEDSPWAPSKVIIEAKFTQKHTEPPTGLISDSDVNLPNSNSVRGVQLSKGGAPAYYVKWMSAKTMRLALEKMHRLRANVNGTLPPLKAEESPDYVIAIEGDEPMRILRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRLVEDKPAIDPESEKIVFRLRASAKKELPNRENAISIRVDFHPKEMRAQNLPDL*
Ga0070730_1002365223300005537Surface SoilMPQTASISRFAFAAYLTFALPANGPWDKSPDQWTAADTNKILEDSPWAPTKVAIETKYSQKYTDNLTHVVSDSPINSTQNSPIVQNVQISRGATPSFYVKWMSAKTMRLALEKMHRMRLNVAGNQPLIKVEESPDYVVAIEGDEPMRILRDAKEDLHDTVFVELDNGFTLDLASVQFIDGADADPLRTEFHFPRAIEGKPAIDPDSEKVVFHLRATAKKELPNRENSIAIRVDFHPKDMRAQTLPDL*
Ga0070730_1022313513300005537Surface SoilMSSTVPISRFLVGALLAFSFPANAPWDKPADQWTAADANRILEDSPWAPSKVTIEAKFTQKHTEPLTGLISNSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNTAGTAPPLKADESPDYVVAIEGDEPMRIIRLAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPIRTEFHFPRTIEGKPAIDPETEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0066701_1015305513300005552SoilMSQPILIAHLALAAILALAFPANAPWEKPADQWTAADTNKILEDSPWAPSKVTIETRFTQKHTEPLTGLISESDVNLNNSPNVRGVQISKSGTPAYYVKWMSAKTMRLALEKMHRMRANVAGGAQPPLKAEQSPDYVIAIEGDEPMRILRNAKEDLHDTVFLELGNGFTLDLAGVQFLEGADADPLRTEFHFPRQIEGKPAIDPDSEKVVFHCRATAKKEIPNRDNSISIRVDFHPRDMRAQSLPDL*
Ga0066704_1024260623300005557SoilMSSVALISRFLIGAFLAVSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTIEGKPAIDPDSGKVVFHCRATAKKELPNRNNAISIRVDFHPR
Ga0066700_1048161813300005559SoilGPGIAASNAMSSVALISRFLAGALLAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0066699_1062543313300005561SoilGLGIAAGNAMPLTVHISRFALAALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLRDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPDSEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0066703_1010977223300005568SoilMSSVALISRFLIGAFLAVSFPANTPWDKPADQWTAADTNKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0066706_1003463823300005598SoilMSSTALISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQRHTEPLTGLISDSDVNLNNSSNIRGVQISKGGTPSYYVKWMSAKTMRLALGKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0066903_10191889713300005764Tropical Forest SoilMFLARTAFATLLALSFASNAPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYTDNLTHLVSDSPVNSQTSSAVPNMQISKGGTPNYYVKWTSAKTMRLALEKMHRMRSNVTGTMPPLKVEESPDYVIAIEGDEPMRILRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGAEADPLRTEFHFPRLIEGKPAIDPNTEKVVFHLRATAKKEMPNRSNAIAIRVDFHPREMRAQN
Ga0066652_10048609613300006046SoilWDKPADQWTAADANKILEDSPWAPTKITIEAKFMQKHTEPLTGLISDSDVNLNNSNNIRGVQLSKGGTPSYYVKWMSAKTMRLALEKMRRMRTNMVGTPPPLRAEESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0070716_10002694043300006173Corn, Switchgrass And Miscanthus RhizosphereMSSTVPISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGIQISKGGTPSFYVKWMSAKTMRLALEKMHRMRTNVAGTPPPLKADESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPETEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0066659_1050137623300006797SoilMSSVALISRFLIGAFLAVSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPRE
Ga0075426_1000917643300006903Populus RhizosphereMSSTVPISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGIQISKGGTPSFYVKWMSAKTMRLALEKMHRMRTNVAGTPPPLKADESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPETEKVIFRCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0075436_10047023913300006914Populus RhizospherePSKVTVEAKYSHKYTDNLTRIISESGTNPQQNIPNVQAVQVSRGSVPGYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLKVEESPDYVVAIEGDEPMRVIRNAKEDLHDTVFVELDNGFTLDMASVQYLDGADADPLRTEFHFPRTVEGKPAIDLDSEKVVFHLRATAKKELPNRENAISIRVDFHPKEMRAQNIPDL*
Ga0099795_1038039313300007788Vadose Zone SoilISRFLAGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRAEESPDYVVAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSESEKVIFHCRATAKK
Ga0066710_10059223723300009012Grasslands SoilMPLTVHISRFALAALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELPNRNNAISIRVDFHPREMRAQNVPDL
Ga0066709_10084703233300009137Grasslands SoilMSSVALISRFLAGALLAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNIAGTPPPLRADESPDYVVAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0126380_1000588533300010043Tropical Forest SoilMPQSSFIVRFALAAVFAAAFPSNTPWDKPPDQWTAADANKILEDSPWAPSKVTIEAKYTQKHLEPLTGITSDSDINAQNTNRVRGVQISKGGTPSCYVKWMSAKTMRLALEKMHRMRANVAGTLPPLKVEESPDYVVAIEGDEPMRILRTAKEDLHDTVFLELDNGFTLDLASVQFLDGADADPIRTDFHFPRQVDGKPTIELDSERIVFHCRATAKKELPNRENALAIRVDFHPKEMRAQNQPDL*
Ga0126380_1014394713300010043Tropical Forest SoilEDSPWAPSKITIEAKYTQRYTDPLTGVVNTSGINAQNTNPVRGVEISRGGTPAYYVKWMSAKTMRLALEKIHRLRINVVGEQPPLKVEESPDYVVAIEGDEPMRIVRDAKEDLHDTVFLELDNGFTLDLSSVQFLDSADADPIRTEFHFPRQVEGKPAIDPDSEKVVFHCRAIAKKELPNRQNVIAIRVDFHPKDMRARNLPDL*
Ga0126384_1001498523300010046Tropical Forest SoilMFLARIALASILALSFPSNTPWDKPPGQWTAADANKILEESPWAPTKVTIEAKYSQKYTDNLSRIVTDSPANATQNSAIVQNVQISRSAAPVYYVKWMSAKTMRLALEKMHRMRTNVVGTQPPFKVEESSDYVIAIEGDEPMRIIKDAKEDLHDTVFVELDNGFPLDLTRVQYVDGADADPLRTEFHFPRLIEGKPAIDPDSEKVIFHLRATAKKEKQNRNNAIAIRVDFHPKEMRAQNVPDL*
Ga0126384_1003755023300010046Tropical Forest SoilMFLARITLASILALSFPSNTPWDKPPDQWTATDANKILEESPWAPTKVTIETKYSQKYTDNLSRVVTDSPANATQNSPIVQNVQISRSATPSYYVKWMSAKTMRLALEKMHRMRANVVGTHPPLKVEESPDYVIAIEGDEPMRIVKGAKEDLHDTVFVELDNGFTLDLAGVQYVDGADADPLRTEFHFPRLIEGKPAIDADCEKVIFHLRATAKKEMQNRNNAIAIRVDFHPKEMRAQNLPDL*
Ga0126370_1008169833300010358Tropical Forest SoilMFLARIAFASLLAFSFPVNVPWDKPADQWSAADTNKILEDSPWAPGHVTVESRYSQKYRDNLTHIASDSPINSQNSPTIKNVQVSKGGTPDFYVKWISAKTMRLALEKMHRMRINVTGNMPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGADADPLRTEFHFPRLIEGKPAIDPDTEKVVFHLRATAKKEMPNRSNSIAIRVDFHPKDMRAQN
Ga0126370_1041178623300010358Tropical Forest SoilMILTRILLASLFALPFPNSTPWDKPPEQWTASETNKILEDSPWAPTKVTIETKYSQKYTDSLTRIVTDSAANPIQNSPIVQNVQISRSTTPSYYVKWMSARTMRLALEKMHRMRTNVVGTQPPLKVAESPDYVIAIEGDEPMRIIKDAKEDLHDTVFVELDNGFTLDLAGVHYVDGADADPLRTEFHFPRSIEGKPAIDPDGEKVIFHLRATAKKELQNRNNAIAIRVDFHPKEMRAQNLPDL*
Ga0126376_1000593243300010359Tropical Forest SoilMIVAFAFPASGPWDKPADQWTAVETNKILEDSPWAPGKVTVEAKFTQKHTELPTGLVSESPINIQNTNNIPGVQFNKGGTPQYYVKWMSAKTMRLALEKMHRMRTNVGANLPPLKVEESTDYVVAIEGDEPMRVIRHAKEDLHDTVFLELDNGFSLDLTSVQYIDGTDADPIRTEFHFPRTIEGQPAIDPQSEKVVFHLRATATKELPNRENTLGIRVDFHPKEMRAQNQPDL*
Ga0126376_1000672083300010359Tropical Forest SoilFFLAMARHKLGQTTQARADFDRAVRWRRDHPKPPGQWTAADANKILEESPWAPTKVTIEAKYSQKYTDNLSRIVTDSPANATQNSAIVQNVQISRSAAPVYYVKWMSAKTMRLALEKMHRMRTNVVGTQPPFKVEESSDYVIAIEGDEPMRIIKDAKEDLHDTVFVELDNGFPLDLTRVQYVDGADADPLRTEFHFPRLIEGKPAIDPDSEKVIFHLRATAKKEKQNRNNAIAIRVDFHPKEMRAQNVPDL*
Ga0126372_1026719733300010360Tropical Forest SoilMLQSSFFAGFALAVIFVGAFPSNTPWDKPPDQWTAADANKILEESPWAPTKVTIETKYSQKYTDSLTRIVTDSAANPIQNSPIVQNVQISRSATPSYYVKWISAKTMRLALEKMHRMRTDVAGVLPPLKAEELPDYVVAIEGNEPMRILRDAKEDLHDTVFIELDNGFTLDLESVQYLDGTDADPIRTEFHFPRLMEGKPAIDPDSEKVVFHLRAAAKKEIPNRNSAIAIRVDFHPKDMRAQNAPDL*
Ga0126372_1041062513300010360Tropical Forest SoilNKIFEDSPWAPSKVTIETKYSQKYTDSLSRIVTDSAANPIQNSPIVQNVQISRSATPSYYVKWMSAKTMQLALEKMRRMRSNVVGTQPPLKVEELPDYVVAIEGDEPMRILKDAKEDLRDTVFIELDNGFTLDLASVQYLDGTDADPIRTEFHFPRLMEGKPLIDPDSEKVVFHMRATAKKEIPNRNNAIAIRVDFHPKDMRAQNAPDL*
Ga0126377_1006837453300010362Tropical Forest SoilMPILRFAFIAILSAAIPLNSPWDRPPDQWTAADATKILEDSPWAPSKVTMEAKYTQKQLEPLTGMASDSEINTQNTNKVRGVAISRGGTPDYYVKWMSAKTVRLALEKMHRLRANVAGTWPPLEAEESPDYVIAIEGDEPMRIFRNAKEDLQDTVFLELDNGFTLDLASVQFVDGAEADSMRTEFHFPRQIEAKPAINSDSERVVFHCRATAKKEIPGRSNAIAIRVDFHPKDMRAQNRPDL*
Ga0126377_1022187123300010362Tropical Forest SoilMSSSIPIRATAFMIVAFAFPASDPWDKPADQWTAVETNKILEDSPWAPGKVTVEAKFTQKHTELPTGLVSESPINIQNTNNIPGVQFNKGGTPQYYVKWMSAKTMRLALEKMHRMRTNVGANLPPLKVEESTDYVVAIEGDEPMRVIRHAKEDLHDTVFLELDNGFSLDLTSVQYIDGTDADPIRTEFHFPRTIEGQPAIDPQSEKVVFHLRATATKELPNRENTLGIRVDFHPKEMRAQNQPDL*
Ga0126379_1007977023300010366Tropical Forest SoilMFLARIAFASLLAFSFPGNVPWDKPADQWSAADTNKILEDSPWAPGHVTVESRYSQKYKDNLTHIPSDSPINSQNSPTIKNVQISKGGTPDYYVKWISAKTMRLALEKMHRMRINVTGNMPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGADADPLRTEFHFPRLIEGKPAIDPDTERVVFHLRATAKKEMANRSNVIAIRVDFHPKDMRAQNSPDL*
Ga0126383_1004102033300010398Tropical Forest SoilMPPPRLILGLALAAILAPALATSNPWDKPPEEWTAAEVSKILEDSPWAPSKITIEAKYTEKHREPLTGMVSDSEINTQNTGRVRGVGISRGGTPDYYVKWMSAKTMRLALERMRETRANVGGRSAPSKVGESPDYVIAIEGDEPMRILRNAKEDLRDTVFLELDNGFALDLENVQFLDGADADPIRTEFHFPRQIEGKPAIDLASERVVFHCRATARKEILGRTNAIALRVEFHPREMRAQNLPDL*
Ga0126383_1036352623300010398Tropical Forest SoilMFLARIAFASLLAFSFPGNVPWDKPADQWSAADTNKILEDSPWAPGHVTVESRYSQKYKDNLTHIPSDSPINSQNSPTIKNVQISKGGTPDYYVKWISAKTMRLALEKMHRMRINVTGNMPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTTFVELDNGFTLDLAKVEYIDGADADPLRTEFHFPRLIEGKPAIDPDTERVVFHLRATAKKEMANRSN
Ga0137393_1006133623300011271Vadose Zone SoilMSSTALISRFLVGALLAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKIAIEAKFMQKHTEPLTGLISDSDVNVNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFSRTVEGKAAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137363_1008860033300012202Vadose Zone SoilMSSAALISRFLVAALLAFSFPANMPWDKPADQWTAADTNKILEDSPWAPSKITVEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSSKTMRLALEKMHRMRTNMVGTPPPLRVDESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVESKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0137362_1002937643300012205Vadose Zone SoilMPQPTFISRFAIAAILALPFPANGPWDKPADQWTAADANKILEDSPWTSSKITIEAKFTQKHTEPLTGLISDSDVNLNNTNNVRGVQLSKGSSTPSYFVKWMSAKTMRLALEKMHRMRSNVAGGAQPPLNAKESPDYVIAIEGDEPMRILQNAKEDLHDTVFLELDNGFTLDLANVQFLDGADADPLRTEFHFPRQVEDKPAIDPDSEKVIFHCRGTAKKEMPGRSNAIAIRVEFHPKEMRVQNLPDL*
Ga0137387_1004283943300012349Vadose Zone SoilMSSSALISRFLVGALLAFSLPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSSNVRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLTSVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0137386_1033226413300012351Vadose Zone SoilPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSSNVRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLTSVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNIPDL*
Ga0137385_1083016613300012359Vadose Zone SoilMSSSTLISRFLVGALLAFSFPGNTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSSNVRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISI
Ga0137360_10001376103300012361Vadose Zone SoilMSSAALISRFLVAALLAFSFPANMPWDKPADQWTAADTNKILEDSPWAPSKITVEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSSKTMRLALEKMHRMRTNMVGTPPPLRVDESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVESKPAIDPESEKVIFHCRATAKKELPNRDNAVSIRVDFHPREMRAQNIPDL*
Ga0137398_1013409013300012683Vadose Zone SoilMGIAAGSAMSSTALISRFLVGALLVFSFPANTPWDKPAEQWTAADANKILEDSPWAPSKITIEAKFTQQHTEPLTGLVSTSDVNLQNSNNIRGVQIGKGGTPSYYVKWMSSKTMRLALEKMHRMRTNIVGTPPPLKADESLDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKAAIDPESEKVIFHCRATAKKELPNRENAISIRVDFHPREMRAQNIPDL*
Ga0137397_1000924863300012685Vadose Zone SoilMPQPTLTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTKPVPGVEISRGSTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGKPAIDPDSERVIFHCRATAKKEMPSRETVIAIRVDFHPKDMRARNLPDL*
Ga0137395_1025590313300012917Vadose Zone SoilMSSAALMSRFLAGAILVFSLPANTPWDKPADQWTAADTNKILEDSPWAPSKIAIEAKFMQKHTEPLTGLISDSDVNVNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDDADADPLRTEFHFPRTVEGKPAIDPESEKMIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137394_1007201533300012922Vadose Zone SoilMPQPTLTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTNPLTGIVNFSGINAQNTNPVPGVEISRGSSTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGKPAIDPDSERVIFHCRATAKKEMPSRETVIAIRVDFHPKDMRARNLPDL*
Ga0137359_1011267823300012923Vadose Zone SoilMSSTALISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137413_1055584413300012924Vadose Zone SoilMFSTALISRFLAGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRNAKEDLHDTVFLELDNGLTVDLATVQFLDGADADPLRTEFHFPRTVEGKPAIDSESEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRA
Ga0137419_1103125213300012925Vadose Zone SoilTAPISRFLVGALFAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNIVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIR
Ga0137404_1134963313300012929Vadose Zone SoilALFAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKGTIEAKFTQKHTEPLTGLISDSAVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMQLALEKMHRMRTNMVGTPPPLRAEESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRAVEGKPAIDSESEKVIFHCRATAKKELPNRDNAISIRV
Ga0137410_10000712203300012944Vadose Zone SoilMSSAALISRFLVGALLAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQIGKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLKADESPDYVIAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPDSEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPAL*
Ga0137410_1001054543300012944Vadose Zone SoilMFLARIAFASILALSFPSNTPWDKPANQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDNLTHLVSDSPINSQTSPAVPSMQISKGGTPDYYVKWMSAKTMRLALEKMHRMRINVAGIQPPLKVDESPDYVIAIEGDEPMRIIRDAKEDLHDTIFVELGNGFTLDLASVQCIDGADADPLRTEFHFPRLIEGKPAIDPDTEKVIFHLRATAKREMPNRQNAIAIRVDFRPKEMRAQNVPDL*
Ga0137410_1001078353300012944Vadose Zone SoilMPQPTLTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTNPVPGVEISRGSGTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGKPAIDPDSERVIFHCRATAKKEMPSRETVIAIRVDFHPKDMRARNLPDL*
Ga0126369_1018718523300012971Tropical Forest SoilMFLARTAFATLFALSFASNSPWDKPADQWSAADTNRILEDSPWAPGKVTIETKYSQRYTDSLTHLVSDSPVNSQTSSAVPNMQISKGGTPNYYIKWMSAKTMRLALEKMHRMRSNVTGTMPPLKVEESPDYVIAIEGDEPMRVLRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGAEADPLRTEFHFPRLIEGKPAIDPNTEKVVFHLRATAKKEMPNRSNAIAIRVDFHPREMRAQNVPDL*
Ga0126369_1098055513300012971Tropical Forest SoilMFLARIAFASLLAFSFPGNVPWDKPADQWSAADTNKILEDSPWAPGHVTVESRYSQKYKDNLTHIPSDSPINSQNSPTIKNVQISKGGTPDYYVKWISAKTMRLALEKMHRMRINVTGNMPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGADADPLRTEFHFPRLIEGKPAIDPDTEKVVFHLRATAKKEMPNRTNAIAIRVDFHPKDMRVQNAPDL*
Ga0134079_1006472113300014166Grasslands SoilMNWKTQNGKYNLPAGLGIAAGNVMPLTVPMSRFALAALLALSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMRRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLRDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPDSEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137405_114577743300015053Vadose Zone SoilMPQPTFISRFAIAAILALPFPANGPWDKPADQWTAADANKILEDSPWTSSKITIEAKFTQKHAEPLTGLISDSDVNLNNTNNVRGVQLSKGSSTPSYFVKWMSAKTMRLALEKMHRMRSNVAGGAQPPLNAKESPDYVIAIEGDEPMRILQNAKEDLHDTVFLELDNGFTLDLANVQFLDGADADPLRTEFHFPRQVEDKPAIDPDSEKVIFHCRGTAKKEMPGRSNAIAIRVEFHPKEMRVQNLPDL*
Ga0137405_123384613300015053Vadose Zone SoilPISRFLVGALFAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDSGFTVDLASVQFLDGADADPLRTEFHFPRAVEGKPAIDPESEKVIFHCRATAKKEL
Ga0137418_1012247333300015241Vadose Zone SoilMFSTAPISRFLVGALFAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNIVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRAVEGKPAIDSESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137412_1002003053300015242Vadose Zone SoilMFSTALISRFLVGALFAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSESEKVVFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0137409_1001800373300015245Vadose Zone SoilMPQPTLTSCFALAAILAFPFPANSPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTNPVPGVEISRGSGTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGKPAIDPDSERVIFHCRATAKKEMPSRETVIAIRVDFHPKDMRARNLPDL*
Ga0137403_10002217123300015264Vadose Zone SoilMFSTALISRFLAGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRAVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL*
Ga0182041_1131458313300016294SoilMFLARIAFASLLAFSFPGNVPWDKPADQWSAADTNKILEDSPWAPGHVTVESRYSQKYGDNLTHIASDSPINSQNSPTIKNVQVSKGGTPDYYVKWTSAKTMRLALEKMHRMRINVTGNMPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELDNGFTLDLAKVEYIDGADADPLRTEFHFPRLIEGKPSIDPDTEKVVFHL
Ga0066669_1020015513300018482Grasslands SoilMRRTFHLSRLALDSLLCISFPPHTPWDKPDDQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNTSTVRGVQISKGGTPSYYVKWMSAKTMRLALEKMRRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLRDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPGSEKVIFHCRATAKKELPNRDNAISILVDFHPREMRAQNVPDL
Ga0210406_100000061383300021168SoilMPQPTLTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTNPVPGVEISRGSTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGKPAIDPDSERVIFHCRATAKKEMPSRETVIAIRVDFHPKDMRARNLPDL
Ga0210400_100000091573300021170SoilMFLARIALASIFALSFPNNAPWDKPASQWSAADANKIFEDSPWAPSKVTIETKYSQKYSDNLTHVVSESPINSQNSPTVQSMQISKGGTPNYYVKWMSAKTMRLALEKMHRMRINVTGTQPPLKVEESPDYVIAIEGDEPMRVLRDAKEDLHDTVFIELDSGFTLDLATVQFLDGADADPIRTEFHFPRLIEGRPAINPDSEKVVFHCRASAKKEMPNRENAIAIRVDFHPRDMRAQNLPDL
Ga0210402_10003952113300021478SoilMPQPTFISHFALAAILALPFPVNAPWDKPADQWTAADTNKILEDSPWAPTKVTIEAKYSQKYTDNLTHIVSDSGINSQNSPNVQSVQVSRGSTPSYYVKWMSAKTMRLALEKMHRMRANVTGALPPLKAEESADYVIAIEGDEPMRILRDAKEDLHDTVFIEMDNGFTLDLASVQYIDGADADPLRTEFHFPRQIEVKPVIDPDSEKVIFHCRANAKKEMPGRENFIAIRVDFHPKDMRVRNLPDL
Ga0247694_1000072243300024178SoilMPRTALMSSFVLASLLAFSFPANGPWDKPANQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLNAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATAKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0247693_1000033243300024181SoilMSSFVLASLLAFSFPANGPWDKPANQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLKAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATAKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0247669_1000088103300024182SoilMPRTALISSFVLASLLAFAFPANGPWDKPANQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLKAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATVKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0247669_102724413300024182SoilIAARRPMPQSTLISRFALAAFLALPVPANGPWDKPVEQWTAADANKILEDSPWAPSKITIEAKFTQQHTEPLTGLVSSSDVNLQNSNNVRGIQLSKGGTSSYYVKWMSAKTMRLALEKMHRMRANISGTSPPLKVEESPDYVIAIEGDEPMRIVHNAKEDLHDTVFLELDNGFTVDLDSVQYLDDADADPIRTEFHFSRTIEGKPAIDFDSEKVVFHLRATAKKELPNRENAISIRVDFHPKEMRAQSVPDL
Ga0247673_101034913300024224SoilLAFSFPANGPWDKPANQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLNAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATAKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0247667_101797813300024290SoilMPRTALISSFVLASLLAFAFPANGPWDKPANQWTAADANKILEDSPWAPSKITVEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLKAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATAKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0137417_149106613300024330Vadose Zone SoilMSSSALISRFLVGALFAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELLNRDNAISIRVDFHPREMRAQNVPDL
Ga0209238_113839613300026301Grasslands SoilALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPTKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQLSKGGTLSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLATVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVVFHCRATAKKELSNRDNAISIRVDFHPREMRAQNVPDL
Ga0209239_108428123300026310Grasslands SoilKIQNGKCNLPAGLGIAAGNAMSSSTLISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNIVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL
Ga0209131_113894613300026320Grasslands SoilMGIAAGSAMSSTALISRFLVGALLVFSFPANTPWDKPAEQWTAADANKILEDSPWAPSKITIEAKFTQQHTEPLTGLVSTSDVNLQNSNNIRGVQIGKGGTPSYYVKWMSSKTMRLALEKMHRMRTNIVGTPPPLKADESLDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKAAIDPESEKVIFHCRATAKKELPNRENAISIRVDFHPREMRAQNIPDL
Ga0209161_1004910723300026548SoilMSSTALISRFLVGALLAFSFPANTPWDKPADQWTAADANKILEDSPWAPSKVTIEAKFTQRHTEPLTGLISDSDVNLNNSSNIRGVQISKGGTPSYYVNWMSAKTMRLALGKMHRMRTNMVGTPPPLRADESPDYVIAIEGDEPMRIIRGAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL
Ga0208525_100472823300027288SoilMFLTRIVLASLLALSFPSNTPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDSLTHLASDSPINSQTSPVVQSMQISRGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQPPLKVEQSPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELGNGFTLDLASIQYIDGADADPLRTEFHFPRQIEGKPAIDPDSEKIVFHLRATAKREMPNRQNAIAIRVDFHPKDMRAQNIPDL
Ga0209689_120393413300027748SoilGPGIAASNAMSSVALISRFLAGALLAFSFPANTPWDKPADQWTAADTNKILEDSPWAPSKVTIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNMVGTPPPLRADESPDYVVAIEGDEPMRIIRSAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDPESEKVIFHCRATAKKELPNRDNAISIRVDFHPREMRAQNVPDL
Ga0209060_100000074973300027826Surface SoilMVLARFALASLLALSFPSNTPWEKPPDQWSAADTNKILEDSPWAPGKVTVETKYSQKYKDNVTQIVSDSPINSQNSPIVQNLQISKGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQSPIKVEESPDYVIAIEGDEPMRIIRDAKEDSHDTVFVELDNGFTLDLASVHYIDGPDADPLRTEFHFPRLIEGKPAIDANSEKVVFHLRATAKREMQNRNNAIAIRVDFHPKEMRAQNVPDL
Ga0209060_1000107583300027826Surface SoilMPQTASISRFAFAAYLTFALPANGPWDKSPDQWTAADTNKILEDSPWAPTKVAIETKYSQKYTDNLTHVVSDSPINSTQNSPIVQNVQISRGATPSFYVKWMSAKTMRLALEKMHRMRLNVAGNQPLIKVEESPDYVVAIEGDEPMRILRDAKEDLHDTVFVELDNGFTLDLASVQFIDGADADPLRTEFHFPRAIEGKAAIDPDSEKVVFHLRATAKKELPNRENSIAIRVDFHPKDMRAQTLPDL
Ga0209166_1000147673300027857Surface SoilMQTTALFLRFAFAALVAISFPANTPWDKPPDQWTAADANKIFEDSPWAPSKVIIEAKFTQKHTEPPTGLISDSDVNLPNSNSVRGVQLSKGGAPAYYVKWMSAKTMRLALEKMHRLRANVNGTLPPLKAEESPDYVIAIEGDEPMRILRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRLVEDKPAIDPESEKIVFRLRASAKKELPNRENAISIRVDFHPKEMRAQNLPDL
Ga0247682_1000027243300028146SoilMSSFVLASLLAFSFPANGPWDKPANQWTAADANKILEDSPWAPSKITIEAKFTQKHTEPLTGLISDSDVNLNNSNNIRGVQISKGGTPSYYVKWMSAKTMRLALEKMHRMRTNLVGTPPPLNAEESPDYVVAIEGDEPMRIIRNAKEDLHDTVFLELDNGFTVDLASVQFLDGADADPLRTEFHFPRTVEGKPAIDSGSEKVVFHLRATAKKELPDRENAISIRVDFHPKEMRAQNVPDL
Ga0137415_1067817123300028536Vadose Zone SoilILEDSPWAPSKVAIEAKYTQKHSEPLTGLISDSDVNLNNSNNIRGVQISRGGTPSYYVKWMSAKTMRLALEKMHRMRTNVTGTPPPLKADESPDYVVAIEGDEPMRILRSAKEDLHDTVFLELDNGFTLDLASVQFLDGADADPIRTEFHFPRQVQGKPAIDPDSEKIVFHLRGTAKKELPNRENAISIRVDFHPKDMRAQNLPDL
Ga0170824_10195394813300031231Forest SoilFTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTNPVPGVEISRGSTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGQPAIDPDSGRVVFHCRA
Ga0170820_1200848413300031446Forest SoilFTSCFALAAILAFPFPANAPWDKPADQWTAGDINKILEDSPWAPSKVVIETKYTQRYTDPLTGIVNTSGINAQNTNPVPGVEISRGSTPAYYVKWMSAKTMRLALEKIHRMRWNVTGGAQPPLKVEDSPDYVVAIEGDEPMRILRDAKEDLHDTVFIELDNGFTLDLASVQFLDGADADPMRTEFHFPRQIEGQPAIDPDSGRVIF
Ga0307471_10057501923300032180Hardwood Forest SoilMVQPTLIARFALSAILAFPFPANGPWDKPADQWTAVDANKIFEDSPWTSSKITIEAKYTQKHLEPLTGIASDSEINTQNTNKVRGVQISKGGTPDYYVKWMSAKTMRLALEKMHRMRANVAGGTQPPLKAEESPDYVIAIEGDEPMRILRDAKEDLHDTVFLELDNGFTLDLANVQFLDGADADPLRTEFHFPRQVEGKPGIDPDSEKVIFHCRGTAKKELPGRSNAIAIRVEFHPKEMRAQNLPDL
Ga0335085_1043693123300032770SoilMILTRIVFASLLALSFPSNTPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDNLTHLASDSPINSQTSPVVPSMQISRGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQPPLKIEESPDYVIAIEGDEPMRIVRGAKEDLHDTVFVELGNGFTLDLASIQYIDGADADPLRTEFHFPRQIEGKPAIDPDSEKVVFHLRATARKEMPNRQNAIAIRVDFHPKDMRAQNIPDL
Ga0335082_1011344333300032782SoilMFLTRIALASLLALSFPSNTPWDKPADQWSAADTNKILEDSPWAPGKVTIETKYSQKYSDNLTHLASDSPINSQTSPVVPSMQISRGGTPDYYVKWMSAKTMRLALEKMHRMRINVVGTQPPLKVEESPDYVIAIEGDEPMRIVRDAKEDLHDTVFVELGNGFTLDLASIRYIDGADADPLRTEFHFPRQIEGKPAIDPDSGKIVFHLRATAKREMPNRQNAIAIRVDFHPKDMRAQNIPDL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.