NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F030683

Metagenome Family F030683

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F030683
Family Type Metagenome
Number of Sequences 184
Average Sequence Length 98 residues
Representative Sequence MKRTVMVTIGILVLLSAQASFAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL
Number of Associated Samples 91
Number of Associated Scaffolds 184

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 84.53 %
% of genes near scaffold ends (potentially truncated) 25.00 %
% of genes from short scaffolds (< 2000 bps) 80.98 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (85.870 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(64.674 % of family members)
Environment Ontology (ENVO) Unclassified
(63.587 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(73.370 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 63.78%    β-sheet: 0.00%    Coil/Unstructured: 36.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 184 Family Scaffolds
PF02534T4SS-DNA_transf 61.96
PF05101VirB3 7.07
PF03135CagE_TrbE_VirB 3.26
PF04610TrbL 0.54
PF10502Peptidase_S26 0.54
PF08388GIIM 0.54

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 184 Family Scaffolds
COG3505Type IV secretory pathway, VirD4 component, TraG/TraD family ATPaseIntracellular trafficking, secretion, and vesicular transport [U] 61.96
COG3702Type IV secretory pathway, VirB3 componentIntracellular trafficking, secretion, and vesicular transport [U] 7.07
COG3451Type IV secretory pathway, VirB4 componentIntracellular trafficking, secretion, and vesicular transport [U] 3.26
COG3704Type IV secretory pathway, VirB6 componentIntracellular trafficking, secretion, and vesicular transport [U] 0.54
COG3846Type IV secretory pathway, TrbL componentsIntracellular trafficking, secretion, and vesicular transport [U] 0.54


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A85.87 %
All OrganismsrootAll Organisms14.13 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004463|Ga0063356_100056357Not Available3997Open in IMG/M
3300005295|Ga0065707_10573763Not Available706Open in IMG/M
3300005332|Ga0066388_101311232Not Available1249Open in IMG/M
3300005332|Ga0066388_104264029All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300005445|Ga0070708_100892423All Organisms → cellular organisms → Bacteria835Open in IMG/M
3300005447|Ga0066689_10283594All Organisms → cellular organisms → Bacteria → Proteobacteria1025Open in IMG/M
3300005467|Ga0070706_100697158Not Available942Open in IMG/M
3300005467|Ga0070706_100991875Not Available775Open in IMG/M
3300005468|Ga0070707_102038547Not Available541Open in IMG/M
3300005536|Ga0070697_100985826All Organisms → cellular organisms → Bacteria749Open in IMG/M
3300005558|Ga0066698_10094503All Organisms → cellular organisms → Bacteria1965Open in IMG/M
3300005985|Ga0081539_10396299Not Available574Open in IMG/M
3300006058|Ga0075432_10526691Not Available530Open in IMG/M
3300006880|Ga0075429_100008087Not Available9142Open in IMG/M
3300007255|Ga0099791_10014964Not Available3292Open in IMG/M
3300007255|Ga0099791_10105474Not Available1300Open in IMG/M
3300007255|Ga0099791_10175797Not Available1005Open in IMG/M
3300007265|Ga0099794_10272229Not Available875Open in IMG/M
3300009012|Ga0066710_100103096All Organisms → cellular organisms → Bacteria3824Open in IMG/M
3300009012|Ga0066710_100334802Not Available2232Open in IMG/M
3300009012|Ga0066710_100834308All Organisms → cellular organisms → Bacteria1415Open in IMG/M
3300009012|Ga0066710_101440504Not Available1066Open in IMG/M
3300009012|Ga0066710_102881878Not Available676Open in IMG/M
3300009038|Ga0099829_10052609All Organisms → cellular organisms → Bacteria → Proteobacteria3023Open in IMG/M
3300009038|Ga0099829_10077748Not Available2531Open in IMG/M
3300009038|Ga0099829_10142645Not Available1905Open in IMG/M
3300009038|Ga0099829_10244273Not Available1462Open in IMG/M
3300009038|Ga0099829_10247866Not Available1452Open in IMG/M
3300009038|Ga0099829_10826885Not Available769Open in IMG/M
3300009038|Ga0099829_10852081Not Available756Open in IMG/M
3300009038|Ga0099829_10973556Not Available704Open in IMG/M
3300009038|Ga0099829_11280273Not Available606Open in IMG/M
3300009088|Ga0099830_11137230Not Available648Open in IMG/M
3300009088|Ga0099830_11254948Not Available615Open in IMG/M
3300009089|Ga0099828_10091269Not Available2619Open in IMG/M
3300009089|Ga0099828_10397178Not Available1243Open in IMG/M
3300009089|Ga0099828_10702713Not Available908Open in IMG/M
3300009089|Ga0099828_10736639Not Available884Open in IMG/M
3300009089|Ga0099828_10766739Not Available865Open in IMG/M
3300009089|Ga0099828_10888176Not Available796Open in IMG/M
3300009090|Ga0099827_10015500Not Available5120Open in IMG/M
3300009090|Ga0099827_10040196Not Available3454Open in IMG/M
3300009090|Ga0099827_10152826Not Available1890Open in IMG/M
3300009090|Ga0099827_10178480All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum1755Open in IMG/M
3300009090|Ga0099827_10238206Not Available1525Open in IMG/M
3300009090|Ga0099827_10453438Not Available1099Open in IMG/M
3300009090|Ga0099827_10817635Not Available806Open in IMG/M
3300009090|Ga0099827_10916597Not Available759Open in IMG/M
3300009090|Ga0099827_11105809Not Available688Open in IMG/M
3300009090|Ga0099827_11829501Not Available529Open in IMG/M
3300009090|Ga0099827_11958840Not Available510Open in IMG/M
3300009137|Ga0066709_100015594Not Available7319Open in IMG/M
3300009137|Ga0066709_100429627Not Available1841Open in IMG/M
3300009137|Ga0066709_102528723Not Available691Open in IMG/M
3300009162|Ga0075423_10987103Not Available894Open in IMG/M
3300009792|Ga0126374_10392302Not Available967Open in IMG/M
3300009837|Ga0105058_1069138Not Available805Open in IMG/M
3300010029|Ga0105074_1014740Not Available1262Open in IMG/M
3300010043|Ga0126380_10941822Not Available722Open in IMG/M
3300010046|Ga0126384_11444111Not Available643Open in IMG/M
3300010359|Ga0126376_10679713Not Available987Open in IMG/M
3300010359|Ga0126376_11279606Not Available752Open in IMG/M
3300010359|Ga0126376_12340839Not Available580Open in IMG/M
3300010360|Ga0126372_10454437All Organisms → cellular organisms → Bacteria1188Open in IMG/M
3300010361|Ga0126378_10957092Not Available961Open in IMG/M
3300010366|Ga0126379_11171165Not Available875Open in IMG/M
3300010398|Ga0126383_10270109Not Available1681Open in IMG/M
3300010398|Ga0126383_10279487Not Available1656Open in IMG/M
3300010398|Ga0126383_11225926Not Available840Open in IMG/M
3300010398|Ga0126383_12051992Not Available659Open in IMG/M
3300011269|Ga0137392_10701686Not Available838Open in IMG/M
3300011270|Ga0137391_10125208All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum2231Open in IMG/M
3300011270|Ga0137391_10647247Not Available882Open in IMG/M
3300011270|Ga0137391_10777037Not Available791Open in IMG/M
3300011270|Ga0137391_10788535Not Available784Open in IMG/M
3300011271|Ga0137393_10346983Not Available1268Open in IMG/M
3300012096|Ga0137389_10056686All Organisms → cellular organisms → Bacteria2982Open in IMG/M
3300012096|Ga0137389_10121278All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum2106Open in IMG/M
3300012096|Ga0137389_10770897All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300012096|Ga0137389_11114295Not Available676Open in IMG/M
3300012189|Ga0137388_10204482Not Available1781Open in IMG/M
3300012189|Ga0137388_10303628Not Available1464Open in IMG/M
3300012189|Ga0137388_10304462Not Available1462Open in IMG/M
3300012189|Ga0137388_11638202Not Available578Open in IMG/M
3300012199|Ga0137383_10440860Not Available954Open in IMG/M
3300012199|Ga0137383_11027374Not Available600Open in IMG/M
3300012203|Ga0137399_10251174Not Available1450Open in IMG/M
3300012204|Ga0137374_10112199Not Available2535Open in IMG/M
3300012204|Ga0137374_10720464Not Available747Open in IMG/M
3300012205|Ga0137362_11688697Not Available520Open in IMG/M
3300012206|Ga0137380_10143329Not Available2179Open in IMG/M
3300012206|Ga0137380_10688484Not Available888Open in IMG/M
3300012206|Ga0137380_10751658Not Available844Open in IMG/M
3300012206|Ga0137380_11324693Not Available605Open in IMG/M
3300012207|Ga0137381_10093669All Organisms → cellular organisms → Bacteria2539Open in IMG/M
3300012207|Ga0137381_11005769Not Available719Open in IMG/M
3300012209|Ga0137379_10102288Not Available2757Open in IMG/M
3300012209|Ga0137379_10400178Not Available1282Open in IMG/M
3300012209|Ga0137379_10734535Not Available892Open in IMG/M
3300012209|Ga0137379_10758042Not Available876Open in IMG/M
3300012210|Ga0137378_10794457Not Available859Open in IMG/M
3300012210|Ga0137378_10839110Not Available832Open in IMG/M
3300012210|Ga0137378_11070986Not Available721Open in IMG/M
3300012210|Ga0137378_11074628Not Available719Open in IMG/M
3300012211|Ga0137377_10638161Not Available1001Open in IMG/M
3300012211|Ga0137377_11807672Not Available530Open in IMG/M
3300012349|Ga0137387_10114750Not Available1893Open in IMG/M
3300012349|Ga0137387_11010641Not Available596Open in IMG/M
3300012351|Ga0137386_10506217Not Available870Open in IMG/M
3300012351|Ga0137386_10907997Not Available631Open in IMG/M
3300012353|Ga0137367_10083992Not Available2353Open in IMG/M
3300012353|Ga0137367_10270129Not Available1221Open in IMG/M
3300012353|Ga0137367_10571005Not Available793Open in IMG/M
3300012355|Ga0137369_10984000Not Available560Open in IMG/M
3300012356|Ga0137371_10388447Not Available1083Open in IMG/M
3300012357|Ga0137384_10070613All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae2900Open in IMG/M
3300012359|Ga0137385_10063091All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum3300Open in IMG/M
3300012359|Ga0137385_11407253Not Available561Open in IMG/M
3300012362|Ga0137361_10251063Not Available1612Open in IMG/M
3300012362|Ga0137361_10251274Not Available1611Open in IMG/M
3300012362|Ga0137361_10316175Not Available1430Open in IMG/M
3300012362|Ga0137361_11379382Not Available628Open in IMG/M
3300012362|Ga0137361_11435220Not Available613Open in IMG/M
3300012362|Ga0137361_11892165Not Available514Open in IMG/M
3300012363|Ga0137390_10026895Not Available5430Open in IMG/M
3300012685|Ga0137397_10331216All Organisms → cellular organisms → Bacteria1135Open in IMG/M
3300012917|Ga0137395_10117470All Organisms → cellular organisms → Bacteria1786Open in IMG/M
3300012918|Ga0137396_10721304All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300012918|Ga0137396_11115251Not Available563Open in IMG/M
3300012923|Ga0137359_10605439Not Available961Open in IMG/M
3300012925|Ga0137419_11356860Not Available599Open in IMG/M
3300012929|Ga0137404_10459993Not Available1130Open in IMG/M
3300012929|Ga0137404_11647959Not Available595Open in IMG/M
3300012944|Ga0137410_10133788Not Available1873Open in IMG/M
3300012944|Ga0137410_11278571Not Available634Open in IMG/M
3300012971|Ga0126369_10556514All Organisms → cellular organisms → Bacteria → Proteobacteria1212Open in IMG/M
3300012971|Ga0126369_12387666Not Available615Open in IMG/M
3300014205|Ga0172380_11052185Not Available575Open in IMG/M
3300016270|Ga0182036_10794618Not Available771Open in IMG/M
3300016371|Ga0182034_10652692Not Available891Open in IMG/M
3300016371|Ga0182034_11687748Not Available557Open in IMG/M
3300018433|Ga0066667_12145739Not Available518Open in IMG/M
3300021086|Ga0179596_10612295Not Available552Open in IMG/M
3300021476|Ga0187846_10392605Not Available571Open in IMG/M
3300025910|Ga0207684_10139857Not Available2081Open in IMG/M
3300025910|Ga0207684_10216328Not Available1653Open in IMG/M
3300025910|Ga0207684_11213611Not Available624Open in IMG/M
3300025922|Ga0207646_10597864Not Available990Open in IMG/M
3300027490|Ga0209899_1047990Not Available889Open in IMG/M
3300027655|Ga0209388_1129924Not Available717Open in IMG/M
3300027846|Ga0209180_10046959All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium japonicum2374Open in IMG/M
3300027846|Ga0209180_10107620Not Available1592Open in IMG/M
3300027846|Ga0209180_10499296Not Available681Open in IMG/M
3300027846|Ga0209180_10506426Not Available676Open in IMG/M
3300027846|Ga0209180_10692311Not Available555Open in IMG/M
3300027862|Ga0209701_10203134Not Available1181Open in IMG/M
3300027875|Ga0209283_10038343Not Available3014Open in IMG/M
3300027875|Ga0209283_10734604Not Available613Open in IMG/M
3300027882|Ga0209590_10015342Not Available3731Open in IMG/M
3300027882|Ga0209590_10048322Not Available2358Open in IMG/M
3300027882|Ga0209590_10160333Not Available1401Open in IMG/M
3300027882|Ga0209590_10275307Not Available1077Open in IMG/M
3300027882|Ga0209590_10405899Not Available880Open in IMG/M
3300027882|Ga0209590_10474149Not Available809Open in IMG/M
3300027882|Ga0209590_10669532Not Available665Open in IMG/M
3300027903|Ga0209488_10511544Not Available879Open in IMG/M
3300027903|Ga0209488_11084491Not Available548Open in IMG/M
3300027907|Ga0207428_10039269All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3843Open in IMG/M
3300027952|Ga0209889_1107929Not Available554Open in IMG/M
3300027961|Ga0209853_1015502All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2332Open in IMG/M
3300027961|Ga0209853_1121915Not Available648Open in IMG/M
3300028536|Ga0137415_11400711Not Available521Open in IMG/M
3300028587|Ga0247828_10026406Not Available2334Open in IMG/M
3300028589|Ga0247818_10045258Not Available2831Open in IMG/M
3300028592|Ga0247822_10318667Not Available1193Open in IMG/M
3300028705|Ga0307276_10147653Not Available597Open in IMG/M
3300031543|Ga0318516_10647211All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300031548|Ga0307408_101653593Not Available609Open in IMG/M
3300031912|Ga0306921_10135150All Organisms → cellular organisms → Bacteria → Proteobacteria2880Open in IMG/M
3300032261|Ga0306920_102470441Not Available715Open in IMG/M
3300032770|Ga0335085_11083168Not Available860Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil64.67%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.15%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.43%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.89%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.09%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.54%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.54%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.54%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.54%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.54%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.54%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.54%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate0.54%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010029Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014205Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 162 metaGEngineeredOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028589Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day1EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028705Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_115EnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J37095_1019860823300002562Grasslands SoilSCAAGLGIPLDSFLATFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0063356_10005635723300004463Arabidopsis Thaliana RhizosphereMDRRMILVSGVAVLGGLSVMPVHAGGLGIPLDGFLSQFQTFVIGLGLIMGLVGLAGYVGSLFDNPFSHVLAGSVGFFTKAGFVESPCME*
Ga0065707_1057376313300005295Switchgrass RhizosphereLLAGVHAASAGGFGIAQLDAFLTTFLTGVTGLGVLVGSVGLVGYVGSLMDNPFSTILSGSIGFFTKAGILGGGTAMLTGLGLVTGGTF*
Ga0066388_10131123213300005332Tropical Forest SoilVKRMSGLAVIGLLMLSARASYAGALGIPLDSFLSTFQTWVVGLGLIMGLVGLVGYVGQLFDNPFAHILSGSMGFFTKAGLLGGGTVLMGLLGLVGGGTL*
Ga0066388_10426402913300005332Tropical Forest SoilMTRTVAVVSGSLVLLSAQASSAAGLGIPLDGFLATFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGAR
Ga0070708_10089242323300005445Corn, Switchgrass And Miscanthus RhizosphereMKRTMAVVSGGLVLLSAQASVAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL*
Ga0066689_1028359423300005447SoilVKRTMIVVAGGLLLVSARMSCAAGLGIPLDSFLATFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0070706_10069715813300005467Corn, Switchgrass And Miscanthus RhizosphereVKRTMIMLACGVLLVSAQRSDAGALGIPLDAFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0070706_10099187513300005467Corn, Switchgrass And Miscanthus RhizosphereVDRMKRTVAVVSGGLVLLSAQASFAAGLGIPLDSFLSTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLLGLVGLVGGATL*
Ga0070707_10203854713300005468Corn, Switchgrass And Miscanthus RhizosphereVKRTMIMLACGVLLVSAQRSYAGALGIPLDAFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGLLG
Ga0070697_10098582623300005536Corn, Switchgrass And Miscanthus RhizosphereVKRTMIMLACGVLLVSAQRSYAGALGIPLDAFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0066698_1009450323300005558SoilMRRPRTVAMGFGVALLLSARVSWAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVLMGLLGLVGGATL*
Ga0081539_1039629923300005985Tabebuia Heterophylla RhizosphereMRQHVGRAALLVLVCASTSHAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVTLMGLLGLVGGATL*
Ga0075432_1052669123300006058Populus RhizosphereMRRRWRGIVTLATALLLLHSRAHAAAMGIPLDAFLQQFQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSIGFFSKAGLLGGGTILLGLVGLTGGAVL*
Ga0075429_10000808773300006880Populus RhizosphereMQRQWTIVGTLVLSMLLLQGQAHAAAMGIPLDAFLQQFQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSIGFFSKAGLLGGGTILLGLVGLTGGAVL*
Ga0099791_1001496423300007255Vadose Zone SoilVKRTVVLGISGGLLLIARASWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLMGLVGGATL*
Ga0099791_1010547423300007255Vadose Zone SoilVKRTVVLGISGGLLLIARVSWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFTKAGLLGGGIVLMGLMGLVGGAML*
Ga0099791_1017579723300007255Vadose Zone SoilMKRTMVVAVMVVVGLGARTASAGALGIPLDAFLAQFQIWVVGLGLVMGLVGLTGFVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0099794_1027222923300007265Vadose Zone SoilVKRTVVIGISGGLLLIARVSWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0066710_10010309633300009012Grasslands SoilVKRTMIVVAGGLLLVSARMSCAAGLGIPLDSFLATFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL
Ga0066710_10033480223300009012Grasslands SoilMKRTGAIGMGGVVLLSARLSSAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGTVLLGLLGLVGGATL
Ga0066710_10083430823300009012Grasslands SoilMKRQLAFGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL
Ga0066710_10144050413300009012Grasslands SoilMRRPRTVAMGFGVALLLSARVSWAGALGIPLDGFLTQFQTFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVLMGLLG
Ga0066710_10288187823300009012Grasslands SoilMSRRILLLVVVGVLASTQVSQAAGLGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVILMGLLRLVGGATL
Ga0099829_1005260923300009038Vadose Zone SoilMVGMKRRMVVAVMVGIGLGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTVP*
Ga0099829_1007774823300009038Vadose Zone SoilMKRTMVVAVMVGVALGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTL*
Ga0099829_1014264533300009038Vadose Zone SoilMKRGIAVAVGGVVLFSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099829_1024427323300009038Vadose Zone SoilMIVVAGGLLLMSARASCAGTLGIPLDAFLTTFQTWVVGLGLIMGLVGLVGYVGGLFDNPFAHILSGSIGFFTKAGLLGGGTVLLGLLGLVGGATL*
Ga0099829_1024786623300009038Vadose Zone SoilMKRTVAVVNGGLVLLSAQASFAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGATV*
Ga0099829_1082688523300009038Vadose Zone SoilMKRNMLVVVMVGVGLGARAASAGALGIPLDGFLAQFQTWVVGLGLVMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGMLGGGTVLLGLMGLVGGATL*
Ga0099829_1085208123300009038Vadose Zone SoilMKRGIAVAVGVVLFSARASFAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099829_1097355623300009038Vadose Zone SoilMKRPVGFTVGSLVLFSAQASCAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0099829_1128027313300009038Vadose Zone SoilMKHGIAVAVGGVVLLSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099830_1113723013300009088Vadose Zone SoilMKRTMVVAVMVGVALGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTV
Ga0099830_1125494813300009088Vadose Zone SoilMKRKMAVAVILAMVGLGARAASAGALGIPLDAFLTQFQAWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTILLGLMGLA
Ga0099828_1009126923300009089Vadose Zone SoilMKRGIAVAVGGVVLLSAQASCAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099828_1039717833300009089Vadose Zone SoilMKRKMAVAVILAMVGLGARAASAGALGIPLDAFLTQFQAWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTILLGLMGLAGGATL*
Ga0099828_1070271323300009089Vadose Zone SoilMKHGIAVGVGGVVLLSAQASCAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099828_1073663923300009089Vadose Zone SoilMKRRIAVMAAVAVLVSVRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0099828_1076673933300009089Vadose Zone SoilMIIVAGGLLLMSARASCAGTLGIPLDAFLTTFQTWVVGLGLIMGLVGLVGYVGGLFDNPFAHILSGSIGFFTKAGLLGGGTVLLGLLGLVGGATL*
Ga0099828_1088817613300009089Vadose Zone SoilMKRTMVVAVMVGVALGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTVP*
Ga0099827_1001550023300009090Vadose Zone SoilMKHGIAVAVGGVVLLSAQASCAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099827_1004019623300009090Vadose Zone SoilMKRTLVLGIGITVLLSAQTSFAAGLGIPLDGFLTTFQTFVIGLGLVMGLVGLIGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0099827_1015282623300009090Vadose Zone SoilMKRTVAIGMGGVVLLSARLSSAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGTVLLGLLGLVGGATL*
Ga0099827_1017848023300009090Vadose Zone SoilMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0099827_1023820613300009090Vadose Zone SoilMKRTVVWSSSGLVLCSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0099827_1045343823300009090Vadose Zone SoilMMTRTAIVTIGSLILLSAQTSFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0099827_1081763533300009090Vadose Zone SoilVVLFSARASWAGGLGIPLDGFLAQLQTWVVGLGLIMGLVGLTGYVGSLFDNPFSHILAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0099827_1091659723300009090Vadose Zone SoilMKRTMAIGIGGVVLLSARASFAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVIMGLLGLVGGATVP*
Ga0099827_1107638713300009090Vadose Zone SoilMVMKRQLACGLGMIVVLSARASWAGALGIPLDGFIQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGG
Ga0099827_1110580913300009090Vadose Zone SoilMKRTAGFAVGSLVVFSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFYNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0099827_1182950113300009090Vadose Zone SoilREMVGMKRKMAVAVMLMVVFGAKGACAAGLGIPLDGFLSQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGVTVLLGLMGLVGGATL*
Ga0099827_1195884023300009090Vadose Zone SoilMTRRVMIGIAIGVLLSARASYAGALGIPLDGFLTTFQTFVIGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVVLMGLLGLVGGATL*
Ga0066709_10001559483300009137Grasslands SoilMIVVAGGLLLVSARMSCAAGLGIPLDSFLATFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0066709_10042962733300009137Grasslands SoilMSRRILLLVVVGVLASTQVSQAAGLGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVILMGLLGLVGGATL*
Ga0066709_10252872313300009137Grasslands SoilMVMKRAIALGVGCVVLFSARASWAGALGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0075423_1098710323300009162Populus RhizosphereMRRTQVVVMAGGALVLLSARTSWAGGLGIPLDGFLTTFQTFVIGLGLIMGLVGLTGYVGSLFYNTFSNILAGSVGLFTKAGLLGGGVTLMGLLGLVGGATL*
Ga0126374_1039230223300009792Tropical Forest SoilMWKRIGWAVGMSVLVAGTSEAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVTLMGLLGLVGGGVL*
Ga0105058_106913823300009837Groundwater SandMKRTAMVTIGSLVLFSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLARSYLVPLWSNA*
Ga0105074_101474023300010029Groundwater SandMKRTLMMAIGGAVLLSARASFAGALGIPLDGFLTTFQTFIIGLGLILGLVGLTGWIGSLFDNPFSNIMAGSVGFFTKAGLLGGG
Ga0126380_1094182213300010043Tropical Forest SoilVLRTEEGGMTRKVQLALGLGMVVLLSARASFAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVTLMGLLGLVGGGVL*
Ga0126384_1144411123300010046Tropical Forest SoilMIMLVCGVLLVSAQRSYAGALGIPLDAFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0126376_1067971323300010359Tropical Forest SoilMKRTVVVGGGVVLLSAQASYAAGLGIPLDGFLQTFQTWVVGLGLVMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0126376_1127960613300010359Tropical Forest SoilQVSFAAGLGIPLDGFLQTFQTWVVGLGLVMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMRLLGLVGGALL*
Ga0126376_1234083923300010359Tropical Forest SoilILMLSARASYAGALGIPLDSFLSTFQTWVVGLGLIMGLVGLVGYVGQLFDNPFAHILSGSMGFFTKAGLLGGGTVLMGLLGLVGGGTL*
Ga0126372_1045443733300010360Tropical Forest SoilMKRTMAVVSGGLVLLSAQASVAAGLGIPLVGFLTTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0126378_1095709223300010361Tropical Forest SoilMKRTVVVGGGVVLLSAQASYAAGLGIPLDGFLQTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL*
Ga0126379_1117116523300010366Tropical Forest SoilMKRTMAVVSGGLVLLSAQASVAVGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLFGGATL*
Ga0126383_1027010943300010398Tropical Forest SoilMLVTSQTRKIAFGLALAIVLYAEGSMAGGLGIPLDGFLTTFQTFVVGIGLIIGLVGLTGYVGSLMDNPFANILAGSVGFFTKAGLLGGGVTLMGLLGVVAGATL*
Ga0126383_1027948733300010398Tropical Forest SoilMKRTMAVVVGGVILVSAQAGFAAGLGIPLDSFLSTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0126383_1122592613300010398Tropical Forest SoilMRTYLGLALVLVLLRASTSHAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVTLMGLLGLVGGATL*
Ga0126383_1205199223300010398Tropical Forest SoilSYAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYIGSLFDNPFSNILAGSIGFFTKAGLLGGGVTLMGLLGLVGGATL*
Ga0137392_1070168613300011269Vadose Zone SoilMKRKMAVAVILAMVGLGARAASAGALGIPLDAFLTQFQAWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTL*
Ga0137391_1012520823300011270Vadose Zone SoilMKRVIAVAVGVVLFSARASCAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGML*
Ga0137391_1064724713300011270Vadose Zone SoilVMKRQMVSGMVVVILMSAHASHAAGLGIPLDGFLNQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATL*
Ga0137391_1077703733300011270Vadose Zone SoilMKRTVAVVNWGLVLLSAQASFAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGATV*
Ga0137391_1078853513300011270Vadose Zone SoilMVGMKRTMVVAVMVGVALGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTVP*
Ga0137393_1034698323300011271Vadose Zone SoilMKRTVVLSISGLVLCSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0137389_1005668633300012096Vadose Zone SoilLSAQASFAAGLGIPLDGFLSQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTVP*
Ga0137389_1012127823300012096Vadose Zone SoilMKRVIAVAVGVVLFSARASCAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0137389_1077089723300012096Vadose Zone SoilMVMKRQMVSGMVVVILMSAHASHAAGLGIPLDGFLNQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATL*
Ga0137389_1111429523300012096Vadose Zone SoilMVMKRQLACGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0137388_1020448223300012189Vadose Zone SoilMKRGIAVAVGGVVLFSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGML*
Ga0137388_1030362823300012189Vadose Zone SoilMVVAVMVGIGLGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTVP*
Ga0137388_1030446223300012189Vadose Zone SoilMKRRIAVMAAVAVLVSVRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLL
Ga0137388_1163820223300012189Vadose Zone SoilMKRGIAVAVGGVVLLSAQASCAGALGIPLDGFLTQFQTWVVGLGLSMGLVGLVGYVGSLFDNPFCHVLACSIGFFTKAGLLGGGVALMTLL
Ga0137383_1044086013300012199Vadose Zone SoilMMSHKRTIALVMGALVLCSARASLAGALGIPLDGFLTQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATVP*
Ga0137383_1102737423300012199Vadose Zone SoilMKRQLAFGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0137399_1025117423300012203Vadose Zone SoilMKRRIAVMAAVAVLVSGRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0137374_1011219923300012204Vadose Zone SoilMSRRILLLVVVGVLASTQGSQAAGLGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVILMGLLGLVGGATL*
Ga0137374_1072046423300012204Vadose Zone SoilMKSMLRILGMVAVMSLVMVPKAHAGGLGIPLDGFLATFQTFVVGLGLIIGLVGLAGYVGSLFDNPFSHILAGSVSFFVKAGLLGGGVTMLGALGLVGGGLLP*
Ga0137362_1168869713300012205Vadose Zone SoilRAREIVMKRQLAWGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATVP*
Ga0137380_1014332933300012206Vadose Zone SoilMKRTGAIGMGGVVLLSARLSSAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGTVLLGLLGLVGGATL*
Ga0137380_1068848423300012206Vadose Zone SoilMKRGIAVAVGGVVLLSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGG
Ga0137380_1075165833300012206Vadose Zone SoilMKRVVFFVVTSAVVLSARLSWAGALGIPLDGFLQTFQTFVLGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0137380_1132469313300012206Vadose Zone SoilMKRTVALGIGAAVLLSARVSAAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVIRGLLGLVGGATVP*
Ga0137381_1009366933300012207Vadose Zone SoilMVMKRQLAFGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0137381_1100576923300012207Vadose Zone SoilMKRMMTVVGGGMVLLSAQASFAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL*
Ga0137379_1010228843300012209Vadose Zone SoilMIVVAGGLLLMSARVSFAGALGIPLDAFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL*
Ga0137379_1040017823300012209Vadose Zone SoilMKRGIAVAVGGMVLFSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0137379_1073453513300012209Vadose Zone SoilMPGRFAAATPTRRQGMKRAIAVGMGYVMLCSARACWAGALGIPLDGFLANFQTWVVGLGLVMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATVP*
Ga0137379_1075804223300012209Vadose Zone SoilMKRVIAVAVGVVLFGARASCAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGML*
Ga0137378_1079445723300012210Vadose Zone SoilMVGVVMVGVMLGARVASAGALGIPLDGFLGQFQAWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTILLGLMGLVGG
Ga0137378_1083911013300012210Vadose Zone SoilLPGRPGISREMVMKRQMVSGMVLVILISAHASHAAGLGIPLDGFLTQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATVP*
Ga0137378_1107098623300012210Vadose Zone SoilMKRMMTVVGGGMVLLSAQASFAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLM
Ga0137378_1107462823300012210Vadose Zone SoilMTTKQTMVIGLGLMMVLSARASCAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVLMGLLGLVGGATL*
Ga0137377_1063816123300012211Vadose Zone SoilMTYRVTVGISLLVLLSARASYAGALGIPLDGFLAQFQTFVIGMGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0137377_1180767213300012211Vadose Zone SoilMMSHKRTIALVMGALVLCSARASLAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVTLMGLLGLVGGATL*
Ga0137387_1011475033300012349Vadose Zone SoilSARASCAGALGIPLDAFLSTFQTWVVGLGLNMGLVGLVGYVGGLFDNPFAHILSGSIGFFTKAGLLGGGTVLLGLLGLVGGATL*
Ga0137387_1101064113300012349Vadose Zone SoilMKRGIAVAVGGMVLFSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGATL
Ga0137386_1050621723300012351Vadose Zone SoilMKRAIAVGMGCVVLCSARACWAGALGIPLDGFLAQFQTWVVGFGLIMGLVGLVGYIGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLL
Ga0137386_1090799723300012351Vadose Zone SoilMKRVIAVAVGVVLFGARASWAGALGIPLDGFMQNFQTWVVGLGLIMGLVGLVGYVGGLFDNPFAHILSGSLGFFTKAGILGGGTVLMGLLRLV
Ga0137367_1008399233300012353Vadose Zone SoilMSRRILFLVVVGVLASTQGSQAAGLGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVILMGLLGLVGGATL*
Ga0137367_1027012933300012353Vadose Zone SoilMKSMLRILGMVAVMSLVMVPKAHAGGLGIPLDGFLATFQTFVVGLGLIIGLVGLAGYVGSLFDNPFSHILAGSVSFFVKAGLLGGGVTMLGALGLVGGGLLQ*
Ga0137367_1057100523300012353Vadose Zone SoilMSSHRYIVSVVVIGSILVSVRGAFAGALGIPLDGFLTTFQTFVVGLGLTMGLVGLVGWIGSLFDNPFSHILAGSVSFFIKAGLLGGGLTLMTMLGLVGGATLP*
Ga0137369_1098400013300012355Vadose Zone SoilVGARQHRELRAVGCVETTTRRDGMKRSVMMWVVVGMMLSARVSCAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILAGSIGFFTKAGLLGGGTVLMGLLGLVGGAQL*
Ga0137371_1038844723300012356Vadose Zone SoilMKRTVVLGISVGLLLSARASWAGALGIPLDGFMTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0137384_1007061333300012357Vadose Zone SoilMVMKRQLAFGLVMIVVLSVGVSCAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0137385_1006309143300012359Vadose Zone SoilMVMKRQMVSGMVLVMLISAHASHAAGLGIPLDGFLTQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATVP*
Ga0137385_1099824113300012359Vadose Zone SoilMKRTVAIGMGGVVLLSARLSSAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIVFFTKAGL
Ga0137385_1140725323300012359Vadose Zone SoilMTTKQTMVIGLGLMMVLSARASFAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVLMGLLGLVGGATL*
Ga0137361_1025106313300012362Vadose Zone SoilMKRTVVLGIGISLLLSAHASWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFAKAGLLGGGVVLMGLMGLVGGATL*
Ga0137361_1025127433300012362Vadose Zone SoilMSRRTALSVSGSVVGVLLHAQAGYTAGLGIPLDAFLAQFQLFVVGLGLVMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGETILMGLLGLVGGATL*
Ga0137361_1031617523300012362Vadose Zone SoilMKRKVIVAALLVLLSARASFAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVVLMGLLGLVGGGTL*
Ga0137361_1137938223300012362Vadose Zone SoilMVMKRAIALGVGCVVLFSARASWAGGLGIPLDGFLAQLQTWVVGLGLIMGLVGLTGYVGSLFDNPFSHILAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL*
Ga0137361_1143522013300012362Vadose Zone SoilMVMKRSIVVVVGGVVLLSAHVSQAAGLGIPLDGFLTQVQTWVIGLGLIMGLIGLTGWVGSMFDNPFSHILAGSVGFFTKAGLLGGGTVILAAMGLVGGATL*
Ga0137361_1189216523300012362Vadose Zone SoilVKRTVVIGISGGLLLIARVSWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFTKAGLLGGGIVLMGLMGLVGGAML*
Ga0137390_1002689573300012363Vadose Zone SoilARASCAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGML*
Ga0137397_1033121623300012685Vadose Zone SoilMKRRIAVMAGVAVLVSVRASSAGALGIPLDGFLSQFQTFVVGLGLIVGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGTVLLGLLGLVGGGVL*
Ga0137395_1011747023300012917Vadose Zone SoilMKRNMLVVVMVGVGLGARAASAGALGIPLDGFLAQFQTWVVGLGLVMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL*
Ga0137396_1072130413300012918Vadose Zone SoilMKRNMLVVVMVGVGLGARAASAGALGIPLDGFLAQFQTWVVGLGLVMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGL
Ga0137396_1111525113300012918Vadose Zone SoilVKRTVVLGISGGLLLIARVSWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFTKAGLLGGASCLWDSWG
Ga0137359_1060543933300012923Vadose Zone SoilVQAHFGDEGQLPNGSLFSKESDMKRRIAVMAAVAVLVSVRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0137419_1135686023300012925Vadose Zone SoilMKRRIAVMAAVAVLVSVRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLPGPVGGATL*
Ga0137404_1045999323300012929Vadose Zone SoilMKRTVMVTIGILVLLSAQASFAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL*
Ga0137404_1164795923300012929Vadose Zone SoilMKRTMVVVVMVGVGLGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATVP*
Ga0137410_1013378813300012944Vadose Zone SoilRPVFSKESRVKRTVVLGISGGLLLIARASWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLMGLVGGATL*
Ga0137410_1127857123300012944Vadose Zone SoilPNGSLFSKESGMKRRIAVMAAVAVLVSGRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL*
Ga0126369_1055651413300012971Tropical Forest SoilMTRRQQLAVGLGMVVLVSARASYAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYIGSFFDNPFSNILAGSIGFFTKAGLLGGGVTLMGLLGLVG
Ga0126369_1238766623300012971Tropical Forest SoilMTVLVAGTSEAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVTLMGLLGLVGGGVL*
Ga0172380_1105218513300014205Landfill LeachateMKRAVRLAGLVSGLVLSAQVAQAASLGIALDEFLTQFETFVIGLGMIMGLVGLVGWIGSLFDSPYGGILSGSIRFFMLAGLLGGGTVILGMMGLVNGAVLP*
Ga0182036_1079461813300016270SoilDTRMKRTHQLAVGVGILVLAVARTSWAGALGIPLDGFLTTFQTFVTGLGLIMGIVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0182034_1065269213300016371SoilSNTEDTRMKRTHQLAVGVGILVLAVARTSWAGALGIPLDGFLTTFQTFVTGLGLIMGIVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0182034_1168774823300016371SoilMNKRQLAVGVGMVVLVVARAASAGALGIPLDGFLTTFQTFVTGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVTLMGLLGLVGGATL
Ga0066667_1214573923300018433Grasslands SoilMTRRQQLAVGLGMVVLLSARVSWAGALGIPLDGFLTTFQTFVVGLGLIMGLVGLTGYIGSLFDNPFSNILAGSIGFFTKAGLLGGGVTLMGLLGLVGGATL
Ga0179596_1061229523300021086Vadose Zone SoilMQQTTRRQGMKRGIAVAVGGVVLLRAQASCAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0187846_1039260523300021476BiofilmMTKATRQRTITVVSGGLVLLSVQASSAAGLGIPLDGFLQTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL
Ga0207684_1013985733300025910Corn, Switchgrass And Miscanthus RhizosphereVKRTMIMLACGVLLVSAQRSDAGALGIPLDAFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGILGGGTVLMGLLGLVGGATL
Ga0207684_1021632833300025910Corn, Switchgrass And Miscanthus RhizosphereQASVAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL
Ga0207684_1121361123300025910Corn, Switchgrass And Miscanthus RhizosphereQASFAAGLGIPLDSFLSTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLLGLVGLVGGATL
Ga0207646_1059786423300025922Corn, Switchgrass And Miscanthus RhizosphereMKRTMAVVSGGLVLLSAQASVAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYVGALFDNPFAHILSGSLGFFTKAGLLGGGTVLMGLLGLVGGALL
Ga0209899_104799013300027490Groundwater SandMKRTAMVTIGSLVLFSAQASCAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTV
Ga0209388_112992423300027655Vadose Zone SoilVKRTVVLGISGGLLLIARASWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGG
Ga0209180_1004695943300027846Vadose Zone SoilMKRVIAVAVGVVLFSARASCAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGML
Ga0209180_1010762023300027846Vadose Zone SoilMQQTTRRQAMKRGIAVAVGGVVLFSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0209180_1049929623300027846Vadose Zone SoilMKRGIAVAVGVVLFSARASFAGGLGIPLDGFLAQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0209180_1050642623300027846Vadose Zone SoilMKRGIAVAVGGVVLLSAQASFAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0209180_1069231123300027846Vadose Zone SoilMKRTMAIGIGGVVLLSARASFAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVIMGLLGL
Ga0209701_1020313413300027862Vadose Zone SoilMKRKMAVAVILAMVGLGARAASAGALGIPLDAFLTQFQAWVIGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0209283_1003834333300027875Vadose Zone SoilMKRGIAVAVGGVVLLSAQASCAGALGIPLDGFLTQFQTWVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGVALMTLLGLVGGGTL
Ga0209283_1073460413300027875Vadose Zone SoilMKRRIAVMAAVAVLVSVRLSYAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLVGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLLGLVGGATL
Ga0209590_1001534233300027882Vadose Zone SoilMKRTMVVAVMVGVALGARAASAGALGIPLDGFLTQFQTWVIGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGGTL
Ga0209590_1004832233300027882Vadose Zone SoilMKRSIVVAVIMGVGLGARMASAGALGIPLDGFLAQFQTWVVGLGLVMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGMLGGGTVLLGLMGLVGGATL
Ga0209590_1016033333300027882Vadose Zone SoilMKRTVAIGMGGVVLLSARLSSAGALGIPLDAFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSHVLAGSIGFFTKAGLLGGGTVLLGLLGLVGGATL
Ga0209590_1027530723300027882Vadose Zone SoilMTRTAIVTIGSLILLSAQTSFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0209590_1040589913300027882Vadose Zone SoilMKRTVVWSSSGLVLCSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0209590_1047414913300027882Vadose Zone SoilMKRQLACGLGMIVVLSVRVSWAGGLGIPLDGFLQNFQTWVVGLGLIMGLVGLTGYVGAQFDNPFSHILAGSVGFFTRAGLLGGGTVLLGLMGLVGGATL
Ga0209590_1066953223300027882Vadose Zone SoilMKRTMAIGIGGVVLLSARASFAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGVVIMGLLGLVGGA
Ga0209488_1051154423300027903Vadose Zone SoilVKRTVVLGISGGLLLIARASWAGALGIPLDGFLTQFQTFVVGLGLIIGLVGLTGYVGSLMDNPFSNILAGSVGFFSKAGLLGGGVVLMGLMGLVGGATL
Ga0209488_1108449113300027903Vadose Zone SoilMKRTVVGGGGLVLLSAQASVAAGLGIPLDGFLTTFQTWVVGLGLIMGLVGLVGYIGALFDNPFAHILSGSLGFFTKAGLLGGGTVLLGLLGLVGGATL
Ga0207428_1003926943300027907Populus RhizosphereMRRRWRGIVTLATALLLLHSRAHAAAMGIPLDAFLQQFQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSIGFFSKAGLLGGGTILLGLVGLTGGAVL
Ga0209889_110792913300027952Groundwater SandMKRTLMMAIGGAVLLSARASFAGALGIPLDGFLTTFQTFIIGLGLILGLVGLTGWIGSLFDNPFSNIMAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0209853_101550233300027961Groundwater SandMKRTAGFAVGSLVVFSAQASFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0209853_112191523300027961Groundwater SandMKRTVGFTVGSLALLSAQTSFAAGLGIPLDGFLTTFQTFVVGLGLIMGLVGLVGYVGSLFYNPFSNILAGSIGFFTKAGLLGG
Ga0137415_1140071123300028536Vadose Zone SoilMKWTVALGLGMAVVLSARFAFAGALGIPLDGFLTQFQTFVVGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGVVLMGLLGLVGGGTL
Ga0247828_1002640623300028587SoilMRRRWTVVGTLVLTMLLLQGRAHAAAMGIPLDGFLQQFQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSVGFFSKAGLLGGGTILLGLVGLTGGAVV
Ga0247818_1004525833300028589SoilMRRRWTVVGTLVLTMLLLQGRAHAAAMGIPLDGFLQQYQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSVGFFSKAGLLGGGTILLGLVGLTGGAVV
Ga0247822_1031866713300028592SoilMRRRWTVVGTLVLTMLLLQGRAHAAAMGIPLDGFLQQFQLFVVGLGLVMGLVGLTGYVGSLFDNPFSNILAGSVGFFSKAGLLGGGTILLGLVGLT
Ga0307276_1014765323300028705SoilMKGGQDVGTHRDMVAFYLAALLLAPHLAHAGALGIPLDGFVATFQTFVIGLGLAVGLVGLIGYVGSLMDNPFSNILAGSVGFFTKAGLLGGGTTLMGLLGLVGGATL
Ga0318516_1064721123300031543SoilMGKKQLAVGAGLLVLVVARAASAGALGIPLDGFLTTFQTFVTGLGLIMGLVGLTGYVGSLFDNPFSNILAGSIGFFTKAGLLGGGTALMALLGLVGGATQ
Ga0307408_10165359323300031548RhizosphereMKAVSRVVLSAGMVVLVWATRGWCGGLGIPLDGFLATFQTFVIGLGLIMGLVGIAGYVGSLFDNPFSHILAGSVGFFTKAGLLGGGSVLLGILGLVGGAVLP
Ga0306921_1013515033300031912SoilMKRTHQLAVGVGILVLAVARTSWAGALGIPLDGFLTTFQTFVTGLGLIMGIVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0306920_10247044123300032261SoilMKRTHQLAVGVGILVLAVARTSWAGALGIPVDGFLTTFQTFVTGLGLIMGIVGLTGYVGSLFDNPFSNILAGSVGFFTKAGLLGGGTVLMGLLGLVGGATL
Ga0335085_1108316823300032770SoilMKRTMMVMACGLLLLSARASMAGALGIPLDSFLATFKTWVVGLGLIMGLVGLVGYVGSLFDNPFSHILSGSIGFFTKAGILGGGTVLMGLLGLVGGGTL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.