NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F046268

Metagenome / Metatranscriptome Family F046268

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046268
Family Type Metagenome / Metatranscriptome
Number of Sequences 151
Average Sequence Length 95 residues
Representative Sequence MGTTRTLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL
Number of Associated Samples 101
Number of Associated Scaffolds 151

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 66.23 %
% of genes near scaffold ends (potentially truncated) 27.15 %
% of genes from short scaffolds (< 2000 bps) 80.79 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (92.053 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.801 % of family members)
Environment Ontology (ENVO) Unclassified
(67.550 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(69.536 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 50.00%    β-sheet: 0.00%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 151 Family Scaffolds
PF00106adh_short 32.45
PF13561adh_short_C2 25.83
PF07883Cupin_2 9.93
PF16798DUF5069 3.31
PF00903Glyoxalase 1.32
PF03446NAD_binding_2 1.32
PF00248Aldo_ket_red 1.32
PF13686DrsE_2 1.32
PF14342DUF4396 1.32
PF14833NAD_binding_11 0.66
PF07715Plug 0.66
PF02954HTH_8 0.66
PF13460NAD_binding_10 0.66
PF02321OEP 0.66
PF13649Methyltransf_25 0.66
PF00881Nitroreductase 0.66
PF08240ADH_N 0.66
PF00724Oxidored_FMN 0.66
PF01047MarR 0.66
PF01370Epimerase 0.66
PF03358FMN_red 0.66

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 151 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.32
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 0.66
COG19022,4-dienoyl-CoA reductase or related NADH-dependent reductase, Old Yellow Enzyme (OYE) familyEnergy production and conversion [C] 0.66


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms92.05 %
UnclassifiedrootN/A7.95 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10073828All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1079Open in IMG/M
3300002560|JGI25383J37093_10053666All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1294Open in IMG/M
3300002561|JGI25384J37096_10037730All Organisms → cellular organisms → Bacteria1879Open in IMG/M
3300002908|JGI25382J43887_10061888All Organisms → cellular organisms → Bacteria2026Open in IMG/M
3300002912|JGI25386J43895_10047186All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300002912|JGI25386J43895_10101149All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300005166|Ga0066674_10459992All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300005172|Ga0066683_10664671All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300005172|Ga0066683_10695539All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300005172|Ga0066683_10780944All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300005172|Ga0066683_10872163All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300005174|Ga0066680_10525844Not Available743Open in IMG/M
3300005174|Ga0066680_10697632All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300005181|Ga0066678_10216814All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1227Open in IMG/M
3300005181|Ga0066678_10225864All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1203Open in IMG/M
3300005186|Ga0066676_10364773All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira968Open in IMG/M
3300005445|Ga0070708_100497666All Organisms → cellular organisms → Bacteria → Nitrospirae1150Open in IMG/M
3300005445|Ga0070708_101395545All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira653Open in IMG/M
3300005446|Ga0066686_10091382All Organisms → cellular organisms → Bacteria1945Open in IMG/M
3300005446|Ga0066686_10322797All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1051Open in IMG/M
3300005446|Ga0066686_11010289All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300005447|Ga0066689_10032040All Organisms → cellular organisms → Bacteria2689Open in IMG/M
3300005447|Ga0066689_10268292All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1053Open in IMG/M
3300005467|Ga0070706_100139944All Organisms → cellular organisms → Bacteria2259Open in IMG/M
3300005536|Ga0070697_100073969All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii2798Open in IMG/M
3300005552|Ga0066701_10585298All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300005552|Ga0066701_10823014All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300005552|Ga0066701_10888279All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300005553|Ga0066695_10316233All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira983Open in IMG/M
3300005554|Ga0066661_10074160All Organisms → cellular organisms → Bacteria1992Open in IMG/M
3300005555|Ga0066692_10132242All Organisms → cellular organisms → Bacteria1516Open in IMG/M
3300005555|Ga0066692_10226578All Organisms → cellular organisms → Bacteria1176Open in IMG/M
3300005556|Ga0066707_10047708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales2469Open in IMG/M
3300005558|Ga0066698_10057455All Organisms → cellular organisms → Bacteria2475Open in IMG/M
3300005558|Ga0066698_10515241All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300005598|Ga0066706_10597741All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300005598|Ga0066706_10609239All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium867Open in IMG/M
3300006034|Ga0066656_10640761All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300006796|Ga0066665_10750215All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium772Open in IMG/M
3300006796|Ga0066665_11085252All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300009012|Ga0066710_101048621Not Available1259Open in IMG/M
3300009012|Ga0066710_102335142Not Available777Open in IMG/M
3300009012|Ga0066710_103318672All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300009012|Ga0066710_103979825All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300009038|Ga0099829_10237756All Organisms → cellular organisms → Bacteria1483Open in IMG/M
3300009038|Ga0099829_11614507Not Available535Open in IMG/M
3300009088|Ga0099830_10132221All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1901Open in IMG/M
3300009088|Ga0099830_11575795All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300009089|Ga0099828_10045921All Organisms → cellular organisms → Bacteria3603Open in IMG/M
3300009089|Ga0099828_11920786Not Available519Open in IMG/M
3300009090|Ga0099827_10278528All Organisms → cellular organisms → Bacteria1411Open in IMG/M
3300009090|Ga0099827_10644547All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium914Open in IMG/M
3300009137|Ga0066709_101787228Not Available864Open in IMG/M
3300009137|Ga0066709_101818448All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300009137|Ga0066709_102184393All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300009444|Ga0114945_10003885All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira defluvii8374Open in IMG/M
3300009444|Ga0114945_10033523All Organisms → cellular organisms → Bacteria2768Open in IMG/M
3300009444|Ga0114945_10845135All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300010043|Ga0126380_10217759Not Available1293Open in IMG/M
3300010080|Ga0127448_149644All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300010087|Ga0127492_1033142All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300010087|Ga0127492_1033339All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300010108|Ga0127474_1112294All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300010114|Ga0127460_1040651Not Available1325Open in IMG/M
3300010124|Ga0127498_1095454All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300010127|Ga0127489_1130035All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300010133|Ga0127459_1144053All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300010142|Ga0127483_1107244All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300010303|Ga0134082_10260346All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300010304|Ga0134088_10134161All Organisms → cellular organisms → Bacteria1175Open in IMG/M
3300010304|Ga0134088_10341416All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300010335|Ga0134063_10208081All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300010336|Ga0134071_10115386All Organisms → cellular organisms → Bacteria1286Open in IMG/M
3300011271|Ga0137393_10979597All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium720Open in IMG/M
3300012189|Ga0137388_10069364All Organisms → cellular organisms → Bacteria2932Open in IMG/M
3300012189|Ga0137388_11778541All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300012201|Ga0137365_10469185All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300012206|Ga0137380_10196011All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1833Open in IMG/M
3300012206|Ga0137380_10627329All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300012206|Ga0137380_11150841All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium660Open in IMG/M
3300012207|Ga0137381_11113717All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium679Open in IMG/M
3300012207|Ga0137381_11720965All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium517Open in IMG/M
3300012209|Ga0137379_10383798All Organisms → cellular organisms → Bacteria1314Open in IMG/M
3300012285|Ga0137370_10013906All Organisms → cellular organisms → Bacteria3908Open in IMG/M
3300012349|Ga0137387_10668173All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300012349|Ga0137387_11227780All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300012351|Ga0137386_10156362All Organisms → cellular organisms → Bacteria1631Open in IMG/M
3300012355|Ga0137369_10103541All Organisms → cellular organisms → Bacteria2331Open in IMG/M
3300012356|Ga0137371_10676891All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300012359|Ga0137385_10831334All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300012363|Ga0137390_11519417All Organisms → cellular organisms → Bacteria610Open in IMG/M
3300012379|Ga0134058_1034061All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300012379|Ga0134058_1227892All Organisms → cellular organisms → Bacteria2526Open in IMG/M
3300012382|Ga0134038_1038615All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300012383|Ga0134033_1181941All Organisms → cellular organisms → Bacteria1441Open in IMG/M
3300012390|Ga0134054_1260783All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300012392|Ga0134043_1126404All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300012397|Ga0134056_1175687All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300012400|Ga0134048_1191754All Organisms → cellular organisms → Bacteria754Open in IMG/M
3300012400|Ga0134048_1285919All Organisms → cellular organisms → Bacteria960Open in IMG/M
3300012401|Ga0134055_1257515All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300012402|Ga0134059_1164577All Organisms → cellular organisms → Bacteria1605Open in IMG/M
3300012403|Ga0134049_1210644Not Available1485Open in IMG/M
3300012406|Ga0134053_1109002All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium810Open in IMG/M
3300012410|Ga0134060_1392605Not Available958Open in IMG/M
3300012410|Ga0134060_1451215All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300012972|Ga0134077_10112249All Organisms → cellular organisms → Bacteria1064Open in IMG/M
3300012976|Ga0134076_10021568All Organisms → cellular organisms → Bacteria2287Open in IMG/M
3300012977|Ga0134087_10611454All Organisms → cellular organisms → Bacteria566Open in IMG/M
3300014154|Ga0134075_10108908All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300014154|Ga0134075_10156742All Organisms → cellular organisms → Bacteria974Open in IMG/M
3300017656|Ga0134112_10031067All Organisms → cellular organisms → Bacteria1880Open in IMG/M
3300017656|Ga0134112_10159337All Organisms → cellular organisms → Bacteria871Open in IMG/M
3300017966|Ga0187776_10036372All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira2757Open in IMG/M
3300017966|Ga0187776_11457451All Organisms → cellular organisms → Bacteria → Nitrospirae524Open in IMG/M
3300018052|Ga0184638_1002035All Organisms → cellular organisms → Bacteria → Proteobacteria6200Open in IMG/M
3300018056|Ga0184623_10045171All Organisms → cellular organisms → Bacteria2004Open in IMG/M
3300018433|Ga0066667_12316407All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300018468|Ga0066662_11511136All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300018468|Ga0066662_12278800Not Available569Open in IMG/M
3300018468|Ga0066662_12617745All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300019360|Ga0187894_10276236All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300022563|Ga0212128_10005194All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → Nitrospira japonica8570Open in IMG/M
3300022563|Ga0212128_10057692All Organisms → cellular organisms → Bacteria2504Open in IMG/M
3300025324|Ga0209640_10013808All Organisms → cellular organisms → Bacteria7030Open in IMG/M
3300025910|Ga0207684_10000507All Organisms → cellular organisms → Bacteria48732Open in IMG/M
3300026296|Ga0209235_1047596All Organisms → cellular organisms → Bacteria2085Open in IMG/M
3300026298|Ga0209236_1016787All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira4180Open in IMG/M
3300026309|Ga0209055_1038807All Organisms → cellular organisms → Bacteria2119Open in IMG/M
3300026310|Ga0209239_1118181All Organisms → cellular organisms → Bacteria1103Open in IMG/M
3300026313|Ga0209761_1130924All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1206Open in IMG/M
3300026324|Ga0209470_1049850All Organisms → cellular organisms → Bacteria2025Open in IMG/M
3300026324|Ga0209470_1193519All Organisms → cellular organisms → Bacteria867Open in IMG/M
3300026327|Ga0209266_1043063All Organisms → cellular organisms → Bacteria2284Open in IMG/M
3300026327|Ga0209266_1162855All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300026332|Ga0209803_1171570All Organisms → cellular organisms → Bacteria822Open in IMG/M
3300026332|Ga0209803_1311179All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300026334|Ga0209377_1059953All Organisms → cellular organisms → Bacteria1669Open in IMG/M
3300026524|Ga0209690_1041214All Organisms → cellular organisms → Bacteria2131Open in IMG/M
3300026528|Ga0209378_1089107Not Available1379Open in IMG/M
3300026528|Ga0209378_1095219All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1322Open in IMG/M
3300026537|Ga0209157_1020809All Organisms → cellular organisms → Bacteria4035Open in IMG/M
3300026537|Ga0209157_1107744All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira1310Open in IMG/M
3300026537|Ga0209157_1109339All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300026540|Ga0209376_1411732All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300027846|Ga0209180_10596016All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300027846|Ga0209180_10713736All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300027875|Ga0209283_10158191All Organisms → cellular organisms → Bacteria1503Open in IMG/M
3300027882|Ga0209590_10000370All Organisms → cellular organisms → Bacteria14317Open in IMG/M
3300031576|Ga0247727_11011969All Organisms → cellular organisms → Bacteria572Open in IMG/M
3300032180|Ga0307471_102100425All Organisms → cellular organisms → Bacteria710Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil23.84%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.87%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil13.91%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs3.31%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.31%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.32%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.32%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.66%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.66%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.66%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.66%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010080Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010087Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010108Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010124Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010127Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010133Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010142Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012382Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012383Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012390Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012400Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012402Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1007382813300002558Grasslands SoilTLGLLASTLIGCASLPMREGNALAGYRQELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPXDPSTLPTAFAPLPXKGREL*
JGI25383J37093_1005366623300002560Grasslands SoilMGTTRKLLLTLGLLASTLVGCASLPMREDNALAGYRQELAKLVEAGVLTKEDEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL*
JGI25384J37096_1003773023300002561Grasslands SoilMGTTRTLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEEKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
JGI25382J43887_1006188813300002908Grasslands SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPL
JGI25386J43895_1004718613300002912Grasslands SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKK
JGI25386J43895_1010114913300002912Grasslands SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPXDPSTLPTAFAPLPXKGREL*
Ga0066674_1045999213300005166SoilVNSGSRMQTTRTFLLTLGHLAFTFVGCASLPVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0066683_1066467123300005172SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0066683_1069553913300005172SoilRRGAVVDGSEHSCRRRRGDVNSGSRMQTTRTVLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0066683_1078094413300005172SoilMQTTRTFLLTLGLLAFTFVGCASLPVREDDTLAAYRLELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPT
Ga0066683_1087216323300005172SoilMTKLRTGYLSPFSVTLLTLGLLAFTFVGCASLPVQENDTLAGYRQELAKLVETGMLTKDDAEKFYGIASLEMERRAKQRHQPSDPTALPTAFGPPARKATEL*
Ga0066680_1052584423300005174SoilMHTMRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL*
Ga0066680_1069763223300005174SoilMRTSYLSPFSVTLLTLGLLASTLVGCESLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066678_1021681423300005181SoilVGCASIPVQENDALAGYRQELAKLVEAGVLTKVDEKKFYQIAALEMERRAKQRHQASDPTTLPTAFAPLPKKAREL*
Ga0066678_1022586423300005181SoilMHTMRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0066676_1036477323300005186SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQHHQASDPTTLPTAFAPLPKKAREL*
Ga0070708_10049766613300005445Corn, Switchgrass And Miscanthus RhizosphereVWAVLLALGLLASTLVGCAGSLVREDDTLAGYRQELAKLVEAGVLTKEDEEKFYRIASLEVERRANQRHQPSDPTALPTGFGSPLRK
Ga0070708_10139554523300005445Corn, Switchgrass And Miscanthus RhizosphereVWAVLLALGLLASTLVGCAGSLVREDDTLAGYRQELAQLVEAGVLTKEDEEKFYGIASLEMERRTKQRHQPSDPTALPTGFGSPLRKVSEL*
Ga0066686_1009138213300005446SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFAPLPKKAREL*
Ga0066686_1032279723300005446SoilMHTTRTLLLTLGLLASTLIGCASLPMREDNALAGYRQELAKLVEAGVLTKADEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0066686_1101028923300005446SoilMQTTRTVLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0066689_1003204033300005447SoilMGTTRTVLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066689_1026829223300005447SoilSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL*
Ga0070706_10013994423300005467Corn, Switchgrass And Miscanthus RhizosphereMQTTRTWLLTLGLLASALVGCAGLTAHEDDTLSGYRRELAELVEAGVLTKEDEEKFYGIASLEMERRTKQRHQPSDPTALPTGFGSPLRKVSEL*
Ga0070697_10007396923300005536Corn, Switchgrass And Miscanthus RhizosphereMRTARTLLLTLGLLVSTLVGCAGSLVREDDTLAGYRQELAKLVEAGVLTKADEEKFYQIASLEMERRAKQGHQASDPTTVPTAFAPLPKKARDL*
Ga0066701_1058529823300005552SoilMGTTRKLLLTLGLLASTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0066701_1082301423300005552SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066701_1088827923300005552SoilMHTTRTLLLTLGLLASTLIGCASLPMREDNALAGYRQELAKLVEAGVLTKEDEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL*
Ga0066695_1031623323300005553SoilMQTTRTVLLTLGLLASTLFGCAGSLVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066661_1007416023300005554SoilMGTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVDAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066692_1013224223300005555SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0066692_1022657823300005555SoilMRTGYLSPFSVTLLTLGLLASTLVGCTSLPVPEDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066707_1004770813300005556SoilMTRTLLLTLGLLAFTFVGCASLPAREDDTLAGYRLELAKLVETGVLTTDDAEKFYGIASLEMERRAALRAGRLRSEPSDPTALPTGSVPLPRTEREL*
Ga0066698_1005745523300005558SoilMGTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066698_1051524123300005558SoilMQTTRKLLLTLGLLVSTLVGCESLPVREDDTLAGYRQELATLVEAGVLTKADEKNFYQIAALEMERRAKQSHQPSDPTTLPTGFVPLARKATEL*
Ga0066706_1059774123300005598SoilMGTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVDAGVLTKADEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL*
Ga0066706_1060923923300005598SoilMTRTLLLTLGLLAFTFVGCASLPAREDDTLAGYRLELAKLVETGVLTKNDAEKFYGIASLEMERRAALRAGRLRSEPSDPTALPTGSVPLPRT
Ga0066656_1064076123300006034SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTGFVPLARKATEL*
Ga0066665_1075021513300006796SoilMRTGYLSPFSVTLLTLGLLASTLVGCTSLPVPEDDTLAGYRQELAKLVEAGVLTKADEEKFYRIASLEMERRAAQRHQPSDPSTLPTAFVPLARKATEL*
Ga0066665_1108525223300006796SoilGCASLPVREDDTLAGYRQELAKLVDAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0066710_10104862113300009012Grasslands SoilTRTLLLTIGLIASMLVGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRAEPLSNQPSDPTALPTGSVPLPRTAREL
Ga0066710_10233514213300009012Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKA
Ga0066710_10331867223300009012Grasslands SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPLDPSTLPTAFAPLPKKGREL
Ga0066710_10397982513300009012Grasslands SoilMTRTLLLTLGLLAFTFVGCASLPAREDDTLAGYRLELAKLVETGVLTKNDAEKFYGIASLEMERRAALRVGRLRSEPSDPTALPTGSVPLPRTEREL
Ga0099829_1023775623300009038Vadose Zone SoilMGTTRTLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL*
Ga0099829_1161450713300009038Vadose Zone SoilMQLAGNRKSSLPPFSVTLLTLGLLASTLIGCAGSLVREDDTLAGYRQELAELVETGVLTKEDEEKFYRIAGLEVERRSALRTGQLSSQPSDSTALPTAFVPRARKASEL*
Ga0099830_1013222123300009088Vadose Zone SoilMQLAGNRKSSLPPFSVTLLTLGLLASTLIGCAGSLVREDDTLAGYRQELAELVETGVLTKEDEEKFYRIAGLEVERRSALRTGQLSSQPSDSTALPTAFVPLLRKATEL*
Ga0099830_1157579523300009088Vadose Zone SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL*
Ga0099828_1004592113300009089Vadose Zone SoilMTKMRTCCLSPFSVTLLALGLLVSTLVGCASLPMREDNPMAGYRQELAKLVEAGVLTKEDEEKFYQIASLEMERRDKQRHQASDP
Ga0099828_1192078623300009089Vadose Zone SoilFAPRLGASGGSRLPSNPSSEWRISEKLASALTELRMQTTRTWLLTLGLLASTLVGCASLPMQENDALAGYRRELVTLVETGVLTKEDEEKFYRIAGLEMKRRANQRHQPSDPTTLPTAFVPLLRKATEL*
Ga0099827_1027852823300009090Vadose Zone SoilMQMTRTVLLTLGLLASMLVGCAGSLVREDDALAGYRQELAKLVEAGVLTKEDEEKFYRIASLEMERRDKQRHQASDPTTLPTAFAPLPQKGREL*
Ga0099827_1064454723300009090Vadose Zone SoilMTKMPTGYFLSPFSVTLLTLGLLASTLIGCASIHVQENDALAGYRQELAKLVETGVLTKDDAEKFYGIASLEMERRAKQRHQPSDPSTLPTVFVPLPKKAREL*
Ga0066709_10178722823300009137Grasslands SoilMHTTRTLLLTLGLLASTLIGCASLPMREDNALAGYRQELAKLVEAGVLTKEDEEKFYGVASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL*
Ga0066709_10181844823300009137Grasslands SoilMTRTLLLTLGLLASTLIGCAGSLVREDDTLAAYRLELANLVETGVLTNGDAEKFYGIASLEMERRAKQRHQPTDPATLPTAFGPPARRATEL*
Ga0066709_10218439313300009137Grasslands SoilMTRTLLLTLGLLAFTFVGCASLPAREDDTLAGYRLELAKLVETGVLTKNDAEKFYGIASLEMERRAALRVGRLRSEPSDLTALPTGSVPLPRTAREL*
Ga0114945_1000388573300009444Thermal SpringsMRTARTWLFLGLLASTLVGCAGSLVREDDTLTGYRQELTELVETGVLTKDDAEKFYRIAGLEMERRANQRHQPSDTTALLTKRRSP*
Ga0114945_1003352333300009444Thermal SpringsMQTTRTWLLTLGLLASTLVGCAGSLVREDDTLAGYRQELTELVETGVLTKDDAEKFYRIAGLEMERRARQRHQPSDPTALPTAFEPLARKASEL*
Ga0114945_1084513523300009444Thermal SpringsMHTTRTWLLTLGLLASTLVGCAGSLVREDDTLAGYRRELAKLVETGVLTKEDEEKFYRIAGLEVERRAALRAGRLSSEPSDPTALP
Ga0126380_1021775913300010043Tropical Forest SoilMQTTRTVLLIFGLLAITLFGCASLTAQEDDTLAAYRQELARLVEADVLTKNDAEKFYGIASLEMERRAALRNGQLRNRPSDPTVRPTHFGPPVGTAREI*
Ga0127448_14964423300010080Grasslands SoilLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0127492_103314223300010087Grasslands SoilMTKMRTGYLSPFSVTLLALGFLASTVVGCASIPVQEDDTLAGYRQELARLVEAGVLTKEDEEKFYRIASLEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0127492_103333923300010087Grasslands SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKAT
Ga0127474_111229423300010108Grasslands SoilMQTTRTLLLTLGLLASTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0127460_104065113300010114Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAR*
Ga0127498_109545413300010124Grasslands SoilVISEKLASPFTESRMQTTRTLLLTLGLLASTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0127489_113003513300010127Grasslands SoilMGTTRTWLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0127459_114405323300010133Grasslands SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0127483_110724423300010142Grasslands SoilMTKMRTAFLSAFSVMLLTLGLLAFTFVGCASLPVREDDTLAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134082_1026034613300010303Grasslands SoilMQTTRKLLLTLGLLVSTLVGCESLPVREDDTLAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134088_1013416123300010304Grasslands SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLARKATEL*
Ga0134088_1034141623300010304Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0134063_1020808123300010335Grasslands SoilMGTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVDAGVLTKADEKNFYRIASLEMERRAKQSLQPSDPTTLPTGFVPLARKATEL*
Ga0134071_1011538613300010336Grasslands SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTA
Ga0137393_1097959723300011271Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLASTLVGCASLPVPEDDTLAGYRQELAKLVEAGVLTKADEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKVREL*
Ga0137388_1006936433300012189Vadose Zone SoilMGTTRTLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL*
Ga0137388_1177854123300012189Vadose Zone SoilVWAVLLALGLLASTPVGCAGSLVREDDTLAGYRQELAKLVEAGVLTKEDEEKFYRIASLEMERRAKQRHQPSDPTALPTGFGSPLRKASEL*
Ga0137365_1046918523300012201Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLASTFVGCASLPVREDDTLAGYRLELARLVEVGVLTKDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPLPKKAREL*
Ga0137380_1019601133300012206Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCASLPVPEDDTLSGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPLPKKAREL*
Ga0137380_1062732923300012206Vadose Zone SoilVISEKLASPFTESRMQTTRKLLLTLGLLVSTLVGCASIPVREDDTLAGYRQELAKLVEAGVLTKADEEKFYRIASLEMERRAKQSHQPSDPTTLPTGFVPLARKATEL*
Ga0137380_1115084123300012206Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCARIPVRENDTLAGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFV
Ga0137381_1111371713300012207Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCARIPVREDDTLSGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPL
Ga0137381_1172096513300012207Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCASIPVRENDTLAGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPL
Ga0137379_1038379823300012209Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCARIPVREDDTLSGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPLPKKAREL*
Ga0137370_1001390643300012285Vadose Zone SoilMTRTLLLTLGLLASTLIGCAGSLVREDDTLAAYRLELANLVQTGVLTNDDAEKFYGIASLEMERRAKQRHQPTDPATLPTAFGPPARRATEL*
Ga0137387_1066817323300012349Vadose Zone SoilVISEKLASPFTESRMQTTRTLLLTLGLLVSTLVGCASLPVPEDDTLAGYRQELARLVETGVLTKEDEKKFYQIAALEMERRAKQRHQASDPTTLPTAFAPLPKKAREL*
Ga0137387_1122778013300012349Vadose Zone SoilMHTTRTVLLTLGLLVSTLVGGASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAAMEMERRAKQRHQPSDTSTLPTAFAPLPQKGREL*
Ga0137386_1015636223300012351Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCASIPVRENDTLAGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTSFVPLPKKAREL*
Ga0137369_1010354133300012355Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLFAFTFVGCASLPVREDDTLAGYRQELARLVEVGVLTKDDAEKFYGIASLEMERRAVLRSDPLSNQPSDPTTLPTRFVPLPKRAREL*
Ga0137371_1067689113300012356Vadose Zone SoilLLTLGLLAFTFVGCASLPVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0137385_1083133423300012359Vadose Zone SoilMTKMRTGYLSPFSVTLLTLGLLVSTLVGCASIPVRENDTLAGYRQELARLVETGVLTNDDAEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPLPKKAREL*
Ga0137390_1151941723300012363Vadose Zone SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDALAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVP
Ga0134058_103406123300012379Grasslands SoilLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKRAREL*
Ga0134058_122789233300012379Grasslands SoilMQTTRTLLLTLGLLASTLVGCASIPVHENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134038_103861523300012382Grasslands SoilVISEKLASPFTESRMHTMRTLLLTLGLLVSTLVGCESLPVREDDTLAGYRQELATLVEAGVLTKADEKNFYQIAALEMERRAKQSHQPSDPTTLPTGFVPLARKATEL*
Ga0134033_118194113300012383Grasslands SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDP
Ga0134054_126078323300012390Grasslands SoilMQTTRKLLLTLGLLVSTLVGCASSPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134043_112640413300012392Grasslands SoilLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134056_117568723300012397Grasslands SoilMGTTRTWLLTLGLLASTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEEKFYRIPSLEMERRAKQSHQPSDPTTFPTAFVPLFRKATEL*
Ga0134048_119175423300012400Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0134048_128591923300012400Grasslands SoilMQTTRTLLLTLGLLVSTLVGCARIPVKENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL*
Ga0134055_125751513300012401Grasslands SoilLVGCATIPVQENDALAGYRQELAKLVETGVLTKEDEEKFYRIPSLEMERRAKQSHQPSDPTTFPTAFVPLARKATEL*
Ga0134059_116457733300012402Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0134049_121064433300012403Grasslands SoilMQTTRTLLLTLGLLASTLVGCAGSLVREDDTLAGYRLELAKLVETGVLTKEDEKKFYQIAALEMERRTKQRHQASDPTTLPTAFVPLPKKAREL*
Ga0134053_110900233300012406Grasslands SoilMTKLRTAFLYAFSVTLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLP
Ga0134060_139260513300012410Grasslands SoilSEKLASPFTESRMHTTRTLLLTLGLLASTLIGCASLPMREDNALAGYRQELARLVETGVLTKDDAEKFYGIASLEMERRAAQRAEQRQGPPLSNPGVQ*
Ga0134060_145121523300012410Grasslands SoilMQTTRTFLLTLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKQEREL*
Ga0134077_1011224913300012972Grasslands SoilMTKLRTAFLYAFSVTLLTLGLLAFTLVGCAGSLVREDDTLSGYRLELARLVETGVLTKDDEEKFYGIASLEMERRAKQRHQVSDPTTLPTAFVPLPKKAREL*
Ga0134076_1002156843300012976Grasslands SoilSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0134087_1061145423300012977Grasslands SoilVISEKLASPFTESRMHTMRTLLLTLGLLVSTLVGCASIPVQENDVLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFAPLPKKAREL*
Ga0134075_1010890823300014154Grasslands SoilMTKMRTGFLCAFSVTLLTLGLLASTFVGCASLPVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL*
Ga0134075_1015674223300014154Grasslands SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL*
Ga0134112_1003106723300017656Grasslands SoilMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL
Ga0134112_1015933723300017656Grasslands SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0187776_1003637233300017966Tropical PeatlandVRMTRTLLLALGLLASTLVGCAGSLVREDDTLAGYRQELAELVEAGVLTKEDEEKFYRIASLEMERRARQRHQPSEPTMLPTGLGPLPRTAREL
Ga0187776_1145745123300017966Tropical PeatlandVRTTRTLLLALGLIFSTLVGCAGHLVREDATLAGYRRELAELVEAGVLTKEDEEKFYRIASLEMERRAKQRNQPSDPPVLPTGFVPLPKKAREL
Ga0184638_100203593300018052Groundwater SedimentMGTMRTLLLTLGLLASTLVGCASLPVREDDTLTGYRQELAELVETGVLTKKDAEKFYGIASLEIEGRAKQGRQPSDSTVLPTGLVPLPGKAGEL
Ga0184623_1004517133300018056Groundwater SedimentVRAVLLALGLLASTLVGCAGSLVREDDTLAGYRQELAELVETGVLTKKDAEKFYRIASLEMERRAKQRHQPSDPTTLPTGFGPPLRKASEL
Ga0066667_1231640723300018433Grasslands SoilMHTMRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQPSDPTTLPTAFVPLPKKAREL
Ga0066662_1151113623300018468Grasslands SoilMGTTRKLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0066662_1227880023300018468Grasslands SoilMHTMRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQRHQPSDP
Ga0066662_1261774513300018468Grasslands SoilTRTLLLTLGLLAFTFVGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAAQRAEPLSNQPSDPTALPTGSVPLPRTEREL
Ga0187894_1027623623300019360Microbial Mat On RocksLSLGLLASMLVGCAGSLLWEDEALTGYRRELAELVETGVLTKADKEKFYRIASLEMERRAALRASLLSGEPSDPTALLIKRRNP
Ga0212128_1000519473300022563Thermal SpringsMRTARTWLFLGLLASTLVGCAGSLVREDDTLTGYRQELTELVETGVLTKDDAEKFYRIAGLEMERRANQRHQPSDTTALLTKRRSP
Ga0212128_1005769233300022563Thermal SpringsMQTTRTWLLTLGLLASTLVGCAGSLVREDDTLAGYRQELTELVETGVLTKDDAEKFYRIAGLEMERRARQRHQPSDPTALPTAFEPLARKASEL
Ga0209640_1001380853300025324SoilMMRGRSKQPAGNRKSSLSHFSVTLLTLGILASTLVGCAGSLVREDDTLAGYRQELAELVETGVLTKNDAEKFYGIASLEMERRAKQHRQPSDPTTLPTGFGPLLRKASEL
Ga0207684_10000507403300025910Corn, Switchgrass And Miscanthus RhizosphereMQTTRTWLLTLGLLASALVGCAGLTAHEDDTLSGYRRELAELVEAGVLTKEDEEKFYGIASLEMERRTKQRHQPSDPTALPTGFGSPLRKVSEL
Ga0209235_104759613300026296Grasslands SoilMGTTRTLLLTLGLLASTLVGCASLPMREDNALAGYRQELAKLVEAGVLTKEDEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL
Ga0209236_101678783300026298Grasslands SoilMHTTRTLLLTLGLLASTLIGCASLPMREGNALAGYRQELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPLDPSTLPTAFAPLPKKGREL
Ga0209055_103880723300026309SoilMGTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0209239_111818123300026310Grasslands SoilMTRTLLLTLGLLASTLIGCAGSLVREDDTLAAYRLELANLVETGVLTNGDAEKFYGIASLEMERRAKQRHQPTDPATLPTAFGPPARRATEL
Ga0209761_113092423300026313Grasslands SoilMHTTRTWLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0209470_104985013300026324SoilMGTTRTLLLTLGLLASTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQHHQASDPTTLPTAFAPLPKKAREL
Ga0209470_119351923300026324SoilMQTTRTLLLTLGLLVSTLVGCASIPVQENDALAGYRQELAKLVETGVLTKEDEKKFYQIAALEMERRAKQSHQPSDPTTLPTAFVPLARKATEL
Ga0209266_104306333300026327SoilMQTTRTFLLTLGLLAFTFVGCASLSVREDDTLAAYRLELARLVETGVLTNDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL
Ga0209266_116285523300026327SoilPCRRRRGDVNSGSRMQTTRTVLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKRAREL
Ga0209803_117157023300026332SoilMGTTRTVLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0209803_131117913300026332SoilKGMHTTRTVLLTLGLLVSTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL
Ga0209377_105995323300026334SoilMGTTRKLLLTLGLLASTLVGCASIPVQENDALAGYRRELAKLVEAGVLTKKDEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL
Ga0209690_104121423300026524SoilMHTTRTLLLTLGLLASTLIGCASLPMREDNALAGYRQELAKLVEAGVLTKEDEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL
Ga0209378_108910713300026528SoilMRTGYLSPFSVTLLTLGLLASTLVGCTSLPVPEDDTLAGYRQELAKLVEAGVLTKADEEKFYRIASLEMERRAAQRHQPSDPSTLPTAFVPLARKATEL
Ga0209378_109521923300026528SoilMTRTLLLTLGLLAFTFVGCASLPAREDDTLAGYRLELAKLVETGVLTTDDAEKFYGIASLEMERRAALRAGRLRSEPSDPTALPTGSVPLPRTEREL
Ga0209157_102080923300026537SoilMGTTRKLLLTLGLLASTLIGCASLPMREDNALAGYRQELAKLVEAGVLTKADEEKFYQIAALEMERRAKQRHQPSDPSTLPTAFAPLPQKGREL
Ga0209157_110774423300026537SoilMQTTRTFLLTLGLLAFTFVGCASLPAREDDTLSGYRQELARLVETGVLTKDDAEKFYGIACLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKKAREL
Ga0209157_110933923300026537SoilVNSGSRMQTTRTVLLTLGLLASTLFGCAGSLVREDDTLAGYRLELARLVETGVLTKDDAEKFYGIASLEMERRAALRSEPLSNQPSDPTTLPTRFVPLPKRAREL
Ga0209376_141173213300026540SoilLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEKKFYQIAALEMERRAKQRHQASDPTTLPTAFVPLPKKAREL
Ga0209180_1059601623300027846Vadose Zone SoilLLALGLLASTLVGCAGSLVREDDTLAGYRQELAELVEAGVLTNDDAKKFYGIASLEMERRAKQRHQPSDPTALPTGFGSPLRKASEL
Ga0209180_1071373613300027846Vadose Zone SoilLAFTFVGCASLPVQEDDTLAGYRLELARLVEVGVLTNDDAEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL
Ga0209283_1015819123300027875Vadose Zone SoilMGTTRTLLLTLGLLASTLVGCASLPVREDDTLAGYRQELAKLVEAGVLTKADEEKFYGIASLEMERRAKQRHQPSDPTTLPTAFAPLPKKAREL
Ga0209590_10000370173300027882Vadose Zone SoilMPTGYFLSPFSVTLLTLGLLASTLIGCASIHVQENDALAGYRQELAKLVETGVLTKDDAEKFYGIASLEMERRAKQRHQPSDPSTLPTVFVPLPKKAREL
Ga0247727_1101196913300031576BiofilmQTTRAWLLSLGLLASTLVGCAGPLVQEDDTLAGYRRELVKLVETGVLTREDEEKFYRIASLEMERRAALRAGQPSSQPSDPTALLIKRRNP
Ga0307471_10210042523300032180Hardwood Forest SoilMRTTPTVLLTLGLLVSTLVGCAGSLVREDDTLAGYRQELAKLVEAGVLTKADDEKFYRIASLEMERRAKQRHQASDPTTLPTAFAPLPKKARDL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.