NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F044601

Metagenome Family F044601

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044601
Family Type Metagenome
Number of Sequences 154
Average Sequence Length 79 residues
Representative Sequence MEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Number of Associated Samples 113
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 81.58 %
% of genes near scaffold ends (potentially truncated) 29.22 %
% of genes from short scaffolds (< 2000 bps) 77.92 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.052 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(31.169 % of family members)
Environment Ontology (ENVO) Unclassified
(67.532 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.974 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 69.81%    β-sheet: 0.00%    Coil/Unstructured: 30.19%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 154 Family Scaffolds
PF09828Chrome_Resist 50.65
PF02777Sod_Fe_C 18.83
PF00081Sod_Fe_N 4.55
PF01554MatE 4.55
PF02423OCD_Mu_crystall 4.55
PF03972MmgE_PrpD 1.30
PF00881Nitroreductase 0.65
PF01068DNA_ligase_A_M 0.65
PF04545Sigma70_r4 0.65
PF07485DUF1529 0.65
PF02254TrkA_N 0.65
PF12833HTH_18 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 154 Family Scaffolds
COG0605Superoxide dismutaseInorganic ion transport and metabolism [P] 23.38
COG2423Ornithine cyclodeaminase/archaeal alanine dehydrogenase, mu-crystallin familyAmino acid transport and metabolism [E] 4.55
COG20792-methylcitrate dehydratase PrpDCarbohydrate transport and metabolism [G] 1.30
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 0.65
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.05 %
UnclassifiedrootN/A1.95 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1028230All Organisms → cellular organisms → Bacteria966Open in IMG/M
3300002558|JGI25385J37094_10047294All Organisms → cellular organisms → Bacteria1455Open in IMG/M
3300002558|JGI25385J37094_10077856All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1037Open in IMG/M
3300002560|JGI25383J37093_10015406All Organisms → cellular organisms → Bacteria2537Open in IMG/M
3300002562|JGI25382J37095_10273288All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300002909|JGI25388J43891_1005153All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2625Open in IMG/M
3300002911|JGI25390J43892_10160732All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300005166|Ga0066674_10039172All Organisms → cellular organisms → Bacteria2115Open in IMG/M
3300005166|Ga0066674_10312421All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300005171|Ga0066677_10287922All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300005174|Ga0066680_10060292All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2251Open in IMG/M
3300005175|Ga0066673_10122449All Organisms → cellular organisms → Bacteria1421Open in IMG/M
3300005177|Ga0066690_10329081All Organisms → cellular organisms → Bacteria1035Open in IMG/M
3300005180|Ga0066685_10237761All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1254Open in IMG/M
3300005180|Ga0066685_10332209All Organisms → cellular organisms → Bacteria1054Open in IMG/M
3300005180|Ga0066685_10970090All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300005186|Ga0066676_10064860All Organisms → cellular organisms → Bacteria2123Open in IMG/M
3300005186|Ga0066676_10881269All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales602Open in IMG/M
3300005187|Ga0066675_10036396All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2934Open in IMG/M
3300005187|Ga0066675_10092726All Organisms → cellular organisms → Bacteria → Proteobacteria1975Open in IMG/M
3300005187|Ga0066675_10495833All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300005445|Ga0070708_100581526All Organisms → cellular organisms → Bacteria1056Open in IMG/M
3300005445|Ga0070708_101332609All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas aeruginosa group → Pseudomonas aeruginosa670Open in IMG/M
3300005446|Ga0066686_10766045All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300005447|Ga0066689_10004914All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5550Open in IMG/M
3300005447|Ga0066689_10023883All Organisms → cellular organisms → Bacteria3031Open in IMG/M
3300005451|Ga0066681_10008474All Organisms → cellular organisms → Bacteria4923Open in IMG/M
3300005467|Ga0070706_101792951All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300005518|Ga0070699_100374328All Organisms → cellular organisms → Bacteria1285Open in IMG/M
3300005540|Ga0066697_10306586All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales932Open in IMG/M
3300005552|Ga0066701_10714128All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300005553|Ga0066695_10105385All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1731Open in IMG/M
3300005555|Ga0066692_10161522All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1382Open in IMG/M
3300005557|Ga0066704_10210403All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300005560|Ga0066670_10319853All Organisms → cellular organisms → Bacteria945Open in IMG/M
3300005561|Ga0066699_10654451All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300005574|Ga0066694_10481325All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium579Open in IMG/M
3300005587|Ga0066654_10082316All Organisms → cellular organisms → Bacteria1512Open in IMG/M
3300006031|Ga0066651_10009693All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3865Open in IMG/M
3300006032|Ga0066696_10566501All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300006176|Ga0070765_100885922All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300006791|Ga0066653_10354268All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300006794|Ga0066658_10022318All Organisms → cellular organisms → Bacteria2487Open in IMG/M
3300006796|Ga0066665_10177765All Organisms → cellular organisms → Bacteria → Proteobacteria1636Open in IMG/M
3300006796|Ga0066665_10824663All Organisms → cellular organisms → Bacteria728Open in IMG/M
3300006796|Ga0066665_10866876All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300006796|Ga0066665_11608874All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300006797|Ga0066659_10320151All Organisms → cellular organisms → Bacteria1191Open in IMG/M
3300006797|Ga0066659_11869167All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300006800|Ga0066660_10874683All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria732Open in IMG/M
3300006852|Ga0075433_10474126All Organisms → cellular organisms → Bacteria1103Open in IMG/M
3300006854|Ga0075425_100232538All Organisms → cellular organisms → Bacteria2121Open in IMG/M
3300006904|Ga0075424_101929319All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300007265|Ga0099794_10798526All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300009012|Ga0066710_100479413All Organisms → cellular organisms → Bacteria1871Open in IMG/M
3300009012|Ga0066710_100940721All Organisms → cellular organisms → Bacteria1332Open in IMG/M
3300009137|Ga0066709_100457484All Organisms → cellular organisms → Bacteria1786Open in IMG/M
3300009137|Ga0066709_100975113All Organisms → cellular organisms → Bacteria1240Open in IMG/M
3300009162|Ga0075423_10768605All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300010320|Ga0134109_10001481All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5477Open in IMG/M
3300010320|Ga0134109_10487757All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300010321|Ga0134067_10483404All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium510Open in IMG/M
3300010323|Ga0134086_10140238All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300010323|Ga0134086_10364015All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium574Open in IMG/M
3300010325|Ga0134064_10001494All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5182Open in IMG/M
3300010326|Ga0134065_10135375All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300010333|Ga0134080_10012434All Organisms → cellular organisms → Bacteria3051Open in IMG/M
3300010333|Ga0134080_10030021All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2067Open in IMG/M
3300010333|Ga0134080_10651292All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300010335|Ga0134063_10185234All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300010336|Ga0134071_10604888All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium573Open in IMG/M
3300010337|Ga0134062_10173173All Organisms → cellular organisms → Bacteria971Open in IMG/M
3300010364|Ga0134066_10227767All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium633Open in IMG/M
3300011270|Ga0137391_10043820All Organisms → cellular organisms → Bacteria3798Open in IMG/M
3300012096|Ga0137389_10309376All Organisms → cellular organisms → Bacteria1338Open in IMG/M
3300012189|Ga0137388_11222283Not Available689Open in IMG/M
3300012189|Ga0137388_11660314All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300012198|Ga0137364_10218142All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1402Open in IMG/M
3300012200|Ga0137382_10852417All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300012209|Ga0137379_10841491All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium822Open in IMG/M
3300012356|Ga0137371_10438931All Organisms → cellular organisms → Bacteria1011Open in IMG/M
3300012363|Ga0137390_11252318All Organisms → cellular organisms → Bacteria688Open in IMG/M
3300012582|Ga0137358_10994353All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300012918|Ga0137396_10761225All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300012927|Ga0137416_10303322All Organisms → cellular organisms → Bacteria1324Open in IMG/M
3300012929|Ga0137404_10023087All Organisms → cellular organisms → Bacteria → Proteobacteria4517Open in IMG/M
3300012930|Ga0137407_12298752All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300012975|Ga0134110_10052596All Organisms → cellular organisms → Bacteria1606Open in IMG/M
3300012975|Ga0134110_10095088All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300012975|Ga0134110_10289448All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria705Open in IMG/M
3300012977|Ga0134087_10107985All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300014154|Ga0134075_10150522All Organisms → cellular organisms → Bacteria994Open in IMG/M
3300014154|Ga0134075_10503312All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300014166|Ga0134079_10397052All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium640Open in IMG/M
3300015357|Ga0134072_10355236All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium564Open in IMG/M
3300015358|Ga0134089_10249102All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300015358|Ga0134089_10486502All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300017654|Ga0134069_1148916All Organisms → cellular organisms → Bacteria781Open in IMG/M
3300017656|Ga0134112_10110227All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300017656|Ga0134112_10142982All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300017659|Ga0134083_10029723All Organisms → cellular organisms → Bacteria1984Open in IMG/M
3300018431|Ga0066655_10014947All Organisms → cellular organisms → Bacteria3478Open in IMG/M
3300018431|Ga0066655_10058598All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2015Open in IMG/M
3300018431|Ga0066655_10120316All Organisms → cellular organisms → Bacteria → Proteobacteria1508Open in IMG/M
3300018433|Ga0066667_10065984All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2264Open in IMG/M
3300018433|Ga0066667_10167363All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1574Open in IMG/M
3300018433|Ga0066667_11786818All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300018433|Ga0066667_12010092All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium531Open in IMG/M
3300018468|Ga0066662_10294344All Organisms → cellular organisms → Bacteria1359Open in IMG/M
3300018468|Ga0066662_10611831All Organisms → cellular organisms → Bacteria1022Open in IMG/M
3300018482|Ga0066669_10060568All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2448Open in IMG/M
3300018482|Ga0066669_11220078All Organisms → cellular organisms → Bacteria → Proteobacteria679Open in IMG/M
3300021432|Ga0210384_10017086All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7098Open in IMG/M
3300025910|Ga0207684_10001883All Organisms → cellular organisms → Bacteria → Proteobacteria21817Open in IMG/M
3300025922|Ga0207646_10167670All Organisms → cellular organisms → Bacteria1983Open in IMG/M
3300025922|Ga0207646_10294356All Organisms → cellular organisms → Bacteria1467Open in IMG/M
3300025922|Ga0207646_11942445All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300026277|Ga0209350_1004105All Organisms → cellular organisms → Bacteria → Proteobacteria5239Open in IMG/M
3300026295|Ga0209234_1058062All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1471Open in IMG/M
3300026295|Ga0209234_1107301All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300026296|Ga0209235_1047748All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2081Open in IMG/M
3300026297|Ga0209237_1091971All Organisms → cellular organisms → Bacteria → Proteobacteria1356Open in IMG/M
3300026300|Ga0209027_1124350All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium897Open in IMG/M
3300026306|Ga0209468_1000537All Organisms → cellular organisms → Bacteria → Proteobacteria17967Open in IMG/M
3300026313|Ga0209761_1215959All Organisms → cellular organisms → Bacteria813Open in IMG/M
3300026325|Ga0209152_10036915All Organisms → cellular organisms → Bacteria1686Open in IMG/M
3300026332|Ga0209803_1026749All Organisms → cellular organisms → Bacteria → Proteobacteria2780Open in IMG/M
3300026333|Ga0209158_1071768All Organisms → cellular organisms → Bacteria1363Open in IMG/M
3300026334|Ga0209377_1115099All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300026351|Ga0257170_1037078All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300026359|Ga0257163_1032662All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300026514|Ga0257168_1047675All Organisms → cellular organisms → Bacteria937Open in IMG/M
3300026514|Ga0257168_1056474All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300026523|Ga0209808_1238234All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium582Open in IMG/M
3300026524|Ga0209690_1038363All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2236Open in IMG/M
3300026538|Ga0209056_10255724All Organisms → cellular organisms → Bacteria1241Open in IMG/M
3300026540|Ga0209376_1117232All Organisms → cellular organisms → Bacteria → Proteobacteria1330Open in IMG/M
3300026547|Ga0209156_10032308All Organisms → cellular organisms → Bacteria → Proteobacteria2959Open in IMG/M
3300026548|Ga0209161_10394209All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300027862|Ga0209701_10557872All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300028047|Ga0209526_10041353All Organisms → cellular organisms → Bacteria → Proteobacteria3243Open in IMG/M
3300028047|Ga0209526_10106730All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1970Open in IMG/M
3300028536|Ga0137415_10179647All Organisms → cellular organisms → Bacteria1942Open in IMG/M
3300028536|Ga0137415_10180564All Organisms → cellular organisms → Bacteria1936Open in IMG/M
3300028828|Ga0307312_10163423All Organisms → cellular organisms → Bacteria1418Open in IMG/M
3300028906|Ga0308309_10053720All Organisms → cellular organisms → Bacteria2919Open in IMG/M
(restricted) 3300031150|Ga0255311_1108857All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300031720|Ga0307469_10247672All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1431Open in IMG/M
3300031820|Ga0307473_10210311All Organisms → cellular organisms → Bacteria1162Open in IMG/M
3300032180|Ga0307471_101034826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria988Open in IMG/M
3300032180|Ga0307471_101791635All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300032180|Ga0307471_103683334All Organisms → cellular organisms → Bacteria542Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil31.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil18.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil18.18%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.69%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.19%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.25%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.60%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.30%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.30%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.65%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.65%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10412000533300000955SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASRVSTRAMQLSLGLSRALANVRAVTKATSRERTPTMHQEETPSQRQPFKPGA
JGI25381J37097_102823023300002557Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
JGI25385J37094_1004729413300002558Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
JGI25385J37094_1007785623300002558Grasslands SoilMEENWYAVEQQIRDRLTDARASARIRTLTQKLAPTARRQYSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
JGI25383J37093_1001540643300002560Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIWTLTQKLALTARRPNSVGITIIRLANWVLARAMQLPLELARALANVQAATKRI*
JGI25382J37095_1027328823300002562Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRSLTQKLAPTARRPNSVGITVIRLANWVLARAMLLPLELSRALANVQAATKRT*
JGI25388J43891_100515323300002909Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
JGI25390J43892_1016073223300002911Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNXVGITIIRLANWVLARAMXLPLELSRALAKVQAATK*
Ga0066674_1003917223300005166SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066674_1031242123300005166SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066677_1028792213300005171SoilMEGDWYTVEQQIRDRLTEARAAAQIRTLTQKLAPRARRPNSVGITIIRLANWVLARAMQLPLELS
Ga0066680_1006029223300005174SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066673_1012244913300005175SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066690_1032908123300005177SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0066685_1023776123300005180SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT*
Ga0066685_1033220923300005180SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0066685_1097009013300005180SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066676_1006486033300005186SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066676_1088126923300005186SoilMEENWYAVEQQVRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT*
Ga0066675_1003639623300005187SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0066675_1009272613300005187SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENGALARAMQLFLGLSRALANVRAVTK*
Ga0066675_1049583323300005187SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS*
Ga0070708_10058152623300005445Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRPWR*
Ga0070708_10133260913300005445Corn, Switchgrass And Miscanthus RhizosphereMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG*
Ga0066686_1076604513300005446SoilKPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLADWVLARAMRLPLELSRALAKVQAATK*
Ga0066689_1000491463300005447SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0066689_1002388323300005447SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRI*
Ga0066681_1000847413300005451SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK*
Ga0070706_10179295123300005467Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRP*
Ga0070699_10037432823300005518Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQGLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSRALANVRTATKRCQSTSALLAGKESRP*
Ga0066697_1030658623300005540SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066701_1071412823300005552SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066695_1010538523300005553SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0066692_1016152223300005555SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLFLGLSRALANVRAVTK*
Ga0066704_1021040323300005557SoilMEENWYAAEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP*
Ga0066670_1031985323300005560SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLEISRALANVQAATKRS*
Ga0066699_1065445123300005561SoilMDEDWYTVEQQIRDRLTEARTAAQIRALTEELAPTARRPTSVGIIRLASWVLARAMQLPLELSRALARVRAAMERRASAARERTPH
Ga0066694_1048132513300005574SoilYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0066654_1008231613300005587SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALAN
Ga0066651_1000969323300006031SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK*
Ga0066696_1056650123300006032SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0070765_10088592213300006176SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPLELSRPLAKVRAAMK*
Ga0066653_1035426813300006791SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066658_1002231823300006794SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066665_1017776533300006796SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPPELSRALANVQAATKRT*
Ga0066665_1082466313300006796SoilVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066665_1086687613300006796SoilLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066665_1160887423300006796SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVHAVTK*
Ga0066659_1032015133300006797SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLAPSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0066659_1186916723300006797SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAN
Ga0066660_1087468323300006800SoilMEENWYAGEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGITIIRLENWTLARAMQLFLGLSRALANVRAVTK*
Ga0075433_1047412613300006852Populus RhizosphereMEENWYAVEQQIRDRLTEARAGARTWTMTQGPTPAARRPHTVGITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSS
Ga0075425_10023253833300006854Populus RhizosphereMEENWYAVEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGTTIIRLANWALARPMQLFLRFSRALANVRAVTK*
Ga0075424_10192931913300006904Populus RhizosphereRDRLTEARARARTWTMTQGPTPAARRPHAVWITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSSKVADSNT*
Ga0099794_1079852613300007265Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKR
Ga0066710_10047941333300009012Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRRNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS
Ga0066710_10094072123300009012Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLALELSRALANVQAATKRT
Ga0066709_10045748423300009137Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0066709_10097511323300009137Grasslands SoilMEENWYAVEQQVRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
Ga0075423_1076860523300009162Populus RhizosphereMEENWYAVEQQIRDRLTEARARARTWTMTQGPTPAARRPHAVWITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSSKVADGNT*
Ga0134109_1000148123300010320Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0134109_1048775723300010320Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKR
Ga0134067_1048340423300010321Grasslands SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKRS*
Ga0134086_1014023813300010323Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMRLPLELSRALAN
Ga0134086_1036401513300010323Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLVNWVLARAMQLPLELSRALAKVQAATK*
Ga0134064_1000149473300010325Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS*
Ga0134065_1013537523300010326Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELS
Ga0134080_1001243453300010333Grasslands SoilLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134080_1003002123300010333Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0134080_1065129223300010333Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134063_1018523423300010335Grasslands SoilMDEDWYTVEQQIRDRLTEARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134071_1060488823300010336Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALASVQAATK*
Ga0134062_1017317323300010337Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0134066_1022776723300010364Grasslands SoilRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0137391_1004382043300011270Vadose Zone SoilMDEAWYLVEQQIRDRLTEARAAARLRTPTQKPAQTGRRPNSVGITISRLASWVLARAMQLSLGLSRVLANVRAVTK*
Ga0137389_1030937633300012096Vadose Zone SoilMEEDWDLEQQIRDRLTEARAAARIRIPTQKLAPTPRRQNSVGITIIRLSNWVLARAMQLSLELSRALANARAATKRG*
Ga0137388_1122228313300012189Vadose Zone SoilMEENMYALEQQVRDRLTEARTAARARALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANARAATKRG*
Ga0137388_1166031423300012189Vadose Zone SoilMEEDWYTVEQQIRDRLTEARTAARIRSLSQGLAPAARRPHSVGTAFIRFASWVWARARELPPEGSGGVANVRTAREDTNHG*
Ga0137364_1021814223300012198Vadose Zone SoilVKAMEGDWYTVEQQIRDRLTEARAAVQIPTLTEKLAPRARRPNSVGITIIRLANWVLARAMQLRLEISRALANVQAATKRS*
Ga0137382_1085241723300012200Vadose Zone SoilMDEAWYIVEQQIRDRLTDARAAARMRPLTQKLALTARRRNSVGITIIRLANWVLARAMQLPLELSRALANVQATTKRS*
Ga0137379_1084149123300012209Vadose Zone SoilMEEKWYAVEQQVRDRLNEARAVARTGALNHGLAPSARRPNSVGITIIRLENWALARAMQLFLGLSRALANVRAVTK*
Ga0137371_1043893123300012356Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSR
Ga0137390_1125231823300012363Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP*
Ga0137358_1099435313300012582Vadose Zone SoilMEENWYAVEQQIRDRLTEARAAARTWSLIHGLAPSARRPYSITVTFIPLASWVLARALGLPLKLSRALASVRAATKRYRSTNALFGGKESRS*
Ga0137396_1076122513300012918Vadose Zone SoilRTLRWREPQGVKRMEENWYAIEQQIRERLSEARAGARTWTLTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRHSTNALLPGKKSRP*
Ga0137416_1030332223300012927Vadose Zone SoilMEENWYAVEQQIRDRLTEARAGARTWALTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTHALLPGKKSRP*
Ga0137404_1002308763300012929Vadose Zone SoilMEENWYAIEQQIRDRLTEARAGARTWILTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTNALLPGKKSRP*
Ga0137407_1229875213300012930Vadose Zone SoilMPTVPGKECTPMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRAVANVRAATKRG*
Ga0134110_1005259623300012975Grasslands SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLVLSRALANVHAATKRS*
Ga0134110_1009508813300012975Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134110_1028944833300012975Grasslands SoilVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0134087_1010798513300012977Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPLELSRALANVQAATKRT*
Ga0134075_1015052223300014154Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134075_1050331213300014154Grasslands SoilQGVKPMAENWYAVEQQVRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
Ga0134079_1039705213300014166Grasslands SoilPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPPELSRALAKVQAATK*
Ga0134072_1035523623300015357Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134089_1024910213300015358Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALASAMQLFLGLSRTLANVRAVTK*
Ga0134089_1048650223300015358Grasslands SoilLCRRAWKGLKPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134069_114891623300017654Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRS
Ga0134112_1011022723300017656Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0134112_1014298223300017656Grasslands SoilQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT
Ga0134083_1002972333300017659Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVMARAMRLPLELSRALANVQAATKRT
Ga0066655_1001494763300018431Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0066655_1005859833300018431Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLAPSGRRPNSVGITIIRLENWALARAMQLFLGLSRTLANVRAVTK
Ga0066655_1012031623300018431Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT
Ga0066667_1006598413300018433Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0066667_1016736313300018433Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSR
Ga0066667_1178681813300018433Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0066667_1201009223300018433Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS
Ga0066662_1029434423300018468Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0066662_1061183123300018468Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK
Ga0066669_1006056823300018482Grasslands SoilMEDNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK
Ga0066669_1122007823300018482Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRS
Ga0210384_1001708613300021432SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPWSSHARSPRCERR
Ga0207684_10001883183300025910Corn, Switchgrass And Miscanthus RhizosphereMEEDWYTVEQQVRDRLTEARAAARIWTLTPKPAATASHLNVVGITIIRLANWVLARAVRSPVDPSRALADVPVTTTRCESAAPERTSPCDWRSPDFRPAHRE
Ga0207646_1016767033300025922Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRP
Ga0207646_1029435623300025922Corn, Switchgrass And Miscanthus RhizosphereMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG
Ga0207646_1194244523300025922Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQIRDRVTEARAAARTRTLIHGLAPSARRPYSITIAFFRLASWVLARALGLPLKLSRALATSNVGGRNT
Ga0209350_100410523300026277Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK
Ga0209234_105806223300026295Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0209234_110730123300026295Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRT
Ga0209235_104774833300026296Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT
Ga0209237_109197123300026297Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209027_112435023300026300Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK
Ga0209468_1000537123300026306SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0209761_121595913300026313Grasslands SoilKPMEENWYAVEQQIRDRLTEARAAARIRSLTQKLAPTARRPNSVGITVIRLANWVLARAMLLPLELSRALANVQAATKRT
Ga0209152_1003691523300026325SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209803_102674923300026332SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRI
Ga0209158_107176823300026333SoilMEENWYAAEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP
Ga0209377_111509923300026334SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALAN
Ga0257170_103707823300026351SoilMEEDWYAVEQQVRDRLNEARAAARTRTLIHGHALTARRPNSLGITIIRLENWVLACAMQLSLGLSRALANVRAVTK
Ga0257163_103266223300026359SoilMEENWYAVEQQIRDRLTEARAGARTWTLTGGLVPAARRPHAVRVVFIRLRSWALARAMELPTELSRALAYVRTATKRRQSTDALLPGKESRP
Ga0257168_104767513300026514SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLVPAARRPLAVRIVFIGLTSWALARATKLSRALANVRTATKRWWQSTNAPLPGKESRR
Ga0257168_105647413300026514SoilMEEDWYTVEQQVRDRLTEARAAGRIRTLPPKLAPTARRPNVVGITIIGLANWVLARAMRSLVDLSRALADVTVTTTRCESAAEKGRRH
Ga0209808_123823423300026523SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK
Ga0209690_103836333300026524SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWTLARAMQLFLGLSRALANVRAVTK
Ga0209056_1025572433300026538SoilLKPMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209376_111723223300026540SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209156_1003230843300026547SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS
Ga0209161_1039420913300026548SoilMDEDWYTVEQQIRDRLTEARAAARIRALTEELAPTARRPTSVGIIRLASWVLARAMQLPPEL
Ga0209701_1055787213300027862Vadose Zone SoilMEENWYAVEQQIRDRLNEARAAARTRALIPKLAPSARRPYSIRLAVIRLAGRVLAQALELPLKLLRALDFTCAHANRR
Ga0209526_1004135333300028047Forest SoilMEENWYAVEQQIRDRLTEARARARTWTLIHGLAPSARRPYSITIALIPLASWVLARALWLSLKLSRALANARTATKRYQSTNALMGGKESRP
Ga0209526_1010673013300028047Forest SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASWVSARAMQLSLGLSRALVNVAR
Ga0137415_1017964733300028536Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP
Ga0137415_1018056433300028536Vadose Zone SoilMEENWYAVEQQIRDRLTEARAGARTWALTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTHALLPGKKSRP
Ga0307312_1016342323300028828SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLVPAARRPHAVRIVFIGLTSWALARATKLSRALANVRTATKRWWQSANAPLPGKESRR
Ga0308309_1005372013300028906SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPLELSRPLAKVRAAMK
(restricted) Ga0255311_110885713300031150Sandy SoilMEENWYAVEQQVRDRLNEARAAARTRALVQNLAPTARRPNSVGFTIIRLANWILARAMQLPPELSRALAKMRGTRERRESA
Ga0307469_1024767223300031720Hardwood Forest SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTILRLANWVLGRPMRSPLELSRPLAKVRAAMK
Ga0307473_1021031123300031820Hardwood Forest SoilMEENWYELEQRIRDRLTEARAAARVRTLTQRVAPTARRPNSVGITIIRLANWVLACALQLPLELSRALVKVRAAPK
Ga0307471_10027383223300032180Hardwood Forest SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASWVSARAMQLSLGLSRALANVRAVTKVTSRERTPTMHQEETTSQRQPFKPGAW
Ga0307471_10103482623300032180Hardwood Forest SoilSSGFTALPDAAVPGKECTPMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG
Ga0307471_10179163513300032180Hardwood Forest SoilMEENWYAVEQQIRDRLTEARAGARTWAMTQGLAPAARRPHAVGITIIRLGNWVLARAMQLSLGLSRALANVRAVTKVTSREGTPTMHQEETT
Ga0307471_10368333413300032180Hardwood Forest SoilMEENWYAVEQQVRDRLNEARAAARTRTLIHRPALTACGPNSVRIIITHLKNEVLARAMQLSLGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.