NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F032596

Metagenome / Metatranscriptome Family F032596

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F032596
Family Type Metagenome / Metatranscriptome
Number of Sequences 179
Average Sequence Length 93 residues
Representative Sequence MLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Number of Associated Samples 127
Number of Associated Scaffolds 179

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 75.98 %
% of genes near scaffold ends (potentially truncated) 31.28 %
% of genes from short scaffolds (< 2000 bps) 87.15 %
Associated GOLD sequencing projects 111
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (76.536 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(46.927 % of family members)
Environment Ontology (ENVO) Unclassified
(51.955 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(55.307 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.53%    β-sheet: 32.23%    Coil/Unstructured: 51.24%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.124.1.1: HesB-like domaind1r94a_1r940.80979
b.124.1.0: automated matchesd2apna_2apn0.77882
b.124.1.1: HesB-like domaind1nwba_1nwb0.77221
b.124.1.0: automated matchesd1x0ga_1x0g0.74412
b.124.1.0: automated matchesd2d2aa_2d2a0.74059


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 179 Family Scaffolds
PF01521Fe-S_biosyn 15.64
PF13767DUF4168 15.08
PF04972BON 2.23
PF07969Amidohydro_3 2.23
PF11885DUF3405 0.56
PF12327FtsZ_C 0.56
PF05697Trigger_N 0.56
PF05698Trigger_C 0.56
PF00211Guanylate_cyc 0.56
PF00378ECH_1 0.56
PF00872Transposase_mut 0.56
PF07110EthD 0.56
PF02955GSH-S_ATP 0.56
PF03030H_PPase 0.56
PF01992vATP-synt_AC39 0.56
PF13649Methyltransf_25 0.56
PF05534HicB 0.56
PF01368DHH 0.56
PF04519Bactofilin 0.56
PF13442Cytochrome_CBB3 0.56
PF03720UDPG_MGDP_dh_C 0.56

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 179 Family Scaffolds
COG0316Fe-S cluster assembly iron-binding protein IscAPosttranslational modification, protein turnover, chaperones [O] 15.64
COG4841Uncharacterized conserved protein YneR, related to HesB/YadR/YfhF familyFunction unknown [S] 15.64
COG0189Glutathione synthase, LysX or RimK-type ligase, ATP-grasp superfamilyTranslation, ribosomal structure and biogenesis [J] 1.12
COG0544FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor)Posttranslational modification, protein turnover, chaperones [O] 1.12
COG1527Archaeal/vacuolar-type H+-ATPase subunit C/Vma6Energy production and conversion [C] 0.56
COG1598Antitoxin component HicB of the HicAB toxin-antitoxin systemDefense mechanisms [V] 0.56
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 0.56
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.56
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.56
COG3808Na+ or H+-translocating membrane pyrophosphataseEnergy production and conversion [C] 0.56
COG4226Predicted nuclease of the RNAse H fold, HicB familyGeneral function prediction only [R] 0.56


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A76.54 %
All OrganismsrootAll Organisms23.46 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_105223759Not Available970Open in IMG/M
3300001593|JGI12635J15846_10151984Not Available1586Open in IMG/M
3300001867|JGI12627J18819_10464186Not Available519Open in IMG/M
3300002245|JGIcombinedJ26739_101392416Not Available594Open in IMG/M
3300002914|JGI25617J43924_10027722All Organisms → cellular organisms → Bacteria1998Open in IMG/M
3300004080|Ga0062385_10213693All Organisms → cellular organisms → Bacteria1050Open in IMG/M
3300005171|Ga0066677_10748160All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Kribbellaceae → Kribbella → Kribbella flavida544Open in IMG/M
3300005332|Ga0066388_104313458Not Available725Open in IMG/M
3300005332|Ga0066388_105021868Not Available672Open in IMG/M
3300005436|Ga0070713_102034454All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Halobacteriales → Haloarculaceae → Haloarcula → Haloarcula japonica557Open in IMG/M
3300005454|Ga0066687_10300408Not Available910Open in IMG/M
3300005467|Ga0070706_101999410Not Available525Open in IMG/M
3300005533|Ga0070734_10000141All Organisms → cellular organisms → Bacteria198178Open in IMG/M
3300005536|Ga0070697_100940847Not Available767Open in IMG/M
3300005537|Ga0070730_10326232All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300005555|Ga0066692_10566019Not Available717Open in IMG/M
3300005556|Ga0066707_10766556Not Available599Open in IMG/M
3300005559|Ga0066700_10220098All Organisms → cellular organisms → Bacteria1318Open in IMG/M
3300005591|Ga0070761_10377705Not Available862Open in IMG/M
3300005607|Ga0070740_10001100All Organisms → cellular organisms → Bacteria40602Open in IMG/M
3300005764|Ga0066903_105671990Not Available657Open in IMG/M
3300005893|Ga0075278_1050014Not Available628Open in IMG/M
3300006028|Ga0070717_11101248Not Available723Open in IMG/M
3300006052|Ga0075029_100321622All Organisms → cellular organisms → Bacteria991Open in IMG/M
3300006796|Ga0066665_10454375All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1056Open in IMG/M
3300006797|Ga0066659_10648135Not Available860Open in IMG/M
3300006804|Ga0079221_10291271Not Available952Open in IMG/M
3300006893|Ga0073928_10004160All Organisms → cellular organisms → Bacteria21820Open in IMG/M
3300006903|Ga0075426_10521933Not Available883Open in IMG/M
3300006914|Ga0075436_100801193All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300007258|Ga0099793_10644196Not Available533Open in IMG/M
3300007265|Ga0099794_10127388Not Available1284Open in IMG/M
3300007982|Ga0102924_1171057Not Available976Open in IMG/M
3300009038|Ga0099829_10286230Not Available1349Open in IMG/M
3300009038|Ga0099829_10512015All Organisms → cellular organisms → Bacteria996Open in IMG/M
3300009038|Ga0099829_10635690Not Available887Open in IMG/M
3300009088|Ga0099830_10245864Not Available1414Open in IMG/M
3300009088|Ga0099830_10507997Not Available984Open in IMG/M
3300009088|Ga0099830_11629635Not Available538Open in IMG/M
3300009089|Ga0099828_12023388Not Available503Open in IMG/M
3300009093|Ga0105240_10985722Not Available902Open in IMG/M
3300009093|Ga0105240_11753009Not Available648Open in IMG/M
3300009093|Ga0105240_12206033Not Available571Open in IMG/M
3300009137|Ga0066709_103524876Not Available568Open in IMG/M
3300009143|Ga0099792_10258061Not Available1018Open in IMG/M
3300010373|Ga0134128_10396281Not Available1541Open in IMG/M
3300010376|Ga0126381_100114637All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3474Open in IMG/M
3300010376|Ga0126381_101086015All Organisms → cellular organisms → Bacteria1156Open in IMG/M
3300010396|Ga0134126_10163116All Organisms → cellular organisms → Eukaryota → Viridiplantae2692Open in IMG/M
3300010396|Ga0134126_10644969Not Available1208Open in IMG/M
3300011120|Ga0150983_12549683Not Available547Open in IMG/M
3300011120|Ga0150983_14852461All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1241Open in IMG/M
3300011269|Ga0137392_10049734All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3148Open in IMG/M
3300011269|Ga0137392_11618795All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Halobacteriales → Haloarculaceae → Haloarcula → Haloarcula japonica506Open in IMG/M
3300011270|Ga0137391_10264905Not Available1487Open in IMG/M
3300011270|Ga0137391_10427966Not Available1128Open in IMG/M
3300011270|Ga0137391_10769783Not Available795Open in IMG/M
3300011271|Ga0137393_10118550Not Available2180Open in IMG/M
3300011271|Ga0137393_10616456Not Available930Open in IMG/M
3300012189|Ga0137388_10959971Not Available789Open in IMG/M
3300012189|Ga0137388_11140218Not Available717Open in IMG/M
3300012189|Ga0137388_11848358Not Available535Open in IMG/M
3300012199|Ga0137383_11134412Not Available565Open in IMG/M
3300012200|Ga0137382_10908399Not Available634Open in IMG/M
3300012200|Ga0137382_10996661Not Available601Open in IMG/M
3300012202|Ga0137363_10333539Not Available1252Open in IMG/M
3300012202|Ga0137363_10807765Not Available796Open in IMG/M
3300012202|Ga0137363_11156639Not Available658Open in IMG/M
3300012203|Ga0137399_10441057Not Available1088Open in IMG/M
3300012203|Ga0137399_10786138Not Available801Open in IMG/M
3300012203|Ga0137399_10864149Not Available762Open in IMG/M
3300012205|Ga0137362_10103589All Organisms → cellular organisms → Bacteria2399Open in IMG/M
3300012205|Ga0137362_10713442Not Available862Open in IMG/M
3300012205|Ga0137362_11395901Not Available586Open in IMG/M
3300012206|Ga0137380_10677270Not Available897Open in IMG/M
3300012208|Ga0137376_10200821Not Available1725Open in IMG/M
3300012210|Ga0137378_11622913Not Available555Open in IMG/M
3300012211|Ga0137377_11255975Not Available671Open in IMG/M
3300012351|Ga0137386_11150778Not Available545Open in IMG/M
3300012361|Ga0137360_10517751Not Available1016Open in IMG/M
3300012361|Ga0137360_10791637Not Available816Open in IMG/M
3300012361|Ga0137360_11279839Not Available634Open in IMG/M
3300012362|Ga0137361_10237216Not Available1659Open in IMG/M
3300012362|Ga0137361_11282099Not Available657Open in IMG/M
3300012363|Ga0137390_10031871All Organisms → cellular organisms → Bacteria5030Open in IMG/M
3300012363|Ga0137390_10401465Not Available1349Open in IMG/M
3300012363|Ga0137390_10945708Not Available815Open in IMG/M
3300012363|Ga0137390_11172278Not Available716Open in IMG/M
3300012363|Ga0137390_11711994Not Available563Open in IMG/M
3300012363|Ga0137390_12029282Not Available500Open in IMG/M
3300012582|Ga0137358_10320270Not Available1052Open in IMG/M
3300012582|Ga0137358_10490543Not Available828Open in IMG/M
3300012582|Ga0137358_10521116Not Available800Open in IMG/M
3300012685|Ga0137397_10054865All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2866Open in IMG/M
3300012685|Ga0137397_10554870Not Available855Open in IMG/M
3300012917|Ga0137395_10770340Not Available697Open in IMG/M
3300012922|Ga0137394_10145293All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium2016Open in IMG/M
3300012923|Ga0137359_10017606All Organisms → cellular organisms → Bacteria6036Open in IMG/M
3300012923|Ga0137359_10280397Not Available1486Open in IMG/M
3300012923|Ga0137359_10472041All Organisms → cellular organisms → Bacteria1109Open in IMG/M
3300012923|Ga0137359_10474371Not Available1106Open in IMG/M
3300012924|Ga0137413_10694377Not Available771Open in IMG/M
3300012924|Ga0137413_11544838Not Available541Open in IMG/M
3300012925|Ga0137419_10981591Not Available699Open in IMG/M
3300012925|Ga0137419_11115373Not Available658Open in IMG/M
3300012927|Ga0137416_10701746Not Available889Open in IMG/M
3300012927|Ga0137416_11378070Not Available638Open in IMG/M
3300012929|Ga0137404_11976223Not Available544Open in IMG/M
3300012930|Ga0137407_11441084Not Available655Open in IMG/M
3300012944|Ga0137410_10648217Not Available876Open in IMG/M
3300012944|Ga0137410_11583681Not Available574Open in IMG/M
3300015241|Ga0137418_10899132Not Available651Open in IMG/M
3300015242|Ga0137412_10361026Not Available1130Open in IMG/M
3300015242|Ga0137412_10387859All Organisms → cellular organisms → Bacteria1083Open in IMG/M
3300016270|Ga0182036_10292152Not Available1236Open in IMG/M
3300018433|Ga0066667_11850951Not Available548Open in IMG/M
3300019279|Ga0184642_1723239Not Available590Open in IMG/M
3300020069|Ga0197907_11345890Not Available584Open in IMG/M
3300020070|Ga0206356_10502534Not Available542Open in IMG/M
3300020080|Ga0206350_11344525Not Available759Open in IMG/M
3300020581|Ga0210399_10438305Not Available1089Open in IMG/M
3300020581|Ga0210399_10668966Not Available855Open in IMG/M
3300021046|Ga0215015_10672737All Organisms → cellular organisms → Bacteria4348Open in IMG/M
3300021151|Ga0179584_1491857Not Available770Open in IMG/M
3300021171|Ga0210405_10130059Not Available1990Open in IMG/M
3300021178|Ga0210408_10606141Not Available867Open in IMG/M
3300021405|Ga0210387_11852621Not Available507Open in IMG/M
3300021407|Ga0210383_11244869Not Available624Open in IMG/M
3300021432|Ga0210384_11824842Not Available514Open in IMG/M
3300021560|Ga0126371_10949342Not Available1003Open in IMG/M
3300021560|Ga0126371_12809608All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Halobacteriales → Haloarculaceae → Haloarcula → Haloarcula rubripromontorii590Open in IMG/M
3300022467|Ga0224712_10643833Not Available519Open in IMG/M
3300022525|Ga0242656_1023158All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium944Open in IMG/M
3300022527|Ga0242664_1046055All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium784Open in IMG/M
3300022529|Ga0242668_1119560All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium554Open in IMG/M
3300022557|Ga0212123_10006608All Organisms → cellular organisms → Bacteria19184Open in IMG/M
3300022717|Ga0242661_1160354All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Halobacteriales → Haloarculaceae → Haloarcula → Haloarcula japonica510Open in IMG/M
3300022724|Ga0242665_10036966All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1235Open in IMG/M
3300022731|Ga0224563_1028675Not Available512Open in IMG/M
3300024288|Ga0179589_10482415Not Available574Open in IMG/M
3300025913|Ga0207695_10707722All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Verrucomicrobium → Verrucomicrobium spinosum888Open in IMG/M
3300025928|Ga0207700_11138000Not Available697Open in IMG/M
3300026514|Ga0257168_1028677Not Available1181Open in IMG/M
3300026551|Ga0209648_10847232All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Halobacteria → Halobacteriales → Haloarculaceae → Haloarcula → Haloarcula japonica500Open in IMG/M
3300026557|Ga0179587_10756053Not Available641Open in IMG/M
3300027587|Ga0209220_1156720Not Available588Open in IMG/M
3300027645|Ga0209117_1069376Not Available1006Open in IMG/M
3300027678|Ga0209011_1044997Not Available1363Open in IMG/M
3300027706|Ga0209581_1004605All Organisms → cellular organisms → Bacteria11009Open in IMG/M
3300027826|Ga0209060_10000123All Organisms → cellular organisms → Bacteria199167Open in IMG/M
3300027846|Ga0209180_10292630Not Available934Open in IMG/M
3300027846|Ga0209180_10356302All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300027862|Ga0209701_10056238Not Available2512Open in IMG/M
3300027875|Ga0209283_10070556Not Available2249Open in IMG/M
3300027903|Ga0209488_10028449All Organisms → cellular organisms → Bacteria4070Open in IMG/M
3300027903|Ga0209488_10693782Not Available730Open in IMG/M
3300027903|Ga0209488_11071980Not Available552Open in IMG/M
3300028047|Ga0209526_10094344All Organisms → cellular organisms → Bacteria2106Open in IMG/M
3300028047|Ga0209526_10789328Not Available589Open in IMG/M
3300028536|Ga0137415_10264728Not Available1526Open in IMG/M
3300028536|Ga0137415_11272828Not Available553Open in IMG/M
3300030730|Ga0307482_1029243Not Available1216Open in IMG/M
3300031057|Ga0170834_105123150Not Available517Open in IMG/M
3300031090|Ga0265760_10193304Not Available684Open in IMG/M
3300031093|Ga0308197_10138125Not Available769Open in IMG/M
3300031231|Ga0170824_102780078Not Available631Open in IMG/M
3300031231|Ga0170824_126425172Not Available751Open in IMG/M
3300031446|Ga0170820_13015755Not Available987Open in IMG/M
3300031708|Ga0310686_109584873Not Available1190Open in IMG/M
3300031708|Ga0310686_111134388Not Available610Open in IMG/M
3300031708|Ga0310686_111697378Not Available2122Open in IMG/M
3300031715|Ga0307476_10149562Not Available1677Open in IMG/M
3300031718|Ga0307474_10807969Not Available742Open in IMG/M
3300031823|Ga0307478_10081713All Organisms → cellular organisms → Bacteria2468Open in IMG/M
3300031879|Ga0306919_11187935Not Available580Open in IMG/M
3300031912|Ga0306921_10249865Not Available2078Open in IMG/M
3300031941|Ga0310912_11379774Not Available533Open in IMG/M
3300032035|Ga0310911_10679290Not Available597Open in IMG/M
3300032180|Ga0307471_102172726Not Available699Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil46.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.38%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.91%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.79%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.79%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.23%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere2.23%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.23%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.23%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.23%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.12%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.68%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.68%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.68%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.56%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.56%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.56%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.56%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.56%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.56%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.56%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005607Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005893Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_0N_202EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020069Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-2 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020080Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022467Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-2 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022527Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022529Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022717Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022731Soil microbial communities from Bohemian Forest, Czech Republic ? CSU4EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10522375923300000364SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
JGI12635J15846_1015198433300001593Forest SoilMLTVTKKAAALLKAAKAAEGATGDAGIRLRRGAIPPNDSGNLIVGFTISDEPAPDDEEFEQEGLRIFVEEALVEPLDGRTLDVQDANEDEGLELVFR*
JGI12627J18819_1046418613300001867Forest SoilMLTVTKKAVELLKAAKTVEGAAEDAGIRIRRGVAANESKISVGIAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
JGIcombinedJ26739_10139241623300002245Forest SoilAAALLKAAKAAEGATGDAGIRLRRGAIPNDSGNLLVGFTISDEPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVCDTNEDEGLELVFR*
JGI25617J43924_1002772233300002914Grasslands SoilMLTVTKKAAAVLKAEIAAEGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR*
Ga0062385_1021369323300004080Bog Forest SoilMTMLTVTKKAAAFLKAAKVAEGATRGAGIRIRRDALPDESGKPSVGFTISEEPEPDDWEFEEEGLRIFVEDQLVQSLDDRILDVRDANEGLQLVFR*
Ga0066677_1074816013300005171SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGR
Ga0066388_10431345823300005332Tropical Forest SoilKAEQGATDDAGIRIRKGVMADESGVSVRFAISDAPDPDDEEFEQDGLRIFVEDVLIEPLDGRTLDVREAGEKTEFVFR*
Ga0066388_10502186823300005332Tropical Forest SoilLKAAAAEGAPQDAGIRILRGGMPNDSAAPAVGFVISDDPEPEDEEFEQDWLPFFVEDVLVEQLDGHTLGVRYADGEPELALR*
Ga0070713_10203445413300005436Corn, Switchgrass And Miscanthus RhizosphereMLTVTERAAALLKAAKAAEGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFIEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0066687_1030040813300005454SoilMLTVTDRAAALVKAAKAAGGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0070706_10199941013300005467Corn, Switchgrass And Miscanthus RhizosphereAEGAADDAGIRIRRGVTANESKISVGFAIRDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0070734_100001411673300005533Surface SoilMLTITRRAAAVLKAAKAAEGAADRAGIRLRAGAPLYDSGVSVGFAITDAPAPKDMELEQDGLRIFIEDVLVEPLDGRTLDVRDAADSMELIFR*
Ga0070697_10094084723300005536Corn, Switchgrass And Miscanthus RhizosphereMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGRPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLEGRTLDVRDANDGLQLVFR*
Ga0070730_1032623223300005537Surface SoilMLTVTRRAATLLKAAKFAEGATEDTGIRLRRGRMVSEPGKLAVGFAISPGPEPSDEQIEQDGLRIFVQDELVEALDGRTLDIRDDAGEVELVFR*
Ga0066692_1056601913300005555SoilMLTVTKKAAAFLKVAKAAEGATRGVGIRLRRDSIPDESGRPSVRFTFSAEPAPDDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANEGLELVFR*
Ga0066707_1076655613300005556SoilMLTVTKKAAALLKAAKAADDGARRGAGIRLRRGAIPDESGKPSVGFKISNEPDPDDWEFEQEGLRIFVEGALVEPLDGCTLDVRDANDGLQLVFRARG
Ga0066700_1022009823300005559SoilMLTVTKRAAAVLKAAKAAEGAANDAGIRIGRGVTANESKISVGFAISDEPDPDDEELELEGLRIFVEDVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0070761_1037770513300005591SoilMLTVTKKAAALLKAAKAAEGATRGAGIRLRRGAIPDDSGDVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR*
Ga0070740_10001100373300005607Surface SoilMLTITRRAAAVLKAAKAAEGAADRAGIRLRAGAPLDDSGVSVGFAITDAPAPKDMELEQDGLRIFIEDVLVEPLDGRTLDVRDAADSMELIFR*
Ga0066903_10567199023300005764Tropical Forest SoilMLTVTKNAAAFLKVAKTAEGETDDAGIRIGKMAEVPGESEISIGFVVRDEPAPDDEEFEQHGLRFFIEDVLVEPLDGHTLDVCEAADGMELVLR*
Ga0075278_105001413300005893Rice Paddy SoilEGATDSAGIRLRMGGIPDDSGKVAIGLAICDEPDPNDEEFEQEGLRIFLEEELVETLENRTLDVTDAEKELKFVFR*
Ga0070717_1110124813300006028Corn, Switchgrass And Miscanthus RhizosphereMLTVTERAAALLKAAKAAEGAPDDAGIRIRSGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0075029_10032162223300006052WatershedsMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESEISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0066665_1045437543300006796SoilMLTVTKKAAALLKAAKDAQGAADDAGIRIRKDVLPDESDKSGIAVGLAISDGPDPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDANDGMELVWR*
Ga0066659_1064813523300006797SoilMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0079221_1029127113300006804Agricultural SoilLTVTKKAAAVLKAAKAAKGAPDDAGVRIQRGVGANESEIAVGFTISDEPDSADEEFEQNGLRIFVEDVLVERLDGRTLDAREADDGMELVFR*
Ga0073928_10004160293300006893Iron-Sulfur Acid SpringMLTVTERAAALLKAAKAAEGAADDAGIRIGRGVTANESEISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0075426_1052193333300006903Populus RhizosphereKAAKAAKGAPDDAGVRIQRGVGANESEIAVGFTISDEPDSADEEFEQNGLRIFVEDVLVERLDGRTLDAREADDGMELVFR*
Ga0075436_10080119313300006914Populus RhizosphereMLTVTKKAAAVLKAAKAAKGAPDDAGVRIQRGVGANESEIAVGFTISDEPDSADEEFEQNGLRIFVEDVLVERLDGRTLDAREADDGMELVFR*
Ga0099793_1064419613300007258Vadose Zone SoilTELFGGRLLIEAGFTIMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0099794_1012738823300007265Vadose Zone SoilMLTVTKKAATVLKAEIAAEGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR*
Ga0102924_117105723300007982Iron-Sulfur Acid SpringVLTVTKRAAALLKAAKAAEGAADNAGIRIRRGVKANESKISIGFAISDEPDPDDEELEQEGLRIFVEDVLIEPLDGRTLDVREASEGTEFVFR*
Ga0099829_1028623013300009038Vadose Zone SoilMLTVTKKAAALLKAEKAAEGAADDAGIRILRGAMPDEFRGAMPDESGIAVEFTIADDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPELVWR*
Ga0099829_1051201513300009038Vadose Zone SoilMLTVTKRAATLLKAAKLAEGAAEHAGIRIRRGLTTSEPGKLAVGFAISPGPEPSDEQIEQDGLRIFVQDELVEVLDGRTLDIHDTAEEVELVFR*
Ga0099829_1063569013300009038Vadose Zone SoilTRGAGIRLRRDALPDESGKPSVGFTISEEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLAFR*
Ga0099830_1024586433300009088Vadose Zone SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRSGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0099830_1050799713300009088Vadose Zone SoilMLTVTKKAAALLKAEKAAEGAADDAGIRILRGAMPDESGIAVEFTITDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPELVWR*
Ga0099830_1162963513300009088Vadose Zone SoilRAATLLKAAKLAEGATEHAGIRIRRGLTTSESGKVAVGFAISPGPEPSDEQIEQDGLRIFVENELVEVLDGRTLDIRDSAEEVELVFR*
Ga0099828_1202338813300009089Vadose Zone SoilMLTVTKRAATLLKAAKLAEGATEHAGIRIRRGLTTSEPGKLAVGFAISPGPEPSDEQIEQDGLR
Ga0105240_1098572213300009093Corn RhizosphereMLTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVETLDGRTLDVREAAEGTELVFR*
Ga0105240_1175300913300009093Corn RhizosphereAAALLKAAKAAEGATDSAGIRLRMGGIPDDSGKVAIGLAICDEPDPNDEEFEQEGLRIFLEEELVETLENRILDVTDANEELKLVFR*
Ga0105240_1220603313300009093Corn RhizosphereMLTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0066709_10352487613300009137Grasslands SoilKAAEGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0099792_1025806123300009143Vadose Zone SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0134128_1039628123300010373Terrestrial SoilMLTVTKRASTLLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0126381_10011463763300010376Tropical Forest SoilMLTITKKAAAVLKAAKAAEGATDNAGIRIRAGAMPDQSGVSVGFAISDAPDPDDMEIEQEGLHVFIQDVLVEPLDGRTLDVREAADGMELIFR*
Ga0126381_10108601523300010376Tropical Forest SoilMLNVTKKAAALLVAAKEAEGGSPSAGIRIRQGTTPQPGSGTVAIGFTISDEPQPDDEQFEQNGLRFFVEESLVEPLDGRTLDVNDVGDGPQLVFR*
Ga0134126_1016311663300010396Terrestrial SoilMVTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0134126_1064496913300010396Terrestrial SoilMLTITKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVETLDGRTLDVREAAEGTELVFR*
Ga0150983_1254968323300011120Forest SoilQIWPQRRGLQHDGDKLKEVSHMLTVTKKAAALLKAAKAAEGAADDAGIRILRGVMPNESGIAVGLAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR*
Ga0150983_1485246113300011120Forest SoilMVTVTKKAAAVLKAAKAAHGASPDAGIRILKGTVPNHPETLAVGFTITDDPRPDDEEFEEQGLRIFVEDALIEPLDGRTLDVRDANEGPELVWR*
Ga0137392_1004973433300011269Vadose Zone SoilMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0137392_1161879513300011269Vadose Zone SoilDMLTVTKKAAAVLKAEIAAEGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR*
Ga0137391_1026490543300011270Vadose Zone SoilEVIDMLTVTKKAAALLKAAKAAEGATGGAGIRLRRGATPKDSEKLTVGFTISDEPDPDDEEFEQDGLRIFVEEALVEPLDGRTLDVRDADEGLELVFR*
Ga0137391_1042796633300011270Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRDAMPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137391_1076978313300011270Vadose Zone SoilLKEVIDMLTVTKKAAALLKAAKAAEGATGGAGIRLRRGAITNDSEKLTVGFTISDEPDPDDEEFEQEGLRIFVEEALVEPLDGRTLDVRDANEGLELVFR*
Ga0137393_1011855033300011271Vadose Zone SoilMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESNISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0137393_1061645613300011271Vadose Zone SoilMVKVTREAAAVLKAAKAAHGATEDAGIRILKGSVPGEPGTLAVGFAITRDPRPDDEEFEQQGLRIFVEDALVEPLDGRTLDVRDDNEGPELVWR*
Ga0137388_1095997123300012189Vadose Zone SoilGFTIMLTVTERAAALLKAAKAAEGAPDDAGIRIRSGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0137388_1114021823300012189Vadose Zone SoilMLTVTKKAAAVLKAEIAADGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR*
Ga0137388_1184835813300012189Vadose Zone SoilMLTVTKRAATLLKAAKLAEGATEHAGIRIRRGLTTSESGKVAVGFAISPGPEPSDEQIEQDGLRIFVENELVEVL
Ga0137383_1113441213300012199Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKAGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADEM
Ga0137382_1090839913300012200Vadose Zone SoilMIDKSKEVHQMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDESGDSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADDDGMELVWR*
Ga0137382_1099666123300012200Vadose Zone SoilSVMLTVTKKTAAFLKVAKAAEGATRGAGIRLRTDAIPDESGKPSVGFTFSDEPDPNDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137363_1033353923300012202Vadose Zone SoilMVRVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKAGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDANDGMELVWR*
Ga0137363_1080776513300012202Vadose Zone SoilMLTITKKAAAILKAAKAAEGAPDDAGIRIRKDAMTDDSGRLAVGLVITDDPSPDDEEFEQEGLRIFVEDALVEPLDGRTLDARDANEGPELIWR*
Ga0137363_1115663913300012202Vadose Zone SoilPEGRESRGKVAKAAEGATRGAGIRLRRDAIPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137399_1044105723300012203Vadose Zone SoilMIDKSKEVHQMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDESGESGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADDGMELVWR*
Ga0137399_1078613813300012203Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGVGIRLRRDSIPDESGRPSVGFTFSAEPAPDDWEFEQQGLRIFVEGVLVEPLDGR
Ga0137399_1086414923300012203Vadose Zone SoilMLTVTKRAAALLKAAKAAEGAADDAGIRIRRGVTANESNISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTEFVFR*
Ga0137362_1010358933300012205Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKSGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADEMELVWR*
Ga0137362_1071344213300012205Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGARRGAGIRLRRDAIPDESGKPSVGFTISDEPDPDDWQFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137362_1139590113300012205Vadose Zone SoilMLTVTKKAAALLRAEKAAEGAADDAGIRIRRSVMPNDSEIGIGLAITDEPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGLEFVFR*
Ga0137380_1067727023300012206Vadose Zone SoilMLTVTKEAAAFLKVAKAAEGARRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEDALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137376_1020082123300012208Vadose Zone SoilMLTVTKQATALLKAAKAADDGARRGAGIRLRTDAIPDESGKPSVGFTFSDEPDPNDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANEGLQLVFR*
Ga0137378_1162291323300012210Vadose Zone SoilMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRDSIPDESGKPSVGFTFSDEPAPDDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137377_1125597523300012211Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRDAIPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGCTLDVRDANDGLQLVFR*
Ga0137386_1115077813300012351Vadose Zone SoilMLTVTKKAAALLKAAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137360_1051775123300012361Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKAGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADEMELVWR*
Ga0137360_1079163723300012361Vadose Zone SoilEGATRGVGIRLRRDSIPDESGKPSVGFTFSAEPAPDDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDVLQLVFR*
Ga0137360_1127983913300012361Vadose Zone SoilGARRGAGIRLRRGAIPDESGKPSVGFKISNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137361_1023721623300012362Vadose Zone SoilMLTVTKKAAALLRAEKAAEGAADDAGIRIRRGVMPNDSEIGIGLAITDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGLEFVFR*
Ga0137361_1128209913300012362Vadose Zone SoilMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQL
Ga0137390_1003187113300012363Vadose Zone SoilMLTVTKRAATLLKAAKLAEGAAEHAGIRIRRGLTTSEPGKLAVGFAISPGPEPSDEQIEQDGLRIFVQDELVEV
Ga0137390_1040146513300012363Vadose Zone SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRSGVTANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTL
Ga0137390_1094570813300012363Vadose Zone SoilMLTITKKAAAILKAAKAAEGAPDDAGIRIRKDAMTDDSGRLAVGLVITEDPSPDDEEFEQEGLRIFVEDALVEPLDGRTLDARDANEGPELIWR*
Ga0137390_1117227823300012363Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRDAMPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGL
Ga0137390_1171199413300012363Vadose Zone SoilMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137390_1202928213300012363Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKSGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADGMELVWR*
Ga0137358_1032027033300012582Vadose Zone SoilMLTVTERAAALLKAAKAAGGAADDAGIRIRRGVMANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTELVF
Ga0137358_1049054313300012582Vadose Zone SoilDFAPSMIDKLKEVHQMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDESGKSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADEGMELVWR*
Ga0137358_1052111623300012582Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGVGIRLRRDSIPDESGRPSVGFTFSAEPAPDDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137397_1005486523300012685Vadose Zone SoilMIDKSKEVHQMLTVTKKAAALLKAAKVAQGAADEAGIRIRKDVMPDESGNSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADDGMELVWR*
Ga0137397_1055487013300012685Vadose Zone SoilAIHARKRDFPPHQGTRTATELFGGRLLIEGGFTIMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137395_1077034013300012917Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKSGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDGLDASDEDADDGMELVWR*
Ga0137394_1014529323300012922Vadose Zone SoilMIDKSKEVHQMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDESGNSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADDGMELVWR*
Ga0137359_1001760683300012923Vadose Zone SoilAAFLKVAKAAEGATRGAGIRLRRDAIPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR*
Ga0137359_1028039723300012923Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGVGIRLRRDSIPDESGRPSVRFTFSAEPAPDDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDVLQLVFR*
Ga0137359_1047204123300012923Vadose Zone SoilMLTVTKKAAAVLKAAKAAEGATNEAGIRIRKDAIVDDSGMLAAGIVITDEPEPEDEEFEQQGLRIFVEDALVEPLDGRTLDVRDANEGVELIWR*
Ga0137359_1047437123300012923Vadose Zone SoilMIDKLKEVHEMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKAGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADGMELVWR*
Ga0137413_1069437713300012924Vadose Zone SoilMIDKLKEVHQMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDESGKSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDGRTLDVLDASDEDADDGMELVWR*
Ga0137413_1154483823300012924Vadose Zone SoilMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTPNESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAA*
Ga0137419_1098159123300012925Vadose Zone SoilMLTVTKQAAAFLKVAKLAEGATRGAGIRLRRDALPDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDRLQLVFR*
Ga0137419_1111537323300012925Vadose Zone SoilMLTVAKKAAALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQVGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR*
Ga0137416_1070174623300012927Vadose Zone SoilMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLCIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137416_1137807013300012927Vadose Zone SoilAFGKSGCNVEACTFNDALKLKEVHEMLTVTKKAAALLKAAKAAHGLADNAGVRIRKDVMPNGSEIAVGIVINDDPDPEDKVFEQQGLRIFVEDALIEPLEGRILDVHEANEGPELVLR*
Ga0137404_1197622323300012929Vadose Zone SoilILIEGGFTIMLTVTKKAAAFLKVAKAAEGATRGAGIRLRTDAIPDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137407_1144108413300012930Vadose Zone SoilSPFYRRSPLVEGGFTIMLTVTKEAAAFLKVAKAAEGATRGAGIRLRRGAILDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVRDANDGLQLVFR*
Ga0137410_1064821723300012944Vadose Zone SoilMALEREPSHSEVRFLIEGGFTIMLTVTKKAAALLKAAKAAEGATRGAGIRLRSGAMPDDSGKLIVGLAISDEPDPDDEEFEQEGLRIFVEGALVEPLDGRTLDVRDADEGQLQLVFR*
Ga0137410_1158368123300012944Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAADDAGIRIRKDVMPDEAGNSGIAVGLAISDGPGPNDEEFEQEGLRIFVEDALVEPLDSR
Ga0137418_1089913223300015241Vadose Zone SoilMLTVTKKAAALLKAAKAAHGLADNAGVRIRKDVMPNGSEIAVGIVINDDPDPEDKVFEQQGLRIFVEDALIEPLEGRILDVHEANEGPELVLR*
Ga0137412_1036102623300015242Vadose Zone SoilMLTVTKKAAALLKAAKAAHGLADNAGVRIRKDVMPNGSEIAVGIVINDDPDPEDKVFEQQGLRIFVEDALIEPLEGRILDVHEADEGPELVLR*
Ga0137412_1038785923300015242Vadose Zone SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRRGVMANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAA*
Ga0182036_1029215233300016270SoilVTKRAAALLKAAKAAEGGASDAGIRIRADKKAQVLEESGISIGFAIRDDPAPHDEELEQHGLRIFIEDVLVESLDGQILDVREAAEGTQLVFR
Ga0066667_1185095113300018433Grasslands SoilMLTVTKKAAGLLKAAKAAEGATDEAGIRIRRGVMPDEPGKVAIGFAISDVPDPDDEELEQDGLRIFVEDALVEPLDGRTLDVRDDGAGPELIFL
Ga0184642_172323923300019279Groundwater SedimentMLTVTKKAAALLKAAKAAEGASDAAIRNRRGVVADESRISVGFAISNEPDPDDEEFEQDGLRIFVEDTLVEPLDGFTLDVRIDDDGTEFVFR
Ga0197907_1134589013300020069Corn, Switchgrass And Miscanthus RhizosphereMLTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVETLDGRTLDVREAAEGTELVFL
Ga0206356_1050253413300020070Corn, Switchgrass And Miscanthus RhizosphereVTKEAVALLKAAKTAEGAADEAGIRIRRGVAANESKISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEHLDGRTLDVSEATEGTEFVFR
Ga0206350_1134452523300020080Corn, Switchgrass And Miscanthus RhizosphereMLTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0210399_1043830523300020581SoilMLTVTKKAVELLKAAKTVEGAAEGAGIRIRRGVAANESKISVGIAISDEPDPDDEEFEQDGLRIFVEDVLVQPLDGRTLDVREAAEGTELVFR
Ga0210399_1066896623300020581SoilMLTVTRKAAAFLKVAKAAEGATRGAGIRLRRDALPDESGKPSVGFTISEEPDPDDWEFEQEGLRIFVEDKLVQPLDGRTLDVRDANEGLQLVFR
Ga0215015_1067273723300021046SoilMLTVTKRAAALLKAAKAAEGAADDAGIRILRGVMPNESGIAVGLAISDDPDPDDEEFEQEGLRIFVEEALVEPLDGRTLDVREADEGPEFVFR
Ga0179584_149185723300021151Vadose Zone SoilSLAGKRDFPSHQGTRTATELFGGRLLIEGGFTIMLTVTKNAAAFLKVAKAAEGATRGAGIRLRRDAIPDESGKPSVGFTFSNEPDPEDWEFEQQGLRIFVEGVLVEPLDGRTLDVCDANDGLQLVFR
Ga0210405_1013005923300021171SoilMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESEISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0210408_1060614113300021178SoilMLTVTKKAAALLKAAKAAEGATGGAGIPLRRGAITNDSEKLTVGFTISDEPDPDDEEFEQEGLRIFVEEALVEPLDGRTLDVRDANEGLELVFR
Ga0210387_1185262123300021405SoilTKKAVELLKAAKTVEGAAEGAGIRIRRGVAANESKISVGIAISDEPDSDDEEFEQDGLRIFVEDVLVQPLDGRTLDVREAAEGTELVFR
Ga0210383_1124486913300021407SoilMLTVTRKAAAFLKVAKAAEGATRGAGIRLRRDALPDESGKPSVGFTISEEPDPDDWEFEQEGLRIFVEDKLVQPLDGRTLDVSDANEGLQLVFR
Ga0210384_1182484213300021432SoilMLTVTERAAALLKAAKAAEGAPDDAGIRIRRGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0126371_1094934223300021560Tropical Forest SoilMLTVTKNAAAFLKVAKTAEGETDDAGIRIGKMAEGPGESEISIGFVVRDEPAPDDEEFEQHGLRFFIEDVLVEPLDGHTLDVREAAEGMELVFR
Ga0126371_1280960813300021560Tropical Forest SoilMLTVTKEAAKLLKAAKAAEKAPENAGIRIRRWVESNGTGGVAVGFAIRDDPDPDDEELEQEGLRIFVQDALIEPLDGRILDVREANEGPELVFR
Ga0224712_1064383313300022467Corn, Switchgrass And Miscanthus RhizosphereMLTVTKRASALLKAAKVAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVETLDGRTLDVREAAEGTELVFR
Ga0242656_102315823300022525SoilAVLKAAKAAHGASPDAGIRILKGTVPNHPETLAVGFTITDDPRPDDEEFEQQGLRIFVEDALIEPLDGRTLDVRDANEGPELVWR
Ga0242664_104605513300022527SoilMVTVTKKAAAVLKAAKAAHGASPDAGIRILKGTVPNHPETLAVGFTITDDPRPDDEEFEEQGLRIFVEDALIEPLDGRTLDVRDANEGPELVWR
Ga0242668_111956023300022529SoilLKAAKAAHGASPDAGIRILKGTVPNHPETLAVGFTITDDPRPDDEEFEEQGLRIFVEDALIEPLDGRTLDVRDANEGPELVWR
Ga0212123_10006608223300022557Iron-Sulfur Acid SpringMLTVTERAAALLKAAKAAEGAADDAGIRIGRGVTANESEISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0242661_116035413300022717SoilMLTLTKRAAAILKAAKAAEGAADDAGIRIRRAAMADESGISVGFAISEEPHSDDEEFEQQGLRIFVEDALVEPLDGRTLDVREADEGPELVWR
Ga0242665_1003696613300022724SoilHNMVTVTKKAAAVLKAAKAAHGASPDAGIRILKGTVPNHPETLAVGFTITDDPRPDDEEFEEQGLRIFVEDALIEPLDGRTLDVRDANEGPELVWR
Ga0224563_102867523300022731SoilMLTVTKKAAALLKAAKAAEGATRGAGIRLRRGAIPDDSGDVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR
Ga0179589_1048241513300024288Vadose Zone SoilMLTVTKKAAAILKAAKAAQGAADDAGIRIRKGVMPDESDQDGIAVGIAISDRPAPSDQEFEQEGLRIFVEEELVEPLDGRTLDVLDANDEIELVWR
Ga0207695_1070772213300025913Corn RhizosphereMLTVTKRASALLKAAKAAEGAADDAGIRIRRGVTANESKISVGFAISNEPDPDDEEFEQHGLRIFVEDVLVERLDGRTLDVREAAEGTELVFR
Ga0207700_1113800023300025928Corn, Switchgrass And Miscanthus RhizosphereMLTVTERAAALLKAAKAAEGAPDDAGIRIRSGVTANESKISVGFAISDEPDPDDEEFEQEGLRIFIEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0257168_102867713300026514SoilMLTVTKKAAAVLKAEIAAEGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR
Ga0209648_1084723213300026551Grasslands SoilMLTVTERAAALLKAAKAAEGAADDAGIRIRRGVTANESNISVGFAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0179587_1075605323300026557Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRGATPNDFEKLTVGFTISDEPDPDDEESEQEGLRIFVEEALVEPLDGRTLDVRNANEGLELVFR
Ga0209220_115672013300027587Forest SoilMLTVTKKAAALLKAAKAAEGATGDAGIRLRRGAIPPNDSGNLIVGFTISDEPAPDDEEFEQEGLRIFVEEALVEPLDGRTLDVQDA
Ga0209117_106937613300027645Forest SoilMLTVTKKAAALLKAAKAAEGATGDAGIRLRRGAIPPNDSGNLIVGFTISDEPAPDDEEFEQEGLRIFVEEALVEPLDGRTLDVQDANEDEGLELVFR
Ga0209011_104499723300027678Forest SoilMLTVTKKAAALLKAAKAAEGATGDAGIRLRRGTIPPNDSGNLIVGFTISDEPAPDDEEFEQEGLRIFVEEALVEPLDGRTLDVQDANEDEGLELVFR
Ga0209581_1004605113300027706Surface SoilMLTITRRAAAVLKAAKAAEGAADRAGIRLRAGAPLDDSGVSVGFAITDAPAPKDMELEQDGLRIFIEDVLVEPLDGRTLDVRDAADSMELIFR
Ga0209060_10000123373300027826Surface SoilMLTITRRAAAVLKAAKAAEGAADRAGIRLRAGAPLYDSGVSVGFAITDAPAPKDMELEQDGLRIFIEDVLVEPLDGRTLDVRDAADSMELIFR
Ga0209180_1029263013300027846Vadose Zone SoilMLTVTKKAAALLKAEKAAEGAADDAGIRILRGAMPDEFRGAMPDESGIAVEFTIADDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPELVWR
Ga0209180_1035630213300027846Vadose Zone SoilMLTVTKRAATLLKAAKLAEGATEHAGIRIRRGLTTSESGKVAVGFAISPGPEPSDEQIEQDGLRIFVQDELVEVLDGRTLDIHDTAEEVELVFR
Ga0209701_1005623823300027862Vadose Zone SoilMLTVTKKAAALLKAEKAAEGAADDAGIRILRGAMPDESGIAVEFTITDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPELVWR
Ga0209283_1007055613300027875Vadose Zone SoilHDMLTVTKKAAAVLKAEIAAEGAADDAGIRILRGVMPNESGIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR
Ga0209488_1002844913300027903Vadose Zone SoilMLTVTKKAAAVLKAEIAAEGAADDAGIRILRGVMPNESRIAVAFAISDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPEFVFR
Ga0209488_1069378223300027903Vadose Zone SoilMLVVTKKAAAFLKVAKAAEGATRGAGIRLRRDAVPDESGKPSVGFTFSNEPDPDDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR
Ga0209488_1107198023300027903Vadose Zone SoilMLTVTKKAAALLKAAKAAHGLADNAGVRIRKDVMPNGSEIAVGIVINDDPDPEDKVFEQQGLRIFVEDALVEPLDGRTLDVREADEGPELVWR
Ga0209526_1009434453300028047Forest SoilLKAAKAAEGAADDAGIRIRRGVTANESEISVGFAISDEPDPDDEEFEQEGLRIFIEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0209526_1078932823300028047Forest SoilAEGATGDAGIRLRRGAIPNDSGNLLVGFTISDEPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVCDTNEDEGLELVFR
Ga0137415_1026472813300028536Vadose Zone SoilMLTVTKKAAAFLKVAKAAEGATRGAGIRLRRGAITNDSEKLTVGFTISDEPDPDDEESEQEGLRIFVEEALVEPLDGRTLDVRNANEGLELVFR
Ga0137415_1127282813300028536Vadose Zone SoilMLTVTKKAAALLKAAKVAQGAANEAGIRIRKDVMPDESDKAGIAVGLSISDRPGPNDAEFEQEGLRIFVEDALVEPLDGRTLDVLDADGMELVWR
Ga0307482_102924313300030730Hardwood Forest SoilMLTVTKKAAALLKAEKAAEGAADDAGIRILRGAMPDEFRGAMPDESGIAVEFTITDDPDPDDEEFEQEGLRIFVEDALVEPLDGRTLDVREADEGPELVWR
Ga0170834_10512315013300031057Forest SoilMLTVTKKAAALLKAAKAANGAVEDAGIRIQKDGITNDSRIEVRVVITDDPDPDDEEFEQEGLRIFVEDALIEPLDGRTLDVRDANEGP
Ga0265760_1019330413300031090SoilIMLTVTKKAAALLKAAKAAEGATRGAGIRLRRGAIPDDSGDVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR
Ga0308197_1013812513300031093SoilMLAVTKRAAALLKAVKAKEGVADDAGIRIRRGAMPTEPGKVAVGFAISDDPDPDDEEFEQEGLRIFVESALVEPLDGRTLDVRDANEGPELVFR
Ga0170824_10278007813300031231Forest SoilMLTVTKKAAALLKAAKAANGAVEDAGIRIQKDGITNDSRIEVRVVITDDPDPDDEEFEQEGLRIFVEDALIEPLDGRTLDVREADERPELVWR
Ga0170824_12642517223300031231Forest SoilMLTVTRKAAAFLKVAKAAEGATRGAGIRLRRDALPDESGKPSVGFTISEEPDPEDWEFEQEGLRIFVEGALVEPLDGRTLDVRDANDGLQLVFR
Ga0170820_1301575523300031446Forest SoilMLTITERAAALLKAAKAAEGAPDDAGIRIRRGVTPNESKISVGFAISDEPDPDDEEFEQEGLRIFIEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0310686_10958487323300031708SoilMITVTKKAAALLKAAKAAEGGTHGAGVRIRRGTWRKMAAVPGEPAISVGFAIRDEPAPDDEEFKQHGLRIFVEDALIEPLDGHTLDVREAAEGMELVFV
Ga0310686_11113438813300031708SoilAALLKAAKAAEGATRGAGIRLRRGAIPDDSGEVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR
Ga0310686_11169737833300031708SoilMLTVTKKAVELLKAAKTVEGAAEDAGIRIRRGVAANESKISVGIAISDEPDPDDEEFEQDGLRIFVEDVLVEPLDGRTLDVREAAEGTELVFR
Ga0307476_1014956213300031715Hardwood Forest SoilMLTVTKKAAALLKAAKAAEGATRGAGIRLRRGAIPDDSGNVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR
Ga0307474_1080796923300031718Hardwood Forest SoilMLTVTKKAAALLKAAKPAEGATRGSGIRLRRGAIPDDSGNVAVGLAICDEPDPNDEEFEQEGLRIFLEEDLVEPLEGRTLDVIDANEGLKLVFR
Ga0307478_1008171323300031823Hardwood Forest SoilMLTVTKKAAALLKAAKAAEGAANEAGIRIRKEGMIENDGMLAVGLDIADEPEPDDEEFEQQGLRIFVEDALVEPLDGRTLDVSDANEGPELIWR
Ga0306919_1118793513300031879SoilMLTVTKRAAALLKAAKAAEGGASNAGVRIRASNKAQLLEESGISIGFAIRDEPAPHDEELEQHGLRIFIEDVLVESLDGQ
Ga0306921_1024986523300031912SoilMLTVTKRAAALLKAAKAAEGGASNAGVRIRASNKAQLLEESGISIGFAIRDEPAPHDEELEQHGLRIFIEDVLVESLDGQTLDVREAAEGTQLVFR
Ga0310912_1137977413300031941SoilGYTIMLTVTKRAAALLKAAKAAEGGASNAGVRIRASNKAQLLEESGISIGFAIRDEPAPHDEELEQHGLRIFIEDVLVESLDGQILDVREAAEGTQLVFR
Ga0310911_1067929023300032035SoilALLKAAKAAEGGASDAGIRIRADKKAQVLEESGISIGFAIRDDPAPHDEELEQHGLRIFIEDVLVESLDGQILDVREAAEGTQLVFR
Ga0307471_10217272613300032180Hardwood Forest SoilGAGIRLRRDALPDESGKPSVGFTISEEPDPDDWEFEQEGLRMFVEGALVEPLDGRTLDVRDANDGLQLVFR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.