NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F046938

Metagenome Family F046938

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046938
Family Type Metagenome
Number of Sequences 150
Average Sequence Length 90 residues
Representative Sequence DMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA
Number of Associated Samples 95
Number of Associated Scaffolds 150

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 4.83 %
% of genes near scaffold ends (potentially truncated) 71.33 %
% of genes from short scaffolds (< 2000 bps) 76.00 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (42.667 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.333 % of family members)
Environment Ontology (ENVO) Unclassified
(64.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.67%    β-sheet: 0.00%    Coil/Unstructured: 51.33%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 150 Family Scaffolds
PF00296Bac_luciferase 12.67
PF132794HBT_2 4.00
PF00293NUDIX 3.33
PF07859Abhydrolase_3 2.67
PF00903Glyoxalase 2.67
PF08818DUF1801 2.67
PF08445FR47 2.00
PF12773DZR 2.00
PF01841Transglut_core 2.00
PF01909NTP_transf_2 1.33
PF00326Peptidase_S9 1.33
PF03950tRNA-synt_1c_C 0.67
PF14102Caps_synth_CapC 0.67
PF06224HTH_42 0.67
PF12695Abhydrolase_5 0.67
PF02945Endonuclease_7 0.67
PF01494FAD_binding_3 0.67
PF00324AA_permease 0.67
PF00202Aminotran_3 0.67
PF09828Chrome_Resist 0.67
PF01428zf-AN1 0.67
PF07366SnoaL 0.67
PF08245Mur_ligase_M 0.67

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 150 Family Scaffolds
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 12.67
COG0657Acetyl esterase/lipaseLipid transport and metabolism [I] 2.67
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 2.67
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 2.67
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 2.67
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.33
COG0008Glutamyl- or glutaminyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.67
COG0531Serine transporter YbeC, amino acid:H+ symporter familyAmino acid transport and metabolism [E] 0.67
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.67
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.67
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.67
COG0833Amino acid permeaseAmino acid transport and metabolism [E] 0.67
COG1113L-asparagine transporter or related permeaseAmino acid transport and metabolism [E] 0.67
COG1115Na+/alanine symporterAmino acid transport and metabolism [E] 0.67
COG3214DNA glycosylase YcaQ, repair of DNA interstrand crosslinksReplication, recombination and repair [L] 0.67
COG3582CDC48-associated ubiquitin-like protein CUZ1, contains AN1-type Zn-finger (protection from As/Sb toxicity)Defense mechanisms [V] 0.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms60.00 %
UnclassifiedrootN/A40.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002557|JGI25381J37097_1011233All Organisms → cellular organisms → Archaea1581Open in IMG/M
3300002558|JGI25385J37094_10164410Not Available594Open in IMG/M
3300002558|JGI25385J37094_10191285All Organisms → cellular organisms → Archaea548Open in IMG/M
3300002561|JGI25384J37096_10075065All Organisms → cellular organisms → Archaea1233Open in IMG/M
3300002561|JGI25384J37096_10112256All Organisms → cellular organisms → Archaea929Open in IMG/M
3300002562|JGI25382J37095_10002358Not Available6116Open in IMG/M
3300002562|JGI25382J37095_10123233Not Available881Open in IMG/M
3300002562|JGI25382J37095_10168497Not Available688Open in IMG/M
3300002562|JGI25382J37095_10239503Not Available548Open in IMG/M
3300002908|JGI25382J43887_10032064All Organisms → cellular organisms → Bacteria2840Open in IMG/M
3300002908|JGI25382J43887_10203086Not Available952Open in IMG/M
3300002908|JGI25382J43887_10285240Not Available735Open in IMG/M
3300002908|JGI25382J43887_10331628All Organisms → cellular organisms → Archaea → TACK group651Open in IMG/M
3300002908|JGI25382J43887_10457165Not Available541Open in IMG/M
3300002909|JGI25388J43891_1023557All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300002912|JGI25386J43895_10069063All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia969Open in IMG/M
3300002912|JGI25386J43895_10096109Not Available769Open in IMG/M
3300002912|JGI25386J43895_10156440Not Available570Open in IMG/M
3300002912|JGI25386J43895_10179447Not Available532Open in IMG/M
3300002914|JGI25617J43924_10075030All Organisms → cellular organisms → Archaea1234Open in IMG/M
3300005166|Ga0066674_10083026All Organisms → cellular organisms → Bacteria1476Open in IMG/M
3300005167|Ga0066672_10233765Not Available1179Open in IMG/M
3300005176|Ga0066679_10019353All Organisms → cellular organisms → Bacteria3578Open in IMG/M
3300005176|Ga0066679_10333837All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon990Open in IMG/M
3300005178|Ga0066688_10055391All Organisms → cellular organisms → Archaea2308Open in IMG/M
3300005178|Ga0066688_10703983All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. CNH099641Open in IMG/M
3300005178|Ga0066688_10781711All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_21599Open in IMG/M
3300005181|Ga0066678_10040667All Organisms → cellular organisms → Bacteria2598Open in IMG/M
3300005186|Ga0066676_10718465Not Available680Open in IMG/M
3300005447|Ga0066689_10037131All Organisms → cellular organisms → Bacteria2533Open in IMG/M
3300005552|Ga0066701_10571808All Organisms → cellular organisms → Archaea692Open in IMG/M
3300005552|Ga0066701_10786341Not Available567Open in IMG/M
3300005554|Ga0066661_10229735All Organisms → cellular organisms → Bacteria1150Open in IMG/M
3300005556|Ga0066707_10217366All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1238Open in IMG/M
3300005557|Ga0066704_10586666All Organisms → cellular organisms → Archaea719Open in IMG/M
3300005557|Ga0066704_10775975Not Available597Open in IMG/M
3300005557|Ga0066704_10808877All Organisms → cellular organisms → Archaea582Open in IMG/M
3300005557|Ga0066704_10917115All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300005558|Ga0066698_10247389Not Available1230Open in IMG/M
3300005558|Ga0066698_10257562All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1205Open in IMG/M
3300005559|Ga0066700_10577050All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300005568|Ga0066703_10083341All Organisms → cellular organisms → Archaea1853Open in IMG/M
3300005568|Ga0066703_10284210All Organisms → cellular organisms → Archaea1002Open in IMG/M
3300005568|Ga0066703_10806757All Organisms → cellular organisms → Archaea536Open in IMG/M
3300005598|Ga0066706_11017647Not Available638Open in IMG/M
3300006794|Ga0066658_10274775All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300006796|Ga0066665_10111620All Organisms → cellular organisms → Archaea → TACK group2022Open in IMG/M
3300006797|Ga0066659_10037952All Organisms → cellular organisms → Bacteria2946Open in IMG/M
3300006797|Ga0066659_10837480All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300006804|Ga0079221_11807899All Organisms → cellular organisms → Archaea500Open in IMG/M
3300006806|Ga0079220_10273788All Organisms → cellular organisms → Archaea1026Open in IMG/M
3300007255|Ga0099791_10372234Not Available686Open in IMG/M
3300007258|Ga0099793_10003636All Organisms → cellular organisms → Archaea → Euryarchaeota5381Open in IMG/M
3300007265|Ga0099794_10665352All Organisms → cellular organisms → Archaea553Open in IMG/M
3300009012|Ga0066710_101210916All Organisms → cellular organisms → Archaea → TACK group1170Open in IMG/M
3300009012|Ga0066710_102565058All Organisms → cellular organisms → Archaea → TACK group734Open in IMG/M
3300009012|Ga0066710_103194360Not Available630Open in IMG/M
3300009012|Ga0066710_104164203Not Available540Open in IMG/M
3300009038|Ga0099829_10188345All Organisms → cellular organisms → Archaea1664Open in IMG/M
3300009038|Ga0099829_10191673All Organisms → cellular organisms → Archaea1650Open in IMG/M
3300009088|Ga0099830_10199781Not Available1564Open in IMG/M
3300009162|Ga0075423_12869835Not Available528Open in IMG/M
3300010304|Ga0134088_10020761All Organisms → cellular organisms → Archaea2907Open in IMG/M
3300010304|Ga0134088_10234362Not Available882Open in IMG/M
3300010304|Ga0134088_10366932All Organisms → cellular organisms → Archaea700Open in IMG/M
3300010329|Ga0134111_10229663All Organisms → cellular organisms → Archaea757Open in IMG/M
3300010336|Ga0134071_10080799All Organisms → cellular organisms → Bacteria1522Open in IMG/M
3300011269|Ga0137392_10256033All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Treboniaceae → Trebonia → Trebonia kvetii1441Open in IMG/M
3300011270|Ga0137391_10473157All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1063Open in IMG/M
3300011270|Ga0137391_11091446Not Available646Open in IMG/M
3300012189|Ga0137388_10731190Not Available919Open in IMG/M
3300012189|Ga0137388_10950186Not Available794Open in IMG/M
3300012201|Ga0137365_10707157Not Available736Open in IMG/M
3300012202|Ga0137363_11078106All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon683Open in IMG/M
3300012202|Ga0137363_11649691Not Available533Open in IMG/M
3300012203|Ga0137399_10442245All Organisms → cellular organisms → Bacteria1087Open in IMG/M
3300012206|Ga0137380_10035201All Organisms → cellular organisms → Bacteria4620Open in IMG/M
3300012206|Ga0137380_10288901All Organisms → cellular organisms → Archaea1471Open in IMG/M
3300012206|Ga0137380_10845729Not Available788Open in IMG/M
3300012206|Ga0137380_11590211Not Available538Open in IMG/M
3300012207|Ga0137381_10135354Not Available2112Open in IMG/M
3300012207|Ga0137381_10273796All Organisms → cellular organisms → Bacteria1468Open in IMG/M
3300012209|Ga0137379_10183138All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2007Open in IMG/M
3300012210|Ga0137378_10141698Not Available2226Open in IMG/M
3300012210|Ga0137378_10199145Not Available1863Open in IMG/M
3300012210|Ga0137378_11504501Not Available584Open in IMG/M
3300012351|Ga0137386_10408686Not Available978Open in IMG/M
3300012357|Ga0137384_10099288Not Available2429Open in IMG/M
3300012358|Ga0137368_10182525All Organisms → cellular organisms → Archaea1512Open in IMG/M
3300012359|Ga0137385_10436717Not Available1113Open in IMG/M
3300012359|Ga0137385_10899054Not Available733Open in IMG/M
3300012359|Ga0137385_11433474Not Available554Open in IMG/M
3300012361|Ga0137360_11055332All Organisms → cellular organisms → Archaea → TACK group701Open in IMG/M
3300012918|Ga0137396_10776620Not Available705Open in IMG/M
3300012927|Ga0137416_10127070Not Available1961Open in IMG/M
3300012944|Ga0137410_10146627Not Available1794Open in IMG/M
3300012944|Ga0137410_10586524All Organisms → cellular organisms → Archaea920Open in IMG/M
3300012972|Ga0134077_10052084Not Available1513Open in IMG/M
3300012972|Ga0134077_10074232All Organisms → cellular organisms → Archaea → TACK group1287Open in IMG/M
3300012972|Ga0134077_10240832All Organisms → cellular organisms → Archaea746Open in IMG/M
3300012972|Ga0134077_10402335All Organisms → cellular organisms → Archaea592Open in IMG/M
3300012976|Ga0134076_10015917All Organisms → cellular organisms → Archaea2619Open in IMG/M
3300012976|Ga0134076_10146663All Organisms → cellular organisms → Archaea964Open in IMG/M
3300012976|Ga0134076_10192014All Organisms → cellular organisms → Archaea853Open in IMG/M
3300012977|Ga0134087_10517383All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_21603Open in IMG/M
3300014154|Ga0134075_10018202All Organisms → cellular organisms → Archaea2777Open in IMG/M
3300015241|Ga0137418_11326219Not Available501Open in IMG/M
3300015245|Ga0137409_10137870All Organisms → cellular organisms → Archaea2240Open in IMG/M
3300015358|Ga0134089_10034710All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1791Open in IMG/M
3300017654|Ga0134069_1119846All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon 13_2_20CM_2_52_21866Open in IMG/M
3300017656|Ga0134112_10333469All Organisms → cellular organisms → Archaea615Open in IMG/M
3300017659|Ga0134083_10161506All Organisms → cellular organisms → Archaea → TACK group911Open in IMG/M
3300017934|Ga0187803_10400637Not Available556Open in IMG/M
3300018433|Ga0066667_10044909All Organisms → cellular organisms → Bacteria2619Open in IMG/M
3300018433|Ga0066667_10089529All Organisms → cellular organisms → Archaea → TACK group2012Open in IMG/M
3300018468|Ga0066662_10172072All Organisms → cellular organisms → Archaea1671Open in IMG/M
3300021088|Ga0210404_10155598All Organisms → cellular organisms → Archaea1201Open in IMG/M
3300026296|Ga0209235_1003242All Organisms → cellular organisms → Bacteria9073Open in IMG/M
3300026297|Ga0209237_1022698All Organisms → cellular organisms → Archaea → TACK group3550Open in IMG/M
3300026298|Ga0209236_1008461All Organisms → cellular organisms → Bacteria6137Open in IMG/M
3300026298|Ga0209236_1260743Not Available567Open in IMG/M
3300026309|Ga0209055_1006176All Organisms → cellular organisms → Archaea6804Open in IMG/M
3300026313|Ga0209761_1001911All Organisms → cellular organisms → Archaea13580Open in IMG/M
3300026313|Ga0209761_1006737All Organisms → cellular organisms → Archaea → TACK group7557Open in IMG/M
3300026326|Ga0209801_1074282All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1500Open in IMG/M
3300026326|Ga0209801_1099881All Organisms → cellular organisms → Archaea1256Open in IMG/M
3300026332|Ga0209803_1180353All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon796Open in IMG/M
3300026333|Ga0209158_1220632All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon655Open in IMG/M
3300026480|Ga0257177_1080215Not Available529Open in IMG/M
3300026499|Ga0257181_1086142Not Available548Open in IMG/M
3300026528|Ga0209378_1091436Not Available1357Open in IMG/M
3300026529|Ga0209806_1012652All Organisms → cellular organisms → Bacteria4500Open in IMG/M
3300026532|Ga0209160_1006255Not Available9242Open in IMG/M
3300026532|Ga0209160_1242863All Organisms → cellular organisms → Archaea616Open in IMG/M
3300026536|Ga0209058_1170843Not Available977Open in IMG/M
3300026538|Ga0209056_10009629Not Available9850Open in IMG/M
3300026552|Ga0209577_10126601All Organisms → cellular organisms → Archaea2038Open in IMG/M
3300027748|Ga0209689_1073865Not Available1825Open in IMG/M
3300027748|Ga0209689_1194855All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon897Open in IMG/M
3300027846|Ga0209180_10012620All Organisms → cellular organisms → Archaea4390Open in IMG/M
3300027846|Ga0209180_10283126All Organisms → cellular organisms → Archaea951Open in IMG/M
3300027846|Ga0209180_10341845Not Available854Open in IMG/M
3300031720|Ga0307469_10909667Not Available815Open in IMG/M
3300031753|Ga0307477_10809452All Organisms → cellular organisms → Archaea622Open in IMG/M
3300031962|Ga0307479_10547390All Organisms → cellular organisms → Archaea1140Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.33%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil28.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil23.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.33%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.67%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.67%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_101123313300002557Grasslands SoilTFEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25385J37094_1016441023300002558Grasslands SoilGRSTTIEKLSQTITNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25385J37094_1019128523300002558Grasslands SoilGRNPEEMKWILRVHNPLEGEKATEPRALLGGTPQQAVEDLPGLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP*
JGI25384J37096_1007506513300002561Grasslands SoilRDMVRRAGRNPEEMKRILRVHNPLSKEKATEPRTLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25384J37096_1011225623300002561Grasslands SoilNNFRDMVGRAGRKPEEMKWILRVHNPLVEEKATEPPALLGGTPQQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
JGI25382J37095_1000235813300002562Grasslands SoilVHNVLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDHVFYDMNHPAQVPIDNQLLLLRRLMRLIKN*
JGI25382J37095_1012323313300002562Grasslands SoilGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEEKATEPRALLGGTPQQAAKDLPRLKELGIDHVFYDMNHPAQVPXDTQLVLLRXLVRLIKA*
JGI25382J37095_1016849713300002562Grasslands SoilFRDMVRRAGRSPEEMKWILRVHDPLDEEKASEPRALLGGTPQQAAKDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS*
JGI25382J37095_1023950323300002562Grasslands SoilDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25382J43887_1003206413300002908Grasslands SoilINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25382J43887_1020308613300002908Grasslands SoilTNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
JGI25382J43887_1028524023300002908Grasslands SoilAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEEKATEPRALLGGTPQQAAKDLPRLKELGIDHVFYDMNHPAQVPVDTQLVLLRKLVRLIKA*
JGI25382J43887_1033162813300002908Grasslands SoilQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA*
JGI25382J43887_1045716523300002908Grasslands SoilLRVHNPLGEEKATEPRALLGGTPRQAAEDLPRLKELGIEHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
JGI25388J43891_102355733300002909Grasslands SoilRDMVRRAGRNPEEIRWILRVHNPLSEEKGTEPRALLGGTSQQAAEDLPRLEELGIDDVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1006906323300002912Grasslands SoilEMKWILRVHNPLDEEKATEPRALLGGTPQQAAEDLPRLNELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1009610923300002912Grasslands SoilAARIADGIMPAAGRSTTIEKLSQTINNFGDMVRRAGRNPEEMKWILRVHNPLDEEKATEPRALLGGTPQQAAKDLPKLKELGIDHVFYDMNHPAQVPIDTQLVLLRKLVRLIKA*
JGI25386J43895_1015644023300002912Grasslands SoilAAGRGTTIEKLGQTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKAXEPRALXGGXPQXAAXDXPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
JGI25386J43895_1017944723300002912Grasslands SoilARIADGIMPAAGRSTTIEKLSXTINNFRDMVRRAGRSPEEMKWILRVHNPLDEEKASEPRALLGGTPQQAAEDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS*
JGI25617J43924_1007503013300002914Grasslands SoilERQARIADGIMPAAGRSTTIEKLSQTINNFHDVVRRAGRNPREIRWILRVHNSLEKKTTEPRPLLGSTPQQAAKDLPRLKDIGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKRPRNALVGSDLFLVPAAKSVWLVSLGF*
Ga0066674_1008302633300005166SoilWILRVHNPLDKEKATEPRPLLGGTPQQAAEELPRLEELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066672_1023376513300005167SoilMPAAAGSTTIEKLSQTIKDFHEKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK*
Ga0066679_1001935313300005176SoilWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066679_1033383723300005176SoilQTIKDFREKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK*
Ga0066688_1005539153300005178SoilIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA*
Ga0066688_1070398313300005178SoilIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066688_1078171123300005178SoilRNPEEMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA*
Ga0066678_1004066713300005181SoilRIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLALLRRLMRLIKA*
Ga0066676_1071846513300005186SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLV
Ga0066686_1024578323300005446SoilLEEKKAAESRALLGGTPEQAAKDLPRLKELGVDHVFYDMNHPAHVPIDTQLVLLRRLVELIKD*
Ga0066689_1003713113300005447SoilTINNFREMVRRAGRNPDEMIWILRVHNPLDEEKATEPRALLGGTPQQAAEDLTRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066701_1057180823300005552SoilEEMKWILRVHNPLAEEKATEPPALLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066701_1078634113300005552SoilTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRKFVRLTKA*
Ga0066661_1022973513300005554SoilAARIADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRSPDEMKWILRVHNPLDEEKASEPRALLGGAPQQAATDLPRLRELGIDHVFYDMNHPAHVPVETQLVLLRRLVRLIKASGA*
Ga0066707_1021736623300005556SoilMKWILRVHNPLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0066704_1058666623300005557SoilKLSQTINSFQDMVRRAGRKPEEMKWILRVHNPLYEEKASEPRALLGGTPQQVAKDFPRVKELGIDHVFYDMNHPAHVPIDSQLVLLRRLVRLIKNN*
Ga0066704_1077597513300005557SoilADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0066704_1080887713300005557SoilLSQTINSFQDMVRRAGRKPEEMKWFLRVHNPLYEEKASEPRALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066704_1091711513300005557SoilDGIMPAAGRSATIEKLSQTINTFQDMVRRAGRKPEEMKWILRVHNPLAEEKATEPHALLGSTPEQAAKDFLRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066698_1024738913300005558SoilDMVRRAGRNSKEMKWILRVHNPMYEEKAAEPRALLGGTPQQAAEDLSRVRELGIGHVFYDMNHPAHVPIDTQLVLLRRLVRLIKP*
Ga0066698_1025756213300005558SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0066700_1057705013300005559SoilDMVRRAGRKPEEMKWILRVHNPLYEEKASEPRALLGGTPQQVAKDFPRVKELGIDHVFYDMNHPAHVPIDSQLVLLRRLVRLIKNN*
Ga0066703_1008334113300005568SoilRIADGIMPAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA*
Ga0066703_1028421013300005568SoilLERAARIADGIMPTAGRNTTIEKLSQTINNFRDIVRRAGHNPEGIKWILRVHNPLEEEKATETRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQVPIDTQLALLRRLVRLIKA*
Ga0066703_1080675713300005568SoilGRKPEEMKWILRVHNPLAEEKATEPHALLGSTPEQAAKDFLRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0066706_1101764713300005598SoilSLERAARIADGIMPAAAGSTMIEKLSQTINNFRDMVRRAGRHPEEMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0066658_1027477513300006794SoilTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA*
Ga0066665_1011162013300006796SoilNNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA*
Ga0066659_1003795243300006797SoilMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA*
Ga0066659_1083748013300006797SoilARLADGIMPAAARSATIEKLSQTINNFHDMARSAGRKPEEMKWILRVHNPLFEEKATEPGALLGGTPQQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0079221_1180789913300006804Agricultural SoilNPDELKWILRAHNTLNEEKASEPRPLLGGTPQQAVNDLPRLKELGIDHVFYDMNHPAQVPMETQLALLRRLVKLIKA*
Ga0079220_1027378813300006806Agricultural SoilMVRKAGRNPNEIKWILRVHNPLDEWKTSEPRGLLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAHVPIDTQLVLLRKLLKLAKS*
Ga0099791_1037223423300007255Vadose Zone SoilRWILRVHNPLTEEKAAEPRPLLGGTPQQAAKDLPRIKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVQLIKN*
Ga0099793_1000363673300007258Vadose Zone SoilMVRRAGRNPEELKWILRVHNTLDEEKATEPRPLLGGTPQQAAQDLPRLKELGIGHVFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
Ga0099794_1066535223300007265Vadose Zone SoilEVDTESPRPLSEEKAAEPRALLGGTPQQAAQDLPRLKELSVDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
Ga0066710_10121091613300009012Grasslands SoilRIADGIMPAAGRSTTIEKLNQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIKTQLVLLRRLVRLIKA
Ga0066710_10256505813300009012Grasslands SoilINNFRDMVRKANRNPDEMKWILRVHNVLEEKKAAESRALLGGTPEQAAKDLPRLKELGIDHVFYDMNHPAQVPIGTPLVLLRRLVELIKD
Ga0066710_10319436013300009012Grasslands SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTPLVLLRRLVRLIKA
Ga0066710_10416420313300009012Grasslands SoilEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLGEGKATEPRALLGGTPRQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0099829_1018834513300009038Vadose Zone SoilARVADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLAEEKVTEPPALLGGTPQQAAKDFARVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0099829_1019167313300009038Vadose Zone SoilAGRSTTIEKFSQTISNFRDMVRRAGRNPEELKWILRVHNPLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK*
Ga0099830_1019978113300009088Vadose Zone SoilAGRTTTIEKLSQTINNFRDMVRRAGRSPDEMKWILRVHNPLDEKKATEPRPLLGGTPQQAADLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLLRLIKA*
Ga0075423_1286983523300009162Populus RhizosphereEEIMRVHDILTGEKAAEPRALLGGTLQQAAEDLPRLKDLGIDPIFYNMNHPAQVPIDTQLSLLTKLIRLIKK*
Ga0134088_1002076113300010304Grasslands SoilEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134088_1023436213300010304Grasslands SoilDMVQKTDRNPDEMKWILRVHNVPDEEKAAEHRALLGGTPEQAAEDLPRLKELGIDHVFHDMNHPAHVPIDTQLVLLRRLVRLINEWEYAA*
Ga0134088_1036693213300010304Grasslands SoilILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0134111_1022966313300010329Grasslands SoilNNFRDMVRRAGRHLEEMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA*
Ga0134071_1008079933300010336Grasslands SoilMVRRVSRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137392_1025603333300011269Vadose Zone SoilKLSQTINNFRDMVRKTDRNPDEMKWILRVHNVLDEEKAAETRALLGGTPEQASKDLPILKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137391_1047315713300011270Vadose Zone SoilKWILRVHNPLSEEKAAEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAQVPIDTQLVLLGRLVQLIKN*
Ga0137391_1089009013300011270Vadose Zone SoilLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK*
Ga0137391_1109144623300011270Vadose Zone SoilMKWILRVHNPLEEEKASEPRALLGGTPQQASKDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRKLVRLIKH*
Ga0137388_1073119023300012189Vadose Zone SoilMPAAGRSTTIEKLSQTIKDFCDMVRRAGRNPEEMKWILRVHNPLDEKKATEPRPLLGGTPQQAADLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLLRLIKA*
Ga0137388_1095018623300012189Vadose Zone SoilMKWILRVHNVLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDHVFYDMNHPAQVPIDTQLLLLRRLMRLIKN*
Ga0137365_1070715713300012201Vadose Zone SoilGMLSQTINNFHDMVRRAGRNPEEMKWILRVHNPLTEEKATEPRTLLGGTPEQAAEDLPRLKELGIDHVFYDMNHPAHVPINTQLVLLRKLMQIINA*
Ga0137363_1107810613300012202Vadose Zone SoilHNPLYEEKAAEPRALLGGTPQQAARDLPKLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLTKT*
Ga0137363_1164969113300012202Vadose Zone SoilERAARIGDGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA*
Ga0137399_1044224523300012203Vadose Zone SoilLSQTINNFRDMVRRAGRDPDELKWILRVHNPLEEEKASEPRALLGGTPQQAAKDLPRLRELGKDHVFYDMNHPAHVPMETQLVLLRRLVRLIKASGS*
Ga0137380_1003520113300012206Vadose Zone SoilMVRKSGRNPDEMKWILRVHNVLDEEKAGDPRPLLGGTPEQAAKDLPRLKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLVKD*
Ga0137380_1028890123300012206Vadose Zone SoilMVRRAGRNPEEMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIRH*
Ga0137380_1084572913300012206Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLDEEKAADPRALLGGTPEQAAKDFPRLKELGINHVFYDMNHPANVPIDTQLALLRRLVRLIKE*
Ga0137380_1159021123300012206Vadose Zone SoilDGIMPAGGRSTTIEKLSQTIKDFREKVRRAGRNPEEMKWILRVHNPLEGEKATEPRALLGGTPQHAVEDLPRLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP*
Ga0137381_1013535413300012207Vadose Zone SoilLARAARIADGIMPAGGRSTTIEKLSQTINNFRDMVRRAARNPEEIRWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLMELGIDHVFYDMNHPAQVPVDTQLVLLRRLVRLMKD*
Ga0137381_1027379613300012207Vadose Zone SoilMVQGAGRNPEEMKWILRVHNPLSEEKAKEPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRL
Ga0137379_1018313823300012209Vadose Zone SoilMKWILRVHNPLSEEKAKEPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIRH*
Ga0137378_1014169843300012210Vadose Zone SoilSQTINNFRDMVRKADRNPDEMKWILRVHNVLDEEKAEEPRALLGGAPEQAAEDLLRLKELGIDHVFYDMNHPAHVPINTQLALLRRLVRLIQRIESAA*
Ga0137378_1019914513300012210Vadose Zone SoilMVRKADRNPDEMRWILRVHNVLEEEKAAEPRALLGGTPEQAARDLPRLKELRIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137378_1150450113300012210Vadose Zone SoilIMPAAAGSTMIEKLSQTINNFRDMVRRAGRHPEEMKWILRVHNPLTEQKATESRTLLGGMPQQAAEDFPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137386_1040868623300012351Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLEEEKAAEPRALLGGTPEQAARDLPRLKELRIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIK*
Ga0137384_1009928823300012357Vadose Zone SoilMVRKADRNPDEMKWILRVHNVLGEEKAADPRALLGGTPEQAAKDLPRLKELGIDHVFYDINHPAHVPIDTQLVLLRRLVRLIKE*
Ga0137368_1018252513300012358Vadose Zone SoilINNFRALVRRAGRNQDEMRWILRVHNPLDEEKATEDRASLGGAPEQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRKLMRLIKN*
Ga0137385_1043671723300012359Vadose Zone SoilRAARIADGIMPAAGRSTTIEKLSQTIKDFRDMVRRAGRNPEEMKWILRVHNPLDEEKATDPRPLLGGTPQQAATDLPRLKELGIDHTFYDMNHPAQVPIDTQLVLLRRLMRLIKN*
Ga0137385_1089905413300012359Vadose Zone SoilMKWILRVHNPLEEEKASEPRALLGGTPQQASEDLPRLKELGINHVFYDMNHPAHVPIDTQLVLLRRLVRLIRH*
Ga0137385_1143347413300012359Vadose Zone SoilARIADGIMPAAGRSTTIEKLNQTINDFRDMVRRAGRNPEEMKWILRVHNPLSKEKATEPRTLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0137360_1105533213300012361Vadose Zone SoilDMVRRAGRSPEEMRWILRVHNPLDEEKATEPRTLLGGTPQQAAKDLPKLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKAQGGSR*
Ga0137396_1077662023300012918Vadose Zone SoilMVRRAGRNPEEIRWILRVHNALDEGKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLALLRRLVRLINA*
Ga0137416_1012707033300012927Vadose Zone SoilMPAAAGGTTIEKLSQTINNFRDMVRIAGRNPEELKWILRVHNPLGEEKATEPRALLGGTPEQAARDLPRLKALGIDHIFYDMNHPAQVPIDTQLVLLRKLVRLIKA*
Ga0137410_1014662723300012944Vadose Zone SoilMKWILRVHNPLGEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHAFYDMNHPAQVPIDTQLVLLRRLMRLIKA*
Ga0137410_1058652413300012944Vadose Zone SoilTINNFRDMVRRAGRNPEEIKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHLPIDTQLVLLRKLVRLI*
Ga0134077_1005208413300012972Grasslands SoilMVRRASRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134077_1007423233300012972Grasslands SoilMVQKADRNPDEMRWILRVHNVLEEEKATEPRALLGGAPEQAVTDLPRLKELGLDHVFYDMNHPAHVPIDTQLVLLRRLVELIKD*
Ga0134077_1024083213300012972Grasslands SoilSQTINNFRDLVRRAGRKPEEMKWILRVHNPLYEEKATEPPALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIRNN*
Ga0134077_1040233513300012972Grasslands SoilAGRNPEEMKWILRVHNPLEEKATEPRALLGGTPQQAAEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0134076_1001591733300012976Grasslands SoilRAARIADGIMPAAGRSTTIEKLSQTINNFRDIVRRAGRTPEEMKWILRVHNPLAEEKATEPPALLGGTPQQAAKDLPRLKELGINHVFYDMNHPAQIPIDTQLVLLRRLVRLIKA*
Ga0134076_1014666323300012976Grasslands SoilKWILRVHNPLYEEKPREPPSLLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN*
Ga0134076_1019201413300012976Grasslands SoilILRVHNPLEEKAAEPRALLGGTPQQATEDLPRVRELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLTKA*
Ga0134087_1051738313300012977Grasslands SoilRAGRNPEEIRWILRVHNPLSEEKGTEPRALLGGTSQQAAEDLPRLEELGIDDVFYDMNHPAQVPIDTQLVLLRRLVRLIKA*
Ga0134075_1001820233300014154Grasslands SoilMVRRASRNPEEIKWILRVHNPLDEETAREPRALLGGTPQQAAKDLPRLKELGINHVFYDMNHPARIPIDTQLVLLRRLVRLIKA*
Ga0137418_1132621913300015241Vadose Zone SoilQTINSFSDMVRRAGRNPEEMKWILRVHNPLTEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDSQLALLRRLVRLIKA*
Ga0137409_1013787023300015245Vadose Zone SoilMKWILRVHNPLGEEKATEPRPLLGGTPQQAAQDLPRLKELGIDHAFYDMNHPAQVPIDTQLVLLRRLMRLIKV*
Ga0134089_1003471013300015358Grasslands SoilMPAAGRSTTIEKLSQTINNFRDLVRKADRNPDEMRWILRVHNVLEEEKATEPRALLGGAPEQAVTDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRMIR*
Ga0134069_111984623300017654Grasslands SoilAGRNPEEMKWILRVHNPLDEEKATEPRPLLGGTPRQAAKDLPRLKERGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0134112_1033346923300017656Grasslands SoilMKWILRVHNPLDKEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAKVPIDTQLLLLRRLVRLIKA
Ga0134083_1016150623300017659Grasslands SoilEMKWILRVHNVLEEEKAVEPRALVGGAPEQAVTDLPRLRELGIDHVFYDMNHPAHVPINTQLVLLRRLVELIKD
Ga0187803_1040063723300017934Freshwater SedimentMVRRAGRNPAEMQWILRVHNTLDKEKATDPRPLLGGTPQQALEDLPRLKDIGIDHVFYDMNHPAHIPIDTQLLLLRRLMQLIKN
Ga0066667_1004490943300018433Grasslands SoilMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA
Ga0066667_1008952933300018433Grasslands SoilVAAARSTTLDKLSQTVNSFADMVRRAGRSPEEMKWILRVHNRLDEEKARASSVIGGGTPQKAATDLPRLKELGIDHIFYDMNHPAHVPIDTQLVLLRRLVLLIKA
Ga0066662_1017207213300018468Grasslands SoilDGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKRILRVHNPLDEEKATEPRALLGGTPQQAAEDLPRLNELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0210404_1015559813300021088SoilASMKRASRLADGILPAAGRKTTIEKLSQTINNFHDMVRRAGRNPEEMKWILRVHNPLEEKSTEPRALLGGTPQQAAKDLPRLKEIGIDHVFYDMNHPAGVPIDTQLLLLRRLMRLIKSQENSRVEKNPQA
Ga0207646_1093142313300025922Corn, Switchgrass And Miscanthus RhizosphereLDEEKAAEPRALLGGTPQQAAKDLPGLKDLGIDQVFYDMNHPAQVPIDNQLLLLRRLMRLIKN
Ga0209350_102149413300026277Grasslands SoilLSKEKATEPRMLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209235_1003242103300026296Grasslands SoilAGRSTTIEKLSQTINNFRDMVRRAGRSPEEMKWILRVHNPLDEEKASEPRALLGGTPQQAAEDLPRLKDLGIDHIFYDMNHPAHVPIETQLVLLRRLVRLINAS
Ga0209237_102269813300026297Grasslands SoilEKLNQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRKLVRLIKA
Ga0209236_100846143300026298Grasslands SoilMPAAGRGTTIEKLGQTINNFRDMVRRAGRHPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209236_126074313300026298Grasslands SoilPEEMKWILRVHNPLSEEKAKAPRALLGGTPKQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLTKA
Ga0209055_100617633300026309SoilMPAAAGSTTIEKLSQTIKDFHEKVRRAGRNPEEMKWILRVHNPLSGEKATEPRALLGGTPQQAAEDLPRLKELGIDHIFYDMNHPAQVPK
Ga0209239_107669513300026310Grasslands SoilAEEKATEPPALLGGTPEQAAKDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0209761_1001911143300026313Grasslands SoilLSQTINDFRDMVRRAGRNPEEMKRILRVHNPLSKEKATEPRMLLGGTPQQASEDLPGLKELGIDHVFCDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209761_100673763300026313Grasslands SoilIADGIMPAAAKSITIERLSQTINNFRDMVRRAGRDPEEMKWILRVHNPLEKGKGTEPRALLGGTPQQAATDLPRLKELGIDHIFYDMNHPAHVPIETQLVLLRRLVRLIKA
Ga0209801_107428223300026326SoilMVRRAGRNPEEMKWILRVHNPLTEEKATEPRALLGGTPQQAAEDLPRLRELGIDHVFYHMNHPTQVPIDTQLVLLRRLLRMIKA
Ga0209801_109988133300026326SoilVDPESPQPPEEIRWILRVHNPLSEERATDPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209803_118035323300026332SoilTINNFREMVRRAGRNPDEMIWILRVHNPLDEEKATEPRALLGGTPQQAAEDLTRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209158_122063223300026333SoilIKDFREKVRRAGRNPEEIKWILRVHNPLEGEKATEPRALLGGTPQQAVEDLPRLKELGIDHVFYDMNHPAQVPIETQLALLRRLVRLIKP
Ga0257177_108021513300026480SoilMKWILRVHNPLEKGKATEPRALLGGTPQQSATDLPRLKELGIDHVFYDMNHPAHVPIGTHLVLLRRLVRLIKA
Ga0257181_108614213300026499SoilAGRSTTIEKLSQTVNSFSDMVRRAGRNPEEMRWILRVHNPLTEEKAAEPRPLLGGTPQQAAKDLPRIKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVQLIRN
Ga0209378_109143623300026528SoilMKWILRVHNPLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209806_101265263300026529SoilMPAAGRGTTIEKLGQTINNFRDMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209160_1006255133300026532SoilMVRRAGRNPEEMKWILRVHNPLSEEKATEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDT
Ga0209160_124286313300026532SoilLSQTINSFQDMVRRAGRKPEEMKWFLRVHNPLYEEKASEPRALLGGTPQQAAGDFPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0209058_117084323300026536SoilMKWILRVHNPMYEEKAAEPRALLGGTPQQAAEDLSRVRELGIGHVFYDMNHPAHVPIDTQLVLLRRLVRLIKP
Ga0209056_10009629103300026538SoilLTEEKAAEPRALLGGTPQQAAEDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLVRLIKA
Ga0209577_1012660113300026552SoilSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209689_107386533300027748SoilAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIRWILRVHNPLSEERATEPRALLGGTPQQAAEDLPRLRELGIDHVFYDMNHPAHVPIDTQLVLLRRLMRLIKA
Ga0209689_119485513300027748SoilAAGRSTTIEKLSQTINSFRDMVRRAGRNPEEIMWILRVHNPLSEERATEPRALLGGTPQQAEEDLPRLKELGIDHVFYDMNHPARVPIDTQLVLLRRLMRMIKA
Ga0209180_1001262013300027846Vadose Zone SoilKLSQTINNFRDMVRRAGRNPEELKWILRVHNPLEEEKAPEPQALLGGTPQQAAEVLPRVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKA
Ga0209180_1028312613300027846Vadose Zone SoilRVHNPLDEEKAAGPRALLGGTPQQAAQDLPRLKELGIDHVFYDMNHPAQVPIDTQLVLLRRLMRLIKK
Ga0209180_1034184523300027846Vadose Zone SoilAARIADGIMPAAGRSTTIEKLSQTINNFRDMVRRAGRNPEEMKWILRVHNPLAEEKVTEPPALLGGTPQQAAKDFARVKELGIDHVFYDMNHPAHVPIDTQLVLLRRLVRLIKNN
Ga0307469_1090966713300031720Hardwood Forest SoilRIADGIMPAAGRSTTIEKLSQTINSFHEMVRRAGRNPEEIMWILRVHNPLEEEKALEPRALLGGTPQQAANDLPRLKELGIDHVFYDMNHPAQVPIQTQLVLLRRLMRLIKA
Ga0307477_1080945223300031753Hardwood Forest SoilRVHNSLSDEKAAEPRALLGGMPQQAVNDLPRLRELGIDHVFYDMNHPDQVPIETQLALLRRLVKLIKA
Ga0307479_1054739013300031962Hardwood Forest SoilDELRWILRVHNSLDEEKAAEPRALLAGTPQQAVNDLPRLRELGIDHVFYDMNHPAQVPIETQLALLRRLVKLIKA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.