NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F087516

Metagenome / Metatranscriptome Family F087516

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F087516
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 157 residues
Representative Sequence MNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Number of Associated Samples 83
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 68.18 %
% of genes near scaffold ends (potentially truncated) 32.73 %
% of genes from short scaffolds (< 2000 bps) 60.00 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (65.455 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(22.727 % of family members)
Environment Ontology (ENVO) Unclassified
(28.182 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.455 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 22.63%    β-sheet: 27.37%    Coil/Unstructured: 50.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.30.5.6: alpha-mannosidase, C-terminal domaind3bvxa23bvx0.54468
d.104.1.1: Class II aminoacyl-tRNA synthetase (aaRS)-like, catalytic domaind1yfsa21yfs0.54187
d.104.1.1: Class II aminoacyl-tRNA synthetase (aaRS)-like, catalytic domaind1atia21ati0.53107
d.104.1.1: Class II aminoacyl-tRNA synthetase (aaRS)-like, catalytic domaind3p8ta_3p8t0.52796
d.104.1.0: automated matchesd1x54a21x540.52686


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF00691OmpA 11.82
PF07883Cupin_2 7.27
PF12276DUF3617 3.64
PF07732Cu-oxidase_3 2.73
PF07731Cu-oxidase_2 1.82
PF00873ACR_tran 0.91
PF13635DUF4143 0.91
PF13248zf-ribbon_3 0.91
PF05362Lon_C 0.91
PF00578AhpC-TSA 0.91
PF13602ADH_zinc_N_2 0.91
PF00486Trans_reg_C 0.91
PF01435Peptidase_M48 0.91
PF13189Cytidylate_kin2 0.91
PF13517FG-GAP_3 0.91
PF16640Big_3_5 0.91
PF00154RecA 0.91
PF13181TPR_8 0.91
PF07676PD40 0.91
PF13424TPR_12 0.91
PF12840HTH_20 0.91
PF00753Lactamase_B 0.91
PF02687FtsX 0.91
PF03841SelA 0.91
PF13240zinc_ribbon_2 0.91
PF01370Epimerase 0.91
PF12833HTH_18 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG2132Multicopper oxidase with three cupredoxin domains (includes cell division protein FtsP and spore coat protein CotA)Cell cycle control, cell division, chromosome partitioning [D] 4.55
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 0.91
COG0468RecA/RadA recombinaseReplication, recombination and repair [L] 0.91
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 0.91
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 0.91
COG1921Seryl-tRNA(Sec) selenium transferaseTranslation, ribosomal structure and biogenesis [J] 0.91
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms65.45 %
UnclassifiedrootN/A34.55 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100027696All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4972Open in IMG/M
3300002245|JGIcombinedJ26739_100037482All Organisms → cellular organisms → Bacteria4343Open in IMG/M
3300003505|JGIcombinedJ51221_10025529All Organisms → cellular organisms → Bacteria → Proteobacteria2112Open in IMG/M
3300005434|Ga0070709_10851591Not Available718Open in IMG/M
3300005529|Ga0070741_10189892All Organisms → cellular organisms → Bacteria2012Open in IMG/M
3300005529|Ga0070741_10319024All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1452Open in IMG/M
3300005533|Ga0070734_10000011All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae658267Open in IMG/M
3300005533|Ga0070734_10217365All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300005534|Ga0070735_10021501All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium4637Open in IMG/M
3300005534|Ga0070735_10111479Not Available1715Open in IMG/M
3300005537|Ga0070730_10004580All Organisms → cellular organisms → Bacteria12077Open in IMG/M
3300005537|Ga0070730_10005192All Organisms → cellular organisms → Bacteria11271Open in IMG/M
3300005541|Ga0070733_10422124Not Available888Open in IMG/M
3300005542|Ga0070732_10148146All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales1397Open in IMG/M
3300005602|Ga0070762_10079387All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3451863Open in IMG/M
3300005610|Ga0070763_10002776All Organisms → cellular organisms → Bacteria6342Open in IMG/M
3300005842|Ga0068858_100195128All Organisms → cellular organisms → Bacteria → Acidobacteria1913Open in IMG/M
3300005843|Ga0068860_100755363All Organisms → cellular organisms → Bacteria → Acidobacteria984Open in IMG/M
3300005921|Ga0070766_10030082All Organisms → cellular organisms → Bacteria2949Open in IMG/M
3300006176|Ga0070765_100066059All Organisms → cellular organisms → Bacteria3015Open in IMG/M
3300006237|Ga0097621_100322414All Organisms → cellular organisms → Bacteria → Acidobacteria1369Open in IMG/M
3300006358|Ga0068871_100315452All Organisms → cellular organisms → Bacteria1375Open in IMG/M
3300006893|Ga0073928_10105378All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii2356Open in IMG/M
3300007982|Ga0102924_1021371All Organisms → cellular organisms → Bacteria4691Open in IMG/M
3300007982|Ga0102924_1211122All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii832Open in IMG/M
3300009093|Ga0105240_12703894Not Available512Open in IMG/M
3300009098|Ga0105245_12820606Not Available538Open in IMG/M
3300009174|Ga0105241_10230981All Organisms → cellular organisms → Bacteria1559Open in IMG/M
3300009176|Ga0105242_10429164All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii1240Open in IMG/M
3300010371|Ga0134125_10000143All Organisms → cellular organisms → Bacteria104747Open in IMG/M
3300010371|Ga0134125_10761316Not Available1066Open in IMG/M
3300010371|Ga0134125_11894482Not Available648Open in IMG/M
3300010373|Ga0134128_10067934All Organisms → cellular organisms → Bacteria → Acidobacteria4092Open in IMG/M
3300010396|Ga0134126_10102475All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3517Open in IMG/M
3300010396|Ga0134126_11867928Not Available658Open in IMG/M
3300010396|Ga0134126_11889190Not Available654Open in IMG/M
3300010397|Ga0134124_12534787Not Available555Open in IMG/M
3300017927|Ga0187824_10081552Not Available1026Open in IMG/M
3300017927|Ga0187824_10117140Not Available866Open in IMG/M
3300017927|Ga0187824_10183511All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300017930|Ga0187825_10340151Not Available566Open in IMG/M
3300017936|Ga0187821_10077545All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter savannae1209Open in IMG/M
3300017936|Ga0187821_10134533Not Available927Open in IMG/M
3300017937|Ga0187809_10076493Not Available1101Open in IMG/M
3300017994|Ga0187822_10079604Not Available970Open in IMG/M
3300020579|Ga0210407_10000065All Organisms → cellular organisms → Bacteria → Acidobacteria139249Open in IMG/M
3300020580|Ga0210403_10015877All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6037Open in IMG/M
3300020581|Ga0210399_10018196All Organisms → cellular organisms → Bacteria5562Open in IMG/M
3300020582|Ga0210395_10123735All Organisms → cellular organisms → Bacteria → Proteobacteria1921Open in IMG/M
3300020583|Ga0210401_10233370All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3451698Open in IMG/M
3300021168|Ga0210406_10004658All Organisms → cellular organisms → Bacteria14582Open in IMG/M
3300021170|Ga0210400_10260330Not Available1420Open in IMG/M
3300021178|Ga0210408_10289229Not Available1306Open in IMG/M
3300021180|Ga0210396_10149433All Organisms → cellular organisms → Bacteria → Proteobacteria2099Open in IMG/M
3300021180|Ga0210396_11050118Not Available688Open in IMG/M
3300021181|Ga0210388_10363037All Organisms → cellular organisms → Bacteria → Proteobacteria1273Open in IMG/M
3300021401|Ga0210393_10006816All Organisms → cellular organisms → Bacteria8983Open in IMG/M
3300021403|Ga0210397_10177958All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3451506Open in IMG/M
3300021404|Ga0210389_10021545All Organisms → cellular organisms → Bacteria4977Open in IMG/M
3300021405|Ga0210387_10852529Not Available803Open in IMG/M
3300021406|Ga0210386_11260893Not Available623Open in IMG/M
3300021407|Ga0210383_10023322All Organisms → cellular organisms → Bacteria → Proteobacteria5204Open in IMG/M
3300021420|Ga0210394_10043569All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes3945Open in IMG/M
3300021432|Ga0210384_10285067Not Available1487Open in IMG/M
3300021433|Ga0210391_10077848All Organisms → cellular organisms → Bacteria2620Open in IMG/M
3300021474|Ga0210390_10156969All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3451917Open in IMG/M
3300021477|Ga0210398_10002377All Organisms → cellular organisms → Bacteria18447Open in IMG/M
3300021477|Ga0210398_11125962Not Available622Open in IMG/M
3300021478|Ga0210402_10044590All Organisms → cellular organisms → Bacteria → Proteobacteria3865Open in IMG/M
3300021559|Ga0210409_10025515All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5718Open in IMG/M
3300022557|Ga0212123_10020032All Organisms → cellular organisms → Bacteria → Acidobacteria7706Open in IMG/M
3300022557|Ga0212123_10101542All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3452347Open in IMG/M
3300025927|Ga0207687_11714515Not Available538Open in IMG/M
3300025934|Ga0207686_10333643All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii1137Open in IMG/M
3300026035|Ga0207703_10197466All Organisms → cellular organisms → Bacteria1786Open in IMG/M
3300027030|Ga0208240_1021996Not Available694Open in IMG/M
3300027297|Ga0208241_1020127Not Available994Open in IMG/M
3300027826|Ga0209060_10000025All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae511435Open in IMG/M
3300027855|Ga0209693_10252829Not Available863Open in IMG/M
3300027857|Ga0209166_10001611All Organisms → cellular organisms → Bacteria18442Open in IMG/M
3300027857|Ga0209166_10004970All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae9177Open in IMG/M
3300027879|Ga0209169_10691991Not Available529Open in IMG/M
3300027986|Ga0209168_10184230Not Available1049Open in IMG/M
3300028906|Ga0308309_10886547Not Available774Open in IMG/M
3300031090|Ga0265760_10017478All Organisms → cellular organisms → Bacteria → Proteobacteria2060Open in IMG/M
3300031708|Ga0310686_103973131Not Available620Open in IMG/M
3300031708|Ga0310686_110312751Not Available1198Open in IMG/M
3300031715|Ga0307476_10773139Not Available711Open in IMG/M
3300031718|Ga0307474_10417057Not Available1046Open in IMG/M
3300031718|Ga0307474_10720535Not Available788Open in IMG/M
3300031996|Ga0308176_10580959All Organisms → cellular organisms → Bacteria1149Open in IMG/M
3300032770|Ga0335085_10049469All Organisms → cellular organisms → Bacteria → Acidobacteria5694Open in IMG/M
3300032770|Ga0335085_10429563All Organisms → cellular organisms → Bacteria1525Open in IMG/M
3300032783|Ga0335079_10001756All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia24765Open in IMG/M
3300032783|Ga0335079_10009577All Organisms → cellular organisms → Bacteria → Acidobacteria11034Open in IMG/M
3300032783|Ga0335079_10022006All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7297Open in IMG/M
3300032783|Ga0335079_10592586All Organisms → cellular organisms → Bacteria → Acidobacteria1170Open in IMG/M
3300032805|Ga0335078_10847927All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Occallatibacter → Occallatibacter savannae1106Open in IMG/M
3300032828|Ga0335080_10152477All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2559Open in IMG/M
3300032828|Ga0335080_10992045Not Available856Open in IMG/M
3300032829|Ga0335070_10018122All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis8167Open in IMG/M
3300032892|Ga0335081_10019029All Organisms → cellular organisms → Bacteria11102Open in IMG/M
3300032892|Ga0335081_10061236All Organisms → cellular organisms → Bacteria5829Open in IMG/M
3300032892|Ga0335081_11320226All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii812Open in IMG/M
3300032897|Ga0335071_11006274Not Available780Open in IMG/M
3300032898|Ga0335072_10831381All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Granulicella → unclassified Granulicella → Granulicella sp. S156877Open in IMG/M
3300032954|Ga0335083_10089207All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii3078Open in IMG/M
3300033004|Ga0335084_10389447All Organisms → cellular organisms → Bacteria1436Open in IMG/M
3300033158|Ga0335077_10819548All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii946Open in IMG/M
3300033158|Ga0335077_12249142Not Available500Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil22.73%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil17.27%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil12.73%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment7.27%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil7.27%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil6.36%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring4.55%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.55%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.73%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.73%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere1.82%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017937Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_4EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027030Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF041 (SPAdes)EnvironmentalOpen in IMG/M
3300027297Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF047 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032897Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5EnvironmentalOpen in IMG/M
3300032898Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.1EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10002769643300002245Forest SoilMHFRLDDFEKEECPMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFLTGQLGAQGENPDAFSVAYERYQPGISEKTIVGLTQGNRL*
JGIcombinedJ26739_10003748253300002245Forest SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTXVILXTGQLGAQGENTDAFSVSYERSQPGLSVKTIXGLTQGKFVCN*
JGIcombinedJ51221_1002552943300003505Forest SoilMXNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIXGLTQGKFVCN*
Ga0070709_1085159123300005434Corn, Switchgrass And Miscanthus RhizosphereMKISAIVAAILFLSLANAAGQAKKLTNDPLTGLPLSPATYGGPFQGNDPDKMPDGQVCRSKMQGNSYDLLKIKIDAATEWYSSHLSGFKKVQGYESGRSQTAFYNSDGTVVIFLTGQRGEQGENTAAYSVAYERYQPGISEKTIIGLTQGKFVCQ*
Ga0070741_1018989233300005529Surface SoilMKSLSLCLAVFVLAIVSAAGQAKKLTNDPLTGLPVSPAVLDSEPDKMPNSQVCKSKMQGEFYPLSNIMDPAKAIKMDAAAAWYASHLSGFTKVQGYESGRSQIAFYNADRTIVIFLTGQSGAPGENTGAYSVSYQRYQPGLSEKTIVGLTQGKMVCN*
Ga0070741_1031902413300005529Surface SoilGQAKVLTNDPLTGFPLIPATVVVENAGNTPIKMPDAQICKSKMQANFYDLYNYFSKRNIKVSEVNSWYSSHLPGFKKVEGYESGRSQTAFYNSDRTILIIVTGNKGAPGENTDAYSVAYERYQPGISEKTVTSLTQGKIVCQ*
Ga0070734_100000113823300005533Surface SoilLTFCLANATGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKLSDVQVCKSKGQGNFYSLSNIMNPASGLKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYNSDKTIVILLTGQLGAEGENANAYGVSYERFQPGISEKAIGPNTGQVHLQLIGAERC*
Ga0070734_1021736523300005533Surface SoilMKTLYLSLVVFVIAIVSAAGQAKKATTDPLTGLPVIPAVLDNQPDKMPNAQVCKSKMQGDFYLLSSIMSPEKAIKMDAAAAWYASNLSGFTKVKGYESGRSQIAFYNADRTIVIFLTGQSGAQGENTGAYSVAYQRYQPGLSEKTIVGMTQGKMVCN*
Ga0070735_1002150123300005534Surface SoilVFTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKMQGNFYSLSNIMNPANGIKMDAAAEWYASHLSGYKKVQGYESGRTQIGFYNSDGSIVIFLTGQRGAQGENADAYSVAYERYQPGISEKTIVGLTQGKIACN*
Ga0070735_1011147923300005534Surface SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN*
Ga0070730_10004580113300005537Surface SoilMKNNVLVSVIVALGVASASGQAKVLTADPLTGLPLIPATVVFKNVGNEPDKIPDGQVCKSKMQGNVYSLSTVMNPASGIKMDAVAAWYASHLSGFKKIQGYADGRSQIAFYNSDGTLLIFIIGQLGAQGENTDAFSVAYGRYQPGLSVKTITGLTQGKFICQ*
Ga0070730_1000519233300005537Surface SoilMKNKLLAAAILTFCLANAAGQGKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDARVCKSKVQGDVYSLSNVMNPANGVKMEAAAAWYASHLSGYKKVQGYADGRSQIAFYNSNGTIVVFLTGQLGAQGESADAFSVTYERSQPGISEKTITSLTEGKIVCQ*
Ga0070733_1042212413300005541Surface SoilNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFEKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN*
Ga0070732_1014814623300005542Surface SoilMKLTLLFATMLIFSLANAAGQGKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDARVCKSKVQGNVYSLSNVMNPAKGVKMEAAAAWYASHLSGYKKVQGYADGRSQIAFYNSDGTIIVFLTGQLGAQGENADAFSVSYVRYQPGTSEKAITSLTEGKIVCQ*
Ga0070762_1007938723300005602SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN*
Ga0070763_1000277643300005610SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIACLTQGKFVCN*
Ga0068858_10019512823300005842Switchgrass RhizosphereMKAICLCVAVFAATLASAAGQAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQGKFICQ*
Ga0068860_10075536323300005843Switchgrass RhizosphereQLQIGIESTHHPMRLRPTDLKMEDFSMKAICLCVAVFAATLASAAGQAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQGKFICQ*
Ga0070766_1003008243300005921SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN*
Ga0070765_10006605923300006176SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKTPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN*
Ga0097621_10032241413300006237Miscanthus RhizosphereLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLSVKTVTGLTQGKFICQ*
Ga0068871_10031545233300006358Miscanthus RhizosphereMKAICLCVAVFAATLASAAGQAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLSVKTVTGLTQGKFICQ*
Ga0073928_1010537833300006893Iron-Sulfur Acid SpringMKMLCLCAVVLALTLANARGQSKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPEAQICKSKMQGEFYSLSSPYSKNKIKLDDATAWYASHLSGFKKVQGYESGRSQIAFYNSNGTGVIFLTGNNGAQGENTDAYSVAYERYQPGISEKTITSLTQGKIVCQ*
Ga0102924_102137153300007982Iron-Sulfur Acid SpringMEEFPMKMLCLCAVVLALTLANARGQSKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPEAQICKSKMQGEFYSLSSPYSKNKIKLDDATAWYASHLSGFKKVQGYESGRSQIAFYNSNGTGVIFLTGNNGAQGENTDAYSVAYERYQPGISEKTITSLTQGKIVCQ*
Ga0102924_121112213300007982Iron-Sulfur Acid SpringPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLVAQGENTDAFSVSYERSQPGLSVKTITSLTQGTFVCQ*
Ga0105240_1270389413300009093Corn RhizosphereKNDPLTGLPLIPATVQFKNVGNEPDKLPDAQVCKSKVQGNFYSLSNVMNPASRIKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYKPDGTVVILLTGQLGAPDENADAFSVTYERPQPGLSEKTIISLTQGKIVCN*
Ga0105245_1282060613300009098Miscanthus RhizosphereAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQGKFICQ*
Ga0105241_1023098123300009174Corn RhizosphereMEKSMTNATVVAIVLVFASANASGQAKVLTNDPLTNFPLIPATVVVANAGNEPVKMPDGRVCKSAIQGNFYSLYNFFSKHNLKVSQAVAWYSSHLSGFNKVSGYGSGRSQTAFYNSDQTTVIIVTGNPGAAGEDTDAYSVAYERYLPGLSERTITSLTQGKIICN*
Ga0105242_1042916413300009176Miscanthus RhizosphereMKNAIVVAGILVLGLANAAGQGKVLTNDPLTGLPLIPATAIANKPVKMPDGQVCKSKMQGNFYSLFSPTSYFSKQTIKVSEVVGWYASHLSGFNKVSGYESRRSHTAFYSSDRTILIIVTGNPGATGENTDAYSVAYERYQPGLSEKTITSLTQGKIACQ*
Ga0134125_10000143173300010371Terrestrial SoilVKYTLLVAAIFVFGLANAAGQAKVLTNDPLTDLPLIPATVVTKNAGNEPVKMPGGQVCKSKMQANFYSLYNYFSKNNVKVSNAIAWYSSHLSGFNKVSGYESRRSQTAFYNSDRTIVIIVTGNPGAAGEDTDAYSVAYERYQPGISEKTIAGLTQGKMVCPN*
Ga0134125_1076131633300010371Terrestrial SoilMKNVLIIAAIFVFSSANGAAQAKTLTNDPLTGLPLIPATVLFKNVGNEPDEMPDVQVCKSKMQGDSYSLSTVMKPASVIKMDAAAAWYTSHLSGFKKVQGYESGRSQIAFYKPDGTSVVFLTGQRGAQGENSGAYAVAYERYQPGISEKTITGLTQGKMVCQ*
Ga0134125_1189448223300010371Terrestrial SoilMKNVLIIAAIFVFSSANGAAQAKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDAQLCKSKMQGDFYSLTTVMNPASVIKMDAAAAWYTSHLSGFKKVQGYESGRSQIAFYKPDGTRVVFLTGQRGAQGENSGAYAVAYERYQPGISEKTITGLTQ
Ga0134128_1006793433300010373Terrestrial SoilMKNVLIIAAIFVFSSANGAAQAKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDAQLCKSKMQGDFYSLTTVMNPASVIKMDAAAAWYTSHLSGFKKVQGYESGRSQIAFYKPDGTSVVFLTGQRGAQGENSGAYAVAYERYQPGISEKTITGLTQGKMVCQ*
Ga0134126_1010247553300010396Terrestrial SoilVKYTLLVAAIFVFGLANAAGQVKVLTNDPLTDLPLIPATVVTKNAGNEPVKMPGGQVCKSKMQANFYSLYNYFSKNNVKVSNAIAWYSSHLSGFNKVSGYESRRSQTAFYNSDRTIVIIVTGNPGAAGEDTDAYSVAYERYQPGISEKTIAGLTQGKMVCPN*
Ga0134126_1186792823300010396Terrestrial SoilMKNVLIIAAIFVFSSANGAAQAKTLTNDPLTGLPLIPATVLFKNVGNEPDEMPDVQVCKSKMQGDSYSLSTVMKPASVIKMDAAAAWYTSHLSGFKKVQGYESGRSQIAFYKPDGTSVVFLTGQRGAQGENSGAYAVAYERYQPGISEKTITGLTQGK
Ga0134126_1188919023300010396Terrestrial SoilMKNVLIIAAIFVFSSANGAAQAKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDAQLCKSKMQGDFYSLTTVMNPASVIKMDAAAAWYTSHLSGFKKVQGYESGRSQIAFYKPDGTRVVFLTGQRGAQGENSGAYAVAYERYQPGISEKTITGLTQGK
Ga0134124_1253478713300010397Terrestrial SoilMEDFSMKAICLCVAVFAATLASAAGQAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQ
Ga0187824_1008155213300017927Freshwater SedimentPSLHFRMEDSMKLTLLIATILIFSLATAAGQPKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDEQVCKSKMQGNSYSLSNVMNPENEIKMDAAAAWYVSHLSGYKKVQGYESGRSQIAFYNSDGTLVIFLTGQQGAQGENTNVFLVAYERYQPGLSVKTITGLTQGKFVCN
Ga0187824_1011714013300017927Freshwater SedimentMKTLCLYAAVLALTLVNASGQAKVLTNDPLTGLPLIPTTVLSKNVGNEPDKMPDGQVCKSKMQGNFYSLFSIMNPTGGIKMDAAATWYASHLSGYRKVQGYESGRSQIAFYNSDGTIVIFLTGQRSPQGENADAYAVAYERYQPGVSEKTITSLTQGKIICP
Ga0187824_1018351123300017927Freshwater SedimentMKQTILTATIFAFCLANAAGQEKVLTNDPLTGLPLIPATVVAKNAGNKPVKMPDGKVCRSNMQGNFYSLYNYFAQNKIKLNEAIVWYATHLSQFKKIQGYESGRSQIAFYNSDRTVVIFLTGNKGAPGEDTDAYSVAYERYQPGLSEKTIASLTQGKIACN
Ga0187825_1034015113300017930Freshwater SedimentLTGLPLIPATVQFKNVGNEPDKMPDEQVCKSKMQGNSYSLSNVMNPENEIKMDVAAAWYVSHLSGYKKVQGYESGRSQIAFYNSDGTLVIFLTGQQGAQGENTNVFLVAYERYQPGISVKTITGLTQGKFVCN
Ga0187821_1007754523300017936Freshwater SedimentVSVIVALGVTSATGQAKVLTNDPLTGLPLIPATVVVQNAGNEPVKMPVGQVCKSKMQGNFYDLYSYFSKHNVKVSNVVAWYVSHLSGFSKVSGYDSSRSQTAFYNSDRTILIIVTGNPGAAGTDTEAYSVAYERYQPGISEKTITSLTQGKIICP
Ga0187821_1013453323300017936Freshwater SedimentMRNNLLAAAILTFCLANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCKSKMQGNIYSLSNVMNPASRIKMDAAAAWYASHLSGYKKVQGNASGRSQIAFYNSDRTIVVFLIGQLEAQGESADAFSVAYQRYQPGLSEKAIDGMTQGNFICN
Ga0187809_1007649313300017937Freshwater SedimentELAPSLHFRMEDSMKLTLLIATILIFSLATAAGQPKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDEQVCKSKMQGNSYSLSNVMNPENEIKMDAAAAWYVSHLSGYKKVQGYESGRSQIAFYNSDGTLVIFLTGQQGAQGENTNVFLVAYERYQPGLSVKTITGLTQGKFVCN
Ga0187822_1007960423300017994Freshwater SedimentMKLTLLIATILIFSLATAAGQPKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDEQVCKSKMQGNSYSLSNVMNPENEIKMDVAAAWYVSHLSGYKKVQGYESGRSQIAFYNSDGTLVIFLTGQQGAQGENTNVFLVAYERYQPGLSVKTITGLTQGKFVCN
Ga0210407_10000065943300020579SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN
Ga0210403_1001587763300020580SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN
Ga0210399_1001819623300020581SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN
Ga0210395_1012373523300020582SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGDFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNANGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210401_1023337023300020583SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIACLTQGKFVCN
Ga0210406_1000465863300021168SoilMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFWTGQLGAQGENPDAFSVAYQPGISEKTIVGLTQGNRL
Ga0210400_1026033013300021170SoilMHFRLDDFEKEECPMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPPIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFLTGQLGAQGGNPD
Ga0210408_1028922913300021178SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDA
Ga0210396_1014943313300021180SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN
Ga0210396_1105011823300021180SoilMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPPIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFLTGQLGAQGGNPDAFSVAYERYQPGISEKTIVGLTQGNRL
Ga0210388_1036303733300021181SoilGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210393_1000681633300021401SoilMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFWTGQLGAQGENPDAFSVAYERYQPGISEKTIVGLTQGNRL
Ga0210397_1017795823300021403SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGEN
Ga0210389_1002154553300021404SoilVREIGAWNSLSTELRIGQLQLKRDNKFVLKERLTMHFRLDDFEKEECPMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFWTGQLGAQGENPDAFSVAYERYQPGISEKTIVGLTQGNRL
Ga0210387_1085252923300021405SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210386_1126089313300021406SoilAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN
Ga0210383_1002332253300021407SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTFVILLTGQLGAQGENTDAFSVSYERSQPGLAVKTIAGLTQGKFVCN
Ga0210394_1004356923300021420SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210384_1028506713300021432SoilMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFWTGQLGAQGENPNAFSVAYQPGISEKTIVGLTQGNRL
Ga0210391_1007784813300021433SoilATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210390_1015696923300021474SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210398_10002377163300021477SoilVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNANGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210398_1112596213300021477SoilVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIACLTQGKFVCN
Ga0210402_1004459033300021478SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMEAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0210409_1002551543300021559SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGESTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN
Ga0212123_1002003233300022557Iron-Sulfur Acid SpringMKMLCLCAVVLALTLANARGQSKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPEAQICKSKMQGEFYSLSSPYSKNKIKLDDATAWYASHLSGFKKVQGYESGRSQIAFYNSNGTGVIFLTGNNGAQGENTDAYSVAYERYQPGISEKTITSLTQGKIVCQ
Ga0212123_1010154233300022557Iron-Sulfur Acid SpringMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTFVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0207687_1171451513300025927Miscanthus RhizosphereAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQGKFICQ
Ga0207686_1033364313300025934Miscanthus RhizosphereMKNAIVVAGILVLGLANAAGQGKVLTNDPLTGLPLIPATAIANKPVKMPDGQVCKSKMQGNFYSLFSPTSYFSKQTIKVSEVVGWYASHLSGFNKVSGYESRRSHTAFYSSDRTILIIVTGNPGATGENTDAYSVAYERYQPGLSEKTITSLTQGKIACQ
Ga0207703_1019746633300026035Switchgrass RhizosphereMKAICLCVAVFAATLASAAGQAKVLTADPLTGLPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNFYSLSNVMNPASGIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNSDATILIFIIGQRGAQGENTDAFSVAYERYQPGLFVKTVTGLTQGKFICQ
Ga0208240_102199613300027030Forest SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNANGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0208241_102012723300027297Forest SoilVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNANGTLVILLTGQLGAQGENTDAFSVCYERSQPGLSVKTIVGLTQGKFVCN
Ga0209060_10000025193300027826Surface SoilLTFCLANATGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKLSDVQVCKSKGQGNFYSLSNIMNPASGLKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYNSDKTIVILLTGQLGAEGENANAYGVSYERFQPGISEKAIGPNTGQVHLQLIGAERC
Ga0209693_1025282913300027855SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIA
Ga0209166_1000161173300027857Surface SoilMKNKLLAAAILTFCLANAAGQGKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDARVCKSKVQGDVYSLSNVMNPANGVKMEAAAAWYASHLSGYKKVQGYADGRSQIAFYNSNGTIVVFLTGQLGAQGESADAFSVTYERSQPGISEKTITSLTEGKIVCQ
Ga0209166_1000497073300027857Surface SoilMKNNVLVSVIVALGVASASGQAKVLTADPLTGLPLIPATVVFKNVGNEPDKIPDGQVCKSKMQGNVYSLSTVMNPASGIKMDAVAAWYASHLSGFKKIQGYADGRSQIAFYNSDGTLLIFIIGQLGAQGENTDAFSVAYGRYQPGLSVKTITGLTQGKFICQ
Ga0209169_1069199113300027879SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAG
Ga0209168_1018423023300027986Surface SoilVFTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKMQGNFYSLSNIMNPANGIKMDAAAEWYASHLSGYKKVQGYESGRTQIGFYNSDGSIVIFLTGQRGAQGENADAYSVAYERYQPGISEKTIVGLTQGKIACN
Ga0308309_1088654713300028906SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKTPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIACLTQG
Ga0265760_1001747853300031090SoilGLALIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTLVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIVGLTQGKFVCN
Ga0310686_10397313113300031708SoilNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGNFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNADGTVVILLTGQLGAQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0310686_11031275113300031708SoilMNNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNERDKLPDGQVCKSKMQGNFYSLSNVMNPANAVKIDAAAAWYASHLSGYKKIQGYADGRLQIAFFNSDRTIVIFLTGQAGAQGENTDAYSVAYEHYQPGISEKTITSLTQGKIVCQ
Ga0307476_1077313913300031715Hardwood Forest SoilMKTIWLGAAVLAFALANAAGQAKVLTNDPLTGLPPIPATVQFNVGNEPDKMPDGQVCRSKMQGNSYSLSNLMNPASGIKMEAAAAWYASRLSGYKKVQGYASGRSQIAFYNSDRTIVIFLTGQLGAQGGNPDAFSVAYERYQPGIS
Ga0307474_1041705733300031718Hardwood Forest SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKTQGDFYSLSTVMNPANAIKMDAAAAWYASHLSGFKKIQGYADGRSQIAFYNANGTLVILLTGQLGTQGENTDAFSVSYERSQPGLSVKTIAGLTQGKFVCN
Ga0307474_1072053513300031718Hardwood Forest SoilMKNTILTAAVIAFCLATAAGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCKSKVQGNFYSLSNVMNPANAVKMDAAAWYALHLSGYKKVQGYESGRSQIAFYNSDGTLVIFLTGQLGAQGESTDAYSVAYEHYQPGISEKTITSLTQGKIVCP
Ga0308176_1058095923300031996SoilMKSNVLVSIIVALGVTSATGQPKVLTNDPLTGLPLIPATVQFKNVGNEPDKMPDGQVCKSKMQGNSYSLSNVMNPASGIKMNAAAAWYASHLSGFKKIQGYESGRSQIAFYNSAGTMVIFLTGQQGAQGENTDAFSVAYERYQPGLSEKTITSLTQGKIVCQ
Ga0335085_1004946913300032770SoilMEGWASPLRRVAYIFRTEKSMKNAIVGATILVLALANAAGQAKVLTNDPLTGLALIPATVVVDNAGNLPVKMPDGRVCKSKMQGNFYSLYNYFSKHNIKVSNVLAWYSSHLSGYHKVSGYESRRSQTAFYNSDRTILIIVTGNPGAAGEDTDAYSVAYERYEPGLSEKTATSLTQGKISC
Ga0335085_1042956343300032770SoilMKNAIVIAGILVLGLANAAGQGKVLTNDPLTGLPLIPATAIAKKPVKMPDGQVCKSKMQGNFYSLFSPTSYFSKQTIKVSEVVAWYASHLSGFNKVNGYESRRSQTVFYSSDRTILIIVTGNPGATGEDTDAYSVGYERYQPGLSEKTITSLTQGKIACQ
Ga0335079_10001756223300032783SoilMKAILCAVVMAVLLANAAGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQVCKSKMQGEFYSLSTVMNPANAIKMDAAAAWYASHLSGYTKVQGYESGRSQIAFYNSDGTIVIFLTGQRGAPGENTGASSVAYERYQPGLSEKTIDGLTHGKIICQ
Ga0335079_1000957743300032783SoilMKTSCIYAAMLIFTFTNATSQPKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQICKSKMQGNFYSLSTVMKPANGLKMDAAAAWYASHLSGFKKVQGYESGRTQIAFYNSDGTIVIFLTGQLGAQGENADAYSVAYERYQPGISEKTIVSLTQGKIICN
Ga0335079_1002200643300032783SoilMKLTLLIATMLIFSLASAAGQSKVLTNDPLTGLPLIPATVVVKNAGNEPVKMPDGQVCKSKMQGDFYSLYNYFSQNKIKLQEAIAWYASHLSGFTKVQGYESGRSQIAFYNSDRTVVIFLTGNNGAQGENTDAYSVAYEHYQPGISEKTITSLTQGKIVCQ
Ga0335079_1059258623300032783SoilMKNILLIASILTLGVAAQGQAAKVLTSDPLTGLPLIPATVQFKNVGNEPDKMPDVRVCKSKVQGNVYSLSNVMNPANGVKMEAAAAWYASHLSGYKKVQGYANGRSQIAFYNSDGTIVVFLIGQLGAQGENADAFSVTYERSQPGLSEKTITNLTQGKIVCQ
Ga0335078_1084792723300032805SoilMKNNLLAAAILTFCIAHAAGQAKVLTNDPLTGLALIPATVVVESAGNEPVKMPDGQVCRSKMQGNFYDLYSYFSKHNVKVSDVVAWYASHLSGFSKVSGYDSSRSQTAFYNSDRTILIIVTGNPGTKGADTEAYSVAYERYQPGLSEKTIESLTQGKIVCP
Ga0335080_1015247733300032828SoilMKNVLAAAILSFCLANAAGQAKVLTSDPLTGVPLIPATVLFKNVGNEPDKLPDGQVCKSKMQGNLYLLSNVMNPASALKMDAAAAWYAAHLSGYKKVAGYEGSRSQIAFYNSDKTIVVILTGQLGAKGENADAFSVAYERYQPGLSEKTISSLTQGKIACN
Ga0335080_1099204513300032828SoilTSQPKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQICKSKMQGNFYSLSTVMKPANGLKMDAAAAWYASHLSGFKKVQGYESGRTQIAFYNSDGTIVIFLTGQLGAQGENADAYSVAYERYQPGISEKTIVSLTQGKIICN
Ga0335070_1001812233300032829SoilMKLTLLIATMLVFSLSNAAGQAKVLTNDPLTGLPLIPATVVVKNAGNEPEKLPDAQVCKSKMQGDFYSLYNYFSQNKIKINEATVWYAAHLSGFKKVQGYESGRSQIAFFNSDRTIVIFLTGNKGAPGEDTEAYSVSYQRYQPGISEETIAGLTRGNIVCR
Ga0335081_1001902933300032892SoilMRNAVALAALLALSLTNAAGQGKTLTNDPLTGLPLIPATVQFKNVGNEPDKLPDVKVCKSKGQGNFYSLSTIMNPASALKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYNADKTIVILLTGQLGAEGENADAYGVAYERFQPGISEKAIAGLTQGKFICY
Ga0335081_1006123653300032892SoilMKNILLIATILTLGVAAQGQAAKVLTSDPLTGLPLIPATVQFKNVGNEPDKMPDVRVCKSKVQGNVYSLSNVMNPANGVKMEAAAAWYASHLSGYKKVQGYANGRSQIAFYNSDGTIVVFLIGQLGAQGENADAFSVTYERSQPGLSEKTITNLTQGKIVCQ
Ga0335081_1132022623300032892SoilTGLPLIPATVVVKNAGNEPVKMPDGQVCKSKMQGDFYSLYNYFSQNKIKLQEAIAWYASHLSGFTKVQGYESGRSQIAFYNSDRTVVIFLTGNNGAQGENTDAYSVAYEHYQPGISEKTITSLTEGKIVCQ
Ga0335071_1100627423300032897SoilSLIATIGQQNARSWAALRGMVHIFRMEKSMTNAVALAALLALSLTNAAGQGKTLTNDPLTGLPLIPATVLFKNVGNEPDKLPDVKVCKSKGQGNFYSLSTIMNPASTLKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYNSDKTIVILLTGQLGAEGENADAYGVTYERFQPGISEKAIAGLTQGKFICN
Ga0335072_1083138113300032898SoilMRSHSLIATIGQQMPDLGPALRGMVQIFRMERSMRNAVALAVLLALSLTNAAGQGKTLTNDPLTRLPLIPATVLFKNVGNEPDKMPDTRVCRSRVQGNVYSLSSVMNPANGVKMEAAAAWYASHLSGFKKVQGYADGRPQIAFFNSEGTIVVFLTGQLGAKGENTDAFSVSYERCQPGISEKAIAGLTQGEFICN
Ga0335083_1008920723300032954SoilMKLTLLIATMLIFSLACAAGQSKVLTNDPLTGLPLIPATVVVKNAGNEPVKMPDGQVCKSKMQGDFYSLYNYFSQNKIKLNDAVAWYASHLSGFTKVQGYESGRSQTAFYNSDRTVVIFLTGTNGAQGENTDAYSVAYEHYQPGISEKTITSLTQGKIVCQ
Ga0335084_1038944723300033004SoilMKNNLLTAAILTFCLAAATGQAKVLTNDPLTGLPLIPATVQFKNVGNEPDKLPDAQVCRSKVQGNFYSLSNVMNPASAIKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYKPDGTVVILLTGQLGAPNENADAFSVAYGRYQPGISEKTIAGLTQGKFVCN
Ga0335077_1081954813300033158SoilYAAMLIFTFTNATSQPKTLTNDPLTGLPLIPATVLFKNVGNEPDKMPDGQICKSKMQGNFYSLSTVMKPANGLKMDAAAAWYASHLSGFKKVQGYESGRTQIAFYNSDGTIVIFLTGQLGAQGENADAYSVAYERYQPGISEKTIVSLTQGKIICN
Ga0335077_1224914213300033158SoilLYLCAAVLALTVVNAAGQAKVLTNDPLTGLPLIPATVVVENAGNEPVKMPDGQVCKSKMQGNFYDLYNYFSKHNVKVSNVVAWYASHLSGFNKVSGYDSRRSQTAFYNSDRTILIIVTGNPGAAGEDTDAYSVAYERYQPGLSEKTIVSLTQGKIVCN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.