NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069103

Metagenome / Metatranscriptome Family F069103

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069103
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 91 residues
Representative Sequence MALKSRVFDDLLYFASASRSKSAVTLAAVSFAVCHLVVMGTASSGVSADIDVEIPRQLVHFAAEFCRFALPLGFMVAGFAIRAKKAPPGQP
Number of Associated Samples 72
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 87.90 %
% of genes near scaffold ends (potentially truncated) 17.74 %
% of genes from short scaffolds (< 2000 bps) 67.74 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (68.548 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(43.548 % of family members)
Environment Ontology (ENVO) Unclassified
(42.742 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(69.355 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 60.50%    β-sheet: 0.00%    Coil/Unstructured: 39.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF00248Aldo_ket_red 17.74
PF05656DUF805 8.87
PF07152YaeQ 6.45
PF01979Amidohydro_1 4.03
PF02586SRAP 2.42
PF07394DUF1501 2.42
PF04471Mrr_cat 1.61
PF00486Trans_reg_C 1.61
PF07045DUF1330 1.61
PF03033Glyco_transf_28 1.61
PF08530PepX_C 1.61
PF04397LytTR 1.61
PF00072Response_reg 0.81
PF00149Metallophos 0.81
PF13432TPR_16 0.81
PF11752DUF3309 0.81
PF09154Alpha-amy_C_pro 0.81
PF01738DLH 0.81
PF02633Creatininase 0.81
PF07715Plug 0.81
PF03176MMPL 0.81
PF07224Chlorophyllase 0.81
PF04951Peptidase_M55 0.81
PF05988DUF899 0.81
PF09922DUF2154 0.81
PF06580His_kinase 0.81
PF00296Bac_luciferase 0.81
PF01261AP_endonuc_2 0.81
PF16655PhoD_N 0.81
PF00542Ribosomal_L12 0.81
PF09286Pro-kuma_activ 0.81
PF00512HisKA 0.81
PF12697Abhydrolase_6 0.81
PF09954DUF2188 0.81
PF03807F420_oxidored 0.81
PF03918CcmH 0.81
PF13473Cupredoxin_1 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG3152Uncharacterized membrane protein YhaH, DUF805 familyFunction unknown [S] 8.87
COG4681Uncharacterized conserved protein YaeQ, suppresses RfaH defectFunction unknown [S] 6.45
COG2135ssDNA abasic site-binding protein YedK/HMCES, SRAP familyReplication, recombination and repair [L] 2.42
COG2936Predicted acyl esteraseGeneral function prediction only [R] 1.61
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 1.61
COG0222Ribosomal protein L7/L12Translation, ribosomal structure and biogenesis [J] 0.81
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 0.81
COG1402Creatinine amidohydrolase/Fe(II)-dependent FAPy formamide hydrolase (riboflavin and F420 biosynthesis)Coenzyme transport and metabolism [H] 0.81
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.81
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 0.81
COG2972Sensor histidine kinase YesMSignal transduction mechanisms [T] 0.81
COG3088Cytochrome c-type biogenesis protein CcmH/NrfFPosttranslational modification, protein turnover, chaperones [O] 0.81
COG3275Sensor histidine kinase, LytS/YehU familySignal transduction mechanisms [T] 0.81
COG4312Predicted dithiol-disulfide oxidoreductase, DUF899 familyGeneral function prediction only [R] 0.81
COG4934Serine protease, subtilase familyPosttranslational modification, protein turnover, chaperones [O] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms70.16 %
UnclassifiedrootN/A29.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001546|JGI12659J15293_10057090All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300002245|JGIcombinedJ26739_100367417All Organisms → cellular organisms → Bacteria1319Open in IMG/M
3300002245|JGIcombinedJ26739_100411706All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium1230Open in IMG/M
3300004082|Ga0062384_100249088All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales1076Open in IMG/M
3300004091|Ga0062387_100591803All Organisms → cellular organisms → Bacteria → Proteobacteria793Open in IMG/M
3300004092|Ga0062389_101297414All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium HGW-Gammaproteobacteria-7913Open in IMG/M
3300004092|Ga0062389_101895820All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300004092|Ga0062389_102148358Not Available733Open in IMG/M
3300004092|Ga0062389_104083602Not Available549Open in IMG/M
3300005591|Ga0070761_10253818All Organisms → cellular organisms → Bacteria → Proteobacteria1051Open in IMG/M
3300005602|Ga0070762_10000286All Organisms → cellular organisms → Bacteria20252Open in IMG/M
3300006176|Ga0070765_100200173All Organisms → cellular organisms → Bacteria1812Open in IMG/M
3300009633|Ga0116129_1027134Not Available1936Open in IMG/M
3300012683|Ga0137398_10110677All Organisms → cellular organisms → Bacteria1740Open in IMG/M
3300012924|Ga0137413_10017991All Organisms → cellular organisms → Bacteria → Proteobacteria3655Open in IMG/M
3300012925|Ga0137419_11398018Not Available591Open in IMG/M
3300012929|Ga0137404_12122261Not Available525Open in IMG/M
3300014501|Ga0182024_11102968Not Available937Open in IMG/M
3300014501|Ga0182024_12128766All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300015160|Ga0167642_1011501Not Available1544Open in IMG/M
3300015206|Ga0167644_1079566Not Available1044Open in IMG/M
3300020004|Ga0193755_1139621All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300020021|Ga0193726_1000009All Organisms → cellular organisms → Bacteria → Proteobacteria744665Open in IMG/M
3300020021|Ga0193726_1000581All Organisms → cellular organisms → Bacteria37344Open in IMG/M
3300020021|Ga0193726_1014292All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → Massilia group → Duganella → unclassified Duganella → Duganella sp. SG9024160Open in IMG/M
3300020027|Ga0193752_1211317All Organisms → cellular organisms → Bacteria731Open in IMG/M
3300020034|Ga0193753_10001023All Organisms → cellular organisms → Bacteria25018Open in IMG/M
3300020579|Ga0210407_10100410All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2198Open in IMG/M
3300020580|Ga0210403_10029480All Organisms → cellular organisms → Bacteria → Proteobacteria4381Open in IMG/M
3300020580|Ga0210403_10051576All Organisms → cellular organisms → Bacteria → Proteobacteria3290Open in IMG/M
3300020580|Ga0210403_10171864All Organisms → cellular organisms → Bacteria → Proteobacteria1775Open in IMG/M
3300020580|Ga0210403_10360838Not Available1189Open in IMG/M
3300020581|Ga0210399_10081144All Organisms → cellular organisms → Bacteria2636Open in IMG/M
3300020581|Ga0210399_10252239Not Available1471Open in IMG/M
3300020582|Ga0210395_10375180All Organisms → cellular organisms → Bacteria → Proteobacteria1070Open in IMG/M
3300020582|Ga0210395_11026956All Organisms → cellular organisms → Bacteria → Proteobacteria610Open in IMG/M
3300020583|Ga0210401_10016670All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales7084Open in IMG/M
3300020583|Ga0210401_10320226Not Available1412Open in IMG/M
3300020583|Ga0210401_10669450All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria899Open in IMG/M
3300021168|Ga0210406_10180438Not Available1759Open in IMG/M
3300021168|Ga0210406_10320482Not Available1256Open in IMG/M
3300021168|Ga0210406_11011617All Organisms → cellular organisms → Bacteria → Proteobacteria618Open in IMG/M
3300021178|Ga0210408_11085853Not Available616Open in IMG/M
3300021178|Ga0210408_11101859Not Available611Open in IMG/M
3300021180|Ga0210396_10089685All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → unclassified Steroidobacteraceae → Steroidobacteraceae bacterium2787Open in IMG/M
3300021181|Ga0210388_10001706All Organisms → cellular organisms → Bacteria → Proteobacteria17277Open in IMG/M
3300021181|Ga0210388_10358293Not Available1282Open in IMG/M
3300021181|Ga0210388_11460742Not Available572Open in IMG/M
3300021401|Ga0210393_10011286All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria6960Open in IMG/M
3300021401|Ga0210393_10069798All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Beijerinckiaceae → Methylocella → Methylocella silvestris → Methylocella silvestris BL22773Open in IMG/M
3300021401|Ga0210393_11047780Not Available659Open in IMG/M
3300021402|Ga0210385_11489043All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300021403|Ga0210397_10026254All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3612Open in IMG/M
3300021403|Ga0210397_10054604All Organisms → cellular organisms → Bacteria → Proteobacteria2572Open in IMG/M
3300021403|Ga0210397_10983466All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium654Open in IMG/M
3300021404|Ga0210389_10135152All Organisms → cellular organisms → Bacteria → Proteobacteria1909Open in IMG/M
3300021404|Ga0210389_10358507Not Available1145Open in IMG/M
3300021404|Ga0210389_11273437Not Available564Open in IMG/M
3300021404|Ga0210389_11491602All Organisms → cellular organisms → Bacteria → Proteobacteria514Open in IMG/M
3300021405|Ga0210387_10004672All Organisms → cellular organisms → Bacteria → Proteobacteria10483Open in IMG/M
3300021405|Ga0210387_10004792All Organisms → cellular organisms → Bacteria10363Open in IMG/M
3300021405|Ga0210387_10117446Not Available2243Open in IMG/M
3300021405|Ga0210387_10880438All Organisms → cellular organisms → Bacteria → Proteobacteria788Open in IMG/M
3300021407|Ga0210383_10261618Not Available1486Open in IMG/M
3300021420|Ga0210394_10003031All Organisms → cellular organisms → Bacteria → Proteobacteria21932Open in IMG/M
3300021420|Ga0210394_10003749All Organisms → cellular organisms → Bacteria18556Open in IMG/M
3300021420|Ga0210394_10306800All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1393Open in IMG/M
3300021474|Ga0210390_10177846All Organisms → cellular organisms → Bacteria → Proteobacteria1797Open in IMG/M
3300021474|Ga0210390_10360958All Organisms → cellular organisms → Bacteria → Proteobacteria1229Open in IMG/M
3300021475|Ga0210392_10347512All Organisms → cellular organisms → Bacteria → Proteobacteria1072Open in IMG/M
3300021475|Ga0210392_10368907All Organisms → cellular organisms → Bacteria → Proteobacteria1041Open in IMG/M
3300021475|Ga0210392_11056112Not Available608Open in IMG/M
3300021477|Ga0210398_10004050All Organisms → cellular organisms → Bacteria13871Open in IMG/M
3300021478|Ga0210402_10198101All Organisms → cellular organisms → Bacteria1845Open in IMG/M
3300021479|Ga0210410_10395588All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter1238Open in IMG/M
3300022533|Ga0242662_10093221All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300025633|Ga0208480_1076200All Organisms → cellular organisms → Bacteria → Proteobacteria824Open in IMG/M
3300027117|Ga0209732_1020602All Organisms → cellular organisms → Bacteria → Proteobacteria1119Open in IMG/M
3300027528|Ga0208985_1000644All Organisms → cellular organisms → Bacteria → Proteobacteria5259Open in IMG/M
3300027545|Ga0209008_1085532All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300027559|Ga0209222_1066076All Organisms → cellular organisms → Bacteria → Proteobacteria695Open in IMG/M
3300027619|Ga0209330_1000302All Organisms → cellular organisms → Bacteria23262Open in IMG/M
3300027652|Ga0209007_1001484All Organisms → cellular organisms → Bacteria → Proteobacteria7005Open in IMG/M
3300027684|Ga0209626_1081185All Organisms → cellular organisms → Bacteria → Proteobacteria831Open in IMG/M
3300027895|Ga0209624_10000830All Organisms → cellular organisms → Bacteria → Proteobacteria26201Open in IMG/M
3300028016|Ga0265354_1000288All Organisms → cellular organisms → Bacteria9069Open in IMG/M
3300028021|Ga0265352_1002227All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales966Open in IMG/M
3300028036|Ga0265355_1000483Not Available2094Open in IMG/M
3300029910|Ga0311369_10076457All Organisms → cellular organisms → Bacteria → Proteobacteria3484Open in IMG/M
3300030730|Ga0307482_1126254Not Available726Open in IMG/M
3300030738|Ga0265462_11181222All Organisms → cellular organisms → Bacteria → Proteobacteria681Open in IMG/M
3300030741|Ga0265459_14289161Not Available500Open in IMG/M
3300030862|Ga0265753_1048430All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium jicamae747Open in IMG/M
3300030923|Ga0138296_1483222Not Available525Open in IMG/M
3300030923|Ga0138296_1857133Not Available632Open in IMG/M
3300031015|Ga0138298_1561111Not Available545Open in IMG/M
3300031057|Ga0170834_103796921Not Available953Open in IMG/M
3300031090|Ga0265760_10108492All Organisms → cellular organisms → Bacteria → Proteobacteria883Open in IMG/M
3300031128|Ga0170823_10485938Not Available568Open in IMG/M
3300031231|Ga0170824_104098553All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Eisenbacteria → Candidatus Eisenbacteria bacterium2408Open in IMG/M
3300031231|Ga0170824_105046881Not Available509Open in IMG/M
3300031231|Ga0170824_115294286All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300031231|Ga0170824_127188899Not Available543Open in IMG/M
3300031446|Ga0170820_11055667Not Available524Open in IMG/M
3300031446|Ga0170820_17542760All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300031474|Ga0170818_108284213All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300031708|Ga0310686_101656683All Organisms → cellular organisms → Bacteria → Proteobacteria726Open in IMG/M
3300031708|Ga0310686_102257766All Organisms → cellular organisms → Bacteria10956Open in IMG/M
3300031708|Ga0310686_103577601All Organisms → cellular organisms → Bacteria26767Open in IMG/M
3300031708|Ga0310686_107557373All Organisms → cellular organisms → Bacteria → Proteobacteria8125Open in IMG/M
3300031708|Ga0310686_108725333All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium jicamae565Open in IMG/M
3300031708|Ga0310686_109776483All Organisms → cellular organisms → Bacteria → Proteobacteria21691Open in IMG/M
3300031708|Ga0310686_111184529All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium799Open in IMG/M
3300031708|Ga0310686_116089199All Organisms → cellular organisms → Bacteria28097Open in IMG/M
3300031708|Ga0310686_116513254Not Available502Open in IMG/M
3300031708|Ga0310686_118598797All Organisms → cellular organisms → Bacteria4054Open in IMG/M
3300031715|Ga0307476_10002658All Organisms → cellular organisms → Bacteria → Proteobacteria10605Open in IMG/M
3300031715|Ga0307476_10132800All Organisms → cellular organisms → Bacteria1779Open in IMG/M
3300031715|Ga0307476_10844217All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter677Open in IMG/M
3300031715|Ga0307476_11075734Not Available591Open in IMG/M
3300031718|Ga0307474_10001340All Organisms → cellular organisms → Bacteria → Proteobacteria17912Open in IMG/M
3300031718|Ga0307474_10036655All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae → Mizugakiibacter → Mizugakiibacter sediminis3610Open in IMG/M
3300031823|Ga0307478_10669678All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium869Open in IMG/M
3300034124|Ga0370483_0313918Not Available541Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil43.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.32%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil8.87%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil7.26%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.45%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil4.84%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.42%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil1.61%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost1.61%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere1.61%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland0.81%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.81%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.81%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001546Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300009633Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10EnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015160Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G7C, Adjacent to main proglacial river, mid transect (Watson river))EnvironmentalOpen in IMG/M
3300015206Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G8B, Adjacent to main proglacial river, end of transect (Watson river))EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020034Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025633Arctic peat soil from Barrow, Alaska - NGEE Surface sample F53-1 shallow-092012 (SPAdes)EnvironmentalOpen in IMG/M
3300027117Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027528Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027559Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027619Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027652Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300028016Rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE1Host-AssociatedOpen in IMG/M
3300028021Soil microbial communities from Maridalen valley, Oslo, Norway - NSE5EnvironmentalOpen in IMG/M
3300028036Rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE2Host-AssociatedOpen in IMG/M
3300029910III_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030738Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VDE Co-assemblyEnvironmentalOpen in IMG/M
3300030741Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ANR Co-assemblyEnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030923Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A3_MS_autumn Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031015Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A9_MS_autumn Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300034124Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_06D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12659J15293_1005709033300001546Forest SoilMAPKSRIFDELLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVM
JGIcombinedJ26739_10036741723300002245Forest SoilMAPKSRIFDELLRFASTSRSRSTVTLAAVSFAICHLVAMGTSPAAAGGTADLAGEIPRQLIHFVAELCRFALPLAVMVVGMIHGRSKQSPLS*
JGIcombinedJ26739_10041170623300002245Forest SoilMARKSRIFDELLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS*
Ga0062384_10024908813300004082Bog Forest SoilMMPKSQIFDDLQRFASTSRSKSMVTLAAVSFAICHLIAMVTSPAAAGGAADLDGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPVS*
Ga0062387_10059180313300004091Bog Forest SoilMMPKSQIFDDLQRFASTSRSKSMVTLAAVSFAICHLIALVTSPAAAGGAADLDGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPVS*
Ga0062389_10129741423300004092Bog Forest SoilMAPKAQLFDDLLNFVSTQRSRSTVTLAAVSFAICHVIVMGTAPSPAGAADLDAEIPRQLMHFAAELFRFALPLVFLVAGLIIRARPAQIPSPKG*
Ga0062389_10189582013300004092Bog Forest SoilMAPKSRIFDELLRFASTSRSRSTVTLAAVCFAVCHLVAMGTNAAAGGGTADLEGEIPRELIHFVAELCRFALPLGVMLVGMIQGRSKQSPLS*
Ga0062389_10214835823300004092Bog Forest SoilMALKSSIFDELLDFAATSRSKSAVTLAATSFAICQLVVMGTASPGASADLDAEMPRRLIHFAAELCQFALPLGFMVAGVVIRAKKTPPSQPKR*
Ga0062389_10408360223300004092Bog Forest SoilMALKLRVFDDLLDFAATSRSKSAVTHAAIAFAICQLVVMGTAASGVSADLDAVIPRQLIHFAAELCQFALPLGFMVTGLWIRAKK
Ga0070761_1025381823300005591SoilMALKSQVFDDLLHFAATSRSRSTVTLAAVSFAVCHLIAMGSASAGAGGSADLGADIPRQLIHFVAELCRFALPLGVMIVGLISARSKQSTES*
Ga0070762_10000286103300005602SoilMAPASRIFDELLRFASTSRSRSMVTLAAVSFAICHLIAMATAASGGTTDLEGEIPRQLIHFVAELCRFALPLGVMAVGMIHGRSKQSPLS*
Ga0070765_10020017333300006176SoilMLEPEWKRLNFKDGFMADKSQVFDGLLHFASTPQSRSMVTLAAVSFAVCHLVVMGTSAAPMNGTADSDLEIPRQLMHFAAELCRFALPLGVVVVGIIRGRSKQSTGAPRARL*
Ga0116129_102713433300009633PeatlandMAPKSRIFDELLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVADLCRFALPLGVMVVGMIHGRSKQSPLS*
Ga0137398_1011067733300012683Vadose Zone SoilLKSRVFDDLLQFASTARSKTAVTLAAVSFAICHLIVMGTASLGASADLDLEIPRQIIHFAAELCRFALPLGFMVAGFAIHSKKARPR*
Ga0137413_1001799153300012924Vadose Zone SoilMALKTRVFDDLLQFASTARSKTAVTLAAVSFAICHLIVMGTASLGASADLDLEIPRQIIHFAAELCRFALPLGFMVAGFAIHSKKARPR*
Ga0137419_1139801813300012925Vadose Zone SoilMALKPRVFADLLQFASTARSKSAVTLAAVSFAICHLIVMGTASLGASADLDLEIPRQIIHFAAELCRFALPLGFMVAGFAIDSKKARPR*
Ga0137404_1212226113300012929Vadose Zone SoilMALKSRVDDLLHFASASRSKSAVTLAAVSFAICHLVVMGTAPHGASADIDAEIPRQIIYFAAELCRFALPLAFMLAGFAIRAKKAPPNQPKT*
Ga0182024_1110296823300014501PermafrostLKSRIFDDLLHFASTSRSRSTVTLAAVSFAVCHLVVMGTSSAPASGSADLQGDIPRQLIHFAAELCRFALPLGVMVIGMIQQHTRR*
Ga0182024_1212876613300014501PermafrostNDSMARKSHFVDDLLRLTSASRSRSAVTLAAVSFAICHFLVMGTASAAVSADLDAEIPLQLIHFAAVVCRFALPLGFMVVGFTTHAKAVRADQRKR*
Ga0167642_101150123300015160Glacier Forefield SoilMAVKSQVFNDLLHFASTSRSKSTVTLAAVSFAVCHLVVMGTYAAPVSGTADLDLEIPRQLVHFAAELCRFALPLGVVIVGMIRGWSKQSTLS*
Ga0167644_107956623300015206Glacier Forefield SoilTGSMAVKSQVFNDLLHFASTSRSKSTVTLAAVSFAVCHLVVMGTYAAPVSGTADLDLEIPRQLVHFAAELCRFALPLGVVIVGMIRGRSKQSTLS*
Ga0193755_113962113300020004SoilMALNSRVFDDLLHFTSASRSKTAVILAAVSFAVCHLVVMGTSSPAASADLDAEIPRQIIYFAAELCRFALPLGFMVAGFAIRAKKSPPSPPKK
Ga0193726_10000091953300020021SoilMAVKSPVFDDLLHLASTSRSKSTVTLAAVSFAVCHLIVMGTYAAPSSGTADDLEIPRQLVHFAAELCRFALPLGVVIVGMIRGWSKQSTLP
Ga0193726_1000581173300020021SoilMANLQSAWDSFLHIVSTSRSRSAVTLAAVSFAVCHLVVMGSGSSPAGITADLDVEIPRQLLHFGAVICRFALPLGFLVAGFATRTKTARISHRKR
Ga0193726_101429263300020021SoilMAYRRGQLSDLLQLASTSRLRSAVTLAAVSFAVCHLVSMATGPGSAGGSADLDIEIPRQLVHFAAELCRFALPLGFMVAGLAIPAKTRPPGKPKQ
Ga0193752_121131723300020027SoilMALKSRVFDDLLHFASASRSRSAVTLAAVSFAICHLVVMGTASHGGTADLDAEIPRQIIYFAAELGRFTLPLAFMVAGFAIRAKKAPPSQPKR
Ga0193753_10001023223300020034SoilMAFKSQVFDDLLHFASASRSKSAVTLAAVSFAICHLVVMGTRSLGASADLDVELPRQIIYFAAELCRFALPLGFIVAGFATRAKKASPSQPKR
Ga0210407_1010041023300020579SoilMALKSRVFDDFLHFASTSRSKSAVTMAAVSFAICQLIVMGTASSGVSADLDAEIPRQLIHFAAELCQFALPLGFMVAGVVIRAKKAPAEK
Ga0210403_1002948063300020580SoilMALKSRVFDDFLHFASTSRSKSAVTMAAVSFAICQLIVMGTASSGVSADLDVEIPRQLVHFAAELCQFVLPLGFMVAGVVIRAKKAPAEK
Ga0210403_1005157643300020580SoilMALKSHFVDDLLRVASASRSRTAVTFAAVSFAICHFIVMGTANAAVTADLDAEIPLQLVHFAAVVCRFALPLGFMVVGFATHGKAVRAGQRKS
Ga0210403_1017186433300020580SoilMALKSRVFDDLLHFASASRSRSAVTLAAVSFAVCHLAVMGTVSSGASADVDVEIPRQLVHFAAELCRFALPLGFMVAGFAIRAKKPTPGRLER
Ga0210403_1036083833300020580SoilMAFKPRVFDRLLQFASAARSKSAVTLAAVSFAICHLVVMGTAASGTSTDLDAEIPRQLVHFAAELCRFALPLGFMVAGFAGRAKTSGNRRRI
Ga0210399_1008114453300020581SoilMAFKPRVFDRLLQFASAARSKSAVTLAAASFAICHLVVMGTAASGTSTDLDAEIPRQLVHFAAELCRFALPLGFMVAGFAGRAKTSGNRRRIS
Ga0210399_1025223923300020581SoilMALKSHFVDDLLRVASASRSRTAVTLAAVSFAICHFIVMGTANAAVTADLDAEIPLQLVHFAAVVCRFALPLGFMVVGFATHGKAVRAGQRKS
Ga0210395_1037518023300020582SoilMALKSRILDDLLHFASASRARSTVTLAAVCFAVCHLVVMGTGSTPAGGSADLQAEMPRQLIRFAAELCRFALPLGVMVVGMVQQLARR
Ga0210395_1102695613300020582SoilMAPKSRIFDDLLNFASTSRSRSTVMWAAVCFAICHLVAMGTHAATAGGSADLEGEIPRQVIHFVAELCRFALPLGVMLVGMFHGRSKQARFS
Ga0210401_1001667043300020583SoilMALKPRVFDNLLHIASTARSKSAITLAAVSFAVCHLVAISTATSGASADLDAEIPRQLLHFAAELCRFVLPLGFMVAGFAGRAKTSRNRRPK
Ga0210401_1032022613300020583SoilMALKSRVLDDLLDFASTSRSKSAVTLAAISFAICQLVVMGTASSSMSADLDAEIPRRLIHFAAELCQFALPLGFMVAGVVIRDKKAPASRVKS
Ga0210401_1066945023300020583SoilMALKSRVFDDFLHFASTSRSKSAVTMAAVSFAICQLIVMGTASSGVSADLDAEIPRQLVHFAAELCQFVLPLGFMVAGVVIRAKKAPAKK
Ga0210406_1018043843300021168SoilMALKSQVFDDLLHFASTSRSKSAVTLAAVSFAICHLVVMGTGSSGDSADMDVEIPRRLIHFAAELCRFALPLGFMLAGLAIRAKKAPPPGQLER
Ga0210406_1032048223300021168SoilMALKSRVFDSLLQFASATRSKSAVTLAAVSFAICHLVVMGTASSGVSTDLDAEIPRQLVHYAAELCRFVLPLGFMVAGFAGRAKTSRNRRRIS
Ga0210406_1101161723300021168SoilSMAVKSQVFDDFLHFASTSRSRSIVTLAAVSFAVCHVVVMGSNPVPASVIADLDAEIPRQLVHFVAELCRFALPLGVVVVGMLRGRPKHST
Ga0210408_1108585323300021178SoilMAFKPRVFDRLLQFASAARSKSAVTLAAVSFAICHLVVMGTAASGTSTDLDAEIPRQLVHFVAELCRFALPLGFMVAGFAGRAKTSGNRRRIS
Ga0210408_1110185913300021178SoilMALKPRVFDNLLHIASTARSKSAITLAAVSFAVCHLVAMGTATSGASADSDVEIPRQLLHFAAELCRFVLPLGFMVAGFAGRAKTSRNRRPK
Ga0210396_1008968533300021180SoilMALKTRVFDDFLHFASTSRSKSAVTMAAVSFAICQLIVMGTASSGVSADLDAEIPRQLIHFAAELCQFALPLGFMVAGVVIRAKKAPAEK
Ga0210388_1000170663300021181SoilMAPASRIFDELLRFASTSRSRSMVTLAAVSFAICHLIAMATAASGGTTDLEGEIPRQLIHFVAELCRFALPLGVMAVGMIHGRSKQSPLS
Ga0210388_1035829313300021181SoilMLEPEWKRLNFKDGFMADKSQVFDGLLHFASTPQSRSMVTLAAVSFAVCHLVVMGTSAAPMNGTADSDLEIPRQLMHFAAELCRFALPLGVVVVGIIRGRSKQSTGAPRARL
Ga0210388_1146074213300021181SoilMAVKSQAFDDFLHFASTSRSRSMVTLAAVSFAVCHLVVMGTSPAPVGGTADLDLEIPRQLVHFAAELCRFALPLGVVIVGMIRSRSKKSPRN
Ga0210393_1001128663300021401SoilMAHKSRVFGDLLDFVSSSRSRSAVTLAALSFAICHCVVLGTEPAFSGVTTDLDAEIPRQIIHFAAELCRFALPLGFMVAGFATRAKAARSIQRKR
Ga0210393_1006979823300021401SoilMAHKSRVFGDLLDFVSTSRSRPAVTLAALSFAICHFVILGTEPASPGVAADLDAEIPRQLIHFAAELCRFALPLGFMVAGFATRAKTARSTQRRR
Ga0210393_1104778023300021401SoilMALKTRVFDDFLHFASTSRSKSAVTMAAVSFAICQLVVMGTASSGVSADLDAEIPRQLVHFAAELCQFVLPLGFMVAGVVIRAKKAPAEK
Ga0210385_1148904323300021402SoilTGSMALKSQVFDDLLHFASTSRSKSAVTLAAVSFAICHLVVMGTGSSGDSADVDVEIPRRLIHFAAELCRFALPLGFMLAGLAIRAKKAPPPGQLER
Ga0210397_1002625433300021403SoilMAHRRGRLSGILQLASTARLRWAVTLAAVSFAVCHLVSMGTGPGPTGGAADLDAEIPRQLVHFAAELCRFALPLGFMLAGLATPAKTRAGQVKR
Ga0210397_1005460413300021403SoilMALKSRVFDDLLYFASASRSKSAVTLAAVSFAVCHLVVMGTASSGVSADIDVEIPRQLVHFAAEFCRFALPLGFMVAGFAIRAKKAPPGQP
Ga0210397_1098346613300021403SoilMEFKSRIRDDLLYFVSTSRSKSAITLAAVSFAICHLVVIGTASFGVSADLDVEVPRQLIYFIAELCRFALPLGFMVAAFHVKKAPRSQLKK
Ga0210389_1013515223300021404SoilMAFEPRVFDRLLQFASAARSKSAVTLAAASFAICHLVVMGTAASGTSTDLDAEIPRQLVHFAAELCRFALPLGFMVAGFAGRAKTSGNRRRIS
Ga0210389_1035850723300021404SoilMAAKSQAFDELLHFASTSRSRSMVTLAAVSFAVCHVIVMGTGGSPMSGAADLDVEIPRQLIHFAAELCRFALPLGVVVVGMIRGRSKQSTLS
Ga0210389_1127343713300021404SoilMAYRRGQLSDLLQLASTSRWRSAVTLAAVSFAVCHLVSMATGPGSTGGSADLDVEIPRQIIHFAAELCRFALPLGFM
Ga0210389_1149160223300021404SoilMAPASRIFDELLRFASTSRSRSMVTLAAVSFAICHLIAMATAASGGTTDLEGEIPRQLIHFVAVLCRFALPLGV
Ga0210387_1000467253300021405SoilMADKSQVFDGLLHFASTPQSRSMVTLAAVSFAVCHLVVMGTSAAPMNGTADSDLEIPRQLMHFAAELCRFALPLGVVVVGIIRGRSKQSTGAPRARS
Ga0210387_1000479253300021405SoilMAFKPRVFDRLLQFASAARSKSAVTLAAVSFAICHLVVMGTAASGTSTDLDAEIPRQLVHFAAELCRFALPLGFMVAGFAGRAKTSGNRRRIS
Ga0210387_1011744633300021405SoilMAAKSQLFDDFLHFASTSRSKSTVTLAAVSFAVCHLVVMGTYAAPVSGTADLDLEIPRQLVHFAAELCRFALPLGVVVVGMIRGGSK
Ga0210387_1088043813300021405SoilMAHKSRVFGDLLDFVSTSRSRSAVTLAALAFAICHCVVLGTEPAFSGVTTDLDAEIPRQIIHFAAELCRFALPLGFMVAGFATRAKAARSIQRKR
Ga0210383_1026161823300021407SoilMAFEPRVFDRLLQFASAARSKSAVTLAAVSFAICHLVVMGTAASGTSTDLDAEIPRQLVHFAAELCRFALPLGFMVAGFAGRAKTSGNRRRIS
Ga0210394_1000303173300021420SoilMALKSRVCDDLLHFASTSRTKSAVTLAAVSFAICHLAVMGTASSSVSADLDAEIPRQLIHFVAELCRFALPLGFVVAGFAIRAKKVPPSQFNR
Ga0210394_10003749163300021420SoilMALKSRVLDDLLDFASTSRSKSAVTLAAISFAICQLVVMGTDSSGVGSDLDSEIPRRLIHFAAELCQFALPLGFMVAGVVIRAKKAPASQLKK
Ga0210394_1030680033300021420SoilMALRFRFVDDLLRVASASRSRSAVTLAAISFAICHVIVMGTATASGAVSSDLDAEIPRQLIHFAAVVCRFALPLGFMVAGFTAHARAVRAGQRKR
Ga0210390_1017784623300021474SoilMALKSRDFDDLLHVASTSRSKSAVTLAAVSFAVCHLAVMGTASSGSRADLDAEIPRQLIHFAAEFCRFALPLGFMVAGFAIRAKKPPPRQLKR
Ga0210390_1036095833300021474SoilMALKPRVFDDLLHFASTSRSRSTVTLAAISFAVCHLIVMGTGSAGFGGPADLGVDIPRQLIHFVAELCRFALPLGVMLVGLIRS
Ga0210392_1034751223300021475SoilMVPIARVFDHLLHFASASRSKSAVTLAAVSFAICHLAVIWTASSGVVADMDVEIPRQLFHFAAELCRFVLPLGFMVAGFARPKNTPPGPLKR
Ga0210392_1036890723300021475SoilMAHKSRVFGDILDFVSSSRSRSAVTLAALSFAICHCVVLGTEPAFSGVTTDLDAEIPRQIIHFAAELCRFALPLGFMVAGFAT
Ga0210392_1105611223300021475SoilMAYRRGQLSDLLQLVSTSRLRSAVTLAAVSFAICHLVSMGTGTGPAGGSADLDVEIPRQLIHFAAALCRFVLPLGFMVAGLAIPAKIRSPGKLKQ
Ga0210398_1000405043300021477SoilMALKSQVFDELLHFASTSRSRSTVTLAAVSFAVCHLFVMGTGSAPANGTADLSGEIPRQIIHFVAELCRFALPLAVMVVGIIQPLKRR
Ga0210402_1019810123300021478SoilMALKSRIVGDLLHFASTARSRSAVTLAAVSFAICHLVAVATGSAPTSGTADLDVEIPRQLIHFAAEFCRFALPLGFMLARFATRASIARISRGKR
Ga0210410_1039558823300021479SoilMALRSRVFGNLLHFASAARSKSAISFAAVSFAICHLVVMGTASSGASADLDVEIPRQLIHFAAELCRFALPLGFMVAGFARRAKGPQ
Ga0242662_1009322123300022533SoilMALKSHFVDDLLRVASASRSRTAVTLAAVSFAICHFIVMGTANAAVTADLDAEIPLQLVHFAAVVCRFALPLGFMVVGFATHSKAVRAGQRRK
Ga0208480_107620023300025633Arctic Peat SoilMALKSRVFDDLLHFASTPRSRSTVTLAAVSFAVCHLFVMGTDSTPVDLPDAIPRQLLHFAADLCRFALPLGVMVVGMLQHHKRR
Ga0209732_102060223300027117Forest SoilMAPKSRIFDELLRFASTSRSRSTVTLAAVSFAICHLVAMGTSPAAAGGTADLAGEIPRQLIHFVAELCRFALPLAVMVVGMIHGRSKQSPLS
Ga0208985_100064433300027528Forest SoilMAYRRGQLSDLLQLASTSRLRSAVTLAAVSFAVCHLVSMATGPGSAGGSADLDVEIPRQLIHFAAELCRFALPLGFMVAGLAIPAKTRPPGKLKQ
Ga0209008_108553213300027545Forest SoilMARKSRIFDELLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0209222_106607613300027559Forest SoilMAPKSRIFDELLRFASTSRSRTTVTLAAVSFAICHLVAMGTSPAAAGGTADLAGEIPRQLIHFVAELCRFALPLAVMVVGMIHGRSKQSPLS
Ga0209330_1000302143300027619Forest SoilMAPKSRIFDELLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0209007_100148413300027652Forest SoilMAPASRIFDELLRFASTSRSRSMVTLAAVSFAICHLIAMATAASGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0209626_108118523300027684Forest SoilMAHKSHVFGDLLGFVSTSRSRSAVTLAALSFAICHFVVLGTGPASPGVTADLDAEIPSQLIHFAAELCRFVLPLGFMVAGFATRAKTARSTQRPR
Ga0209624_10000830103300027895Forest SoilMAHRSRVFGDLLDFVSTSRSRSAVTLAALSFAICHFVVLGTGPASPGVTADLDAEIPSQLIHFAAELCRFALPIGFMVAGFATRAKTARSTQRPR
Ga0265354_100028833300028016RhizosphereMALKSRVFDDLLHFASTSRSRSTVTLAAVSFAVCHLVVMGTGSVSVGGTADLGVDIPRQLIHFVAELCRFALPLGVMLVGLIRS
Ga0265352_100222713300028021SoilMALKSQVFDDLLHFASTSRSRSTVTLAAVSFAVCHLFVMGTGAAPANGTADLSGEIPRQLIHFVAELCRFALPLAVMVVGIILPLKRR
Ga0265355_100048323300028036RhizosphereMAPKSRIFDELLRFASTSRSRSMVTLAAVSFAICHLVAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0311369_1007645733300029910PalsaMAFKSRVFDDLLHFAATSRMRSTVTLAAVSFAVCHLVAMGSASAGAGGSADSGADIPRQLIHFVAELCRFALPLGVMVVGLISAPSKQSTEL
Ga0307482_112625423300030730Hardwood Forest SoilMALKPRVFDNLLHIASTARSKSAITLAAVSFAVCHLVAISTATSGASADLDAEIPRQLLHFAAELCRFVLPLGFMVAGFACRAKTSRNR
Ga0265462_1118122223300030738SoilMALKAGIFDDLLHFASTSRSRSAVILAAVSFAVCHLVVMGTGPALASGTVDLQGDMPRQLIHFAAELCRFALPLGVMVVGMVQQLTRR
Ga0265459_1428916113300030741SoilMALKSRVFNGLLHFASTSRSRSTVTLAAISFAVCHLVVMGTGAANPSASGDLQGEIPRQLIHFAAELCRFGLPLGVMAVGMIHQLTRR
Ga0265753_104843023300030862SoilMAHKSRVFGDLLDFVSTSRSRSAVTLAALSFAICHSVVLGTEPAYPGVTADLDAEIPIQIIHFAAELCRFALPLGFMVAGFATRAKTAQSTQRKK
Ga0138296_148322213300030923SoilAPRIGPMANRRGQLSDLLQLASTSRLRSAVTLAAVSFAVCHLVSMATGPGSAGGSADLEVEIPSQIIHFAAELCRFALPLGFMVVGLAIPAKARSPGKLKK
Ga0138296_185713323300030923SoilPGNSRNDSMALKAHFVEDLLRFASASRSRSAVTLAAVSFAICHFVVLGTATANAAVTADLDAEIPRQLIHFAAVVCRFALPLGFMVAGFTTHAKAVRAGQRKG
Ga0138298_156111123300031015SoilMANRRGQLSDLLQLASTSRLRSAVTLAAVSFAVCHLVSMATGPGSAGGSADLEVEIPRQIIHFAAELCRFALPLGFMVVGLAIPAKARSPGKLKK
Ga0170834_10379692113300031057Forest SoilMALKSRVFDDLLHFASASRSKSAVTLAAVSFAVCHLVVVGNASSGVSADIDVEIPRQLIHFGAELCRFALPLGFMVAGLALRAKKAPPSRLKR
Ga0265760_1010849223300031090SoilMAPIARVFDDLLHFTSTSRSRSTVTLAAVSFAICHLFVMSTGAAPANEAADLPGEIPRQLIHFVAELCRFALPLGVMVVGMIPQFKRR
Ga0170823_1048593823300031128Forest SoilMALKSRVFDDLLHFASASRSKSAVTLAAVSFAVCHLVVVGTASSGVSADIDVEIPRQLIHFGAELCRFALPLGFMVAGLALRAKKAPPSRLKR
Ga0170824_10409855333300031231Forest SoilMALKSRVFNDLLHFASDARLKSAVTLAAVSFAICHLVVMGTASLGASSDLDHEIPRQIIHFAAELCRFALPLGFMVAGFANHSKKSRPR
Ga0170824_10504688113300031231Forest SoilGDALTPRKGSMALKSRVFDDLLHFASASRSKSAVTLAAVSFAVCHLVVVGNASSGVSADIDVEIPRQLIHFGAELCRFALPLGFMVAGLALRAKKAPPSRLKR
Ga0170824_11529428623300031231Forest SoilLLHFASASRSKAAVTLAAVSFAICHLVVMGTTSFGPSANLEVEIPRQIIYFTAELCRFALPFGFMVAGFAIRAKKAPTGQIGR
Ga0170824_12718889913300031231Forest SoilNTGALTPRIGSMALKSRVDDLLRFASASRSKSAVTLAAVSFAVCHLIVMGTASLGASADLDVEIPRQIIHYAAELCRFLLPLGFMVAGFAIRAKKARPRHQ
Ga0170820_1105566723300031446Forest SoilMALKSRVFNDLLHFASDARLKSAVTLAAVSFAICHLVVMGTASLGASADLDHEIPRQIIHFAAELCRFALPLGFMVAGFAIHSKKSRPR
Ga0170820_1754276023300031446Forest SoilMALKSRVDDLLRFASASRSKSAVTLAAVSFAVCHLIVMGTASLGASADLDVEIPRQIIHYAAELCRFLLPLGFMVAGFAIRAKKARPRHQ
Ga0170818_10828421323300031474Forest SoilMALKTRVFDGLLHFASASRSKAAVTLAAVSFAICHLVVMGTTSFGPSANLEVEIPRQIIYFTAELCRFALPFGFMVAGFAIRAKKAPTGQIGR
Ga0310686_10165668323300031708SoilMAPKSRIFDELLRFASTSRSRSMVTLAAVSFAICHLVAMGTSSAAGGGTAVLEAEIPRQLIHFVAELCRFALPLAVMVVGIMQPLKRR
Ga0310686_10225776633300031708SoilVALKSQVFDGLLHFAATSRSKSAVTLAAVSFAICQLVVMATASSGVSADLDTEIPRQLIHFAAELCQFVLPLGFMVAGVAIRTKKTPPSQL
Ga0310686_103577601153300031708SoilMALKPGLFDDLLHFASTSRSRSTVTLAAVSFAVCHLIIMATGPSPASGSADEIPRQLIHFFAELCRFALPLSVMAIGLIQGLSKRSTL
Ga0310686_10755737383300031708SoilMAPKSRIVDELLRFASTSRSRSTVTLAALSFAICHLLAMGTSTAAAGGTADLDGEIARQLIHFVAELCRFALPLGVMLFGMIHGRSKQSPLS
Ga0310686_10872533313300031708SoilMAHKSRVFGDLLDFVSTSRSRSAVTLAALSFAICHFVILGTEPASPGVTADLDAEIPRQLIHFAAELCRFALPLGFMVAGFATRAKTARSTQRKR
Ga0310686_109776483233300031708SoilMALKSQVFDDLLYFASTSRSRSTVTLAAVSFAICHLFVMGTDSAPTNGTADLSGEIPRQLIHFAAELCRFALPLVVMVVGIIQPLKRR
Ga0310686_11118452913300031708SoilLLRFASTSRSRSMVTLAAVCFAVCHLIAMGTGSAAAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0310686_116089199203300031708SoilMALKPRVFDDLLHFASTSRSRSTVTLAAISFAVCHLIVMGTGSAGFSGSADLGVDIPRQLIHFVAELCRFALPLGVMLVGLIRS
Ga0310686_11651325413300031708SoilSRVLDDFLRFASTSRSKSAVTLAATSFAIFQLIVMGTAPSGVSADLDAETPRQLIHFAAELCQFALPLGFMVAGVAIRAKKAPPSRFER
Ga0310686_11859879733300031708SoilMALNSRVFDDLLHFASTSRSRATVTMAAVSFAICHLIVMGTANGTADLGIEIPRQLIHFFAELCRFALPLGVMVVGLIQRRSKQSTRP
Ga0307476_1000265833300031715Hardwood Forest SoilMALKPRVFDDLLHFASTSRSRSTVTLAAISFAVCHLIVMGTGSAGFDGTADLGLDIPRQLIHFVAELCRFALPLGVMLVGLIRS
Ga0307476_1013280043300031715Hardwood Forest SoilMAPKSQIFDDLLRFASTSRSRSTVTWAAVCFAICHLIAMGTNAAAAGGTADADEIPRQLIHFVAELCRFALPLGVMLVGLIHGRSKQSPLS
Ga0307476_1084421713300031715Hardwood Forest SoilAVASRQSNHNGDALTPGNGSMALKPRVFDNLLHIASTARSKSAITLAAVSFAVCHLVAISTATSGASADLDAEIPRQLLHFAAELCRFVLPLGFMVAGFAGRAKTSRNRRPK
Ga0307476_1107573413300031715Hardwood Forest SoilMAAKPQAFDQLLHFASTSQSRSMVTLAAVSFAVCHLIVMGTSAAPMSGSADLDAEIPRQLIHFAAELCRFVLPLGVVVVGMIRGRPKQSTLP
Ga0307474_10001340143300031718Hardwood Forest SoilMAPKSRIFDELLRFASTSRSRSMVTLAAVSFAVCHLIAMGTATAGGTTDLEGEIPRQLIHFVAELCRFALPLGVMVVGMIHGRSKQSPLS
Ga0307474_1003665523300031718Hardwood Forest SoilMAVKSQVFGDLLHFASTPRSKSTVTLAAVSFAVCHLVVMGTNAAPVSGTADLDLEIPRQLLHFAAELCRFALPLGVVIVGMIRGWSKQSSLS
Ga0307478_1066967823300031823Hardwood Forest SoilMALKPRVFDDLLHFASTSRSRSTVTLAAISFAVCHLIVMGTGSAGFDGTADLGLDIPRQLIHFAAELCRFALPLGVMLVGLIRS
Ga0370483_0313918_220_4803300034124Untreated Peat SoilMAPKFRVFDELLHFVSTSRSKSTVTLAAVSFAVCHLAAMASGPAAGVAADSDLEIPRQLVHFAAEFCRFVLPLGFMVAGFATRAKN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.