NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F080249

Metagenome / Metatranscriptome Family F080249

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080249
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 73 residues
Representative Sequence MGDDKKKTGTPDRNLISFKEKYEFDYAVKQLQKQVPDTTKQEAKQALTEAAKKISPSEGREKIMRAARKKLRS
Number of Associated Samples 96
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 52.83 %
% of genes near scaffold ends (potentially truncated) 16.52 %
% of genes from short scaffolds (< 2000 bps) 55.65 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (87.826 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater
(11.304 % of family members)
Environment Ontology (ENVO) Unclassified
(26.087 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(30.435 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.53%    β-sheet: 0.00%    Coil/Unstructured: 53.47%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF01726LexA_DNA_bind 4.35
PF00496SBP_bac_5 1.74
PF02195ParBc 1.74
PF14082DUF4263 1.74
PF03401TctC 0.87
PF08843AbiEii 0.87
PF03887YfbU 0.87
PF01555N6_N4_Mtase 0.87
PF13458Peripla_BP_6 0.87
PF06421LepA_C 0.87
PF13489Methyltransf_23 0.87
PF13776DUF4172 0.87
PF00353HemolysinCabind 0.87
PF00903Glyoxalase 0.87
PF12705PDDEXK_1 0.87
PF00899ThiF 0.87
PF00892EamA 0.87
PF01402RHH_1 0.87
PF01588tRNA_bind 0.87
PF00145DNA_methylase 0.87
PF10412TrwB_AAD_bind 0.87
PF12307DUF3631 0.87
PF00270DEAD 0.87
PF03473MOSC 0.87
PF07715Plug 0.87
PF13177DNA_pol3_delta2 0.87
PF00271Helicase_C 0.87
PF03404Mo-co_dimer 0.87
PF06808DctM 0.87
PF00378ECH_1 0.87
PF00872Transposase_mut 0.87
PF00589Phage_integrase 0.87
PF08327AHSA1 0.87
PF13393tRNA-synt_His 0.87
PF05713MobC 0.87
PF13031DUF3892 0.87
PF10881DUF2726 0.87
PF00580UvrD-helicase 0.87
PF01381HTH_3 0.87
PF09848DUF2075 0.87
PF05990DUF900 0.87
PF04909Amidohydro_2 0.87
PF03713DUF305 0.87
PF02796HTH_7 0.87
PF12161HsdM_N 0.87
PF13649Methyltransf_25 0.87
PF09665RE_Alw26IDE 0.87
PF08937DUF1863 0.87
PF13676TIR_2 0.87
PF00202Aminotran_3 0.87
PF01207Dus 0.87
PF01098FTSW_RODA_SPOVE 0.87
PF00132Hexapep 0.87
PF00176SNF2-rel_dom 0.87
PF16189Creatinase_N_2 0.87
PF04998RNA_pol_Rpb1_5 0.87
PF01844HNH 0.87
PF09492Pec_lyase 0.87
PF07669Eco57I 0.87
PF08808RES 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG0210Superfamily I DNA or RNA helicaseReplication, recombination and repair [L] 0.87
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.87
COG0481Translation elongation factor EF-4, membrane-bound GTPaseTranslation, ribosomal structure and biogenesis [J] 0.87
COG0772Peptodoglycan polymerase FtsW/RodA/SpoVECell cycle control, cell division, chromosome partitioning [D] 0.87
COG0863DNA modification methylaseReplication, recombination and repair [L] 0.87
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 0.87
COG10743’-5’ helicase subunit RecB of the DNA repair enzyme RecBCD (exonuclease V)Replication, recombination and repair [L] 0.87
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 0.87
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 0.87
COG2517Predicted RNA-binding protein, contains C-terminal EMAP domainGeneral function prediction only [R] 0.87
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.87
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.87
COG3544Uncharacterized conserved protein, DUF305 familyFunction unknown [S] 0.87
COG3973DNA helicase IVReplication, recombination and repair [L] 0.87
COG4782Esterase/lipase superfamily enzymeGeneral function prediction only [R] 0.87
COG5654Predicted toxin component of a toxin-antitoxin system, contains RES domainDefense mechanisms [V] 0.87
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 0.87
COG0073tRNA-binding EMAP/Myf domainTranslation, ribosomal structure and biogenesis [J] 0.87
COG0086DNA-directed RNA polymerase, beta' subunit/160 kD subunitTranscription [K] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms87.83 %
UnclassifiedrootN/A12.17 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2010549000|RicEn_FSXC3319_g1All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300000571|JGI1358J11329_10010129All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Rubrobacteria → Rubrobacterales → Rubrobacteraceae → Rubrobacter → Rubrobacter xylanophilus5748Open in IMG/M
3300000571|JGI1358J11329_10070268All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1259Open in IMG/M
3300000574|JGI1357J11328_10042026All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Parcubacteria1900Open in IMG/M
3300001380|JGI1356J14229_10203481All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → unclassified Bacteroidetes → Bacteroidetes bacterium564Open in IMG/M
3300001605|Draft_10137896Not Available1616Open in IMG/M
3300001605|Draft_10668548All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium524Open in IMG/M
3300002461|AADWTP_10030597All Organisms → cellular organisms → Bacteria → Proteobacteria4407Open in IMG/M
3300002502|C687J35174_10793266All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → unclassified Bacteroidetes → Bacteroidetes bacterium568Open in IMG/M
3300003313|P32013IDBA_1018464All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2720Open in IMG/M
3300003319|soilL2_10207108All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1757Open in IMG/M
3300003320|rootH2_10186964All Organisms → cellular organisms → Bacteria1511Open in IMG/M
3300004210|Ga0066639_10347205All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Nomurabacteria884Open in IMG/M
3300005336|Ga0070680_100006611All Organisms → cellular organisms → Bacteria → Proteobacteria8815Open in IMG/M
3300005336|Ga0070680_100759888All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus antibioticophila835Open in IMG/M
3300005337|Ga0070682_100618771All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium858Open in IMG/M
3300005337|Ga0070682_101028346All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium685Open in IMG/M
3300005441|Ga0070700_100480156All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium952Open in IMG/M
3300005455|Ga0070663_100256920All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1385Open in IMG/M
3300005458|Ga0070681_10035025All Organisms → cellular organisms → Bacteria → Proteobacteria5042Open in IMG/M
3300005518|Ga0070699_101476828All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → unclassified Parcubacteria group → Parcubacteria group bacterium RIFCSPHIGHO2_01_FULL_47_10b623Open in IMG/M
3300005539|Ga0068853_100014546All Organisms → cellular organisms → Bacteria → Proteobacteria6450Open in IMG/M
3300005544|Ga0070686_101004395All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium684Open in IMG/M
3300006178|Ga0075367_10153825All Organisms → cellular organisms → Bacteria1428Open in IMG/M
3300006353|Ga0075370_10932383All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium531Open in IMG/M
3300006417|Ga0069787_10117000All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia16895Open in IMG/M
3300006847|Ga0075431_101306433All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium686Open in IMG/M
3300006871|Ga0075434_100270375All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1719Open in IMG/M
3300007351|Ga0104751_1220776All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium655Open in IMG/M
3300008886|Ga0115930_1000327All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria70744Open in IMG/M
3300009031|Ga0103682_10018444All Organisms → cellular organisms → Bacteria4324Open in IMG/M
3300009093|Ga0105240_10420849All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1501Open in IMG/M
3300009101|Ga0105247_11363617All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium572Open in IMG/M
3300009156|Ga0111538_11822330All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300009692|Ga0116171_10370891All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → unclassified Bacteroidetes → Bacteroidetes bacterium767Open in IMG/M
3300010045|Ga0126311_10002747All Organisms → cellular organisms → Bacteria8909Open in IMG/M
3300010356|Ga0116237_10090906All Organisms → cellular organisms → Bacteria3166Open in IMG/M
3300010371|Ga0134125_10620628All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1194Open in IMG/M
3300010373|Ga0134128_10002813All Organisms → cellular organisms → Bacteria → Proteobacteria21377Open in IMG/M
3300010397|Ga0134124_11313829All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300010397|Ga0134124_11822844All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium643Open in IMG/M
3300010397|Ga0134124_12590652All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium550Open in IMG/M
3300012202|Ga0137363_10043387All Organisms → cellular organisms → Bacteria3181Open in IMG/M
3300012202|Ga0137363_10385705Not Available1165Open in IMG/M
3300012205|Ga0137362_10803121All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium806Open in IMG/M
3300012212|Ga0150985_106845717All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium767Open in IMG/M
3300012212|Ga0150985_119547222All Organisms → cellular organisms → Bacteria2026Open in IMG/M
3300012469|Ga0150984_103830429All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales771Open in IMG/M
3300012469|Ga0150984_109032288All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium656Open in IMG/M
3300012469|Ga0150984_115325741All Organisms → cellular organisms → Bacteria1669Open in IMG/M
3300012685|Ga0137397_10200357All Organisms → cellular organisms → Bacteria1485Open in IMG/M
3300012943|Ga0164241_10000027All Organisms → cellular organisms → Bacteria → Proteobacteria400514Open in IMG/M
3300012943|Ga0164241_10004280All Organisms → cellular organisms → Bacteria → Proteobacteria13921Open in IMG/M
3300012943|Ga0164241_10441592All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium935Open in IMG/M
3300012986|Ga0164304_10619789All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300013297|Ga0157378_12100590All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium615Open in IMG/M
3300013772|Ga0120158_10285981All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium805Open in IMG/M
3300014059|Ga0119868_1017927All Organisms → cellular organisms → Bacteria2286Open in IMG/M
3300014208|Ga0172379_10011802All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Peregrinibacteria10335Open in IMG/M
3300014208|Ga0172379_10634703All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → unclassified Bacteroidetes → Bacteroidetes bacterium658Open in IMG/M
3300017651|Ga0182742_1002396All Organisms → cellular organisms → Bacteria → Proteobacteria20804Open in IMG/M
3300018429|Ga0190272_11027362All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium790Open in IMG/M
3300018481|Ga0190271_10616410All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1205Open in IMG/M
3300019360|Ga0187894_10069601All Organisms → cellular organisms → Bacteria → Proteobacteria1973Open in IMG/M
3300019775|Ga0197853_1041513All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales51391Open in IMG/M
3300020027|Ga0193752_1006768All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae6690Open in IMG/M
3300020213|Ga0163152_10209485All Organisms → cellular organisms → Bacteria1052Open in IMG/M
3300020215|Ga0196963_10187964Not Available891Open in IMG/M
3300020591|Ga0180223_1001428All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root146218511Open in IMG/M
3300021074|Ga0194044_10001161All Organisms → cellular organisms → Bacteria13413Open in IMG/M
3300021339|Ga0193706_1013353All Organisms → cellular organisms → Bacteria2729Open in IMG/M
3300021363|Ga0193699_10463802All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium520Open in IMG/M
3300021403|Ga0210397_11357512All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes552Open in IMG/M
3300021605|Ga0194054_10251031Not Available585Open in IMG/M
3300022549|Ga0212091_10469112All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium524Open in IMG/M
3300023049|Ga0136601_1000836All Organisms → cellular organisms → Bacteria11554Open in IMG/M
3300023270|Ga0247784_1000047All Organisms → cellular organisms → Bacteria → Proteobacteria43042Open in IMG/M
3300024347|Ga0179591_1146559All Organisms → cellular organisms → Bacteria4059Open in IMG/M
3300025015|Ga0209210_1030223All Organisms → cellular organisms → Bacteria2033Open in IMG/M
3300025317|Ga0209541_10100981All Organisms → cellular organisms → Bacteria2931Open in IMG/M
3300025323|Ga0209542_10543610All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → unclassified Bacteroidetes → Bacteroidetes bacterium852Open in IMG/M
3300025536|Ga0207952_1044967All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300025912|Ga0207707_10004898All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae11766Open in IMG/M
3300026041|Ga0207639_10008725All Organisms → cellular organisms → Bacteria → Proteobacteria6959Open in IMG/M
3300027815|Ga0209726_10277003All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium832Open in IMG/M
3300027819|Ga0209514_10008628All Organisms → cellular organisms → Bacteria11508Open in IMG/M
3300027819|Ga0209514_10029460All Organisms → cellular organisms → Bacteria4444Open in IMG/M
3300027819|Ga0209514_10047240All Organisms → cellular organisms → Bacteria3059Open in IMG/M
3300027835|Ga0209515_10021767All Organisms → cellular organisms → Bacteria6031Open in IMG/M
3300027835|Ga0209515_10043331All Organisms → cellular organisms → Bacteria3597Open in IMG/M
3300027835|Ga0209515_10162376All Organisms → cellular organisms → Bacteria1383Open in IMG/M
3300027909|Ga0209382_11067047All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium837Open in IMG/M
3300028028|Ga0265292_1000252All Organisms → cellular organisms → Bacteria91748Open in IMG/M
3300028738|Ga0302292_1180313All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Parcubacteria → unclassified Parcubacteria → Candidatus Parcubacteria bacterium703Open in IMG/M
3300029288|Ga0265297_10253177All Organisms → cellular organisms → Bacteria → Spirochaetes → unclassified Spirochaetota → Spirochaetota bacterium1251Open in IMG/M
3300029989|Ga0311365_10705767Not Available874Open in IMG/M
3300031521|Ga0311364_10631917All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1079Open in IMG/M
3300031740|Ga0307468_100010938All Organisms → cellular organisms → Bacteria3552Open in IMG/M
3300031949|Ga0214473_11484110All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria685Open in IMG/M
3300032144|Ga0315910_10003941All Organisms → cellular organisms → Bacteria12503Open in IMG/M
3300032157|Ga0315912_11295626All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium576Open in IMG/M
3300032173|Ga0315268_10001563All Organisms → cellular organisms → Bacteria26325Open in IMG/M
3300032770|Ga0335085_10000076All Organisms → cellular organisms → Bacteria339163Open in IMG/M
3300033134|Ga0335073_10572971All Organisms → cellular organisms → Bacteria1269Open in IMG/M
3300033233|Ga0334722_10001016All Organisms → cellular organisms → Bacteria37292Open in IMG/M
3300034155|Ga0370498_032206All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 65-71134Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater11.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.57%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere5.22%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.35%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.35%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.48%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater2.61%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen2.61%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere2.61%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.74%
Anoxic Zone FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Zone Freshwater1.74%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.74%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.74%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine1.74%
CompostEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Compost1.74%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.74%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.74%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.74%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere1.74%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.74%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere1.74%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate1.74%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge1.74%
Hydrocarbon Resource EnvironmentsEngineered → Wastewater → Industrial Wastewater → Petrochemical → Unclassified → Hydrocarbon Resource Environments1.74%
Freshwater Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat0.87%
FreshwaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unclassified → Freshwater0.87%
GroundwaterEnvironmental → Aquatic → Freshwater → Drinking Water → Chlorinated → Groundwater0.87%
Contaminated GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Contaminated Groundwater0.87%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean0.87%
Saline LakeEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Saline Lake0.87%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.87%
Ore Pile And Mine Drainage Contaminated SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Ore Pile And Mine Drainage Contaminated Soil0.87%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.87%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.87%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.87%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.87%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.87%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.87%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.87%
Serpentinite Rock And FluidEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Serpentinite Rock And Fluid0.87%
Deep Subsurface AquiferEnvironmental → Terrestrial → Deep Subsurface → Aquifer → Unclassified → Deep Subsurface Aquifer0.87%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.87%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.87%
Micrasterias Crux-Melitensis (Mzch 98) AssociatedHost-Associated → Microbial → Bacteria → Unclassified → Unclassified → Micrasterias Crux-Melitensis (Mzch 98) Associated0.87%
Rice EndophytesHost-Associated → Plants → Rhizoplane → Endophytes → Unclassified → Rice Endophytes0.87%
Sugarcane Root And Bulk SoilHost-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil0.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.87%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.87%
Enhanced Biological Phosphorus Removal BioreactorEngineered → Wastewater → Nutrient Removal → Biological Phosphorus Removal → Activated Sludge → Enhanced Biological Phosphorus Removal Bioreactor0.87%
Enriched SedimentEngineered → Lab Enrichment → Defined Media → Anaerobic Media → Unclassified → Enriched Sediment0.87%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2010549000Rice endophytes microbial communities from Berkeley, California, USAHost-AssociatedOpen in IMG/M
3300000571Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 mEnvironmentalOpen in IMG/M
3300000574Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 mEnvironmentalOpen in IMG/M
3300000961Marine microbial communities from the Deep Pacific Ocean - MP1649EnvironmentalOpen in IMG/M
3300001380Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW37 contaminated, 5.8 mEnvironmentalOpen in IMG/M
3300001605Tailings pond microbial communities from Northern Alberta - Syncrude Mildred Lake Settling BasinEngineeredOpen in IMG/M
3300002461Freshwater microbial communities from a drinking water treatment plant in Ann Arbor, Michigan, USAEnvironmentalOpen in IMG/M
3300002502Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank highO2_0.1EnvironmentalOpen in IMG/M
3300003313Ore pile and mine drainage contaminated soil microbial communities from Mina do Sossego, Brazil - P3 sampleEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003320Sugarcane root Sample H2Host-AssociatedOpen in IMG/M
3300004210Groundwater microbial communities from aquifer - Crystal Geyser CG10_big_fil_rev_8/21/14_0.10EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005337Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3L metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300006178Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-2Host-AssociatedOpen in IMG/M
3300006353Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-5Host-AssociatedOpen in IMG/M
3300006417Combined Assembly of Gp0110018, Gp0110022, Gp0110020EngineeredOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007351Combined Assembly of Gp0115775, Gp0115815EnvironmentalOpen in IMG/M
3300008886Microbial communities associated with unicellular green alga Micrasterias crux-melitensis, Germany - (MZCH: 98)Host-AssociatedOpen in IMG/M
3300009031Microbial communities from groundwater in Rifle, Colorado, USA - 3D_0.1umEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009692Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from Japan - AD_JPNHW2_MetaGEngineeredOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010356AD_USDEcaEngineeredOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013122 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10.3mEnvironmentalOpen in IMG/M
3300013123 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_11mEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014059Activated sludge microbial communities from Shanghai, China - membrane bioreactor - Membrane foulantsEngineeredOpen in IMG/M
3300014208Groundwater microbial communities from an aquifer near a municipal landfill in Southern Ontario, Canada - Groundwater well OW334 metaGEnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300017651Enriched Miracle-Growth compost microbial communities from Emeryville, California, USA - eDNA 5th pass 37_C BE-Lig MG (version 2)EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019775Lab enriched sediment microbial communities from hydrocarbon-contaminated retail site, Toronto, Canada - S1, HI.1247_001EngineeredOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020213Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.MP8.IB-2EnvironmentalOpen in IMG/M
3300020215Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_5EnvironmentalOpen in IMG/M
3300020415Marine microbial communities from Tara Oceans - TARA_B100001146 (ERX555973-ERR599166)EnvironmentalOpen in IMG/M
3300020443Marine microbial communities from Tara Oceans - TARA_B100001179 (ERX556000-ERR598944)EnvironmentalOpen in IMG/M
3300020591Enriched Organic Plus compost microbial communities from Emeryville, California, USA - eDNA 5th pass 30_C Kraft OP (version 2)EnvironmentalOpen in IMG/M
3300021074Anoxic zone freshwater microbial communities from boreal shield lake in IISD Experimental Lakes Area, Ontario, Canada - Jun2016-L442-17mEnvironmentalOpen in IMG/M
3300021339Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c1EnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021605Anoxic zone freshwater microbial communities from boreal shield lake in IISD Experimental Lakes Area, Ontario, Canada - Sep2016-L227-10mEnvironmentalOpen in IMG/M
3300022549Cold Creek_combined assemblyEnvironmentalOpen in IMG/M
3300023049Saline lake microbial communities from Rauer Islands, Antarctica - Metagenome Filla 2 #699EnvironmentalOpen in IMG/M
3300023270Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L169-409R-5EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025015Contaminated groundwater microbial communities from Rifle, Colorado, USA - Rifle Groundwater A1 (SPAdes)EnvironmentalOpen in IMG/M
3300025317Groundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.1 (SPAdes)EnvironmentalOpen in IMG/M
3300025323Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank highO2_0.1 (SPAdes)EnvironmentalOpen in IMG/M
3300025536Serpentinite rock and fluid subsurface biosphere microbial communities from McLaughlin Reserve, California, USA - CR12Aug_8Ca (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027819Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW37 contaminated, 5.8 m (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028028Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 296AEngineeredOpen in IMG/M
3300028738Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Fen_N2_1EnvironmentalOpen in IMG/M
3300029288Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 137-91EngineeredOpen in IMG/M
3300029989III_Fen_N1 coassemblyEnvironmentalOpen in IMG/M
3300031521III_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032173Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_topEnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
RicEn_1962202010549000Rice EndophytesMIDATQRETPMTDNKQNKGTPDRNLISFKQKYEFDYAVKQLQKQIPDTTRQEAKDALAAAAKKISPSEGREKIMRAARKTLRD
JGI1358J11329_1001012923300000571GroundwaterMADNKNKVKEDRNLISFKENYEVYYAVNQLKKQFTDETKADIKEALFDAAKQVSPSEGREKIMRLARKELNN*
JGI1358J11329_1007026823300000571GroundwaterMTDNKKKTGKPDSYLISFKERYEFDYAVGQLQKQVSDTTKQEAKDALTKAAKKVNPSEGREKIMRAARKILNS*
JGI1357J11328_10002202113300000574GroundwaterMADNKNKTGKPDIYLISFKEAYEVNYAVKQLQKQFPEKTKQVVKEALFEAAKIVDPSEGREKVMKQARKNLRD*
JGI1357J11328_1004202623300000574GroundwaterMADDKKKVKEDRNLISFKEHYEVYYAVNQLKKQFPDETKADIKDALFDAAKQVKPSEGREKIMRLARKDLRD*
JGI12026J13078_101227823300000961Deep OceanMADNKSKRGKPDSYLISFKERYEVDYAVKQLQKQFPAKTKTEVKKALTKAASEIDPSRGRERVMKGARKLIKK*
JGI1356J14229_1020348113300001380GroundwaterMSDDKKKVKEDRNLISFKENYEVDYAVNQLKKQFPDETKKEIKEALFDAAKQVSPSEGREKIMRIARKD
Draft_1013789623300001605Hydrocarbon Resource EnvironmentsMKLTTGEGKTMTDDKKNRGNPDRNLISFKEKYEFDYAVKQLQKQVPDTTRQEAKEALVSAAKQISPSEGREKIMRAARKVLRD*
Draft_1066854813300001605Hydrocarbon Resource EnvironmentsGGEYGLMGSASINETHHRGGKTMADDKKNRGNPDRNLISFKEKYEFDYAVKQLQKQVPDTTRQDAKAALTAAAKQISPSEGREKIMRAARKVLRD*
AADWTP_1003059773300002461FreshwaterMADDKKNIGKPDRNLISFKEKYEFNYAAKQLQNQVADTTRQEAKDALTAAAKKVSPSEGREKIMRAARKILRD*
C687J35174_1079326613300002502SoilMADDKNKKKEDANLISFKENYEVYYAVNQLKKEFPDETKADIKEALFDAAKQVKPSEGREKIMRLVRKDLKD*
P32013IDBA_101846433300003313Ore Pile And Mine Drainage Contaminated SoilVADDKSKTGTPDRNLISFKQKYEVDYAVRQLQKRFPEETRAAVKEALFSAAKKVSPSEGREKVMRAASKKLG*
soilL2_1020710813300003319Sugarcane Root And Bulk SoilMADDKKNVGNPDRNLISFKEKYEFDYAAKQLQKQVPDTTRQEAKEALTKAARQVSPSEGREKVMRAARKILKD*
rootH2_1018696413300003320Sugarcane Root And Bulk SoilMADDKKKVGRQDDNLISFKQKYEFAYAQATKNKFPDETKQEVKDALTDAARKISPSEGREKIMRQARKNQGLVLSQLYK
Ga0066639_1034720523300004210GroundwaterMADNKNKVKEDRNLISFKENYEVYYAVNQLKKQFPDETKSNIKEALFDAAKQVSPSEGREKIMRLTRKELNS*
Ga0070680_10000661123300005336Corn RhizosphereMADDKKKVGQPDRNLISFKEKYEFDYAAKQLQKQVPDTTKQEARNALIKAARQVSPSEGREKVMRAARKILKD*
Ga0070680_10075988813300005336Corn RhizosphereDRNLISFKEKYEFDYAVRQLQKQVSDTTKQEARNALTKAARQVSPSEGREKVMRAARKILKD*
Ga0070682_10061877123300005337Corn RhizosphereMADNKQKRGTPDRNLISFREKYEFNYAVKQLQKQVSNTTRQEAKDALTAAAKKISPSEGREKIMRAARKKLRT*
Ga0070682_10102834623300005337Corn RhizosphereMIRNKRGAPDRNLISFREKYEFEYALKQLQKQVPESSNHAAKEALTEAARKVSPSEGREKIMRALPAKS*
Ga0070700_10048015613300005441Corn, Switchgrass And Miscanthus RhizosphereKYEFDYAVKQLQKQVPDTTKQEAKQALTEAAKKISPSEGREKIMRAARKKLRG*
Ga0070663_10025692013300005455Corn RhizosphereMADDKKKVGQPDRNLISFEEKYEFDYAVRQLQKQVSDTTKQEARNALTKAARQVSPSEGREKVMRAARKILKD*
Ga0070681_1003502523300005458Corn RhizosphereMADDKKKVGQPDRNLISFKEKYEFDYAVRQLQKQVSDTTKQEARNALTKAARQVSPSEGREKVMRAARKILKD*
Ga0070699_10147682823300005518Corn, Switchgrass And Miscanthus RhizosphereMSDNKKNIGKPDRNLISFKENYEVNYAVNQLKKQFPEETKQDTKDALFQAAKKVEPSEGREKIMREARKKLRD*
Ga0068853_10001454643300005539Corn RhizosphereMDTPMPDDKRNIGKPDRNLISFKEKYEFNYALKQLQKQFPDETKQEVKDALTSAAKKVAPSEGREKIMQQARKNLRE*
Ga0070686_10100439513300005544Switchgrass RhizosphereMTDNKKKTGKPDSYLISFKQKYEFDYAAKQLQKQVKGTTRQEAKDALTAAAKKISPSEGREKIMRAARKKLRS*
Ga0075367_1015382513300006178Populus EndosphereMSDNKQNRGNPDRNLISFKQKYEFDYAVKQLQKQVPDTTRQEAKDALTKAARKISPSEGREKIMRAALKDLR
Ga0075370_1093238323300006353Populus EndosphereSDNKQNRGNPDRNLISFKQKYEFDYAVKQLQKQVPDTTRQEAKDALTKAARKISPSEGREKIMRAALKDLRD*
Ga0069787_10117000163300006417Enhanced Biological Phosphorus Removal BioreactorMADDKTKVGRQDDNLISFKQKYEVDYAVNQLKKAFPDESRKDVKEALFKAARENSPSEGREKIMRAARKNLRG*
Ga0075431_10130643323300006847Populus RhizosphereMADDKKNVGSPDRNLISFKEKYEFDYAAKQLQKQVPDITRQEAKEALTRAARQVSPSEGREKVMRAARKILKS*
Ga0075434_10027037533300006871Populus RhizosphereMVDNKKKTGKPDSYLISFKEKYEFNYAVKQLQKQVAGTTRQEAKDALTTAAKKISPSEGREKIMRRARKILRD*
Ga0104751_122077613300007351Deep Subsurface AquiferMADNKKNTGKPDSYLISFKQKYEFNYAVKQLQNQVSDTTRQEAKEALTKAATKVSPSEGREKVMRAARKIL
Ga0115930_1000327393300008886Micrasterias Crux-Melitensis (Mzch 98) AssociatedMADDKSKRGKPDSYLISFKEKYEFEYAAKQLQKQVAGTTRQEARDALTVAAKKISPSEGREKIMRAARKNLRD*
Ga0103682_1001844413300009031GroundwaterMADNKNKIQEDKNLISFKQKYEVDYAIKQLKKQFPNETKIDVKEALFDAAKQVKPSEGREKIMRIARKELRD*
Ga0105240_1042084923300009093Corn RhizosphereMDTPMPDDKRNIGKPDRNLISFKEKYEFNYALKQLQKQIPDETKQEVKDALTSAAKKVAPSEGREKIMQQARKNLRE*
Ga0105247_1136361713300009101Switchgrass RhizosphereGRPDSYLISFKQKYEFNYAVKQLQKQVEGTTRQQAKDALTAAAKKISPSEGREKIMRAARKRLRS*
Ga0111538_1182233023300009156Populus RhizosphereLISFKEKYEFDYAAKQLQKQVPDTTRQEAKDALTAVAKKVSPSEGREKIMRAARKILRD*
Ga0116171_1037089123300009692Anaerobic Digestor SludgeMKVKKDMADDKIKKKEDANLISFKENYEVNYAVNQLKKEFPDESKQDIKDALFDAAKKVKPGEGREKIMQLARKNLK*
Ga0126311_10002747123300010045Serpentine SoilMRMTDNKKKTGKPDSYLISFKQKYEFNYAVKQLQKQVEGTTRQQAKDALTTAAKKISPSEGREKIMRAARKRLRS*
Ga0116237_1009090663300010356Anaerobic Digestor SludgeMVDDKNKIKEDKNLISFKEKWEFDYAVNQLQKQVPDTTKQEAKDALIAAAKKISPSEGRDKIMRQARKNLKD*
Ga0134125_1062062823300010371Terrestrial SoilMADNKQKRGTPDRNLISFREKYEFNYAVKQLQKQVSNTTRQEAKDALTAAAKKISPSEGREKILRAARKKLRT*
Ga0134128_10002813153300010373Terrestrial SoilMADDKKNVGQPDRNLISFKEKYEFDYAAKQLQKQVADTTKQEARDALTKAARQVSPSEGREKVMRAARKILKD*
Ga0134124_1131382923300010397Terrestrial SoilMADNKKNIGKPDRNLINFKENYEVYYAVNQLKKQFPDETKTDIKEALFDAAKKVAPSEGREKVMKIARKILQD*
Ga0134124_1182284423300010397Terrestrial SoilMGDNKQKRGNPDRYLISFKEKYEFDYAAKQLQKQVPETTRQEARDALSAVAKKISPSEGREKIMRAARKRLRD*
Ga0134124_1259065213300010397Terrestrial SoilMGDDKKKTGTPDRNLISFKEKYEFDYAVKQLQKQVPDTTKQEAKQALTEAAKKISPSEGREKIMRAARKKLRS*
Ga0137363_1004338713300012202Vadose Zone SoilMAMSDNKKNIGKPDRNLISFKENYEVDYAVKQLQKQVPDTTKQEAKDALFAAAKKVAPSEGREKIMRLARKALQA*
Ga0137363_1038570523300012202Vadose Zone SoilMTDNKKNIGNPDRYLISFKEKYEVNYAVTQLQKQVPGTTRPEAKEALFEAAKQIDPSRGREKIMAAARKDLRS*
Ga0137362_1080312113300012205Vadose Zone SoilMGDNKKNTGKPDRNLISFKEKYEFNYAVKQLQKQVPDTTRQEAKDALSAAARRISPSEGREKIMRAARKILK*
Ga0150985_10684571713300012212Avena Fatua RhizosphereKTKRGAPDRNLINFKEKYEFDYAVKQLQNQVPDTTKQEAKEALTEAARKISPSEGREKIMREARKTLKD*
Ga0150985_11954722233300012212Avena Fatua RhizosphereMGDDKTKRGTPDRNLISFKENYEFDYAVRQLQNQVPDTTKQQAKEALTEAARKTSPSEGREKIMRVARKILKE*
Ga0150984_10383042913300012469Avena Fatua RhizosphereDKTKRGSPDRNLINFKEKYEFNYAVRQLQNQVPDTTKQDAKQALTEAAKKISPSEGREKIMREARKILKD*
Ga0150984_10903228813300012469Avena Fatua RhizospherePDRNLINFKEKYEFDYAVKQLQNQVPETTKQAAREALTDAARKISPSEGREKIMREARKILKD*
Ga0150984_11532574113300012469Avena Fatua RhizosphereDKTKRGTPDRNLISFKENYEFDYAVRQLQNQVPDTTKQQAKEALTEAARKTSPSEGREKIMRVARKILKE*
Ga0137397_1020035733300012685Vadose Zone SoilMADDKTKRGKEDRNLISFKEKYEVDYAVNQLKKEFPDETKTDIKESLFDAAKQVSPSEGREKIMRLARKDLND*
Ga0164241_10000027193300012943SoilMADDKKNVGHPDRNLISFKQRYEFDYAAKQLQKQVADTTRQEARDALTEAARLVSPSEGREKVMRAARKILKD*
Ga0164241_10004280143300012943SoilMADDKTKVGQPDRNLISFKEKYEFDYAAKQLQKQVPDTTRQEARDALTEAARRVSPSEGREKVMRAARKVLKS*
Ga0164241_1044159223300012943SoilMSDDKTKRGTPDRNLINFKEKYEFSYAVKQLQKQVPETTKQHAKDALVEAARKISPSEGREKIMRAARKILND*
Ga0164304_1061978913300012986SoilMADDKKKQKEDRNLISFKEKYEVDYAVNQLKKQFPDQTKTDVKEALFDAAKKVAPSEGREKIMRLARKTLKD*
(restricted) Ga0172374_105587433300013122FreshwaterQVAIIHRVKGCNYYLILIKMADNKNKTKEDRNLISFKQNYEVYYAVNQLKKQFPDETKIEIKEILFDAARKVSPSEGREKIMRLARKELKN*
(restricted) Ga0172368_1002152043300013123FreshwaterVAIIHRVKGCNYYLILIKMADNKNKTKEDRNLISFKQNYEVYYAVNQLKKQFPDETKIEIKEILFDAARKVSPSEGREKIMRLARKELKN*
Ga0157378_1210059023300013297Miscanthus RhizosphereMADNKKKTGKPDSYLISFKEKYEFDYAANQLQKQVPDTTKQEAKNALTEAAKKVSPSEGREKVMRAARKILRS*
Ga0120158_1028598123300013772PermafrostMADDKKNIGKPDRNLISFKEKYEFNYAVNQLQKQIPDTTKQESKDALNKAAKQISPSEGREKIMRAARKNLRS*
Ga0119868_101792713300014059Activated SludgeRNLISFKQKYEFDYAVKQLQKQVPDTTKQEAKEALTKAAKKISPSEGRAKIMRVAKKDLRS*
Ga0172379_10011802123300014208GroundwaterMADNKKNIGKPDRNLISFKENYEVYYAVNQLQKQIPGTTKREAKDALFKAAKKTSPSEGREKIMRIARKDLRN*
Ga0172379_1063470323300014208GroundwaterMDIMADDKNKIKEDRNLISFKENYEVYYAVNQLKKQFPDETKQAIKEALFEAAKQVKPSEGREKIMRIVRKDLRD*
Ga0182030_1022579423300014838BogMSDNKKKTGKPDSYLISFKEKYEVDYAVKQLQKQFPDEKKQVVKAALIKAATEVKPSDGREKIMKQARKDLKS*
Ga0157379_1113719623300014968Switchgrass RhizosphereMTDNKKKTGKPDSYLISFKEKYEVDYAVKQLQKEFPNEAKQKVKDALIEAAKQIDPSDGREKIMRQARKDLKD*
Ga0182742_100239683300017651CompostMADDKKKTGNPDRNLIAFNQKYEFNYAVRQLQKQIPDTTRQEAKEALTKAARQVSPSEGREKVMRAARKILKD
Ga0190272_1102736223300018429SoilMADNKKKIGVPDRNLISFKEKYEFDYAVKQVQKKAPDGTTKQEAKDALTAAAKKISPSEGRKKIMREALKRLKD
Ga0190271_1061641023300018481SoilMADDKKNVGQPDRNLISFKQKYEFNYAAAQLQKQVPDTTRQEAKDALTKAARQVTPSEGREKVMRAARKFLKD
Ga0187894_1006960123300019360Microbial Mat On RocksMSDDKSKVGKPDRYLISFKQKYEFDYAVNQLRKKFPEENQTAVKKALTEAAKKVSPSEGREKVMREARRKLRD
Ga0197853_1041513453300019775Enriched SedimentMADDKKKVGRQDDNLIAFKQRYEFDYAAKQLQKQVVGATRQEAKEALTKAAKKVSPSEGREKVMREARKILRD
Ga0193752_100676873300020027SoilMSDDKRNIGKPDRNLISFNQKYEFDYAVGQLQKQFPSETKQEVKDALTEAARRVSPSEGREKIMRAARKNLRD
Ga0163152_1020948523300020213Freshwater Microbial MatMADDKKNIGKPDRNLISFKEKYEVNYAVNQLQKQIPDTTKQEAKDALFKAAKQISPSEGREKIMRAARKDLRS
Ga0196963_1018796423300020215SoilMTDNKKKIGKPDRNLISFKENWEFSYAIRQLLVKNPRATKESVKDALTKAARKVEPSEG
Ga0211553_1002678623300020415MarineMADNKSKRGKPDSYLISFKERYEVDYAVKQLQKQFSTKTKTEVKKALTKAASEIDPSRGRERIMNGARKILKK
Ga0211544_1018946223300020443MarineMADNKSKRGKPDSYLISFKERYEVDYAVKQLQKQFPAKTKTEVKKALTKAANEIDPSRGRERIMKGARKLIKK
Ga0180223_100142893300020591CompostMADDKQNRGSPDRNLISFKQRYEFNYAVKQLQKQVSDTTRQEAKDALTAAAKANSPSEGREKIMRAARKILRD
Ga0194044_1000116193300021074Anoxic Zone FreshwaterMADNKNMVKEDRNLISFKENYEVYYAVNQLKKQFPDETKADIKEVLFGAAKQVSPSEGREKIMRLTRKELNS
Ga0193706_101335323300021339SoilMADDKKNVGQPDRNLISFKEKYEFDYAARQLQKQFPGETKQDVKEALTDAAKKVSPSEGREKIMRQARKNLRD
Ga0193699_1046380213300021363SoilMADDKKNVGQPDRNLISFKEKYEFDYAVRQLKTQFPDETKQDVKEALTEAAKKVSPSEGREKIMRQARKN
Ga0210397_1135751223300021403SoilREYLCYNIFYMSDNKKNTGKPDSYLISFKEKYEVNYAVNQLQKEFPNEKKQVVKDALFEAAKKVNPSDGRENIMRQARKNLKN
Ga0194054_1025103123300021605Anoxic Zone FreshwaterMSDNKNKVGWQDDNLISFKQDYEVNYAVKQLQKQFPEETKNEVKKALFDAAKKISPSEGREKIMKLARKNLRN
Ga0212091_1046911213300022549GroundwaterMADDKKKTGAPDRNLISFKQKYEFDYAVGQLRKQFPNEPRQEVKDALTAAATRISPSEGREKIMREARKRLKD
Ga0136601_1000836113300023049Saline LakeMSDNKKNTGNPDRYLISFKEKYEVEYAIKQLQKKFPDKTRDKITKALNDAAKEVDPSRGREKIMIKARKNLR
Ga0247784_1000047333300023270Plant LitterMADDKTKTGKEDDNLISFKQKYEVDYAVKQLQKQVPDTTRREAKDALFEAAKKTSPSEGREKVMRAARKILKD
Ga0179591_114655983300024347Vadose Zone SoilMADDKKNIGKPDRNLISFKEKYEFDYAVKQLQNQVSDTTRQEAKDALTAAAKKISPSEGREKIMRAARKQLRD
Ga0209210_103022313300025015Contaminated GroundwaterMADDKKKVKEDRNLISFKEHYEVYYAVNQLKKQFPDETKADIKDALFDAAKQVKPSEGREKIMRLARKDLRD
Ga0209541_1010098133300025317GroundwaterMADDKNKVKEDRNLISFKENYEVYYAVNQLKKQFPDETKSDIKEALFDAAKQVSPSEGREKIMRLTRKELNS
Ga0209542_1054361013300025323SoilMADDKNKKKEDANLISFKENYEVYYAVNQLKKEFPDETKADIKEALFDAAKQVKPSEGREKIMRLVRKDLKD
Ga0207952_104496723300025536Serpentinite Rock And FluidMADDKKKVKEDRNLISFKENYEVNYAVNQLKKQFPDETKVDIKEALFDAAKQVSPSEGREKIMRLARKDLRD
Ga0207707_1000489863300025912Corn RhizosphereMADDKKKVGQPDRNLISFKEKYEFDYAVRQLQKQVSDTTKQEARNALTKAARQVSPSEGREKVMRAARKILKD
Ga0207639_1000872553300026041Corn RhizosphereMDTPMPDDKRNIGKPDRNLISFKEKYEFNYALKQLQKQFPDETKQEVKDALTSAAKKVAPSEGREKIMQQARKNLRE
Ga0209726_10000289123300027815GroundwaterMADNKNKTGKPDIYLISFKEAYEVNYAVKQLQKQFPEKTKQVVKEALFEAAKIVDPSEGREKVMKQARKNLRD
Ga0209726_1027700323300027815GroundwaterMADDKKKIGVPDRNLISFKERYEFDYAVKQLQKQVPEATRQAAKEALTDAAKKVSPSEGREKIMRQARKNLRD
Ga0209514_10008628113300027819GroundwaterMADDKTKRGVPDRNLISFKEKYEFDYAVKQLQNQVADTTRQEAKDALTAAAKKISPSEGREKIMREAKKNLRD
Ga0209514_1002946053300027819GroundwaterMSDDKKKVKEDRNLISFKENYEVDYAVNQLKKQFPDETKKEIKEALFDAAKQVSPSEGREKIMRIARKDLKD
Ga0209514_1004724043300027819GroundwaterMADNKKKIKEDANLISFKENYEVYYAVNQLKKQFPDETKADIKDALFDAAKQVKPSEGREKIMRLARKELND
Ga0209515_1002176773300027835GroundwaterMADNKNKVKEDRNLISFKENYEVYYAVNQLKKQFTDETKADIKEALFDAAKQVSPSEGREKIMRLARKELNN
Ga0209515_1004333173300027835GroundwaterMTDNKKKTGKPDSYLISFKERYEFDYAVGQLQKQVSDTTKQEAKDALTKAAKKVNPSEGREKIMRAARKILNS
Ga0209515_1016237623300027835GroundwaterMLTSMADDKKNIGKPDRNLISFKEKWEFDYAIRQLHVQDPDGTKQQAKNALTKAAKRIEPSEGRKRIMKEALKILKDN
Ga0209382_1106704733300027909Populus RhizosphereMADDKKNVGSPDRNLISFKEKYEFDYAAKQLQKQVPDITRQEAKEALTRAARQVSPSEGREKVMRAARKILKN
Ga0265292_1000252743300028028Landfill LeachateMADDKTKIGKRDRNLISFKQKYEVAYAVNQLKKQFPDETKQDIKDVLFEAAKKISPSEGRDKIMRLARKNLQN
Ga0302292_118031323300028738FenMSDNKNKIGRQDRNLISFKEKYEVDYAVRQLQKQFPDEAKQDVKKALIKAAIAVNPSEGREKIMRQARKNLK
Ga0265297_1025317723300029288Landfill LeachateMSDNKKNIGKPDRNLISFKERYEFDYAVKQLQKKFPDETKQEVKDALTKAAKKVEPSEGREKIMREARKNLR
Ga0311365_1070576713300029989FenMADDKKNVGQPDRSLISFKEKYEFNYAAKQLQKQVPDTNKQEARDALTKAARQVSPSEGREKVMRAARKILRD
Ga0311364_1063191713300031521FenMADNKKNIGKPDRNLISFKEKYEFDYAVKQLQKQVSNTTKQEAKDALVAAAKAVSPSEGREKIMRAALKKLKN
Ga0307468_10001093823300031740Hardwood Forest SoilMADDKKNVGPADRHLISFKQKYEFDYAARQLQKQVPDATRQEARNALARAARQESPSEGRERIMRAARKILRA
Ga0214473_1148411013300031949SoilISFRQKYEFDYAVRQLQKQFQDTTRQEAKDALTEAARRISPSEGREKIMRAARKTLRN
Ga0315910_10003941113300032144SoilMADDKKKIGREDANVISFKQKYEVDYAVNQLQKAFPNETKKDVKEALFEAARKISPSEGREKIMRAARKNLRD
Ga0315912_1129562613300032157SoilMADNKNKTGSPDRNLISFKQKYEFDYAVKQVQNKGPEGTTKQEAKDALTAAARKISPSEGRDKIMRDALKRLK
Ga0315268_10001563243300032173SedimentMTDDKKNIGKPDRNLISFKEKYEVDYAVNQLQKQVPGTTKQEAREALFDAARKVSPSEGREKIMRAARKDLRD
Ga0335085_100000762963300032770SoilMTDNKKNRGNPDRYLISFKEKYEFRYAAKQVQKKAPQGTTLQKATAALTAAAKKISPSEGRKKIMRAALKILKK
Ga0335073_1057297113300033134SoilMADDKKNRGNPDRYLISFKEKWEFAYAVKQLKKKAADATTQEAKDALTAAARKISPSEGRKKIMREALKKLKG
Ga0334722_10001016283300033233SedimentLATLFNHLTLRYDCYMTDDKKNIGKPDRNLISFKEKYEVDYAVNQLQKQVPGTTKQEAREALFDAARKVSPSEGREKIMRAARKDLRD
Ga0370498_032206_352_5733300034155Untreated Peat SoilMADDKTKIGKPDRNLISFKERYEFDYAVWQLQKQFPDETKKDVKDALTAAARKVSPSEGREKIMRQARKSLRD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.