NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F074798

Metagenome / Metatranscriptome Family F074798

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074798
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 47 residues
Representative Sequence MVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASRKVVEAVRARLSR
Number of Associated Samples 105
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 62.61 %
% of genes near scaffold ends (potentially truncated) 36.97 %
% of genes from short scaffolds (< 2000 bps) 79.83 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (66.387 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(12.605 % of family members)
Environment Ontology (ENVO) Unclassified
(28.571 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.378 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.32%    β-sheet: 12.16%    Coil/Unstructured: 63.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF03576Peptidase_S58 2.52
PF01989AcnX_swivel_put 2.52
PF00472RF-1 1.68
PF13379NMT1_2 1.68
PF03551PadR 1.68
PF01738DLH 1.68
PF13620CarboxypepD_reg 1.68
PF03473MOSC 1.68
PF00144Beta-lactamase 1.68
PF07676PD40 1.68
PF14534DUF4440 1.68
PF12704MacB_PCD 1.68
PF02687FtsX 1.68
PF06197DUF998 1.68
PF07721TPR_4 1.68
PF04235DUF418 1.68
PF13564DoxX_2 0.84
PF00449Urease_alpha 0.84
PF00199Catalase 0.84
PF00583Acetyltransf_1 0.84
PF11918Peptidase_S41_N 0.84
PF13485Peptidase_MA_2 0.84
PF12441CopG_antitoxin 0.84
PF00903Glyoxalase 0.84
PF03572Peptidase_S41 0.84
PF08837DUF1810 0.84
PF01494FAD_binding_3 0.84
PF01609DDE_Tnp_1 0.84
PF04191PEMT 0.84
PF13420Acetyltransf_4 0.84
PF04140ICMT 0.84
PF13614AAA_31 0.84
PF01381HTH_3 0.84
PF00701DHDPS 0.84
PF02517Rce1-like 0.84
PF02896PEP-utilizers_C 0.84
PF07606DUF1569 0.84
PF13520AA_permease_2 0.84
PF13649Methyltransf_25 0.84
PF00753Lactamase_B 0.84
PF12697Abhydrolase_6 0.84
PF02861Clp_N 0.84
PF08734GYD 0.84
PF10502Peptidase_S26 0.84
PF07927HicA_toxin 0.84
PF06826Asp-Al_Ex 0.84
PF10604Polyketide_cyc2 0.84
PF00782DSPc 0.84
PF09285Elong-fact-P_C 0.84
PF12833HTH_18 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG3191L-aminopeptidase/D-esteraseAmino acid transport and metabolism [E] 5.04
COG1786Mevalonate 5-phosphate dehydratase subunit 2, swiveling domain (modified mevalonate pathway)Lipid transport and metabolism [I] 2.52
COG0216Protein chain release factor RF1Translation, ribosomal structure and biogenesis [J] 1.68
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 1.68
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.68
COG1186Protein chain release factor PrfBTranslation, ribosomal structure and biogenesis [J] 1.68
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 1.68
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 1.68
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 1.68
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.68
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 1.68
COG2311Uncharacterized membrane protein YeiBFunction unknown [S] 1.68
COG2367Beta-lactamase class ADefense mechanisms [V] 1.68
COG3371Uncharacterized membrane proteinFunction unknown [S] 1.68
COG0542ATP-dependent Clp protease, ATP-binding subunit ClpAPosttranslational modification, protein turnover, chaperones [O] 0.84
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.84
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.84
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.84
COG0753CatalaseInorganic ion transport and metabolism [P] 0.84
COG0793C-terminal processing protease CtpA/Prc, contains a PDZ domainPosttranslational modification, protein turnover, chaperones [O] 0.84
COG0804Urease alpha subunitAmino acid transport and metabolism [E] 0.84
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.84
COG1724Predicted RNA binding protein YcfA, dsRBD-like fold, HicA-like mRNA interferase familyGeneral function prediction only [R] 0.84
COG2985Uncharacterized membrane protein YbjL, putative transporterGeneral function prediction only [R] 0.84
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 0.84
COG3293TransposaseMobilome: prophages, transposons [X] 0.84
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 0.84
COG4274Uncharacterized conserved protein, contains GYD domainFunction unknown [S] 0.84
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.84
COG5421TransposaseMobilome: prophages, transposons [X] 0.84
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 0.84
COG5579Uncharacterized conserved protein, DUF1810 familyFunction unknown [S] 0.84
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms66.39 %
UnclassifiedrootN/A33.61 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000953|JGI11615J12901_10067307All Organisms → cellular organisms → Bacteria6437Open in IMG/M
3300004019|Ga0055439_10260096All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium567Open in IMG/M
3300004114|Ga0062593_100589273All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300004114|Ga0062593_100894962All Organisms → cellular organisms → Bacteria → Proteobacteria896Open in IMG/M
3300004114|Ga0062593_101814375Not Available671Open in IMG/M
3300004156|Ga0062589_100646538All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300004156|Ga0062589_101193242All Organisms → cellular organisms → Bacteria728Open in IMG/M
3300004157|Ga0062590_101032530All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300004463|Ga0063356_100544821All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1550Open in IMG/M
3300004463|Ga0063356_101092443Not Available1148Open in IMG/M
3300004463|Ga0063356_103038985Not Available723Open in IMG/M
3300004643|Ga0062591_100183431Not Available1512Open in IMG/M
3300004643|Ga0062591_100184520All Organisms → cellular organisms → Bacteria1509Open in IMG/M
3300004643|Ga0062591_103002352Not Available501Open in IMG/M
3300005294|Ga0065705_10265910All Organisms → cellular organisms → Bacteria1146Open in IMG/M
3300005328|Ga0070676_11427022All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes531Open in IMG/M
3300005332|Ga0066388_101714783All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → unclassified Pyrinomonadaceae → Pyrinomonadaceae bacterium1112Open in IMG/M
3300005334|Ga0068869_100129388All Organisms → cellular organisms → Bacteria1939Open in IMG/M
3300005340|Ga0070689_100079258All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2576Open in IMG/M
3300005345|Ga0070692_10204833Not Available1158Open in IMG/M
3300005354|Ga0070675_100273781All Organisms → cellular organisms → Bacteria → Proteobacteria1482Open in IMG/M
3300005406|Ga0070703_10044890All Organisms → cellular organisms → Bacteria1391Open in IMG/M
3300005444|Ga0070694_101064659Not Available673Open in IMG/M
3300005471|Ga0070698_100409986All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_67_14b1289Open in IMG/M
3300005536|Ga0070697_102076596Not Available509Open in IMG/M
3300005546|Ga0070696_101211991Not Available638Open in IMG/M
3300005549|Ga0070704_100737973All Organisms → cellular organisms → Bacteria → Proteobacteria876Open in IMG/M
3300005566|Ga0066693_10139884All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales → Micrococcaceae → Arthrobacter → unclassified Arthrobacter → Arthrobacter sp. KBS0703903Open in IMG/M
3300005719|Ga0068861_102101097All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300005843|Ga0068860_102768117Not Available509Open in IMG/M
3300006046|Ga0066652_100054724All Organisms → cellular organisms → Bacteria3026Open in IMG/M
3300006049|Ga0075417_10518679All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300006640|Ga0075527_10107405Not Available774Open in IMG/M
3300006845|Ga0075421_100448940All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300006845|Ga0075421_101168563All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium860Open in IMG/M
3300006847|Ga0075431_102199203Not Available506Open in IMG/M
3300006852|Ga0075433_10058167All Organisms → cellular organisms → Bacteria3381Open in IMG/M
3300006852|Ga0075433_10104801All Organisms → cellular organisms → Bacteria → Proteobacteria2505Open in IMG/M
3300006852|Ga0075433_11272284All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300006854|Ga0075425_100743988All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300006854|Ga0075425_101333532All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → unclassified Pyrinomonadaceae → Pyrinomonadaceae bacterium813Open in IMG/M
3300006880|Ga0075429_100922358Not Available764Open in IMG/M
3300006894|Ga0079215_10121030All Organisms → cellular organisms → Bacteria1194Open in IMG/M
3300006904|Ga0075424_100477790All Organisms → cellular organisms → Bacteria1331Open in IMG/M
3300007076|Ga0075435_100349472All Organisms → cellular organisms → Bacteria → Acidobacteria1267Open in IMG/M
3300007076|Ga0075435_100698561All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300009091|Ga0102851_13515306Not Available503Open in IMG/M
3300009147|Ga0114129_12288176All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300009177|Ga0105248_12995893Not Available538Open in IMG/M
3300009609|Ga0105347_1002108All Organisms → cellular organisms → Bacteria → Acidobacteria7994Open in IMG/M
3300009840|Ga0126313_10893375Not Available725Open in IMG/M
3300010045|Ga0126311_10593065Not Available876Open in IMG/M
3300010166|Ga0126306_10597064All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300010293|Ga0116204_1295736Not Available503Open in IMG/M
3300010373|Ga0134128_11823245Not Available669Open in IMG/M
3300010375|Ga0105239_13278432Not Available527Open in IMG/M
3300010399|Ga0134127_11314081Not Available792Open in IMG/M
3300012163|Ga0137355_1046053All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300012173|Ga0137327_1071295All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300012201|Ga0137365_10114858All Organisms → cellular organisms → Bacteria2024Open in IMG/M
3300012206|Ga0137380_10118135All Organisms → cellular organisms → Bacteria2423Open in IMG/M
3300012469|Ga0150984_100876618All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → Gemmatimonas → unclassified Gemmatimonas → Gemmatimonas sp. SG8_38_2737Open in IMG/M
3300012532|Ga0137373_10290657Not Available1303Open in IMG/M
3300012916|Ga0157310_10028927All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1493Open in IMG/M
3300012951|Ga0164300_10225589Not Available935Open in IMG/M
3300012961|Ga0164302_11059760Not Available637Open in IMG/M
3300014300|Ga0075321_1035405All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300015371|Ga0132258_13001870All Organisms → cellular organisms → Bacteria1169Open in IMG/M
3300015374|Ga0132255_100877535All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1340Open in IMG/M
3300018051|Ga0184620_10106350All Organisms → cellular organisms → Bacteria871Open in IMG/M
3300018061|Ga0184619_10243656Not Available827Open in IMG/M
3300018075|Ga0184632_10470491Not Available519Open in IMG/M
3300019878|Ga0193715_1003669All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium AA133331Open in IMG/M
3300019879|Ga0193723_1014455All Organisms → cellular organisms → Bacteria2470Open in IMG/M
3300020003|Ga0193739_1088784All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium784Open in IMG/M
3300020068|Ga0184649_1484096All Organisms → cellular organisms → Bacteria855Open in IMG/M
3300021090|Ga0210377_10086219All Organisms → cellular organisms → Bacteria2114Open in IMG/M
3300021344|Ga0193719_10015223All Organisms → cellular organisms → Bacteria3260Open in IMG/M
3300021344|Ga0193719_10205349Not Available840Open in IMG/M
3300025115|Ga0209835_1180377Not Available503Open in IMG/M
3300025324|Ga0209640_10060036All Organisms → cellular organisms → Bacteria3296Open in IMG/M
3300025899|Ga0207642_10051347All Organisms → cellular organisms → Bacteria1865Open in IMG/M
3300025908|Ga0207643_11112577Not Available509Open in IMG/M
3300025918|Ga0207662_10386855Not Available946Open in IMG/M
3300025923|Ga0207681_10311967All Organisms → cellular organisms → Bacteria1248Open in IMG/M
3300025927|Ga0207687_11703535All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300025933|Ga0207706_10346421All Organisms → cellular organisms → Bacteria1292Open in IMG/M
3300026075|Ga0207708_10330685All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300026142|Ga0207698_11613226Not Available664Open in IMG/M
3300027360|Ga0209969_1001865All Organisms → cellular organisms → Bacteria2902Open in IMG/M
3300027378|Ga0209981_1070881All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300027513|Ga0208685_1009562All Organisms → cellular organisms → Bacteria → Acidobacteria → Vicinamibacteria → Vicinamibacterales → Vicinamibacteraceae → Luteitalea → Luteitalea pratensis2386Open in IMG/M
3300027636|Ga0214469_1162893All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300027682|Ga0209971_1033683All Organisms → cellular organisms → Bacteria1231Open in IMG/M
3300027717|Ga0209998_10009124All Organisms → cellular organisms → Bacteria2056Open in IMG/M
(restricted) 3300027799|Ga0233416_10038176All Organisms → cellular organisms → Bacteria1612Open in IMG/M
(restricted) 3300027799|Ga0233416_10043635Not Available1510Open in IMG/M
3300027886|Ga0209486_10271739All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium987Open in IMG/M
3300027909|Ga0209382_12184765All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300028380|Ga0268265_10042388All Organisms → cellular organisms → Bacteria → Acidobacteria3377Open in IMG/M
3300028819|Ga0307296_10371670Not Available781Open in IMG/M
3300028824|Ga0307310_10079112All Organisms → cellular organisms → Bacteria1428Open in IMG/M
3300030620|Ga0302046_10128611All Organisms → cellular organisms → Bacteria2073Open in IMG/M
3300031228|Ga0299914_10208663All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1731Open in IMG/M
3300031538|Ga0310888_10093717Not Available1513Open in IMG/M
3300031562|Ga0310886_10212165All Organisms → cellular organisms → Bacteria → Acidobacteria → Vicinamibacteria → Vicinamibacterales → Vicinamibacteraceae → Luteitalea → Luteitalea pratensis1063Open in IMG/M
3300031716|Ga0310813_11365105All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300031731|Ga0307405_11099287All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300031824|Ga0307413_11829524All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300031854|Ga0310904_10075415All Organisms → cellular organisms → Bacteria → Acidobacteria → Vicinamibacteria → Vicinamibacterales → Vicinamibacteraceae → Luteitalea → Luteitalea pratensis1775Open in IMG/M
3300031903|Ga0307407_10297419All Organisms → cellular organisms → Bacteria1124Open in IMG/M
3300031949|Ga0214473_11941683Not Available576Open in IMG/M
3300032075|Ga0310890_10265824Not Available1216Open in IMG/M
3300033407|Ga0214472_10012839All Organisms → cellular organisms → Bacteria8701Open in IMG/M
3300033475|Ga0310811_10112300All Organisms → cellular organisms → Bacteria3508Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil7.56%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.72%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere5.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.20%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.36%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil3.36%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.52%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.52%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil2.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.52%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.52%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.52%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.52%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.68%
Anoxic Lake WaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Lake Water1.68%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.68%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.68%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.68%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.68%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.68%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.68%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.68%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.84%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.84%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.84%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.84%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.84%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.84%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.84%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.84%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.84%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.84%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000953Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006640Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE Permafrost305-11BEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009091Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300009840Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105AEnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010293Anoxic lake water microbial communities from Lake Kivu, Rwanda to study Microbial Dark Matter (Phase II) - Lake Kivu water 52m metaGEnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012163Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT800_2EnvironmentalOpen in IMG/M
3300012173Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT517_2EnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012916Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S213-509R-2EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300014300Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D1EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300025115Anoxic lake water microbial communities from Lake Kivu, Rwanda to study Microbial Dark Matter (Phase II) - Lake Kivu water 52m metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025908Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026142Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027360Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027378Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027636Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57 HiSeqEnvironmentalOpen in IMG/M
3300027682Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027703Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 81 (SPAdes)EnvironmentalOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031228Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031731Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-1Host-AssociatedOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031903Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-1Host-AssociatedOpen in IMG/M
3300031939Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R2EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI11615J12901_1006730773300000953SoilMIHGKYGEFKILVDNEIVVDGGGLGFLGVLPSGREVLKV
Ga0055439_1026009623300004019Natural And Restored WetlandsMVHGRYGEYQVLVDGEVVIDGGMLAMLGVVPARQKVVDAVRT
Ga0062593_10058927333300004114SoilMVHGHYAEYKVLVDGQIVADGGALTALGVVPSSNKIVEAVRAKLTGESSA*
Ga0062593_10089496223300004114SoilMIHGDYAEYKVLVDGEVVIQGGALTALGIAPSGRKVLDAVRARLAEAR*
Ga0062593_10181437523300004114SoilMVHGRYAEYQVLVDGECVADGGALAVLGIVPARKKVVEAIRAQLIRSDASSPPTTR*
Ga0062589_10064653823300004156SoilLGIEPEMVHGRYGEYKVLVDGKVVIDGGPLTALGVVPASKKVVEAVRARLSR*
Ga0062589_10119324213300004156SoilMVHGQYGEYKVLVDGETVIDGGALAALGIVPARRKVVEAVRAHLSR*
Ga0062590_10103253013300004157SoilMVHGQYGEYKVLVDGETVIDGGALAALGIVPARRKVVEAVRAHLST*
Ga0063356_10054482123300004463Arabidopsis Thaliana RhizosphereMVHGRYGEFTVLVDGETVVDGGPLAVVGVLPSGPRVVAAVRARLSG*
Ga0063356_10109244313300004463Arabidopsis Thaliana RhizosphereMVRGRYGELKVLVDGETVVDGGTLAAFGVLPSGRKIVEVVRDRLSR*
Ga0063356_10303898523300004463Arabidopsis Thaliana RhizosphereQHGRYGEYKVLVDGETVVDGGALAVLGAVPSGRKVVAAVRDRLSV*
Ga0062591_10018343113300004643SoilMVHGRYAEYQVLVDGECVADGGALAVLGIVPARKKVVEAIRAQLIRSDATSQPTTR*
Ga0062591_10018452023300004643SoilMVRGRYGEFLVLVDGETVVDGGALAALGVLPSGRKVLDAVRARLSG*
Ga0062591_10300235213300004643SoilMVHGRYGEYKVLVDGKVVIDGGPLTALGVVPASKKVVEAVRARLSR*
Ga0065705_1026591023300005294Switchgrass RhizosphereMIHGRYGEYQVLVDGEVAIDGGALATLGIVPARRKVVEIVRARLSKVSDG*
Ga0070676_1142702213300005328Miscanthus RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASKKVVEAVRARLSR*
Ga0066388_10171478323300005332Tropical Forest SoilMIHGGYGEFTVLVDGETVVDSGALAALGVLPSAAKVAKAVQ
Ga0068869_10012938823300005334Miscanthus RhizosphereMIRGKYGEFKILVDGETVIDAGALAVVGVLPSGRKIVEAVRARLGS*
Ga0070689_10007925823300005340Switchgrass RhizosphereMIPGRYAEYKVLVDGKVVIDGGALTALGVVPGSKKVVDTVRAHLK*
Ga0070692_1020483323300005345Corn, Switchgrass And Miscanthus RhizosphereMIRGKYGEFKILVDGEAVIDAGALAVVGVLPSGRKIVEAVRARLGS*
Ga0070675_10027378123300005354Miscanthus RhizosphereMVPGHYAEYKVLVDGKVVIDGGALTALGVVPGSKKVVDTVRAHLK*
Ga0070703_1004489023300005406Corn, Switchgrass And Miscanthus RhizosphereMIRGKYGEFKILVDGETVIDAGALAVVGVLPSGRKIVEAVRARLGR*
Ga0070694_10106465923300005444Corn, Switchgrass And Miscanthus RhizosphereVEMVRGHYGEYKVLVDGAIVVDGGKLAALGVLPSGRKVVEVVRASLARSAPG*
Ga0070698_10040998623300005471Corn, Switchgrass And Miscanthus RhizosphereMVPGGFGEFRVLVDGDPVIEGGAFAALGVLPSGRKVLEAVRARLASL*
Ga0070697_10207659613300005536Corn, Switchgrass And Miscanthus RhizosphereLVHGPYGQYKVLVDGEVVIDGGSLAFLGVLPSMPTIVETVRTRLAKN
Ga0070696_10121199113300005546Corn, Switchgrass And Miscanthus RhizosphereMIRGKYGEFKILVDGETMIDAGALAVVGVLPSGRKIVEAVRARLGS*
Ga0070704_10073797323300005549Corn, Switchgrass And Miscanthus RhizosphereSTEVDLVHGPYGQYKVLVDGEVIIDGGSLAFLGVLPSMPTIVETVRTRLAKNG*
Ga0066693_1013988423300005566SoilLAIDVEMVHGRYGEYKVLVDGQTVIDGGPLTALGVVPASRKVVEAVRAHIAR*
Ga0068861_10210109713300005719Switchgrass RhizosphereRDLGIEPEMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASKKVVDAVRARLAR*
Ga0068860_10276811713300005843Switchgrass RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASRKVVDAVRARLSR*
Ga0066652_10005472433300006046SoilMQHGSYAEYKVLVDGQTVINGGALTALGIVPARRKVVEVVRAHLSPR*
Ga0075417_1051867913300006049Populus RhizosphereMIHGGYGEFTVLVDGETVVDGGAFAALGVLPSGAKVVKAVK
Ga0075527_1010740513300006640Arctic Peat SoilRYGEYKVLVDGEIVVDGGALAVLGVLPSGRKSVSAVRDRLSRS*
Ga0075421_10044894033300006845Populus RhizosphereMIHGGYGEFTVLVDGETVVDGGAFAALGVLPSGAKVVKAVKGRLSGTSQVRPS*
Ga0075421_10116856323300006845Populus RhizosphereMVKGRYGEFQVLVDGEVVVDGGALAALGVLPSGRNVVAVVRTKLSG*
Ga0075431_10219920313300006847Populus RhizosphereMIHGGYGEFTVLVDGETVVDGGAFAALGVLPSGAKVVKAVKGRLSGTSRPS*
Ga0075433_1005816713300006852Populus RhizosphereMVHGGYGEFTVLVDGETAVDGGALAALGVLPSGATVVKTVKERLAAREDRLL*
Ga0075433_1010480133300006852Populus RhizosphereMVHGPYGQFHVLVDGETLIDGGALAALGVLPSARKIVDAVRQRLAA*
Ga0075433_1127228423300006852Populus RhizosphereMVHGRYGEYKIEVDGETVVDGGALAVLGIASSGRKAVEAVRARLTKP*
Ga0075425_10074398813300006854Populus RhizosphereMVHGRYGEYKIEVDGETVVDGGALAVLGIASSGRKAVEAV
Ga0075425_10133353223300006854Populus RhizosphereMVRGGYGEFTVLVDGETVVDGGALAALGVLPSGAKVVKAVKERLSSGTGPSAS*
Ga0075429_10092235823300006880Populus RhizosphereMVHGPYGQFHVLVEGELVIDGGPLAALGVLPSSRKVVEVVRARLAAA*
Ga0079215_1012103033300006894Agricultural SoilYKVLVDDETVIDGGALTALGVVPGDNKVIAAVRDRLSRA*
Ga0075424_10047779033300006904Populus RhizosphereMVHGPYGQFHILVDGETLIDGGALAALGVLPSARKIVDAVRQRLAA*
Ga0075435_10034947223300007076Populus RhizosphereMVHGGYGEFTVLVDGETAVDGGALAALGVLPSGATVVKTVKERLGAREDRLL*
Ga0075435_10069856113300007076Populus RhizosphereMVHGRYGEYKIEVDGETVVDGGALAVLGIASSGRKAVEAVRARL
Ga0102851_1351530613300009091Freshwater WetlandsGQFKVLVDGQTAIDAGGWAALGILPSGRKVVDAVREKLSP*
Ga0114129_1228817623300009147Populus RhizosphereMVHGRYGEYKVLVDGETVVDGGALAALGIVPSSGKIVNAVRDRLSA
Ga0105248_1299589323300009177Switchgrass RhizosphereMVHGRYGEYKVLVDGELVIAGGPLTALGVVPASKKVVEAVRARLSR*
Ga0105347_100210853300009609SoilMVKGRYGEFQVLVDGEPVVDAGPLAALGVLPSGRKVIAAVRAKLG*
Ga0126313_1089337523300009840Serpentine SoilVRGRYGELKVLVDGETVVDGGTLAALGVLPSARKIVQAVRDRLSR*
Ga0126311_1059306523300010045Serpentine SoilMEHGSYGEYKVLVDGQTVIDGGALTALGIVPARRKVVEVVRTYLSPR*
Ga0126306_1059706423300010166Serpentine SoilDVDMVRGRYGELKVLVDGETVVDGGTLAAFGVLSSGRKIVEVVRDRLSR*
Ga0116204_129573613300010293Anoxic Lake WaterLVHGRYGEYKVLVDGKVAVDGGAGVILGIVPSAGAVVAAVRERLRK*
Ga0134128_1182324513300010373Terrestrial SoilSYGEYKVLVDGQTVIDGGALAALGIVPARRKVVETVRASLLP*
Ga0105239_1327843223300010375Corn RhizosphereYGEYKVLVDGEVVVNGGALAALGVLPSAHKTVKVVRDRLSRLAG*
Ga0134127_1131408123300010399Terrestrial SoilMIRGGYGEFTVLVDGETVVDGGALAALGVLPSGAKVVKAVKERLSGSRANN*
Ga0137355_104605323300012163SoilVRGPYGQFKVLVDGETTVDGGALAALGVLPSGRKVVEAVRTRLSAERS*
Ga0137327_107129533300012173SoilGIEVEMEHGSYGEYKVLVDGQTVVDGGALTALGVVPARRKVVETVRAHLSA*
Ga0137365_1011485823300012201Vadose Zone SoilMVHGRYGEYKVLVDGQTVIDGGALTTLGVVPARRKIVEAVRAHLSP*
Ga0137380_1011813513300012206Vadose Zone SoilMLKGKYGEFRVLVDGQTVVDGGALAALGVLPSARKVVEAVRARLGSSTRDSSS*
Ga0150984_10087661823300012469Avena Fatua RhizosphereVELVQGHYGEYTVLVDDHTVIDGGPLTALGVVPASQKVVDAVREAIAK*
Ga0137373_1029065713300012532Vadose Zone SoilMVPGRYGQFLVQVDGQTVVDAGAWAVLGLLPSGRKVVEAVKARLP*
Ga0157310_1002892723300012916SoilMIKGRYGEFRVLVDGQTVVDGGALAALGVLPSARRVVDAVKAKLD*
Ga0164300_1022558923300012951SoilMVHGHYAEYKVLVDGQIVADGGALTALGVVPSSNKIVEAVRARLKSEPSA*
Ga0164302_1105976023300012961SoilMIEGRHGEFKVLVDGETVVDAGALAALGVLPSGRKVVDAVKAKLG*
Ga0075321_103540513300014300Natural And Restored WetlandsMVRGRYGEYKVLVDGATVVDGGKLAALGVLPSGRKVIEA
Ga0132258_1300187023300015371Arabidopsis RhizosphereMVHGRYGEYKVLVDGETVVDGGALAALGIVPSGHKVLSAVRDRLSK*
Ga0132256_10162105123300015372Arabidopsis RhizosphereMVRGGYGEFKVLVDGTPVIDGGALAALGVLPSARRIIR
Ga0132255_10087753513300015374Arabidopsis RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGGVPASKKVVEAVRARLSR*
Ga0184620_1010635013300018051Groundwater SedimentLGIDVEMVHGRYGEYKILVDGQTVIDGGALTALGVVPTGRKVVEAVRAHLSPE
Ga0184619_1024365633300018061Groundwater SedimentEYRILVDGQTVIDGGALTVLGIVPARRTAVEAVRAHLSRDSGANAV
Ga0184632_1047049123300018075Groundwater SedimentVEMVHGRYGEYKILVDGETAIDGGALTALGIVPARRKVVEAVRARLSRGKG
Ga0193715_100366913300019878SoilMVHGRYGEYRILVDGETVIDGGALTVLGIVPARRKAVEAVRAHLSRRPGPSSKDR
Ga0193723_101445513300019879SoilMVRGRYGEFLVLVDGETVIDGGALAALGVLPSGRKVLDAVRARLSG
Ga0193739_108878423300020003SoilMVRGGVGEFKVLVDGDTVIEGGIFAALGVLPSGRKIVDAVRARLAR
Ga0184649_148409613300020068Groundwater SedimentVDLAHGPYGQFKVLVDGATIVDGGKWAALGVLPSGPEVVVAVRARLGTT
Ga0210377_1008621913300021090Groundwater SedimentLVGGPYGQFKVLVDGETAVDGGKWAALGVLPSGRTVVEAVRGKLGKP
Ga0193719_1001522333300021344SoilMEHGFYGEYKVLVDGQTVIDGGALTALGIVPARRRVVETVRAHLSP
Ga0193719_1020534913300021344SoilMVHGRYGEYRILVDGETVIDGGALTVLGIVPARRKAVEAVRAHLSRRP
Ga0209835_118037723300025115Anoxic Lake WaterLVHGRYGEYKVLVDGKVAVDGGAGVILGIVPSAGAVVAAVRERLRK
Ga0209640_1006003633300025324SoilMVRGGYGQFKVLVDGDTVVDGGALAALGVLPSYRKVLAAVQARYRGQPSNP
Ga0207642_1005134723300025899Miscanthus RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASKKVVEAVRARLSR
Ga0207643_1111257713300025908Miscanthus RhizosphereMIRGKYGEFKILVDGETVIDAGALAVVGVLPSGRKIVEAVR
Ga0207662_1038685523300025918Switchgrass RhizosphereMVHGHYAEYKVLVDGQIVADGGALTALGVVPSSNKIVEAVRAKLTGESSA
Ga0207681_1031196723300025923Switchgrass RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASRKVVEAVRARLSR
Ga0207687_1170353523300025927Miscanthus RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASKKVVEAVR
Ga0207706_1034642123300025933Corn RhizosphereMVHGRYGEYKVLVDGEVVIDGGPLTALGVVPASRKVVDAVRARLSR
Ga0207708_1033068513300026075Corn, Switchgrass And Miscanthus RhizosphereMVHGRYGEYKVLVDGKVVIDGGPLTALGVVPASKKVVEAVRARLSR
Ga0207698_1161322623300026142Corn RhizosphereRRDLGIEPEMVHGRYGEYKVLVDGKVVIDGGPLTALGVVPASRKVVDAVRARLSR
Ga0256867_1005281123300026535SoilMIHGRYGEYKILVDDDVVIDAGALSAIGIVPSDRKVVDAVRRRLA
Ga0209969_100186533300027360Arabidopsis Thaliana RhizosphereMIHGRYGEYQVLVDGEVAIDGGALATLGIVPARRKVVEVVRARLSKVSGG
Ga0209981_107088123300027378Arabidopsis Thaliana RhizosphereMIHGRYGEYQVLVDGEVAIDGGALATLGIVPARRKVVEVVRARLSKVSDG
Ga0208685_100956233300027513SoilRYGEFQVLVDGEPVVDAGPLAALGVLPSGRKVIAAVRAKLG
Ga0214469_116289323300027636SoilLRTEVEMVRGRYGEFQVLVDGKTVVDGGALAALGVLPSGRKVLDAVRATLSG
Ga0209971_103368323300027682Arabidopsis Thaliana RhizosphereMVHGRYGEYKVLVDGETVVDGGALAALGIVPSGHKVLSAVRDRLSK
Ga0207862_100583333300027703Tropical Forest SoilMVRGRYGEFKILVDGQTVIDGGAAAFLGVLPSGRKVVAAVQAVMSS
Ga0209998_1000912433300027717Arabidopsis Thaliana RhizosphereMIHGRYGEHQVLVDGEVAIDGGALATLGIVPARRKVVEVVRARLSKVSGG
(restricted) Ga0233416_1003817623300027799SedimentMDVDMVRGRYAEFKVLVDGETVIDGGAMAFLGVLPSGRKVVDAVRERLSGR
(restricted) Ga0233416_1004363523300027799SedimentMVRGRYAEFKVLVDGQTVIDGGALAALGILPSGRKIVEAVRGRLSR
Ga0209486_1027173913300027886Agricultural SoilGRYGEYKILVDNVVVVDGGPLLALGVMPAARKTVAAVRTKLGL
Ga0209382_1218476513300027909Populus RhizosphereMIHGGYGEFTVLVDGETVVDGGAFAALGVLPSGAKVVKAVKGRLSGTSQVRPS
Ga0268265_1004238823300028380Switchgrass RhizosphereMIRGKYGEFKILVDGETVIDAGALAVVGVLPSGRKIVEAVRARLGS
Ga0307296_1037167033300028819SoilVEMVHGRYGEYTILVDGQTVIDGGALTALGVVPTGRKVVEAVRAHLSPE
Ga0307310_1007911243300028824SoilIDVEMVHGRYGEYTILVDGQTVIDGGALTALGVVPTGRKVVEAVRAHLSPE
Ga0302046_1012861143300030620SoilLVGGPYGQFKVLVDGETAVDGGRWAALGVLPSGRTVVEAVRGKLGKP
Ga0299914_1020866323300031228SoilMVRGRYGEFQVLVDGKTVVDGGALAALGVLPSGRKVLAAVRATLSG
Ga0310888_1009371723300031538SoilMVHGRYAEYQVLVDGQTVIDGGALTALGIVPARRKVVEAIRARLQS
Ga0310886_1021216513300031562SoilMIRGRYGQFKVLMDGETVVDGGALAALGVLPSSRKVVDTVRAKNSQ
Ga0310813_1136510513300031716SoilMVHGRYGEYKIEVDGATVVDGGALAVLGIASSGRKAVEAVRARL
Ga0307405_1109928723300031731RhizosphereMIHGRYAEYQVLVDGETVIDGGALTALGVVPARRKVVEAIRARLEKNDKART
Ga0307413_1182952413300031824RhizosphereMVRGRYGQFKVLVDGEPVIDAGALAALGVLPSAGRVIEAVRDRLAG
Ga0310904_1007541523300031854SoilMIRGRYGQFKVLMDGETVVDGGALAALGVLPSSRKVVDTVRAKNSQEKRDAPN
Ga0307407_1029741933300031903RhizosphereDVDMVRGRYGQFKVLVDGEPVIDAGALAALGVLPSAGRVIEAVRDRLAG
Ga0308174_1131876223300031939SoilMVHGQYGEYKVLVDGETVIDGGALAALGIVPARRKV
Ga0214473_1194168313300031949SoilEYKVLVDGEIVVDGGALAILGLLPSARKTVSAVRDRISR
Ga0310890_1026582423300032075SoilVEFSRPPGLVDGQTVVDAGALAALGVLPSGRKVVDTVKAQLG
Ga0214472_1001283913300033407SoilLVGGPYGQFKVLVDGETAVDGGRWAALGVLPSGRTVV
Ga0310811_1011230033300033475SoilMVHGRYGEYQVLVDGQTVIDGGALAALGIVPARTKVVEAVRFHLAQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.