NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078894

Metagenome / Metatranscriptome Family F078894

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078894
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 75 residues
Representative Sequence LEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Number of Associated Samples 100
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 4.59 %
% of genes near scaffold ends (potentially truncated) 87.07 %
% of genes from short scaffolds (< 2000 bps) 87.07 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.345 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(12.931 % of family members)
Environment Ontology (ENVO) Unclassified
(36.207 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(42.241 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 60.78%    β-sheet: 0.00%    Coil/Unstructured: 39.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.8.7.1: Typo IV secretion system protein TraCd1r8ia_1r8i0.787
a.29.13.1: Bacillus cereus metalloprotein-liked3dbya13dby0.77604
a.29.13.1: Bacillus cereus metalloprotein-liked3d19a23d190.77092
e.18.1.1: Nickel-iron hydrogenase, large subunitd4u9hl_4u9h0.76463
a.29.13.1: Bacillus cereus metalloprotein-liked3d19a13d190.76274


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF06831H2TH 10.34
PF06827zf-FPG_IleRS 5.17
PF00497SBP_bac_3 3.45
PF01145Band_7 2.59
PF03404Mo-co_dimer 2.59
PF00072Response_reg 1.72
PF02390Methyltransf_4 1.72
PF13302Acetyltransf_3 1.72
PF00296Bac_luciferase 1.72
PF12847Methyltransf_18 0.86
PF00873ACR_tran 0.86
PF04392ABC_sub_bind 0.86
PF05239PRC 0.86
PF01078Mg_chelatase 0.86
PF00174Oxidored_molyb 0.86
PF01156IU_nuc_hydro 0.86
PF03729DUF308 0.86
PF11897DUF3417 0.86
PF03992ABM 0.86
PF00589Phage_integrase 0.86
PF00699Urease_beta 0.86
PF02518HATPase_c 0.86
PF13442Cytochrome_CBB3 0.86
PF13557Phenol_MetA_deg 0.86
PF13419HAD_2 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG0266Formamidopyrimidine-DNA glycosylaseReplication, recombination and repair [L] 10.34
COG003016S rRNA A1518 and A1519 N6-dimethyltransferase RsmA/KsgA/DIM1 (may also have DNA glycosylase/AP lyase activity)Translation, ribosomal structure and biogenesis [J] 1.72
COG0220tRNA G46 N7-methylase TrmBTranslation, ribosomal structure and biogenesis [J] 1.72
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.72
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 1.72
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 1.72
COG0832Urease beta subunitAmino acid transport and metabolism [E] 0.86
COG1957Inosine-uridine nucleoside N-ribohydrolaseNucleotide transport and metabolism [F] 0.86
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.86
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.86
COG3247Acid resistance membrane protein HdeD, DUF308 familyGeneral function prediction only [R] 0.86
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms85.34 %
UnclassifiedrootN/A14.66 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000033|ICChiseqgaiiDRAFT_c2341157All Organisms → cellular organisms → Bacteria1763Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_104842389All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium850Open in IMG/M
3300000550|F24TB_10183710All Organisms → cellular organisms → Bacteria2244Open in IMG/M
3300000955|JGI1027J12803_109368259All Organisms → cellular organisms → Bacteria1867Open in IMG/M
3300000956|JGI10216J12902_100432820All Organisms → cellular organisms → Bacteria1642Open in IMG/M
3300003505|JGIcombinedJ51221_10384317All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300004268|Ga0066398_10097718All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300004643|Ga0062591_101329816All Organisms → cellular organisms → Bacteria → Proteobacteria708Open in IMG/M
3300005186|Ga0066676_10395066All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300005330|Ga0070690_100648116All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium806Open in IMG/M
3300005343|Ga0070687_100706318All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300005436|Ga0070713_100887755All Organisms → cellular organisms → Bacteria857Open in IMG/M
3300005440|Ga0070705_101659704All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300005445|Ga0070708_101788112All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300005445|Ga0070708_101834786All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300005445|Ga0070708_102152167All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005447|Ga0066689_10209342All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1187Open in IMG/M
3300005467|Ga0070706_101219017All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300005467|Ga0070706_101298341All Organisms → cellular organisms → Bacteria667Open in IMG/M
3300005536|Ga0070697_101958032All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300005576|Ga0066708_10931005All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium541Open in IMG/M
3300005586|Ga0066691_10527458All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300005713|Ga0066905_100121010All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1826Open in IMG/M
3300005719|Ga0068861_100131505All Organisms → cellular organisms → Bacteria2032Open in IMG/M
3300005841|Ga0068863_100289751All Organisms → cellular organisms → Bacteria1587Open in IMG/M
3300005841|Ga0068863_100450155All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300005843|Ga0068860_100899890All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → Rubinisphaera → unclassified Rubinisphaera → Rubinisphaera sp.901Open in IMG/M
3300005844|Ga0068862_100310485All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1453Open in IMG/M
3300006034|Ga0066656_10184096All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300006058|Ga0075432_10237600All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300006196|Ga0075422_10332909Not Available658Open in IMG/M
3300006800|Ga0066660_11678146All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300006881|Ga0068865_100145364All Organisms → cellular organisms → Bacteria1793Open in IMG/M
3300006903|Ga0075426_11016209All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300006903|Ga0075426_11055634All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium615Open in IMG/M
3300006969|Ga0075419_10703544All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300007265|Ga0099794_10074134All Organisms → cellular organisms → Bacteria1671Open in IMG/M
3300009147|Ga0114129_13360301All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300009174|Ga0105241_12268166All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300009176|Ga0105242_11385512Not Available730Open in IMG/M
3300009804|Ga0105063_1005752All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300009837|Ga0105058_1158754All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300010046|Ga0126384_10405100All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1153Open in IMG/M
3300010359|Ga0126376_10912490All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300010359|Ga0126376_13150465All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300010362|Ga0126377_10127345All Organisms → cellular organisms → Bacteria → Proteobacteria2362Open in IMG/M
3300010362|Ga0126377_13016585Not Available543Open in IMG/M
3300010362|Ga0126377_13417278All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300010397|Ga0134124_12924055All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300011429|Ga0137455_1061460All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300012096|Ga0137389_10905661All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300012363|Ga0137390_11554600All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300012363|Ga0137390_11749848All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300012901|Ga0157288_10130880All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300012922|Ga0137394_10538590Not Available990Open in IMG/M
3300012930|Ga0137407_10124381All Organisms → cellular organisms → Bacteria → Proteobacteria2245Open in IMG/M
3300012986|Ga0164304_10772295All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300013306|Ga0163162_10060305All Organisms → cellular organisms → Bacteria3829Open in IMG/M
3300014326|Ga0157380_10691976All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → Rubinisphaera → unclassified Rubinisphaera → Rubinisphaera sp.1023Open in IMG/M
3300015258|Ga0180093_1084408All Organisms → cellular organisms → Bacteria762Open in IMG/M
3300015372|Ga0132256_103400654All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium535Open in IMG/M
3300018027|Ga0184605_10084048All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1391Open in IMG/M
3300018028|Ga0184608_10334789All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300018028|Ga0184608_10343738All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300019228|Ga0180119_1267068All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300019881|Ga0193707_1041970All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium elkanii1476Open in IMG/M
3300020004|Ga0193755_1132087Not Available771Open in IMG/M
3300020006|Ga0193735_1167875All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax → unclassified Variovorax → Variovorax sp. Sphag1AA552Open in IMG/M
3300021344|Ga0193719_10160189Not Available969Open in IMG/M
3300023057|Ga0247797_1038996All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300024283|Ga0247670_1064865All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium663Open in IMG/M
3300025910|Ga0207684_10096079All Organisms → cellular organisms → Bacteria2529Open in IMG/M
3300025911|Ga0207654_11032480All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300025922|Ga0207646_10541535All Organisms → cellular organisms → Bacteria1047Open in IMG/M
3300025922|Ga0207646_11608683All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300025928|Ga0207700_10454929All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300025934|Ga0207686_10702817All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300026118|Ga0207675_100077434All Organisms → cellular organisms → Bacteria3115Open in IMG/M
3300026335|Ga0209804_1331076All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300026342|Ga0209057_1075157All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1453Open in IMG/M
3300026355|Ga0257149_1060709All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300026507|Ga0257165_1050033All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300026551|Ga0209648_10012253All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7459Open in IMG/M
3300026552|Ga0209577_10522168All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300026557|Ga0179587_10329521All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium988Open in IMG/M
3300027013|Ga0209884_1001965All Organisms → cellular organisms → Bacteria1689Open in IMG/M
3300027277|Ga0209846_1038609Not Available748Open in IMG/M
3300027378|Ga0209981_1038146All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300027873|Ga0209814_10520887All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300027880|Ga0209481_10505415All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300027950|Ga0209885_1004252All Organisms → cellular organisms → Bacteria1406Open in IMG/M
3300028380|Ga0268265_11359539All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300028381|Ga0268264_10953866All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → unclassified Mesorhizobium → Mesorhizobium sp. J18863Open in IMG/M
3300028673|Ga0257175_1101126All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300028771|Ga0307320_10358116All Organisms → cellular organisms → Bacteria583Open in IMG/M
(restricted) 3300031150|Ga0255311_1147590All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300031170|Ga0307498_10353123All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300031198|Ga0307500_10176928Not Available624Open in IMG/M
3300031199|Ga0307495_10216339All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300031562|Ga0310886_10449228Not Available769Open in IMG/M
3300031720|Ga0307469_10533625All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300031720|Ga0307469_10758173Not Available886Open in IMG/M
3300031720|Ga0307469_11408269All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300031720|Ga0307469_12100026All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300031720|Ga0307469_12331104All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300031740|Ga0307468_101544907All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300031908|Ga0310900_11638672All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300032003|Ga0310897_10655593All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300033004|Ga0335084_11074330All Organisms → cellular organisms → Bacteria808Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.93%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.62%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.76%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere6.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.90%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.03%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.17%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.31%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.45%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.59%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.59%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.72%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.72%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.72%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.86%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.86%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.86%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.86%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.86%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.86%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006580Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLPC (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009837Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012901Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S119-311C-1EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300019228Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300023057Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S136-409B-6EnvironmentalOpen in IMG/M
3300024283Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK11EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027013Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027378Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027950Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031170Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_SEnvironmentalOpen in IMG/M
3300031198Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 14_SEnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_234115723300000033SoilMKQNPSLSPADKGKVDTLLPQANSLNTELAKPQAEPSRLTQLAGQLGDLQKQVGSLKGMIK*
INPhiseqgaiiFebDRAFT_10484238913300000364SoilPGVGSLPSSMIPDKAALLEQAKKILADLMAMKQDPKLPAADKAKVDTLIPKAQSANTELAKPQVEPSKLTQLASQLGDLQKQVAAIKPAAVR*
F24TB_1018371043300000550SoilQAKQLVADLTAMKQNPSLSPADKGKVDSLLPKANSLNTELAKPQVEPSRLTQLAGQLGDLQKQVGSLKGMIK*
JGI1027J12803_10936825923300000955SoilMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
JGI10216J12902_10043282013300000956SoilDLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGAMK*
JGIcombinedJ51221_1038431713300003505Forest SoilQSQKLVTELTSMKSGGKLGAADAGKVDTLLPKATAVNTELAKPQVEPSRLTQLAGQLADLQKQAASLKGLMK*
Ga0066398_1009771813300004268Tropical Forest SoilSVHAQLPGVSSLIPDKATLLEQAKKLLAELTEMKQDPKVSPADKSKVDKLIPQATAVNTELAKPQVEPSKLTKLAAQLGDLQKQVAAIKR*
Ga0062591_10132981623300004643SoilQIPGMGSLSLDKGALLEQAKQLVAELTAMKQNPSLSPADKGKVDSLLPKANSLNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKSTIK*
Ga0066676_1039506613300005186SoilLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
Ga0070690_10064811613300005330Switchgrass RhizosphereAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKEVGALKGAMK*
Ga0070687_10070631823300005343Switchgrass RhizosphereSMLPDKTQLLEQAQKLVTDLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVAALKGAIK*
Ga0070713_10088775533300005436Corn, Switchgrass And Miscanthus RhizosphereQLLEQAKKLVTELTAMKSSGKLNAADTSKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVTALRGSVK*
Ga0070705_10165970423300005440Corn, Switchgrass And Miscanthus RhizosphereGSMLPDKATLLDQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKEVGALKGAMK*
Ga0070708_10178811213300005445Corn, Switchgrass And Miscanthus RhizosphereQMQIPGVGSMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLVGQLGDLQKQVGALKGVMK*
Ga0070708_10183478613300005445Corn, Switchgrass And Miscanthus RhizosphereVGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDALLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALQGVMK*
Ga0070708_10215216713300005445Corn, Switchgrass And Miscanthus RhizosphereVGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGLMK*
Ga0066689_1020934223300005447SoilLAQMPSVGSMLPDKAQLLEQGQKLVADLTSMKSSGKLGAADVGKVDSLLPKATALNTELAKPEVAPSRLAQLAGQLGDLQKQVGALKGLMK*
Ga0070706_10121901723300005467Corn, Switchgrass And Miscanthus RhizospherePSVGSMLPDKAQLLEQGQKLVADLTSMKSSGKLGAADVGKVDSLLPKATALNTELDKPEVAPSRLAQLAGQLGDLQKQVGALKGLMK*
Ga0070706_10129834113300005467Corn, Switchgrass And Miscanthus RhizosphereVGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDALLPKATALNTELAKPQVPPSRLAQLAGQLGDLQKQAGALQGLMK*
Ga0070697_10195803213300005536Corn, Switchgrass And Miscanthus RhizosphereAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLVGQLGDLQKQVGALKGVMK*
Ga0066708_1093100513300005576SoilAQKLVADLTSMKSSGKLGTADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
Ga0066691_1052745833300005586SoilLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTKLAGQLGDLQKQVGALRGLMK*
Ga0066905_10012101013300005713Tropical Forest SoilDKGALLEQAKQLVSELTALKENPNLSPADKSKVDSLLPKANSLNTELAKPQVEPNRLTQLAGQLGDLQKQVGSLKGMIGK*
Ga0068861_10013150543300005719Switchgrass RhizospherePSVSSLIPDKATLLEQGKKLLADLTAMRQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN*
Ga0068863_10028975113300005841Switchgrass RhizosphereAMRQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN
Ga0068863_10045015513300005841Switchgrass RhizosphereQMQIPSVGSMLPDKATLLDQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKEVGALKGAMK*
Ga0068860_10089989023300005843Switchgrass RhizosphereDLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVAALKGAIK*
Ga0068862_10031048513300005844Switchgrass RhizosphereVAAQIPGMGSLSLDKGALLEQAKQLVAELTAMKQNPSLSPADKGKVDSLLPKANSLNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKSTIK*
Ga0066656_1018409633300006034SoilMTRGTEVVAFLTAWKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
Ga0075432_1023760013300006058Populus RhizosphereAFAQLPNIGSMLPDKTQLLEQAKKLVTELTAMKSSGKLSAADTSKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVTALRGSVK*
Ga0075422_1033290913300006196Populus RhizosphereLTAMKQNPSLSPADKGKVDSLLPKANSLNTELAKPQVEPSRLTQLAGQLGDLQKQVGSLKGTIK*
Ga0074049_1306920613300006580SoilFPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN*
Ga0066660_1167814613300006800SoilVGSMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLPQLAGQLGDLQKQVGALKGLMK*
Ga0068865_10014536413300006881Miscanthus RhizosphereADLTAMKQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN*
Ga0075426_1101620923300006903Populus RhizosphereLTSMKSSGKLGAADSAKVDSLLPKATAVNTELAKPQVEPSRMTQLAGQLGDLQKQVGALKGVMK*
Ga0075426_1105563423300006903Populus RhizosphereTLAQMQIPGVGSMLPDKAQLLEQAQKLLADLTSMKSSGRLGATDTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKEVGALKGVMK*
Ga0075419_1070354413300006969Populus RhizosphereDPKLPAADKSKVDALIPKASSVSSELAKPQVEPSRLTQLAGQLTDLQKQYASLKGH*
Ga0099794_1007413433300007265Vadose Zone SoilMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGVMK*
Ga0105245_1097561323300009098Miscanthus RhizosphereAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN*
Ga0114129_1336030113300009147Populus RhizosphereQIPGVGSMLPDKTQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGAMK*
Ga0105241_1226816613300009174Corn RhizosphereQLPGMSSLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKTKVDAMIPKATAVNTELAKPQVEPSKLTQLAAQLGDLQKQYASLRGN*
Ga0105242_1138551213300009176Miscanthus RhizosphereLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKTKVDAMIPKATAVNTELAKPQVEPSKLTQLAAQLGDLQKQYASLRGN*
Ga0105063_100575213300009804Groundwater SandLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
Ga0105058_115875413300009837Groundwater SandSSMIPDKTALLEQAKKPLAELTAMKQDPKLPAANKSKVDTLLPQATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGVLKGLGR*
Ga0126384_1040510023300010046Tropical Forest SoilGAQMPSVGSLIPDKAALLEQAKKILADLMAMKQDPKLPAADKAKVDTLIPKAQNVNTELAKPQVEPSKLTQLASQLGDLQKQVAAIKPAALR*
Ga0126376_1091249023300010359Tropical Forest SoilLLEQAQQLVTELTSMKSSGKLGAADSAKVDSLLPKATAVNTELAQPQVEPSRLAQLASQLGDLQKQVAALTGAMK*
Ga0126376_1290677513300010359Tropical Forest SoilDKSKVDALIPQATSLNSELAKPQTEPSRLAQLASQLGDLQKQLAVLKGGGK*
Ga0126376_1315046513300010359Tropical Forest SoilTSLDKGALLEQAKQLVSELTALKENPSLSPADKGKVESLLPKANSLNSELAKPQVEPSRLTQLATQLGDLQKQVGSLKGMIGK*
Ga0126377_1012734543300010362Tropical Forest SoilDKSALLEQAKQLVAELTAMKQNPNLSAADKSKVDSLLPKANSLNTELAKPQVEPNRLTQLAGQLGDLQKQVGSLKGMIGK*
Ga0126377_1301658513300010362Tropical Forest SoilMLPDKTMLLEQGKKLVADLTSMKSSGKLSAADTKQVDSMLPKANALNTELAKPQVEPSKLTKLASQLGDLQKQASALQGKMK*
Ga0126377_1341727823300010362Tropical Forest SoilMIPDKTALLEQAKKLLADLTALKQDPKLPAADKTKVDAMIPKATAVNTELAKPQVEPSKLTQLATQLGDLQKQYASLRGN*
Ga0134124_1292405513300010397Terrestrial SoilLDKGALLEQAKQLVAELTAMKQNPSLSPADKGKVDSLLPKANSLNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKSTIK*
Ga0137455_106146023300011429SoilLEQAQKLVTELTSMKNSGKLSTADSAKVNTMLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQYSALKGGMK*
Ga0137389_1090566133300012096Vadose Zone SoilSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLAQLAGQLGDLQKQAGALQGLMK*
Ga0137363_1155683823300012202Vadose Zone SoilSSGKLNAADTTKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVTALRGSVK*
Ga0137390_1155460013300012363Vadose Zone SoilVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALQGLMK*
Ga0137390_1174984823300012363Vadose Zone SoilVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVQPSRLTQLAGQLGDLQKQVGALQGLMK*
Ga0157288_1013088033300012901SoilGEKLVADLTSMKSSGKLDAADTQKVDSMLPKANALNTELAKPQVEPSKLTQLAGQLGDLQKQAGALQGKMK*
Ga0137394_1053859023300012922Vadose Zone SoilMPDKAALLEQGKKLLADLTAMKQDPKLPAADKTKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQKQYASLKGMGN*
Ga0137407_1012438113300012930Vadose Zone SoilKLVADLTSMKSSGKLGPADVGKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGLMK*
Ga0137410_1172380313300012944Vadose Zone SoilSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK*
Ga0164304_1077229533300012986SoilLTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGVMK*
Ga0163162_1006030553300013306Switchgrass RhizosphereQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN*
Ga0157380_1069197613300014326Switchgrass RhizosphereDKTQLLEQAQKLVTDLTSLKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVAALKGAMK*
Ga0180093_108440823300015258SoilSMKSSGKLTATDSAKVNTMLPKATAVNTELAKPQVDPSRLTQLAGQLGDLQKQYSALKGGMK*
Ga0132256_10340065413300015372Arabidopsis RhizosphereITLAQMQIPGVGSMLPDKAQLLEQAQKLVSDLTSMKSSGKLGAADTAKVDSLLPKATAVNNELAKPQVEPSRLAQLAGQLGDLQKQVGALKGVMP*
Ga0184605_1008404833300018027Groundwater SedimentEQAQKLVADLTSMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVEPSRLTQLASQLGDLQKQVGALKGLMK
Ga0184608_1033478913300018028Groundwater SedimentTAMKQDPKLPAADKSKVDKLLPQATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGVLKGMGR
Ga0184608_1034373813300018028Groundwater SedimentQAQKLVADLTSMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVEPSRLTQLASQLGDLQKQVGALKGLMK
Ga0180119_126706813300019228Groundwater SedimentKLVADLTSMKSSGNLSAADTAKVNTMLPKATAVNTELAKPQVEPSRLTQLAGQLGDLHKQYSALKGGMK
Ga0193707_104197033300019881SoilMKSSGKLGATDAGKVDSLLPKATALNTELAKPQVEPSRLTQLASQLGDLQKQVGALKGLM
Ga0193755_113208723300020004SoilGVGSLPSASSLMPDKAALLEQGKKLLADLTAMKQDPKLPAADKTKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQKQYASLKGMGN
Ga0193735_116787513300020006SoilLVTELTAMKSSGKLGAADASKVDSLLPKATAVNTELAKPEVPPSRLAQLAGQLGDLQKQVGSLQGLMK
Ga0193719_1016018913300021344SoilKTVLLEQGKKLLSELTAMKQDPKLPAADKSKVDTLIPQATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGMGR
Ga0247797_103899613300023057SoilPDKTMLLEQGKKLVADLTSMKSSGKLDAADTQKVDSMLPKANALNTELAKPQVEPSKLTKLAGQLGDLQKQAGALQGKMK
Ga0247670_106486523300024283SoilMLPDKATLLDQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKEVGALKGAMK
Ga0207684_1009607963300025910Corn, Switchgrass And Miscanthus RhizosphereKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDALLPKATALNTELAKPQVPPSRLAQLAGQLGDLQKQAGALQGLMK
Ga0207654_1103248013300025911Corn RhizosphereQLPGMSSLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKTKVDAMIPKATAVNTELAKPQVEPSKLTQLAAQLGDLQKQYASLRGN
Ga0207646_1054153513300025922Corn, Switchgrass And Miscanthus RhizosphereLAQMQIPGVGSMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0207646_1160868323300025922Corn, Switchgrass And Miscanthus RhizospherePNVGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGLMK
Ga0207700_1045492933300025928Corn, Switchgrass And Miscanthus RhizosphereQLLEQAKKLVTELTAMKSSGKLNAADTSKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVTALRGSVK
Ga0207686_1070281723300025934Miscanthus RhizosphereLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKTKVDAMIPKATAVNTELAKPQVEPSKLTQLAAQLGDLQKQYASLRGN
Ga0207675_10007743443300026118Switchgrass RhizosphereQGKKLLADLTAMRQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN
Ga0209802_100623413300026328SoilSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0209158_125679123300026333SoilKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTKLAGQLGDLQKQVGALRGLMK
Ga0209804_133107613300026335SoilQIPGVGSMLPDKAQLLEQAQKLVADLTSMKTSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0209057_107515733300026342SoilMTRGTEVVAFLTAWKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0257149_106070913300026355SoilKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGVMK
Ga0257165_105003333300026507SoilALAQLPNVGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALRGVMK
Ga0209648_1001225393300026551Grasslands SoilQLPSIGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALQGLMK
Ga0209577_1052216813300026552SoilVGSMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLPQLAGQLGDLQKQVGALKGVMK
Ga0179587_1032952113300026557Vadose Zone SoilLVADLTSMKSSGKLGPADVGKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGLMK
Ga0209884_100196513300027013Groundwater SandMLPDKAQLLEQAQKLVADLTSMKSGGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0209846_103860923300027277Groundwater SandMKSSGKLGAADTAKVASLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVM
Ga0209981_103814613300027378Arabidopsis Thaliana RhizosphereTSMKSSGKLDAADTQKVDSMLPKANALNTELAKPQVEPSKLTQLAGQLGDLQKQAGALQGKMK
Ga0209814_1052088723300027873Populus RhizosphereMKQDPKLPAADKSKVDALIPNASSVSSELAKPQVEPSRLTQLAGQLTDLQKQYASLKGH
Ga0209481_1050541513300027880Populus RhizosphereQDPKLPAADKSKVDALIPKASSVSSELAKPQVEPSRLTQLAGQLTDLQKQYASLKGH
Ga0209885_100425233300027950Groundwater SandAMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0268265_1135953913300028380Switchgrass RhizosphereVGSMLPDKTQLLEQAQKLVTDLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVAALKGAIK
Ga0268264_1095386613300028381Switchgrass RhizosphereLVTDLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVAALKGAIK
Ga0257175_110112623300028673SoilLPSIGSMLPDKTQLLEQAQKLVTELTAMKSSGKLGAADAGKVDSLLPKATALNTELAKPQVPPSRLTQLAGQLGDLQKQVGALQGLMK
Ga0307320_1035811613300028771SoilLTSMKSSCKLGAADAGKVDSLLPKATALNTELAKPQVEPSRLTQLASQLGDLQKQVGALKGLMK
(restricted) Ga0255311_114759013300031150Sandy SoilDKAQLLEQGQKLVADLTSMKSSGKLGAADVGKVDSLLPKATALNTELAKPEVAPSRLAQLAGQLGDLQKQVGALKGLMK
Ga0307498_1035312323300031170SoilSIPSVSSLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN
Ga0307500_1017692823300031198SoilPDKAALLEQGKKLLADLTAMKQDPKLPAADKTKVDAMIPKTTAVNTELAKPQVEPSKLTQLAAQLGDLQKQYASLRGN
Ga0307495_1021633923300031199SoilAQLPGVGSIPSVSSLIPDKATLLEQGKKLLADLTAMKQDPKLPAADKSKVDTLIPKATAVNTELAKPEVEPSRLTKLAGQLGDLQRQYAALKGN
Ga0310886_1044922823300031562SoilTAMKQDPKLPAADKTKVDSLIPKTTAVNTELAKPQVEPSRLTQLAGQLGDLQKQYASLTG
Ga0307469_1053362513300031720Hardwood Forest SoilQAQKLVADLASMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0307469_1075817323300031720Hardwood Forest SoilKLLADLTAMKQDPKLPAADKTKVDSLIPKTTAVNTELAKPQVEPSRLTQLAGQLGDLQKQYASLTGK
Ga0307469_1140826913300031720Hardwood Forest SoilAMKQDPKLPAADKTKVDALIPKATSVNTELAKPQVEPSRLTQLAGQLGDLQKQAAALKGVAR
Ga0307469_1210002613300031720Hardwood Forest SoilDLTSMKSSGKLGAADTAKVDSLLPKATALNTELAKPQVEPSRLTQLAGQLGDLQKQVGALKGVMK
Ga0307469_1233110423300031720Hardwood Forest SoilSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLVGQLGDLQKQVGALKGVMK
Ga0307468_10154490713300031740Hardwood Forest SoilMLPDKAQLLEQAQKLVADLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLVGQLGDLQKQVGALKGVMK
Ga0310900_1163867223300031908SoilDLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALQAAMK
Ga0310897_1065559313300032003SoilKLLEQAQKLVTDLTSMKSSGKLGAADTAKVDSLLPKATAVNTELAKPQVEPSRLTQLAGQLGDLQKQVGALQAAMK
Ga0335084_1107433023300033004SoilLLEQGKKLLADLTAMKQDPSVSAADKSKVDALIPKATSVNTELAKPQVEPSKLTQLASQLGDLQKQYAALKGH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.