NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105979

Metagenome / Metatranscriptome Family F105979

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105979
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 45 residues
Representative Sequence MNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQW
Number of Associated Samples 91
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 97.00 %
% of genes near scaffold ends (potentially truncated) 98.00 %
% of genes from short scaffolds (< 2000 bps) 90.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(11.000 % of family members)
Environment Ontology (ENVO) Unclassified
(32.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(35.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 32.43%    β-sheet: 0.00%    Coil/Unstructured: 67.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF07729FCD 32.00
PF00392GntR 11.00
PF00501AMP-binding 2.00
PF09413DUF2007 2.00
PF10095DUF2333 2.00
PF11583AurF 2.00
PF00106adh_short 1.00
PF00730HhH-GPD 1.00
PF00107ADH_zinc_N 1.00
PF00578AhpC-TSA 1.00
PF06628Catalase-rel 1.00
PF01966HD 1.00
PF14561TPR_20 1.00
PF09335SNARE_assoc 1.00
PF13517FG-GAP_3 1.00
PF03405FA_desaturase_2 1.00
PF13738Pyr_redox_3 1.00
PF00691OmpA 1.00
PF07298NnrU 1.00
PF13193AMP-binding_C 1.00
PF00216Bac_DNA_binding 1.00
PF07040DUF1326 1.00
PF00300His_Phos_1 1.00
PF00561Abhydrolase_1 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1802DNA-binding transcriptional regulator, GntR familyTranscription [K] 32.00
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 32.00
COG01223-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 1.00
COG0177Endonuclease IIIReplication, recombination and repair [L] 1.00
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 1.00
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 1.00
COG0753CatalaseInorganic ion transport and metabolism [P] 1.00
COG0776Bacterial nucleoid DNA-binding protein IHF-alphaReplication, recombination and repair [L] 1.00
COG1059Thermostable 8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 1.00
COG1194Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairsReplication, recombination and repair [L] 1.00
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 1.00
COG22313-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamilyReplication, recombination and repair [L] 1.00
COG4094Uncharacterized membrane proteinFunction unknown [S] 1.00
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms96.00 %
UnclassifiedrootN/A4.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2035918004|FACENC_F56XM5W01DQ64TAll Organisms → cellular organisms → Bacteria → Proteobacteria524Open in IMG/M
3300000559|F14TC_100329335All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1682Open in IMG/M
3300004019|Ga0055439_10067761All Organisms → cellular organisms → Bacteria → Proteobacteria1004Open in IMG/M
3300004027|Ga0055459_10055515All Organisms → cellular organisms → Bacteria955Open in IMG/M
3300004463|Ga0063356_100738936All Organisms → cellular organisms → Bacteria → Proteobacteria1361Open in IMG/M
3300005183|Ga0068993_10303702All Organisms → cellular organisms → Bacteria → Proteobacteria577Open in IMG/M
3300005332|Ga0066388_100522947All Organisms → cellular organisms → Bacteria1819Open in IMG/M
3300005332|Ga0066388_106952323All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300005347|Ga0070668_100849169All Organisms → cellular organisms → Bacteria → Proteobacteria814Open in IMG/M
3300005364|Ga0070673_100902460All Organisms → cellular organisms → Bacteria → Proteobacteria819Open in IMG/M
3300005441|Ga0070700_100447962All Organisms → cellular organisms → Bacteria982Open in IMG/M
3300005446|Ga0066686_10613396All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300005545|Ga0070695_101121662All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi644Open in IMG/M
3300005548|Ga0070665_100204354All Organisms → cellular organisms → Bacteria1976Open in IMG/M
3300005553|Ga0066695_10810994All Organisms → cellular organisms → Bacteria → Proteobacteria539Open in IMG/M
3300005559|Ga0066700_10349576All Organisms → cellular organisms → Bacteria1044Open in IMG/M
3300005576|Ga0066708_10948175All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium536Open in IMG/M
3300005617|Ga0068859_101393040All Organisms → cellular organisms → Bacteria → Proteobacteria773Open in IMG/M
3300005713|Ga0066905_100365953All Organisms → cellular organisms → Bacteria → Proteobacteria1157Open in IMG/M
3300005713|Ga0066905_101179612All Organisms → cellular organisms → Bacteria → Proteobacteria683Open in IMG/M
3300005764|Ga0066903_100555582All Organisms → cellular organisms → Bacteria1971Open in IMG/M
3300005764|Ga0066903_108161186All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300005843|Ga0068860_100827420All Organisms → cellular organisms → Bacteria → Proteobacteria940Open in IMG/M
3300005937|Ga0081455_10852433All Organisms → cellular organisms → Bacteria → Proteobacteria569Open in IMG/M
3300006046|Ga0066652_101497480All Organisms → cellular organisms → Bacteria → Proteobacteria626Open in IMG/M
3300006163|Ga0070715_10485261All Organisms → cellular organisms → Bacteria → Proteobacteria704Open in IMG/M
3300006465|Ga0082250_11248006All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium501Open in IMG/M
3300006865|Ga0073934_10041203All Organisms → cellular organisms → Bacteria4181Open in IMG/M
3300009090|Ga0099827_11127699All Organisms → cellular organisms → Bacteria → Proteobacteria681Open in IMG/M
3300009094|Ga0111539_13037603Not Available542Open in IMG/M
3300009137|Ga0066709_101811866All Organisms → cellular organisms → Bacteria → Proteobacteria856Open in IMG/M
3300009137|Ga0066709_104357768All Organisms → cellular organisms → Bacteria → Proteobacteria516Open in IMG/M
3300009156|Ga0111538_11752728All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium782Open in IMG/M
3300009162|Ga0075423_11294528All Organisms → cellular organisms → Bacteria → Proteobacteria779Open in IMG/M
3300010047|Ga0126382_11330186Not Available651Open in IMG/M
3300010047|Ga0126382_12351765All Organisms → cellular organisms → Bacteria → Proteobacteria516Open in IMG/M
3300010303|Ga0134082_10252780All Organisms → cellular organisms → Bacteria → Proteobacteria730Open in IMG/M
3300010360|Ga0126372_12200641All Organisms → cellular organisms → Bacteria → Proteobacteria600Open in IMG/M
3300010362|Ga0126377_12317881All Organisms → cellular organisms → Bacteria → Proteobacteria613Open in IMG/M
3300010366|Ga0126379_13210533All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300010376|Ga0126381_104474448All Organisms → cellular organisms → Bacteria → Proteobacteria540Open in IMG/M
3300010400|Ga0134122_12168607All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium598Open in IMG/M
3300011271|Ga0137393_11193993All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300012205|Ga0137362_11142828All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium661Open in IMG/M
3300012205|Ga0137362_11425035All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300012349|Ga0137387_11181408All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium541Open in IMG/M
3300012351|Ga0137386_10977737All Organisms → cellular organisms → Bacteria → Proteobacteria603Open in IMG/M
3300012362|Ga0137361_10168650All Organisms → cellular organisms → Bacteria → Proteobacteria1967Open in IMG/M
3300012582|Ga0137358_10022062All Organisms → cellular organisms → Bacteria4110Open in IMG/M
3300012685|Ga0137397_10775900All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300012923|Ga0137359_10456404All Organisms → cellular organisms → Bacteria1130Open in IMG/M
3300012975|Ga0134110_10262350All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria738Open in IMG/M
3300012977|Ga0134087_10031231All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2016Open in IMG/M
3300014155|Ga0181524_10146972Not Available1226Open in IMG/M
3300014638|Ga0181536_10065266All Organisms → cellular organisms → Bacteria2262Open in IMG/M
3300014885|Ga0180063_1192771All Organisms → cellular organisms → Bacteria → Proteobacteria655Open in IMG/M
3300015259|Ga0180085_1226195All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300015264|Ga0137403_10987978All Organisms → cellular organisms → Bacteria → Proteobacteria689Open in IMG/M
3300018058|Ga0187766_11246812Not Available540Open in IMG/M
3300018468|Ga0066662_12098854All Organisms → cellular organisms → Bacteria → Proteobacteria592Open in IMG/M
3300019487|Ga0187893_10353882All Organisms → cellular organisms → Bacteria → Proteobacteria1016Open in IMG/M
3300020220|Ga0194119_10899827All Organisms → cellular organisms → Bacteria → Proteobacteria515Open in IMG/M
3300021859|Ga0210334_10941793All Organisms → cellular organisms → Bacteria4405Open in IMG/M
(restricted) 3300024054|Ga0233425_10242805All Organisms → cellular organisms → Bacteria → Proteobacteria841Open in IMG/M
3300025550|Ga0210098_1098809All Organisms → cellular organisms → Bacteria → Proteobacteria503Open in IMG/M
3300025900|Ga0207710_10343214All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium760Open in IMG/M
3300025910|Ga0207684_11655064All Organisms → cellular organisms → Bacteria → Proteobacteria517Open in IMG/M
3300025933|Ga0207706_10970237All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium715Open in IMG/M
3300025936|Ga0207670_11071298All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium680Open in IMG/M
3300025938|Ga0207704_11327472All Organisms → cellular organisms → Bacteria → Proteobacteria615Open in IMG/M
3300025972|Ga0207668_11005436All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300025986|Ga0207658_11458796All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium626Open in IMG/M
3300026035|Ga0207703_10086952All Organisms → cellular organisms → Bacteria → Proteobacteria2620Open in IMG/M
3300026088|Ga0207641_12401648All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300026529|Ga0209806_1298801All Organisms → cellular organisms → Bacteria → Proteobacteria541Open in IMG/M
3300026548|Ga0209161_10070034All Organisms → cellular organisms → Bacteria2207Open in IMG/M
3300027770|Ga0209086_10027302All Organisms → cellular organisms → Bacteria3464Open in IMG/M
3300027900|Ga0209253_10340312All Organisms → cellular organisms → Bacteria → Proteobacteria1154Open in IMG/M
3300027905|Ga0209415_10647236All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium770Open in IMG/M
(restricted) 3300027977|Ga0247834_1307756All Organisms → cellular organisms → Bacteria → Proteobacteria544Open in IMG/M
3300028379|Ga0268266_10535160All Organisms → cellular organisms → Bacteria → Proteobacteria1121Open in IMG/M
3300028380|Ga0268265_11116010All Organisms → cellular organisms → Bacteria → Proteobacteria783Open in IMG/M
3300031242|Ga0265329_10109930All Organisms → cellular organisms → Bacteria → Proteobacteria880Open in IMG/M
3300031344|Ga0265316_10946893All Organisms → cellular organisms → Bacteria → Proteobacteria600Open in IMG/M
3300031681|Ga0318572_10424188All Organisms → cellular organisms → Bacteria → Proteobacteria791Open in IMG/M
3300031708|Ga0310686_109077153All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300031769|Ga0318526_10070173All Organisms → cellular organisms → Bacteria → Proteobacteria1368Open in IMG/M
3300031805|Ga0318497_10188674All Organisms → cellular organisms → Bacteria → Proteobacteria1137Open in IMG/M
3300031942|Ga0310916_11303014All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium598Open in IMG/M
3300031949|Ga0214473_10747215All Organisms → cellular organisms → Bacteria1061Open in IMG/M
3300031949|Ga0214473_11310853All Organisms → cellular organisms → Bacteria → Proteobacteria742Open in IMG/M
3300032180|Ga0307471_101889837All Organisms → cellular organisms → Bacteria → Proteobacteria747Open in IMG/M
3300032180|Ga0307471_103194771All Organisms → cellular organisms → Bacteria → Proteobacteria581Open in IMG/M
3300032180|Ga0307471_103348651All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium568Open in IMG/M
3300032205|Ga0307472_100951754All Organisms → cellular organisms → Bacteria → Proteobacteria800Open in IMG/M
3300032261|Ga0306920_103904888All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium543Open in IMG/M
3300032829|Ga0335070_10098049All Organisms → cellular organisms → Bacteria → Proteobacteria3075Open in IMG/M
3300032955|Ga0335076_10728836All Organisms → cellular organisms → Bacteria → Proteobacteria873Open in IMG/M
3300033486|Ga0316624_10063422All Organisms → cellular organisms → Bacteria2433Open in IMG/M
3300034165|Ga0364942_0270909All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium555Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil6.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere6.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere5.00%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater2.00%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog2.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil2.00%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere2.00%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment1.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.00%
SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Sediment1.00%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine1.00%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.00%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.00%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2035918004Soil microbial communities from sample at FACE Site 2 North Carolina CO2-EnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004027Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_CattailNLC_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006465Deep-sea sediment bacterial and archaeal communities from Fram Strait - Hausgarten IXEnvironmentalOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014155Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_60_metaGEnvironmentalOpen in IMG/M
3300014638Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_60_metaGEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020220Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015018 Mahale Deep Cast 100mEnvironmentalOpen in IMG/M
3300021859Metatranscriptome of estuarine sediment microbial communities from the Columbia River estuary, Oregon, United States ? S.306 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024054 (restricted)Freshwater microbial communities from Lake Towuti, South Sulawesi, Indonesia - Watercolumn_Towuti2014_140_MGEnvironmentalOpen in IMG/M
3300025550Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025986Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027770Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130207_XF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027900Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - BRP12 BR (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027977 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_12mEnvironmentalOpen in IMG/M
3300028379Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300031242Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-16-27 metaGHost-AssociatedOpen in IMG/M
3300031344Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-5-22 metaGHost-AssociatedOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031769Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f24EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
FACENCA_68721102035918004SoilMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWVGARDIEWDRPI
F14TC_10032933533300000559SoilMSDFTLNTTTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERD
Ga0055439_1006776113300004019Natural And Restored WetlandsMNDFTLHSSTQTPGLATAMQVIYQWNYDSEVDELRRLYVKGTEAQWIAER
Ga0055459_1005551513300004027Natural And Restored WetlandsMSDFKIETTTQEADLPTAMQVIYQWNYDPEVEELRNLYVKAAEAQWIG
Ga0063356_10073893613300004463Arabidopsis Thaliana RhizosphereMRDFGLTTDTQEPELDTAMKVIYQWSYDPEVDELRRLYVKAAEA
Ga0068993_1030370213300005183Natural And Restored WetlandsMKNGAFQLDTPTQEPGLATAMEVIYQWNYDSEVEELRRLYVKAAE
Ga0066388_10052294733300005332Tropical Forest SoilMNEFTVQTPTQETGLATAMEVVYQWNYDAEVEELRNLYVK
Ga0066388_10695232323300005332Tropical Forest SoilMSEFMLDTPTQEPDLESAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAERDID
Ga0070668_10084916923300005347Switchgrass RhizosphereMSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELRNLYVKAAE
Ga0070673_10090246023300005364Switchgrass RhizosphereMTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRLYAK
Ga0070700_10044796223300005441Corn, Switchgrass And Miscanthus RhizosphereMSEFMLKTATQEPDLATAMQVVYQWNYDPEVEELRNLYVKA
Ga0066686_1061339613300005446SoilMNDFTLTTPTQEPGLATSMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERDLDW
Ga0070695_10112166213300005545Corn, Switchgrass And Miscanthus RhizosphereMSDFTLQTPTQEPDLESAMRVVYQWNYDPEVEQLRSLYVKAAEAQWISNRD
Ga0070665_10020435433300005548Switchgrass RhizosphereMSEFTLKTPSQEPGYQTAMEVVFQWNYDTELEELRN
Ga0066695_1081099423300005553SoilMNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQWI
Ga0066700_1034957613300005559SoilMNDFTLTTSTQEPGLATAMAVVYQWNYDAEVDELRRLYVKAAEAQWI
Ga0066708_1094817523300005576SoilMSEFTIDTPTQEPGLETAMKVVYQWNYDPEVEELRRLYVKAAEAQWISERDVDWNRPIDH
Ga0068859_10139304013300005617Switchgrass RhizosphereMSEFTLTSATQEPELPTAMKVVYQWNYEPEVEELR
Ga0066905_10036595333300005713Tropical Forest SoilMNEFTLTTPTQEPDLGTAMKVVYQWNYGSEVEELR
Ga0066905_10117961223300005713Tropical Forest SoilMNDFELTTPTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAE
Ga0066903_10055558213300005764Tropical Forest SoilMSDFTLATTTQEPNLDTAMKVVYQWNYDPEVEELRRLY
Ga0066903_10816118613300005764Tropical Forest SoilMSDFTVTTSTQEPELDTAMKVVYQWNYEPEVEELRRLYVKAAEAQW
Ga0068860_10082742013300005843Switchgrass RhizosphereMSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELR
Ga0081455_1085243313300005937Tabebuia Heterophylla RhizosphereMKSDGFNLQTPTQEPGLATAMEVIYQWNYDSEVEELRR
Ga0066652_10149748023300006046SoilMSEFTLSSPSQEPGYQTAMEVVFQWNYDPEVEELRNLYVKAAEAQW
Ga0070715_1048526123300006163Corn, Switchgrass And Miscanthus RhizosphereMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYV
Ga0082250_1124800623300006465SedimentMKLQTETQEPDMETAMKVVYQWNYGSELEELRRLYVKGAELQWVA
Ga0073934_1004120363300006865Hot Spring SedimentMNEFRLRTATQEPDLDTAMKIIYQWNYDPEVEELRRLYIKAAEAQWIAERD
Ga0099827_1112769923300009090Vadose Zone SoilMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWIGARDI
Ga0111539_1303760323300009094Populus RhizosphereMTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELR
Ga0066709_10181186613300009137Grasslands SoilMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVK
Ga0066709_10435776813300009137Grasslands SoilMTVQTSDFTIQTPTQEPGLGTAMEVVYQWNYDPEVEELRSLY
Ga0111538_1175272823300009156Populus RhizosphereMSDFTLRTPTQEPDLATAMQVVYQWKYDPEVEELR
Ga0075423_1129452813300009162Populus RhizosphereMSEFTVQTPTQETGIDTAMQVVYQWNYDPEVEELRSLYVKAA
Ga0126382_1133018613300010047Tropical Forest SoilMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNL
Ga0126382_1235176523300010047Tropical Forest SoilMETLTLQTPTQDPGLATAMEVVYQWNYDPEVEELRNLYVKAAEAQ
Ga0134082_1025278013300010303Grasslands SoilMNDFKLTTPTQEPGLASAMQVVYQWNYDPEVEELRRLY
Ga0126372_1220064123300010360Tropical Forest SoilMNDFMLETPTQEPDLASAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAE
Ga0126377_1231788113300010362Tropical Forest SoilMNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWVGARDI
Ga0126379_1321053323300010366Tropical Forest SoilMSDFRLQTDTQEPGLDTAMKVVYQWNYDPEVEELRRLYVKAAEAQWVSE
Ga0126381_10447444813300010376Tropical Forest SoilMSDFTLRTPTQEPGYQTAMEVVFQWNYDPEVEELRNLYVKAAE
Ga0134122_1216860713300010400Terrestrial SoilMSEFTLNTPTQEPELDTAMKVVFQWNYDPEVDELRRLYVKAAEAQWISSRDLDW
Ga0137393_1119399313300011271Vadose Zone SoilMSTLPIRTETQEPDLETAMKVVYQWNYDSEVDELRR
Ga0137362_1114282823300012205Vadose Zone SoilMSDAFTLKTSTQEPDLPTAMQIVYQWNYDPEVEELRNLYVK
Ga0137362_1142503523300012205Vadose Zone SoilMSDFTTVTATQEPALDTAMKVVYQWNYDPEVEELR
Ga0137387_1118140813300012349Vadose Zone SoilMNDFTLTTPTQEPGLATSMQVVYQWNYDPEVDELRRLYVKAAEA*
Ga0137386_1097773713300012351Vadose Zone SoilMNDFTLTTPTQEPGLATSMQVVYQWNYDPEVEELRRLYVKAAEAQWVAER
Ga0137361_1016865033300012362Vadose Zone SoilMPKERSMSDDFTLKTPTQEPDLPTAMQVIYQWNYDPEIEELRNLYVKAAEAQWIGAKDL
Ga0137358_1002206213300012582Vadose Zone SoilMSDDFTLKTPTQEPDLPTAMQVIYQWNYDPEIEELRNLYVKAAEAQWIGAKDLDWNRE
Ga0137397_1077590023300012685Vadose Zone SoilMEEPPMKEFTLQTATQEPGLGTAMEVVYQWNYDVEVDELRSLY
Ga0137359_1045640413300012923Vadose Zone SoilMDDFKLTTATQEPPLETAMKVVYQWNYDPEVEELRRLY
Ga0134110_1026235013300012975Grasslands SoilMSDFTIETPTQEPGLETAMKVVYQWNYDPEVEELRRLYVKAAEAQWISEATSI
Ga0134087_1003123113300012977Grasslands SoilMNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQW
Ga0181524_1014697223300014155BogMSQFDINSATQEPDLATAMKVVYQWNYGSEVEELR
Ga0181536_1006526613300014638BogMSQFDINTATQEPDLATAMKVVYQWNYGSEVEELRHLYVKALEAQ
Ga0180063_119277113300014885SoilMDDFSLNTPTQEPGLETAMKVVYQWDYDPQVEELRRLYVKAAEAQWIADRDI
Ga0180085_122619523300015259SoilMSEFTLKTTTQEPDLATAMQVVYQWNYDAEVEELRNLYVKAAEAQWIGEKHLDW
Ga0137403_1098797823300015264Vadose Zone SoilMNDFTLTTPTQEPGLASAMQVVYQWNYDPEVDELRRLYVKAAEAQWV
Ga0187766_1124681213300018058Tropical PeatlandMSEFTVTTATQEAGLDTAMQVVYQWNYEPQVDELRRLYVKATEAQ
Ga0066662_1209885413300018468Grasslands SoilMNDFTLTTSTQEPGLATAMAVVYPWNYDAEVDELRRL
Ga0187893_1035388223300019487Microbial Mat On RocksMSDFSLATATQEPGLDTAMKVVYQWDYEPQVEELRRLYVKAA
Ga0194119_1089982713300020220Freshwater LakeMSDFHIQTETQEPPLDTAMQVIYQWSYDPEVDELRNL
Ga0210334_1094179333300021859EstuarineMSEFKLRTETQEPDLDTAMKVVYQWNYDPEVEELRRLYHKATE
(restricted) Ga0233425_1024280513300024054FreshwaterMSDFTLETPTQEPGLATAMQVVYQWSYEPEVDELRNLYVKGAEAQWVATRDIDWDRDID
Ga0210098_109880913300025550Natural And Restored WetlandsMSDFKIETTTQEADLPTAMQVIYQWNYDPEVEELRNLY
Ga0207710_1034321423300025900Switchgrass RhizosphereMSDFTVKTSTQEPDLSTAMQIVYQWNYEPEVDELRNLYVKAAEAQWV
Ga0207684_1165506423300025910Corn, Switchgrass And Miscanthus RhizosphereMKNGAFSLDSPTQEPGLATAMEVIYQWNYDSEVEELRRLYVKAAEA
Ga0207706_1097023713300025933Corn RhizosphereMIDFTLNTTTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERDL
Ga0207670_1107129813300025936Switchgrass RhizosphereMSEFMLKTATQEPDLATAMQVVYQWNYDPEVEELRNLYVKAAEGQWIGA
Ga0207704_1132747223300025938Miscanthus RhizosphereMNEFTLQTPTQEPGYETAMEVVFQWNYDPEVEELRNLYVKAAEAQW
Ga0207668_1100543613300025972Switchgrass RhizosphereMTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRLYAKATEAQWI
Ga0207658_1145879623300025986Switchgrass RhizosphereMRDFGLTTDTQEPELDTAMKVIYQWSYDPEVDELRRLYVKAAEAQWVHRVATRAPVP
Ga0207703_1008695253300026035Switchgrass RhizosphereMSDFTLNTMTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAER
Ga0207641_1240164823300026088Switchgrass RhizosphereMTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRL
Ga0209806_129880113300026529SoilMNDFTLTTPTQEPGLASAMQVVYQWNYDPEVEELRRLYVKAAEAQWVSERDLDWSRP
Ga0209161_1007003433300026548SoilMNDFTLTTPTQEPGLASAMQVVYQWNYDPEVDELRRLYVKAAEAQ
Ga0209086_1002730213300027770Freshwater LakeMSEFQLTTGTEEPGLDTAMKVVYQWNYEPDVEELRSLYHKAT
Ga0209253_1034031213300027900Freshwater Lake SedimentMSDFSLTTSTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEAQWVATR
Ga0209415_1064723623300027905Peatlands SoilMSEFTITPATQDPGFETAMKVVYQWNYGSEVEELRRLYVKA
(restricted) Ga0247834_130775623300027977FreshwaterMSDFQLTTTTQEPGFDTAMQVIYQWKYDPDVEELRDLYHKATQLQWV
Ga0268266_1053516033300028379Switchgrass RhizosphereMSEFTLKTPSQEPGYQTAMEVVFQWNYDTELEELRNLYVKAAEAQWI
Ga0268265_1111601013300028380Switchgrass RhizosphereMDDFTLNTATQEAGLDTAMKVVYQWNYEPEVEELRRLYMKAT
Ga0265329_1010993023300031242RhizosphereMSDFTITTGTQEPPLDTAMQVIYQWNYDPEVEELRNLYVKAAEAQWV
Ga0265316_1094689313300031344RhizosphereMDDFTLETATQEPGIATAMQVIYQWNYEPQVEELRRLYGKAT
Ga0318572_1042418823300031681SoilMSEFRLDTPTQEPDLESAMTVVYQWNYGSEVEELRRLYVKAAEAQWVAERDI
Ga0310686_10907715323300031708SoilMNQFDITSATQEPGLSTAMQVVYQWNYGSEVEELR
Ga0318526_1007017313300031769SoilMSEFMLDTPTQEPDLESAMKVVYQWNYGSEVDELRRLYVK
Ga0318497_1018867413300031805SoilMSEFMLDTPTQEPDLESAMKVVYQWNYGSEVDELR
Ga0310916_1130301423300031942SoilMNEFTLTTRTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKA
Ga0214473_1074721523300031949SoilMREFTLETPTQDPPLETAMRVVYQWSYEPEVEELRRLYLKAVEAQWIAARDIDWE
Ga0214473_1131085313300031949SoilMRDFTLTTDTQEPALDAAMKVIYQWNYDPEVEELRRLYVKAADAPWVSE
Ga0307471_10188983713300032180Hardwood Forest SoilMSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELRNLYVKAAEAQW
Ga0307471_10319477113300032180Hardwood Forest SoilMKPFSLHTTTQEPGLGTAMEVVYQWNYDAEVDELRNLYVKAAEA
Ga0307471_10334865123300032180Hardwood Forest SoilMRDFSLSTDTQEPELDTAMKVIYQWSYDPEVEELRRLYVKAAEAQWVSERDLDWNRSIDH
Ga0307472_10095175423300032205Hardwood Forest SoilMNDFTLASPTQEPGLATAMHVVYQWNYDPEVDELRRLYVKAAEAQWIADRD
Ga0306920_10390488813300032261SoilMNEFTLTTPTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEA
Ga0335070_1009804913300032829SoilMSAFTVKTPTQEPDLATAMQVVYQWNYDPEVEELRNLYVKAAEAQWISNRDLD
Ga0335076_1072883623300032955SoilMSDFNVRTTTQEPDLDTAMKVIYQWNYEPEVEELRRLYVKAAD
Ga0316624_1006342213300033486SoilMSDFKLTTPTQEPDLDTAMKVIYQWNYDPEVEELRRLYVKATEAQW
Ga0364942_0270909_3_1553300034165SedimentMSEFTVKTSTQEPDLPTAMQVVYQWNYDTDVEELRNLYVKAAEAQWIGAKH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.