NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100800

Metagenome Family F100800

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100800
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 86 residues
Representative Sequence IEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI
Number of Associated Samples 77
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 12.87 %
% of genes near scaffold ends (potentially truncated) 76.47 %
% of genes from short scaffolds (< 2000 bps) 82.35 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.020 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(42.157 % of family members)
Environment Ontology (ENVO) Unclassified
(47.059 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.078 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 11.82%    β-sheet: 16.36%    Coil/Unstructured: 71.82%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00903Glyoxalase 4.90
PF01565FAD_binding_4 4.90
PF00089Trypsin 3.92
PF01408GFO_IDH_MocA 2.94
PF069833-dmu-9_3-mt 1.96
PF07715Plug 1.96
PF04389Peptidase_M28 1.96
PF07676PD40 1.96
PF02585PIG-L 0.98
PF13783DUF4177 0.98
PF04993TfoX_N 0.98
PF01979Amidohydro_1 0.98
PF00583Acetyltransf_1 0.98
PF07992Pyr_redox_2 0.98
PF04226Transgly_assoc 0.98
PF03544TonB_C 0.98
PF07045DUF1330 0.98
PF08309LVIVD 0.98
PF00403HMA 0.98
PF08031BBE 0.98
PF09413DUF2007 0.98
PF03965Penicillinase_R 0.98
PF14499DUF4437 0.98
PF14534DUF4440 0.98
PF01613Flavin_Reduct 0.98
PF01872RibD_C 0.98
PF09594GT87 0.98
PF08241Methyltransf_11 0.98
PF01592NifU_N 0.98
PF07394DUF1501 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG2764Zn-dependent glyoxalase, PhnB familyEnergy production and conversion [C] 1.96
COG3865Glyoxalase superfamily enzyme, possible 3-demethylubiquinone-9 3-methyltransferaseGeneral function prediction only [R] 1.96
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 0.98
COG0277FAD/FMN-containing lactate dehydrogenase/glycolate oxidaseEnergy production and conversion [C] 0.98
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.98
COG0822Fe-S cluster assembly scaffold protein IscU, NifU familyPosttranslational modification, protein turnover, chaperones [O] 0.98
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.98
COG1853FMN reductase RutF, DIM6/NTAB familyEnergy production and conversion [C] 0.98
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 0.98
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 0.98
COG2217Cation-transporting P-type ATPaseInorganic ion transport and metabolism [P] 0.98
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 0.98
COG2608Copper chaperone CopZInorganic ion transport and metabolism [P] 0.98
COG3070Transcriptional regulator of competence genes, TfoX/Sxy familyTranscription [K] 0.98
COG3682Transcriptional regulator, CopY/TcrY familyTranscription [K] 0.98
COG5276Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domainFunction unknown [S] 0.98
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.02 %
UnclassifiedrootN/A0.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10110619All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium787Open in IMG/M
3300002907|JGI25613J43889_10089942All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11812Open in IMG/M
3300005171|Ga0066677_10632486All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium604Open in IMG/M
3300005184|Ga0066671_10102223All Organisms → cellular organisms → Bacteria1591Open in IMG/M
3300005187|Ga0066675_10656656All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300005187|Ga0066675_11126219All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium585Open in IMG/M
3300005446|Ga0066686_10093305All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111927Open in IMG/M
3300005451|Ga0066681_10945443All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium515Open in IMG/M
3300005467|Ga0070706_101142094All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11716Open in IMG/M
3300005518|Ga0070699_100393405All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111252Open in IMG/M
3300005518|Ga0070699_100494794All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111110Open in IMG/M
3300005556|Ga0066707_10220673All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111229Open in IMG/M
3300005556|Ga0066707_10891718All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11545Open in IMG/M
3300005561|Ga0066699_10680114All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11735Open in IMG/M
3300005574|Ga0066694_10093299All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111403Open in IMG/M
3300006796|Ga0066665_10205379All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1532Open in IMG/M
3300006796|Ga0066665_10366783All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111180Open in IMG/M
3300006797|Ga0066659_11345927All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11596Open in IMG/M
3300007255|Ga0099791_10055426All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1779Open in IMG/M
3300007258|Ga0099793_10102195All Organisms → cellular organisms → Bacteria1327Open in IMG/M
3300007788|Ga0099795_10210618All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11823Open in IMG/M
3300009012|Ga0066710_101098133All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300009012|Ga0066710_101249379All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111151Open in IMG/M
3300009012|Ga0066710_101859458All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11905Open in IMG/M
3300009012|Ga0066710_102855954All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11680Open in IMG/M
3300009089|Ga0099828_10448641All Organisms → cellular organisms → Bacteria1163Open in IMG/M
3300010303|Ga0134082_10349378All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium626Open in IMG/M
3300010329|Ga0134111_10008225All Organisms → cellular organisms → Bacteria3268Open in IMG/M
3300010333|Ga0134080_10007202All Organisms → cellular organisms → Bacteria3878Open in IMG/M
3300010337|Ga0134062_10229594All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11856Open in IMG/M
3300010371|Ga0134125_11794478All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11667Open in IMG/M
3300011269|Ga0137392_10673828All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11857Open in IMG/M
3300011431|Ga0137438_1059442All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1141Open in IMG/M
3300012096|Ga0137389_10987776All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11722Open in IMG/M
3300012203|Ga0137399_11350645All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11598Open in IMG/M
3300012203|Ga0137399_11362660All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11595Open in IMG/M
3300012205|Ga0137362_10617834All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11934Open in IMG/M
3300012206|Ga0137380_10098864All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_112666Open in IMG/M
3300012206|Ga0137380_10299586All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111441Open in IMG/M
3300012207|Ga0137381_10098377All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_112478Open in IMG/M
3300012207|Ga0137381_10113890All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_112304Open in IMG/M
3300012207|Ga0137381_10999179All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11722Open in IMG/M
3300012208|Ga0137376_10086306All Organisms → cellular organisms → Bacteria2640Open in IMG/M
3300012208|Ga0137376_10677058All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11890Open in IMG/M
3300012209|Ga0137379_10719974All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11903Open in IMG/M
3300012209|Ga0137379_11622233All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11545Open in IMG/M
3300012209|Ga0137379_11691853All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11529Open in IMG/M
3300012211|Ga0137377_10825657All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11859Open in IMG/M
3300012285|Ga0137370_10147149All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111360Open in IMG/M
3300012351|Ga0137386_10684970All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium737Open in IMG/M
3300012356|Ga0137371_10028398All Organisms → cellular organisms → Bacteria4315Open in IMG/M
3300012357|Ga0137384_10868203All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11728Open in IMG/M
3300012360|Ga0137375_10342805All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300012361|Ga0137360_10791371All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11816Open in IMG/M
3300012361|Ga0137360_11080947All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium693Open in IMG/M
3300012362|Ga0137361_10205878All Organisms → cellular organisms → Bacteria1781Open in IMG/M
3300012362|Ga0137361_10845799All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11831Open in IMG/M
3300012532|Ga0137373_10026767All Organisms → cellular organisms → Bacteria5672Open in IMG/M
3300012683|Ga0137398_10310363All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111062Open in IMG/M
3300012683|Ga0137398_10447627All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11883Open in IMG/M
3300012685|Ga0137397_10668886All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium772Open in IMG/M
3300012918|Ga0137396_10094078All Organisms → cellular organisms → Bacteria2130Open in IMG/M
3300012918|Ga0137396_10512898All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11888Open in IMG/M
3300012918|Ga0137396_10644923All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Archangiaceae → Stigmatella → Stigmatella aurantiaca782Open in IMG/M
3300012925|Ga0137419_11155553All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11647Open in IMG/M
3300012925|Ga0137419_11828749All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11520Open in IMG/M
3300012944|Ga0137410_10010687All Organisms → cellular organisms → Bacteria6215Open in IMG/M
3300013758|Ga0120147_1011980All Organisms → cellular organisms → Bacteria1798Open in IMG/M
3300013772|Ga0120158_10512192All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11525Open in IMG/M
3300015162|Ga0167653_1000143All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae22371Open in IMG/M
3300018000|Ga0184604_10053228All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111125Open in IMG/M
3300018054|Ga0184621_10089239All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111080Open in IMG/M
3300018054|Ga0184621_10209011All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11699Open in IMG/M
3300018061|Ga0184619_10007876All Organisms → cellular organisms → Bacteria4094Open in IMG/M
3300018061|Ga0184619_10480230All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11551Open in IMG/M
3300018075|Ga0184632_10067219All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111558Open in IMG/M
3300018076|Ga0184609_10493792All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11558Open in IMG/M
3300018433|Ga0066667_10044824All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_112621Open in IMG/M
3300018433|Ga0066667_10640188All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes889Open in IMG/M
3300018482|Ga0066669_10149737All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111715Open in IMG/M
3300018482|Ga0066669_11951099All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11548Open in IMG/M
3300019865|Ga0193748_1003129All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1257Open in IMG/M
3300019877|Ga0193722_1075039All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium835Open in IMG/M
3300019883|Ga0193725_1012525All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2389Open in IMG/M
3300019885|Ga0193747_1002492All Organisms → cellular organisms → Bacteria4667Open in IMG/M
3300020001|Ga0193731_1097475All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11759Open in IMG/M
3300020002|Ga0193730_1151261All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11613Open in IMG/M
3300022694|Ga0222623_10199197All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11777Open in IMG/M
3300022756|Ga0222622_10483294All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium883Open in IMG/M
3300022756|Ga0222622_10590373All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11801Open in IMG/M
3300025885|Ga0207653_10031290All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111718Open in IMG/M
3300026301|Ga0209238_1027779All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300026528|Ga0209378_1036046All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2546Open in IMG/M
3300026542|Ga0209805_1321469All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium587Open in IMG/M
3300027655|Ga0209388_1236068All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11500Open in IMG/M
3300027875|Ga0209283_10785141All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11587Open in IMG/M
3300027903|Ga0209488_10878697All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11630Open in IMG/M
3300028536|Ga0137415_10433157All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111120Open in IMG/M
3300028705|Ga0307276_10190030All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium537Open in IMG/M
3300028824|Ga0307310_10328815All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300028828|Ga0307312_10761997All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11641Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil42.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.73%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.80%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.86%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.94%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.98%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013758Permafrost microbial communities from Nunavut, Canada - A24_65cm_12MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300015162Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-4c, rock/ice/stream interface)EnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019865Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020009Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2s1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028705Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_115EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1011061913300002560Grasslands SoilDRAGQAATLSDWNTFSIDGLFDNLERSVEHDGAVQVAFDPRWHFPAYARSVALPGPDAWTIIEARGLRPLGGPPD*
JGI25613J43889_1008994213300002907Grasslands SoilQPLRAWDKAGNSASLTDWNTFNIDGLFDSLERSADIKGQVQIAFDPHWHFPKYVHSVVLPGPDAWSTIEVRGFRPI*
Ga0066677_1063248623300005171SoilMEVRSNQPLRAWDRTGKSVPITDWNTFSIDGLYDDLERAADINGEAQIAFDPRWHFPTYVHTVALPGPDMWSVIEVRA
Ga0066671_1010222343300005184SoilVRNGMALRAWERTGQPVALTDWNTFSIDGLYEILERELDTNGEIQAAFDARWHFPKYVRTRMLPGPDAWSITEVRALRPI*
Ga0066675_1065665623300005187SoilAWERSGQPAAIADWNTFSIDGLYDNLDRAADINGEAQIAFDPRWHFPKYVRTVAIPGPDAWSIIELRALRPI*
Ga0066675_1112621923300005187SoilMEVRNGMALRAWERTGQPVALTDWNTFSIDGLYEILERELDTNGEIQTAFDARWHFPKYVRTRMLPGPDAWSITEVRALRPI*
Ga0066686_1009330513300005446SoilRGWLLMEVRSNQPLRAWDRTGKYVPITDWNTFSIDGLYDDLERAADINGEAQIAFDPRWHFPTYVHTVALPGPDMWSVIEVRALRPIGLAARPPGRARA*
Ga0066681_1094544313300005451SoilMEVRDAMALRAWERTGQPAAVTDWNTFSIDGLYEILERELDTNGEIQTAFDARWHFPKYVRTRMLPGPDAWSITEVRALRPI*
Ga0070706_10114209423300005467Corn, Switchgrass And Miscanthus RhizosphereRGWLLMDVRANQPLRAWDRAGRSFALTDWNTFSIDGLYDNLERSADINGQVQIAFDPHWHFPKYVYSVLLPGPDAWSTIEVRGFRPI*
Ga0070699_10039340523300005518Corn, Switchgrass And Miscanthus RhizosphereNQPLRAWDRAGRSFALTDWNTFSIDGLYDNLERSADINGQVQIAFDPHWHFPKYVYSVLLPGPDAWSTIEVRGFRPI*
Ga0070699_10049479423300005518Corn, Switchgrass And Miscanthus RhizosphereNQPLRAWDRAGRSFALTDWNTFSIDGLYDNLERSADINGRVQIAFDPRWHFPKFVYSVVHPGPDAWSTVEVRGFRPI*
Ga0066707_1022067313300005556SoilPPDYRFLVRTGCFCPGMRGWLLMEVRSGQPLRAWDRAGRSAPLSDWDTFSIDGLFDMLERSADHDAAVQVTFDPRWHFPTYIHTRALPGPDMWAVIEARGLRPI*
Ga0066707_1089171813300005556SoilLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPRREFLRQ*
Ga0066699_1068011433300005561SoilDSLRTALRAEHALWRANSSSDYRFLLRTACFCPGGRGWLLIEVRTGQPLRAWDRAGTSAPLSDWDTFSIDGLFDMLERSAERDAVVQITFDPRWHFPTYVYTRALPGPDMWAIIEARGFRPF*
Ga0066694_1009329933300005574SoilRGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI*
Ga0066665_1020537913300006796SoilGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPSREFLRQ*
Ga0066665_1036678333300006796SoilAWDRTGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRVLPGPDMWSVIEARGLRPI*
Ga0066659_1134592723300006797SoilMDVRKSQPLRAWDPTGKVVAISDWNTFSIDGLFDNIERSIDRDNVVQVAFDPRWHFPAFVHTVALPGPDMWATIDARALRRSTP*
Ga0099791_1005542613300007255Vadose Zone SoilAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAVVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0099793_1010219523300007258Vadose Zone SoilLIEVRSGRPLRAWDRAGKSAPIIDWDTFSIDGLYDNLERMADIQGEVQIAFDSRWHFPRYVSAVASPGPDMWSVTEVRGFRPI*
Ga0099795_1021061833300007788Vadose Zone SoilMDVRADQPLRAWDKAGNSASLTDWNTFNIDGLFDSLERSADIKGQVQIAFDPHWHFPKYVHSVVLPGPDAWSTIEVRGFRPI*
Ga0066710_10109813313300009012Grasslands SoilNQPLRAWDRTGKSVPITDWNTFSIDGLYDDLERAADINGEAQIAFDPRWHFPTYVHTVALPGPDMWSIIEVRALRPI
Ga0066710_10124937923300009012Grasslands SoilPPDYRFLVRTGCFCPGMRGWLLMEVRSGQPLRAWDRAGRSAPLSDWDTFSIDGLFDMLERSADHDAAVQVTFDPRWHFPTYIHTRALPGPDMWAVIEARGLRPI
Ga0066710_10185945823300009012Grasslands SoilMEVHSNQPLRAWDRTGKSVPITDWNTLSIDGLYDDLERAADINGAAQIAFDPRWHFPTYVHTVALPGPEMWSVIEVRALRPIGLAARPPGRARA
Ga0066710_10285595423300009012Grasslands SoilMEVRSGQPLRAWDRTGRAAALTDWNTLSIDGLFDNLERSVTMDGVVKVAFDRRWHFPTYVYTVALPGPDTWSITEALGFRPI
Ga0099828_1044864123300009089Vadose Zone SoilRAWDRAGKSADLTDWSTFSIDGLYDNLERSADINGQVQIAFDPLWHVPKYVYAVALPGPDMWSTIEVRGLRPI*
Ga0134082_1034937813300010303Grasslands SoilRPATISDWNMFGIDELFDNVERSIDRVSVVEIAFDPRWHFPAYVRSVALPGPDAWSIIDARALRRQTP*
Ga0134111_1000822513300010329Grasslands SoilDYRFLVRTGCFCPGARGWLLIEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPRREFLRQ*
Ga0134080_1000720213300010333Grasslands SoilIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI*
Ga0134062_1022959423300010337Grasslands SoilGQPLRAWDRAGRSAPLSDWDTFSIDGLFDMLERSADHDAAVQVTFDPRWHFPTYIHTRALPGPDMWAVIEARGLRPI*
Ga0134125_1179447813300010371Terrestrial SoilCFCPGVRGWLLMEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDMVERSADRDAAVQVGFDPRWHFPTYVYTRALPGPDMWAIIEARGFRPF*
Ga0137392_1067382833300011269Vadose Zone SoilGWLLMDVRANQPLRAWDRAGRSFALTDWNTFSIDGLYDNLERSADINGQVQIAFDPHWHVPKYVYAVALPGPDMWSTIEVRGLRPI*
Ga0137438_105944223300011431SoilMDVRSGQPLRAWDKTGKSAALTDWNTLSIDGLYDNLERTADINGEVQVAFDPRWHFPRYVRTTALPGPDMWSVVDVRGFHDLSAPR*
Ga0137389_1098777623300012096Vadose Zone SoilEVRRGQPLRAWDRAGRSAPLRDWDTFSIDGLFDMLERSADHAAAVQVSFDPRWHFPTYIHTRALPGPDMWTVIEARGLRPM*
Ga0137399_1135064523300012203Vadose Zone SoilLLMEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFAMLERSADHDAAVQVRFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137399_1136266013300012203Vadose Zone SoilPGVRGWLLMEVRRGQPLRAWDRAGRSAPLRDWDTFSIDGLFDMLERWADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137362_1061783413300012205Vadose Zone SoilMEVRAGQPLRAWDKAGNSAALTWNTFSIDGLYDNLERSADINGQVQIAFDPRWHFPKYVYSVVLPGPDAWSTIEVRGFRPI*
Ga0137380_1009886443300012206Vadose Zone SoilCFCPGVRGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPHWHFPTYVYTRGLPGPDMWTVIEARGLRPI*
Ga0137380_1029958633300012206Vadose Zone SoilLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137381_1009837743300012207Vadose Zone SoilGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPHWHFPTYVYTRGLPGPDMWTVIEARGLRPI*
Ga0137381_1011389053300012207Vadose Zone SoilRTGCFCPGVRGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI*
Ga0137381_1099917913300012207Vadose Zone SoilVRRSQPLRAWDRAGQVAAPSDWNTFGIDGLFDNLERSVDRDGMVQVAFDPRWHFPAYIRTVALPGPDAWTIIEARGLRPM*
Ga0137376_1008630613300012208Vadose Zone SoilRGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRSHFPTYVYTRALPGPDMWAVIEARGLRPI*
Ga0137376_1067705813300012208Vadose Zone SoilSSPPDYRFLVRTGCFCPGVRGWLLIEVRRGQPLRAWDRAGRAAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI*
Ga0137379_1071997413300012209Vadose Zone SoilRARQPLRAWDVSGKSAALIDWNTFSIDGLYDDLERAADINGNAQIAFDPRWHFPTYVRTVAVPGPDAWSIIEVRALRPI*
Ga0137379_1162223313300012209Vadose Zone SoilSSPPDYRFLVRTGCLCPGVRGWLLMEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137379_1169185313300012209Vadose Zone SoilRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPHWHFPTYVYTRGLPGPDMWTVIEARGLRPI*
Ga0137377_1082565713300012211Vadose Zone SoilMEVRSGQPLRAWDRTGKAAALTDWNTLSIDGLFDNLERSVTMDGVVKVAFDRRWHFPTYVYTVALPGPDTWSITEALGFRPI*
Ga0137370_1014714913300012285Vadose Zone SoilTGCFCPGVRGWLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRVLPGPDMWSVIEARGLRPI*
Ga0137386_1068497013300012351Vadose Zone SoilGKSAALIDWNTFSIDGLYDDLERAAEINGGAQIAFDPRWHFPTYVHTVALPGPDMWSVIEVRALRPIGLAARPSGRARA*
Ga0137371_1002839863300012356Vadose Zone SoilQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAAVQVSFDPRWHFPTYVYTRALPGPDMWVVIEARGLRPI*
Ga0137384_1086820323300012357Vadose Zone SoilMEVRAGQPLRAWDKAGNSAALTDWNMFSIDGLYDNLERSADINGQVQIAFDPRWHFPKYVHSVVLPGPDAWSTIEVRGFRPT*
Ga0137375_1034280533300012360Vadose Zone SoilMEVRSDRPLRAWDRTGNAVSLTDWNTLSIDGLFDNLERSADRDGLVQVAFDPRWHFPAYVRTVALPGPDAWAIIEARALRPI*
Ga0137360_1079137113300012361Vadose Zone SoilMDVRAGQPLRAWDRAGKSAGLTDWNTVSVDGLYDNLERSADTNGQVQIAFDPRWHFPKHVYSVVLPGPDAWSTIEVRGFRPI*
Ga0137360_1108094713300012361Vadose Zone SoilDRAGKAATLSDWNTFSIDGLFDNLERSVEHDGAVQVAFDPRWHFPAYARSVALPGPDAWTIIEARGLRPLGGPPD*
Ga0137361_1020587843300012362Vadose Zone SoilLLIEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIDARGVWPI*
Ga0137361_1084579913300012362Vadose Zone SoilTGCFCPGVRGWLLIEVRKGQPLRAWDRAGRSAALSDRDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPM*
Ga0137373_1002676713300012532Vadose Zone SoilRLACFGPGVRGWLLIDVRSVRPLRAWDKTGNAVALTDWHTLSIDGLFDNLERSADRDGLVQVAFDPRWHFPAYVRTVALPGPDAWAIIEARGLRPI*
Ga0137398_1031036333300012683Vadose Zone SoilPGTRGWLLMDVRADQPLRAWDKAGNSASLTDWNTFNIDGLYNSLERSADIKGQVQIAFDPHWHFPKYVHSVVLPGPDAWSTIEVRGFRPTRL*
Ga0137398_1044762713300012683Vadose Zone SoilPGTRGWLLMDVRADQPLRAWDKAGNSASLTDWNTFNIDGLYDSLERSADIKGQVRIAFDPRWHFPKYVYTVALPGPDMWSTIEVRGFRPI*
Ga0137397_1066888623300012685Vadose Zone SoilMDVRGGKLVRAWDRTGESAALTDWNTFSIDGLYDNLERTADINGQVQIAFDPRWHFPKYVATTVFPGPDAWSIVEVRGFRPI*
Ga0137396_1009407833300012918Vadose Zone SoilLIEVRSGQSLRAWDRAGKPAPITDWDTFSIDGLYDNLQRTADIPGEVKIAFDPRWHFPKYVSAAASPGPDMWSVTEVRGFRPI*
Ga0137396_1051289813300012918Vadose Zone SoilFCPGVRGWLLMEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSGDHDAAVQVSFDPRWHFPTYIYIRALPGPDMWAVIEARGLRPI*
Ga0137396_1064492313300012918Vadose Zone SoilRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRGLPGPDMWAVIEARGLRPI*
Ga0137419_1115555313300012925Vadose Zone SoilFCPGVRGWLLMEVRRGQPLRAWDRAGRSAPLRDWDTFSIDGLFDMLERWADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137419_1182874913300012925Vadose Zone SoilAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI*
Ga0137410_1001068713300012944Vadose Zone SoilMRAWDRAGKAATLSDWNTFSIDGLFDNLERSVEHDGAVQVAFDPRWHFPAYARSVALPGPDAWTIIEARGLRPLGGSPD*
Ga0120147_101198043300013758PermafrostLIEVRSSRPLRAWDRAGKSAALTDWNTFSIDGLYDNLERTADNVGQVQIAFDPRWHFPKYVYTVVLPGPDAWSTIEVRGLRPI*
Ga0120158_1051219213300013772PermafrostTRGWLLIEVRSSQPLRAWDRAGRSAALTDWNTFSIDGLYDNLERAADNVGEVQIAFDPRWHFPKYVYTVVLPGPDAWSTIEVRGLRPI*
Ga0167653_1000143163300015162Glacier Forefield SoilMDVRSGQPLRAWDRAGKAAALTDWNTFSIDGLFDSLDKTADINGEVRIAFDPRWHFPTYVSISALPGPDMWSLIEARALRPN*
Ga0184604_1005322823300018000Groundwater SedimentRGWLLVEVRSGQPLRAWDRTGKPAALTDWNTFSIDGLYDNLERAAEIDGRVKIAFDPRWHFPTYVHTVALPGPDAWSIIEARALRPL
Ga0184621_1008923923300018054Groundwater SedimentRTGCFCPGVRGWLLMEVRRGQPLRAWDPAGRSAPLSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIRTRALPGPDMWAVIEARGLRPI
Ga0184621_1020901123300018054Groundwater SedimentWLLIEVHTGQPLRAWDRAGRSAPLSDWDTFSIDGLFDMVERSTDRDAPVQVSFDPRWHFPTYVYTRALPGPDMWAIIEARGFRPF
Ga0184619_1000787613300018061Groundwater SedimentEVRSGQPLRAWDRAGKSAALTDWNTFSIDGLYDNLERSLDRDARVQIAFDPRWHFPRYVSTVVLPGPDAWSVVEVRAFRPI
Ga0184619_1048023023300018061Groundwater SedimentLLMEVRSGQPLRAWDRAGKSAALTDWNTFSIDGLYDNLERTADINGQVQIAFDPRWHFPRYVYTVALPGPDMWSIIEVQGFRPN
Ga0184632_1006721933300018075Groundwater SedimentSGWLMMEVRNGRLLRASDSGGKSAPLTDWNTFSIDGLFDHLERTAEIDGVVQVAFDPHWHFPSYVSTVRLPGPDTWATIEARGLRPI
Ga0184609_1049379223300018076Groundwater SedimentGGRGWLLMEVRRGQPLRAWDRAGRSAGLSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIRTRALPGPDMWAVIEARGLRPI
Ga0066667_1004482463300018433Grasslands SoilVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLDRSADRDAAVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI
Ga0066667_1064018823300018433Grasslands SoilRTALRRERALWRANSLSDYRFLVRTACFCPGGRGWLLIEVRTGQPLRAWDRVGRSAPLSDFDTFSIDGLFDMLERSADRDAAVQVSFDPRWHFPTYVYTRALTGPDMWAIIEARGFRPF
Ga0066669_1014973743300018482Grasslands SoilLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI
Ga0066669_1195109913300018482Grasslands SoilGVRGWLLIEVRRGQPLRAWDQAGRSAPLSDWDTFSIDGLFDILARSADHDAVVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI
Ga0193748_100312933300019865SoilMEVRSGQPLRAWDRTGKSVGLTDWNTVSIDGLYDNLERTAGINGEALIAFDPRWHFPRYVRSVTLPGPDAWSITEARALRPI
Ga0193722_107503923300019877SoilPGTRGWLLMDVRDSKLVRAWDRTGKSVPLTDWNTLSIDGLYDNLERSAGINGQAQIAFDPRWHFPRFAYTVVAPGPDAWSTIEVRGFRPI
Ga0193725_101252513300019883SoilMEVRDSKLVRAWDRTGKSVPLTDWNTLSIDGLYDSLDRSTDINGQVQIAFDPRWRFPRFVHTVVAPGPDAWSTIEVRGFRPT
Ga0193747_100249233300019885SoilMDVRSGQPLRAWDRSGKSAALTDWNTLNIDGLYDNLERTADINGEVQIAFDPRWHFPKYVRTTVLPGPDMWSIIDVRGFRDLSATR
Ga0193731_109747523300020001SoilCFCPGTRGWLLMEVRSGHALRAWDKAGKSAALTDWNTFSIDGLYDNLGRSADIKGQVRIAFDPRWHFPKYVYTVALPGPDMWSVIEVRGFRPI
Ga0193730_115126123300020002SoilMEVRSGHALRAWDKAGKSAALTDWNTFSIDGLYDNLGRSADIKGQVRIAFDPRWHFPKYVYTVALPGPDMWSVIEVRGFRPI
Ga0193740_105708313300020009SoilPETRGWLLMEVRSGEPLRAWDRTGRQVALTDWNTLSIDGLYDNLERPDRDGGVQIDFDPRWHFPTFIGTSAARGPDTWSTTEARALRAIK
Ga0222623_1019919723300022694Groundwater SedimentGGRGWLLMEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIRTRALPGPDMWAVIEARGLRPI
Ga0222622_1048329423300022756Groundwater SedimentRGWLLMDVRSGQPLRAWDRSGKSAALTDWNTLSIDGLYDNLERTADINGEVQIAFDPRWHFPKYVRTTGLPGPDMWSVIDVRGFHDLSAP
Ga0222622_1059037313300022756Groundwater SedimentWLLIEVRSGQPLRAWDKAGKSAALTDWNTFSIDGLYDNLEQSVDRNGQVQIAFDPRWHFPRYVGTVTLPGPDAWSNTEVRGFRPLP
Ga0207653_1003129013300025885Corn, Switchgrass And Miscanthus RhizosphereNSLSNYRFLLRTGCFCPGRRGWLLIEVHTGQPLRAWDRAGRSAPLSDWDTFTVDGLFDMVERSADRDAAVQVGFDPRWHFPTYVYTRALPGPDMWAIIEARGFRPF
Ga0209238_102777913300026301Grasslands SoilLLIEVRRGQPLRAWDRAGRSAPLSDWDTFSIDGLFDFLDRSADRDAAVQVSFDPRWHFPTYVYTRALPGPDMWAVIEARGLRPI
Ga0209378_103604643300026528SoilMRGWLLMEVRRGQPLRAWDRAGRSAALSDWDTFSIDGLFDMLERSADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPRREFLRQ
Ga0209805_132146923300026542SoilMEVRRGQPLRAWDVTGKPAALTDWNTFSIDGLYDDLERAADINGAAQIAFDSRWHFPTYVHTVALPGPDMWSIIEARAVRPI
Ga0209388_123606813300027655Vadose Zone SoilMRAWDRAGKAATLSDWNTFSIDGLFDNLERSVEHDGAVQVAFDPRWHFPAYARSVALPGPDAWTIIEARGLRPLGGPPD
Ga0209283_1078514113300027875Vadose Zone SoilRAWDRAGKSADLTDWSTFSIDGLYDNLERSADINGQVQIAFDPHWHVPKYVYAVALPGPDMWSTIEVRGLRPI
Ga0209488_1087869723300027903Vadose Zone SoilLVRTGCFCPGVRGWLLIEVRTGQPLRAWDRAGRSAPLSAWDTFSIDGLFDFLERSADRDAEVQVSFDPRWHFPTYVYTRALPGPDTWSVIEARGLRPI
Ga0137415_1043315723300028536Vadose Zone SoilPPDYRFLVRTGCFCPGARGWLLMEVRRGQPLRAWDRAGRSAPLRDWDTFSIDGLFDMLERWADHDAAVQVSFDPRWHFPTYIYTRALPGPDMWAVIEARGLRPI
Ga0307276_1019003023300028705SoilLIEVRSGQPLRAWDRTGKSAALTDWNTFSIDGLFDNLERTAGINGQVQIAFDPRWHLPRYVTTTVLPGPDTWSITEVRGLRPIR
Ga0307310_1032881513300028824SoilTGKSAALSDWNTFSIDGLYDQLERADIKGQVQIAFDPRWHFPKYVYTVVLPGPDAWSTVEAQGFRPMR
Ga0307312_1076199723300028828SoilRGWLLMDVRSGQPLRAWDRSGKSAALTDWNTLNIDGLYDNLERTADINGEVQIAFDPRWHFPKYVRTTVLPGPDMWSIIDVRGFRDLSATR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.