G-quadruplexes (G4s) play crucial roles in key biological functions, including transcription, replication, telomere maintenance, and genomic instability, and have been an important component of organismal evolution.
Quadrupia is a comprehensive database of G4 sequences for a large collection of reference genomes from GenBank and RefSeq, enabling their study across the tree of life. Each genome has been annotated using two distinct methods (regular expressions and G4hunter) for the detection of G4 sequences. In addition, the database contains a curated list of G4 sequence clusters, created using the linclust algorithm of MMseqs2.
Through Quadrupia, users can:
Chantzi, N.+, Nayak, A.+, Baltoumas, F.A.+, Aplakidou, E., Liew, S.W., Galuh, J.E., Montgomery, A., Moeckel1, C., Mouratidis, I., Sazed, S.A., Guiblet, W., Karmiris-ObrataĆski, P., Wang, G., Zaravinos, A., Vasquez, K.M, Kwok, C.K., Pavlopoulos, G.A., Georgakopoulos-Soares, I. (2024)
Quadrupia: Derivation of G-quadruplexes for organismal genomes across the tree of life
Submitte
Genomes | Total G-quadruplexes | Predicted by regular expressions | Predicted with G4Hunter |
---|---|---|---|
89,465 | 159,334,907 | 33,392,376 | 125,942,531 |