scispace - formally typeset
Open AccessJournal ArticleDOI

pqsfinder: an exhaustive and imperfection-tolerant search tool for potential quadruplex-forming sequences in R.

Reads0
Chats0
TLDR
A newly developed Bioconductor package for identifying potential quadruplex‐forming sequences (PQS), which allows for sequence searches that accommodate possible divergences from the optimal G4 base composition and demonstrates that the algorithm behind the searches has a 96% accuracy.
Abstract
Motivation: G-quadruplexes (G4s) are one of the non-B DNA structures easily observed in vitro and assumed to form in vivo. The latest experiments with G4-specific antibodies and G4-unwinding helicase mutants confirm this conjecture. These four-stranded structures have also been shown to influence a range of molecular processes in cells. As G4s are intensively studied, it is often desirable to screen DNA sequences and pinpoint the precise locations where they might form. Results: We describe and have tested a newly-developed Bioconductor package for identifying potential quadruplex-forming sequences (PQS). The package is easy-to-use, flexible and customizable. It allows for sequence searches that accommodate possible divergences from the optimal G4 base composition. A novel aspect of our research was the creation and training (parametrization) of an advanced scoring model which resulted in increased precision compared to similar tools. We demonstrate that the algorithm behind the searches has a 96% accuracy on 392 currently known and experimentally observed G4 structures. We also carried out searches against the recent G4-seq data to verify how well we can identify the structures detected by that technology. The correlation with pqsfinder predictionswas 0.622, higher than the correlation 0.491 obtained with the second best G4Hunter. Availability:http://bioconductor.org/packages/pqsfinder/ This paper is based on pqsfinder-1.4.1.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Whole genome experimental maps of DNA G-quadruplexes in multiple species.

TL;DR: An improved version of a G-quadruplex sequencing method is employed to generate whole genome G4 maps for 12 species that include widely studied model organisms and also pathogens of clinical relevance, and reveals that the enrichment of OQs in gene promoters is particular to mammals such as mouse and human, among the species studied.
Journal ArticleDOI

A guide to computational methods for G-quadruplex prediction

TL;DR: The present review aims at providing an updated overview of the current open-source G-quadruplex prediction algorithms and straightforward examples of their implementation, and proposing other estimates which consider non-canonical sequences and/or structure propensity and stability.
Journal ArticleDOI

G4Hunter web application: a web server for G-quadruplex prediction.

TL;DR: A web version of the G4Hunter application is developed that allows retrieval of gene/nucleotide sequence entries from NCBI databases and provides complete characterization of localization and quadruplex propensity of quadruplex-forming sequences.
Journal ArticleDOI

Detecting RNA G-Quadruplexes (rG4s) in the Transcriptome.

TL;DR: Methodologies including predictive algorithms and structure-based sequencing have enabled the detection and mapping of rG4 structures on a transcriptome-wide scale at high sensitivity and resolution and the associated findings in relation to rG 4-related biological mechanisms are discussed.
Journal ArticleDOI

Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome.

TL;DR: In this article, the authors conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within non-coding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms.
References
More filters
Journal ArticleDOI

Mfold web server for nucleic acid folding and hybridization prediction

TL;DR: The objective of this web server is to provide easy access to RNA and DNA folding and hybridization software to the scientific community at large by making use of universally available web GUIs (Graphical User Interfaces).
Journal ArticleDOI

Software for computing and annotating genomic ranges.

TL;DR: This work describes Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions, including those for sequence analysis, differential expression analysis and visualization.
Journal ArticleDOI

A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics

TL;DR: Six of the studies are actually in remarkable agreement with one another and explanations are provided in cases where discrepancies remain, and a single set of parameters, derived from 108 oligonucleotide duplexes, adequately describes polymer and oligomer thermodynamics.
Journal ArticleDOI

An overview of the Amber biomolecular simulation package

TL;DR: The most recent developments, since version 9 was released in April 2006, of the Amber and AmberTools MD software packages are outlined, referred to here as simply the Amber package.
Related Papers (5)