Discovering Motifs in Ranked Lists of DNA Sequences

doi:10.1371/JOURNAL.PCBI.0030039

Open AccessJournal ArticleDOI

Discovering Motifs in Ranked Lists of DNA Sequences

Eran Eden, +5 more

- 23 Mar 2007 -

PLOS Computational Biology

- Vol. 3, Iss: 3

Chats0

TLDR

The implementation of this framework in a software application, termed DRIM (discovery of rank imbalanced motifs), which identifies sequence motifs in lists of ranked DNA sequences, is demonstrated, demonstrating that the statistical framework embodied in the DRIM software tool is highly effective for identifying regulatory sequence elements in a variety of applications.

Abstract:

Computational methods for discovery of sequence elements that are enriched in a target set compared with a background set are fundamental in molecular biology research. One example is the discovery of transcription factor binding motifs that are inferred from ChIP–chip (chromatin immuno-precipitation on a microarray) measurements. Several major challenges in sequence motif discovery still require consideration: (i) the need for a principled approach to partitioning the data into target and background sets; (ii) the lack of rigorous models and of an exact p-value for measuring motif enrichment; (iii) the need for an appropriate framework for accounting for motif multiplicity; (iv) the tendency, in many of the existing methods, to report presumably significant motifs even when applied to randomly generated data. In this paper we present a statistical framework for discovering enriched sequence elements in ranked lists that resolves these four issues. We demonstrate the implementation of this framework in a software application, termed DRIM (discovery of rank imbalanced motifs), which identifies sequence motifs in lists of ranked DNA sequences. We applied DRIM to ChIP–chip and CpG methylation data and obtained the following results. (i) Identification of 50 novel putative transcription factor (TF) binding sites in yeast ChIP–chip data. The biological function of some of them was further investigated to gain new insights on transcription regulation networks in yeast. For example, our discoveries enable the elucidation of the network of the TF ARO80. Another finding concerns a systematic TF binding enhancement to sequences containing CA repeats. (ii) Discovery of novel motifs in human cancer CpG methylation data. Remarkably, most of these motifs are similar to DNA sequence elements bound by the Polycomb complex that promotes histone methylation. Our findings thus support a model in which histone methylation and CpG methylation are mechanistically linked. Overall, we demonstrate that the statistical framework embodied in the DRIM software tool is highly effective for identifying regulatory sequence elements in a variety of applications ranging from expression and ChIP–chip to CpG methylation data. DRIM is publicly available at http://bioinfo.cs.technion.ac.il/drim.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

Eran Eden, +7 more

- 03 Feb 2009 -

BMC Bioinformatics

TL;DR: GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets, and its unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation.

...read moreread less

Journal ArticleDOI

Molecular and genetic properties of tumors associated with local immune cytolytic activity.

Michael S. Rooney, +5 more

- 15 Jan 2015 -

Cell

TL;DR: The genetic findings provide evidence for immunoediting in tumors and uncover mechanisms of tumor-intrinsic resistance to cytolytic activity, suggesting immune-mediated elimination.

...read moreread less

Journal ArticleDOI

Host microbiota constantly control maturation and function of microglia in the CNS

Daniel Erny, +19 more

- 01 Jul 2015 -

Nature Neuroscience

TL;DR: It is determined that short-chain fatty acids (SCFA), microbiota-derived bacterial fermentation products, regulated microglia homeostasis and mice deficient for the SCFA receptor FFAR2 mirroredmicroglia defects found under GF conditions, suggesting that host bacteria vitally regulate microglian maturation and function.

...read moreread less

Journal ArticleDOI

Microglia development follows a stepwise program to regulate brain homeostasis

Orit Matcovitch-Natan, +29 more

- 19 Aug 2016 -

Science

TL;DR: It is found that microglia from germ-free mice exhibited dysregulation of dozens of genes associated with the adult phase and immune response, including MAFB, which led to disruption of homeostasis in adulthood and increased expression of interferon and inflammation pathways.

...read moreread less

Journal ArticleDOI

Single-Cell Transcriptomic Analysis of Human Lung Provides Insights into the Pathobiology of Pulmonary Fibrosis.

Paul A. Reyfman, +48 more

- 15 Jun 2019 -

American Journal of Respiratory and Crit...

TL;DR: The results support the feasibility of discovery-based approaches using next-generation sequencing technologies to identify signaling pathways for targeting in the development of personalized therapies for patients with pulmonary fibrosis.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Fitting a mixture model by expectation maximization to discover motifs in biopolymers.

Timothy L. Bailey, +1 more

TL;DR: The algorithm described in this paper discovers one or more motifs in a collection of DNA or protein sequences by using the technique of expectation maximization to fit a two-component finite mixture model to the set of sequences.

...read moreread less

Journal ArticleDOI

Transcriptional Regulatory Networks in Saccharomyces cerevisiae

Tong Ihn Lee, +20 more

- 25 Oct 2002 -

Science

TL;DR: This work determines how most of the transcriptional regulators encoded in the eukaryote Saccharomyces cerevisiae associate with genes across the genome in living cells, and identifies network motifs, the simplest units of network architecture, and demonstrates that an automated process can use motifs to assemble a transcriptional regulatory network structure.

...read moreread less

Journal ArticleDOI

Control of developmental regulators by Polycomb in human embryonic stem cells.

Tong Ihn Lee, +28 more

- 21 Apr 2006 -

Cell

TL;DR: It is found that PRC2 target genes are preferentially activated during ES cell differentiation and that the ES cell regulators OCT4, SOX2, and NANOG cooccupy a significant subset of these genes.

...read moreread less