scispace - formally typeset
Search or ask a question
Author

Maite Huarte

Bio: Maite Huarte is an academic researcher from University of Navarra. The author has contributed to research in topics: Regulation of gene expression & RNA. The author has an hindex of 32, co-authored 58 publications receiving 16841 citations. Previous affiliations of Maite Huarte include Beth Israel Deaconess Medical Center & Harvard University.


Papers
More filters
Journal ArticleDOI
12 Mar 2009-Nature
TL;DR: It is demonstrated that specific lincRNAs are transcriptionally regulated by key transcription factors in these processes such as p53, NFκB, Sox2, Oct4 (also known as Pou5f1) and Nanog, defining a unique collection of functional linc RNAs that are highly conserved and implicated in diverse biological processes.
Abstract: There is growing recognition that mammalian cells produce many thousands of large intergenic transcripts. However, the functional significance of these transcripts has been particularly controversial. Although there are some well-characterized examples, most (>95%) show little evidence of evolutionary conservation and have been suggested to represent transcriptional noise. Here we report a new approach to identifying large non-coding RNAs using chromatin-state maps to discover discrete transcriptional units intervening known protein-coding loci. Our approach identified ~1,600 large multi-exonic RNAs across four mouse cell types. In sharp contrast to previous collections, these large intervening non-coding RNAs (lincRNAs) show strong purifying selection in their genomic loci, exonic sequences and promoter regions, with greater than 95% showing clear evolutionary conservation. We also developed a functional genomics approach that assigns putative functions to each lincRNA, demonstrating a diverse range of roles for lincRNAs in processes from embryonic stem cell pluripotency to cell proliferation. We obtained independent functional validation for the predictions for over 100 lincRNAs, using cell-based assays. In particular, we demonstrate that specific lincRNAs are transcriptionally regulated by key transcription factors in these processes such as p53, NFκB, Sox2, Oct4 (also known as Pou5f1) and Nanog. Together, these results define a unique collection of functional lincRNAs that are highly conserved and implicated in diverse biological processes.

3,875 citations

Journal ArticleDOI
TL;DR: A model in which some lincRNAs guide chromatin-modifying complexes to specific genomic loci to regulate gene expression is proposed, and it is shown that siRNA-mediated depletion of certain linc RNAs associated with PRC2 leads to changes in gene expression.
Abstract: We recently showed that the mammalian genome encodes >1,000 large intergenic noncoding (linc)RNAs that are clearly conserved across mammals and, thus, functional. Gene expression patterns have implicated these lincRNAs in diverse biological processes, including cell-cycle regulation, immune surveillance, and embryonic stem cell pluripotency. However, the mechanism by which these lincRNAs function is unknown. Here, we expand the catalog of human lincRNAs to ≈3,300 by analyzing chromatin-state maps of various human cell types. Inspired by the observation that the well-characterized lincRNA HOTAIR binds the polycomb repressive complex (PRC)2, we tested whether many lincRNAs are physically associated with PRC2. Remarkably, we observe that ≈20% of lincRNAs expressed in various cell types are bound by PRC2, and that additional lincRNAs are bound by other chromatin-modifying complexes. Also, we show that siRNA-mediated depletion of certain lincRNAs associated with PRC2 leads to changes in gene expression, and that the up-regulated genes are enriched for those normally silenced by PRC2. We propose a model in which some lincRNAs guide chromatin-modifying complexes to specific genomic loci to regulate gene expression.

2,738 citations

Journal ArticleDOI
TL;DR: The strategies that led to the identification of cancer-related lncRNAs and the methodologies and challenges involving the study of these molecules are discussed, as well as the imminent applications of these findings to the clinic.
Abstract: It is increasingly evident that many of the genomic mutations in cancer reside inside regions that do not encode proteins. However, these regions are often transcribed into long noncoding RNAs (lncRNAs). The recent application of next-generation sequencing to a growing number of cancer transcriptomes has indeed revealed thousands of lncRNAs whose aberrant expression is associated with different cancer types. Among the few that have been functionally characterized, several have been linked to malignant transformation. Notably, these lncRNAs have key roles in gene regulation and thus affect various aspects of cellular homeostasis, including proliferation, survival, migration or genomic stability. This review aims to summarize current knowledge of lncRNAs from the cancer perspective. It discusses the strategies that led to the identification of cancer-related lncRNAs and the methodologies and challenges involving the study of these molecules, as well as the imminent applications of these findings to the clinic.

2,067 citations

Journal ArticleDOI
06 Aug 2010-Cell
TL;DR: A model whereby transcription factors activate lincRNAs that serve as key repressors by physically associating with repressive complexes and modulate their localization to sets of previously active genes is proposed.

1,768 citations

Journal ArticleDOI
TL;DR: A review of the mechanisms of lncRNA biogenesis, localization and functions in transcriptional, post-transcriptional and other modes of gene regulation, and their potential therapeutic applications is presented in this article.
Abstract: Evidence accumulated over the past decade shows that long non-coding RNAs (lncRNAs) are widely expressed and have key roles in gene regulation. Recent studies have begun to unravel how the biogenesis of lncRNAs is distinct from that of mRNAs and is linked with their specific subcellular localizations and functions. Depending on their localization and their specific interactions with DNA, RNA and proteins, lncRNAs can modulate chromatin function, regulate the assembly and function of membraneless nuclear bodies, alter the stability and translation of cytoplasmic mRNAs and interfere with signalling pathways. Many of these functions ultimately affect gene expression in diverse biological and physiopathological contexts, such as in neuronal disorders, immune responses and cancer. Tissue-specific and condition-specific expression patterns suggest that lncRNAs are potential biomarkers and provide a rationale to target them clinically. In this Review, we discuss the mechanisms of lncRNA biogenesis, localization and functions in transcriptional, post-transcriptional and other modes of gene regulation, and their potential therapeutic applications.

1,630 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.
Abstract: High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

13,337 citations

Journal ArticleDOI
TL;DR: In this article, the authors present an approach for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.
Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

10,798 citations

Journal ArticleDOI
23 Feb 2007-Cell
TL;DR: The surface of nucleosomes is studded with a multiplicity of modifications that can dictate the higher-order chromatin structure in which DNA is packaged and can orchestrate the ordered recruitment of enzyme complexes to manipulate DNA.

10,046 citations

Journal ArticleDOI
19 Mar 2010-Cell
TL;DR: The role of PRRs, their signaling pathways, and how they control inflammatory responses are discussed.

6,987 citations

Journal ArticleDOI
TL;DR: The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution.
Abstract: Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today’s sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license.

6,930 citations