Home
/
Authors
/
Meagan Fastuca

Author

Meagan Fastuca

Bio: Meagan Fastuca is an academic researcher from Cold Spring Harbor Laboratory. The author has contributed to research in topics: Gene & Genome. The author has an hindex of 6, co-authored 6 publications receiving 5534 citations.

Topics: Gene, Genome, Human genome, ENCODE, Transcriptome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Landscape of transcription in human cells

[...]

Sarah Djebali, Carrie A. Davis¹, Angelika Merkel, Alexander Dobin¹, Timo Lassmann, Ali Mortazavi², Ali Mortazavi³, Andrea Tanzer, Julien Lagarde, Wei Lin¹, Felix Schlesinger¹, Chenghai Xue¹, Georgi K. Marinov², Jainab Khatun⁴, Brian A. Williams², Chris Zaleski¹, Joel Rozowsky⁵, Marion S. Röder, Felix Kokocinski⁶, Rehab F. Abdelhamid, Tyler Alioto, Igor Antoshechkin², Michael T. Baer¹, Nadav Bar⁷, Philippe Batut¹, Kimberly Bell¹, Ian Bell⁸, Sudipto K. Chakrabortty¹, Xian Chen⁹, Jacqueline Chrast¹⁰, Joao Curado, Thomas Derrien, Jorg Drenkow¹, Erica Dumais⁸, Jacqueline Dumais⁸, Radha Duttagupta⁸, Emilie Falconnet¹¹, Meagan Fastuca¹, Kata Fejes-Toth¹, Pedro G. Ferreira, Sylvain Foissac⁸, Melissa J. Fullwood¹², Hui Gao⁸, David Gonzalez, Assaf Gordon¹, Harsha P. Gunawardena⁹, Cédric Howald¹⁰, Sonali Jha¹, Rory Johnson, Philipp Kapranov⁸, Brandon King², Colin Kingswood, Oscar Junhong Luo¹², Eddie Park³, Kimberly Persaud¹, Jonathan B. Preall¹, Paolo Ribeca, Brian A. Risk⁴, Daniel Robyr¹¹, Michael Sammeth, Lorian Schaffer², Lei-Hoon See¹, Atif Shahab¹², Jørgen Skancke⁷, Ana Maria Suzuki, Hazuki Takahashi, Hagen Tilgner¹³, Diane Trout², Nathalie Walters¹⁰, Huaien Wang¹, John A. Wrobel⁴, Yanbao Yu⁹, Xiaoan Ruan¹², Yoshihide Hayashizaki, Jennifer Harrow⁶, Mark Gerstein⁵, Tim Hubbard⁶, Alexandre Reymond¹⁰, Stylianos E. Antonarakis¹¹, Gregory J. Hannon¹, Morgan C. Giddings⁹, Morgan C. Giddings⁴, Yijun Ruan¹², Barbara J. Wold², Piero Carninci, Roderic Guigó¹⁴, Thomas R. Gingeras⁸, Thomas R. Gingeras¹ - Show less +84 more•Institutions (14)

Cold Spring Harbor Laboratory¹, California Institute of Technology², University of California, Irvine³, Florida State University College of Arts and Sciences⁴, Yale University⁵, Wellcome Trust Sanger Institute⁶, Norwegian University of Science and Technology⁷, Affymetrix⁸, University of North Carolina at Chapel Hill⁹, University of Lausanne¹⁰, University of Geneva¹¹, Genome Institute of Singapore¹², Stanford University¹³, Pompeu Fabra University¹⁴

06 Sep 2012-Nature

TL;DR: Evidence that three-quarters of the human genome is capable of being transcribed is reported, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs that prompt a redefinition of the concept of a gene.

...read moreread less

Abstract: Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene.

...read moreread less

4,450 citations

Journal Article•DOI•

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu¹, Weisheng Wu⁵, Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁷, Yin Shen², Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender¹², M. A. Bender⁴, Miaohua Zhang¹², Rachel Byron¹², Mark Groudine¹², Mark Groudine⁴, David McCleary², Long Pham², Zhen Ye², Samantha Kuan², Lee Edsall², Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹⁴, Manolis Kellis¹³, Cheryl A. Keller¹, Christapher S. Morrissey¹, Tejaswini Mishra¹, Deepti Jain¹, Nergiz Dogan¹, Robert S. Harris¹, Philip Cayting³, Trupti Kawli³, Alan P. Boyle⁵, Alan P. Boyle³, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang¹, Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao², Sheng Zhong², Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon³⁰, Rebecca F. Lowdon²⁹, Leslie B. Adams³¹, Leslie B. Adams³⁰, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁷, Ross C. Hardison¹, Michael A. Beer²⁰, Bing Ren² - Show less +142 more•Institutions (31)

Pennsylvania State University¹, University of California, San Diego², Stanford University³, University of Washington⁴, University of Michigan⁵, New College of Florida⁶, Florida State University⁷, Cold Spring Harbor Laboratory⁸, California Institute of Technology⁹, University of Vienna¹⁰, Emory University¹¹, Fred Hutchinson Cancer Research Center¹², Massachusetts Institute of Technology¹³, Broad Institute¹⁴, University of California, Irvine¹⁵, University of California, Santa Cruz¹⁶, University of California, San Francisco¹⁷, Yale University¹⁸, University of Florida¹⁹, Johns Hopkins University²⁰, University College London²¹, University of Oxford²², Cornell University²³, Memorial Sloan Kettering Cancer Center²⁴, Harvard University²⁵, University of Iowa²⁶, Yeshiva University²⁷, University of Pennsylvania²⁸, Washington University in St. Louis²⁹, National Institutes of Health³⁰, University of North Carolina at Chapel Hill³¹

20 Nov 2014-Nature

TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases

...read moreread less

1,335 citations

Journal Article•DOI•

An encyclopedia of mouse DNA elements (Mouse ENCODE)

[...]

John A. Stamatoyannopoulos¹, Michael Snyder², Ross C. Hardison³, Bing Ren⁴, Thomas R. Gingeras⁵, David M. Gilbert⁶, Mark Groudine⁷, M. A. Bender⁷, Rajinder Kaul¹, Theresa K. Canfield¹, Erica Giste¹, Audra K. Johnson¹, Mia Zhang⁷, Gayathri Balasundaram⁷, Rachel Byron⁷, Vaughan Roach¹, Peter J. Sabo¹, Richard Sandstrom¹, A Sandra Stehling¹, Robert E. Thurman¹, Sherman M. Weissman⁸, Philip Cayting⁸, Manoj Hariharan², Jin Lian⁸, Yong Cheng², Stephen G. Landt², Zhihai Ma², Barbara J. Wold⁹, Job Dekker¹⁰, Gregory E. Crawford¹¹, Cheryl A. Keller³, Weisheng Wu³, Christopher T. Morrissey³, Swathi Ashok Kumar³, Tejaswini Mishra³, Deepti Jain³, Marta Byrska-Bishop³, Daniel Blankenberg³, Bryan R. Lajoie², Gaurav Jain¹⁰, Amartya Sanyal¹⁰, Kaun-Bei Chen¹¹, Olgert Denas¹¹, James Taylor¹², Gerd A. Blobel¹³, Mitchell J. Weiss¹³, Max Pimkin¹³, Wulan Deng¹³, Georgi K. Marinov⁹, Brian A. Williams⁹, Katherine I. Fisher-Aylor⁹, Gilberto DeSalvo⁹, Anthony Kiralusha⁹, Diane Trout⁹, Henry Amrhein⁹, Ali Mortazavi¹⁴, Lee Edsall⁴, David McCleary⁴, Samantha Kuan⁴, Yin Shen⁴, Feng Yue⁴, Zhen Ye⁴, Carrie A. Davis⁵, Chris Zaleski⁵, Sonali Jha⁵, Chenghai Xue⁵, Alexander Dobin⁵, Wei Lin⁵, Meagan Fastuca⁵, Huaien Wang⁵, Roderic Guigó, Sarah Djebali, Julien Lagarde, Tyrone Ryba⁶, Takayo Sasaki⁶, Venkat S. Malladi¹⁵, Melissa S. Cline¹⁵, Vanessa M. Kirkup¹⁵, Katrina Learned¹⁵, Kate R. Rosenbloom¹⁵, W. James Kent¹⁵, Elise A. Feingold¹⁶, Peter J. Good¹⁶, Michael J. Pazin¹⁶, Rebecca F. Lowdon¹⁶, Leslie B Adams¹⁶ - Show less +82 more•Institutions (16)

University of Washington¹, Stanford University², Pennsylvania State University³, University of California, San Diego⁴, Cold Spring Harbor Laboratory⁵, Florida State University⁶, Fred Hutchinson Cancer Research Center⁷, Yale University⁸, California Institute of Technology⁹, University of Massachusetts Medical School¹⁰, Duke University¹¹, Emory University¹², Children's Hospital of Philadelphia¹³, University of California, Irvine¹⁴, University of California, Santa Cruz¹⁵, National Institutes of Health¹⁶

13 Aug 2012-Genome Biology

TL;DR: The Mouse E NCODE Consortium is applying the same experimental pipelines developed for human ENCODE to annotate the mouse genome to enable a broad range of mouse genomics efforts.

...read moreread less

Abstract: To complement the human Encyclopedia of DNA Elements (ENCODE) project and to enable a broad range of mouse genomics efforts, the Mouse ENCODE Consortium is applying the same experimental pipelines developed for human ENCODE to annotate the mouse genome

...read moreread less

445 citations

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu⁵, Weisheng Wu¹, Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁷, Yin Shen², Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender¹², M. A. Bender⁴, Miaohua Zhang¹², Rachel Byron¹², Mark Groudine¹², Mark Groudine⁴, David McCleary², Long Pham², Zhen Ye², Samantha Kuan², Lee Edsall², Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹³, Manolis Kellis¹⁴, Cheryl A. Keller¹, Christapher S. Morrissey¹, Tejaswini Mishra¹, Deepti Jain¹, Nergiz Dogan¹, Robert S. Harris¹, Philip Cayting³, Trupti Kawli³, Alan P. Boyle⁵, Alan P. Boyle³, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang¹, Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao², Sheng Zhong², Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon³⁰, Rebecca F. Lowdon²⁹, Leslie B. Adams³⁰, Leslie B. Adams³¹, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁷, Ross C. Hardison¹, Michael A. Beer²⁰, Bing Ren² - Show less +142 more•Institutions (31)

01 Nov 2014

TL;DR: By comparing with the human genome, this work not only confirms substantial conservation in the newly annotated potential functional sequences, but also finds a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.

...read moreread less

226 citations

Journal Article•DOI•

Enhanced transcriptome maps from multiple mouse tissues reveal evolutionary constraint in gene expression

[...]

Dmitri D. Pervouchine¹, Sarah Djebali, Alessandra Breschi, Carrie A. Davis², Pablo Prieto Barja, Alexander Dobin², Andrea Tanzer³, Julien Lagarde, Chris Zaleski², Lei Hoon See², Meagan Fastuca², Jorg Drenkow², Huaien Wang², Giovanni Bussotti, Baikang Pei⁴, Suganthi Balasubramanian⁴, Jean Monlong⁵, Arif Harmanci⁴, Mark Gerstein⁴, Michael A. Beer⁶, Cedric Notredame, Roderic Guigó, Thomas R. Gingeras² - Show less +19 more•Institutions (6)

Moscow State University¹, Cold Spring Harbor Laboratory², University of Vienna³, Yale University⁴, McGill University⁵, Johns Hopkins University⁶

13 Jan 2015-Nature Communications

TL;DR: In this article, the authors characterize, by RNA sequencing, the transcriptional profiles of a large and heterogeneous collection of mouse tissues, augmenting the mouse transcriptome with thousands of novel transcript candidates.

...read moreread less

Abstract: Mice have been a long-standing model for human biology and disease. Here we characterize, by RNA sequencing, the transcriptional profiles of a large and heterogeneous collection of mouse tissues, augmenting the mouse transcriptome with thousands of novel transcript candidates. Comparison with transcriptome profiles in human cell lines reveals substantial conservation of transcriptional programmes, and uncovers a distinct class of genes with levels of expression that have been constrained early in vertebrate evolution. This core set of genes captures a substantial fraction of the transcriptional output of mammalian cells, and participates in basic functional and structural housekeeping processes common to all cell types. Perturbation of these constrained genes is associated with significant phenotypes including embryonic lethality and cancer. Evolutionary constraint in gene expression levels is not reflected in the conservation of the genomic sequences, but is associated with conserved epigenetic marking, as well as with characteristic post-transcriptional regulatory programme, in which sub-cellular localization and alternative splicing play comparatively large roles.

...read moreread less

81 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

STAR: ultrafast universal RNA-seq aligner

[...]

Alexander Dobin¹, Carrie A. Davis¹, Felix Schlesinger¹, Jorg Drenkow¹, Chris Zaleski¹, Sonali Jha¹, Philippe Batut¹, Mark Chaisson¹, Thomas R. Gingeras¹ - Show less +5 more•Institutions (1)

Cold Spring Harbor Laboratory¹

01 Jan 2013-Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Abstract: Motivation Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

...read moreread less

30,684 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•

An integrated encyclopedia of DNA elements in the human genome.

[...]

ENCODEConsortium

01 Jan 2012-Nature

...read moreread less

8,106 citations

Journal Article•DOI•

voom: precision weights unlock linear model analysis tools for RNA-seq read counts

[...]

Charity W. Law¹, Charity W. Law², Yunshun Chen², Yunshun Chen¹, Wei Shi¹, Wei Shi², Gordon K. Smyth¹, Gordon K. Smyth² - Show less +4 more•Institutions (2)

University of Melbourne¹, Walter and Eliza Hall Institute of Medical Research²

03 Feb 2014-Genome Biology

TL;DR: New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments, and the voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline.

...read moreread less

Abstract: New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods.

...read moreread less

4,475 citations

Journal Article•DOI•

The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.

[...]

Thomas Derrien, Rory Johnson, Giovanni Bussotti, Andrea Tanzer, Sarah Djebali, Hagen Tilgner, Gregory Guernec¹, David C. Martin, Angelika Merkel, David G. Knowles, Julien Lagarde, Lavanya Veeravalli², Xiaoan Ruan², Yijun Ruan², Timo Lassmann, Piero Carninci, James B. Brown³, Leonard Lipovich⁴, José M. González⁵, Mark G. Thomas⁵, Carrie A. Davis⁶, Ramin Shiekhattar⁷, Thomas R. Gingeras⁶, Tim Hubbard⁵, Cedric Notredame, Jennifer Harrow⁵, Roderic Guigó⁸ - Show less +23 more•Institutions (8)

Institut national de la recherche agronomique¹, Agency for Science, Technology and Research², University of California, Berkeley³, Wayne State University⁴, Wellcome Trust Sanger Institute⁵, Cold Spring Harbor Laboratory⁶, Wistar Institute⁷, Pompeu Fabra University⁸

01 Sep 2012-Genome Research

TL;DR: The most complete human lncRNA annotation to date is presented, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts, and expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes.

...read moreread less

Abstract: The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.

...read moreread less

4,291 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse