Home
/
Authors
/
Alicia Oshlack

Author

Alicia Oshlack

Other affiliations: Walter and Eliza Hall Institute of Medical Research, Monash University, Victorian Life Sciences Computation Initiative ...read more

Bio: Alicia Oshlack is an academic researcher from Peter MacCallum Cancer Centre. The author has contributed to research in topics: Regulation of gene expression & Population. The author has an hindex of 49, co-authored 149 publications receiving 17971 citations. Previous affiliations of Alicia Oshlack include Walter and Eliza Hall Institute of Medical Research & Monash University.

Topics: Regulation of gene expression, Population, Gene, Bioconductor, Transcriptome ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2002
2001

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A scaling normalization method for differential expression analysis of RNA-seq data

[...]

Mark D. Robinson¹, Mark D. Robinson², Alicia Oshlack²•Institutions (2)

Garvan Institute of Medical Research¹, Walter and Eliza Hall Institute of Medical Research²

02 Mar 2010-Genome Biology

TL;DR: A simple and effective method for performing normalization is outlined and dramatically improved results for inferring differential expression in simulated and publicly available data sets are shown.

...read moreread less

Abstract: The fine detail provided by sequencing-based transcriptome surveys suggests that RNA-seq is likely to become the platform of choice for interrogating steady state RNA. In order to discover biologically important changes in expression, we show that normalization continues to be an essential step in the analysis. We outline a simple and effective method for performing normalization and show dramatically improved results for inferring differential expression in simulated and publicly available data sets.

...read moreread less

6,042 citations

Journal Article•DOI•

Gene ontology analysis for RNA-seq: accounting for selection bias

[...]

Matthew D. Young¹, Matthew Wakefield¹, Gordon K. Smyth¹, Alicia Oshlack¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

04 Feb 2010-Genome Biology

TL;DR: Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.

...read moreread less

Abstract: We present GOseq, an application for performing Gene Ontology (GO) analysis on RNA-seq data. GO analysis is widely used to reduce complexity and highlight biological processes in genome-wide expression studies, but standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts. Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.

...read moreread less

5,034 citations

Journal Article•DOI•

A comparison of background correction methods for two-colour microarrays

[...]

Matthew E. Ritchie¹, Jeremy D. Silver², Alicia Oshlack², Melissa L. Holmes², Dileepa Diyagama², Andrew J. Holloway², Gordon K. Smyth² - Show less +3 more•Institutions (2)

University of Cambridge¹, Walter and Eliza Hall Institute of Medical Research²

20 Sep 2007-Bioinformatics

TL;DR: The model-based correction methods are shown to be markedly superior to the usual practice of subtracting local background estimates, and methods which stabilize the variances of the log-ratios along the intensity range perform the best.

...read moreread less

Abstract: Motivation: Microarray data must be background corrected to remove the effects of non-specific binding or spatial heterogeneity across the array, but this practice typically causes other problems such as negative corrected intensities and high variability of low intensity log-ratios. Different estimators of background, and various model-based processing methods, are compared in this study in search of the best option for differential expression analyses of small microarray experiments. Results: Using data where some independent truth in gene expression is known, eight different background correction alternatives are compared, in terms of precision and bias of the resulting gene expression measures, and in terms of their ability to detect differentially expressed genes as judged by two popular algorithms, SAM and limma eBayes. A new background processing method (normexp) is introduced which is based on a convolution model. The model-based correction methods are shown to be markedly superior to the usual practice of subtracting local background estimates. Methods which stabilize the variances of the log-ratios along the intensity range perform the best. The normexp+offset method is found to give the lowest false discovery rate overall, followed by morph and vsn. Like vsn, normexp is applicable to most types of two-colour microarray data. Availability: The background correction methods compared in this article are available in the R package limma (Smyth, 2005) from http://www.bioconductor.org. Contact: smyth@wehi.edu.au Supplementary information: Supplementary data are available from http://bioinf.wehi.edu.au/resources/webReferences.html.

...read moreread less

946 citations

Journal Article•DOI•

SWAN: Subset-quantile within array normalization for illumina infinium HumanMethylation450 BeadChips.

[...]

Jovana Maksimovic, Lavinia Gordon, Alicia Oshlack

15 Jun 2012-Genome Biology

TL;DR: Subset-quantile Within Array Normalization (SWAN) is presented, a new method that substantially improves the results from this platform by reducing technical variation within and between arrays.

...read moreread less

Abstract: DNA methylation is the most widely studied epigenetic mark and is known to be essential to normal development and frequently disrupted in disease. The Illumina HumanMethylation450 BeadChip assays the methylation status of CpGs at 485,577 sites across the genome. Here we present Subset-quantile Within Array Normalization (SWAN), a new method that substantially improves the results from this platform by reducing technical variation within and between arrays. SWAN is available in the minfi Bioconductor package.

...read moreread less

734 citations

Journal Article•DOI•

From RNA-seq reads to differential expression results.

[...]

Alicia Oshlack¹, Mark D. Robinson², Mark D. Robinson¹, Matthew D. Young¹•Institutions (2)

Walter and Eliza Hall Institute of Medical Research¹, Garvan Institute of Medical Research²

22 Dec 2010-Genome Biology

TL;DR: Many methods and tools are available for preprocessing high-throughput RNA sequencing data and detecting differential expression and in doing so improving the quality of results and reducing the number of errors.

...read moreread less

Abstract: Many methods and tools are available for preprocessing high-throughput RNA sequencing data and detecting differential expression.

...read moreread less

731 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth¹, Gordon K. Smyth⁵ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

Journal Article•DOI•

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

[...]

Yang Liao¹, Gordon K. Smyth¹, Wei Shi¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

01 Apr 2014-Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Abstract: MOTIVATION: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. RESULTS: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. AVAILABILITY AND IMPLEMENTATION: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

...read moreread less

14,103 citations

Journal Article•DOI•

Differential expression analysis for sequence count data.

[...]

Simon Anders, Wolfgang Huber

27 Oct 2010-Genome Biology

TL;DR: A method based on the negative binomial distribution, with variance and mean linked by local regression, is proposed and an implementation, DESeq, as an R/Bioconductor package is presented.

...read moreread less

Abstract: High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. We propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, DESeq, as an R/Bioconductor package.

...read moreread less

13,356 citations

Journal Article•DOI•

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

[...]

Cole Trapnell¹, Cole Trapnell², Brian A. Williams³, Geo Pertea², Ali Mortazavi³, Gordon Kwan³, Marijke J. van Baren⁴, Steven L. Salzberg², Barbara J. Wold³, Lior Pachter¹ - Show less +6 more•Institutions (4)

University of California, Berkeley¹, University of Maryland, College Park², California Institute of Technology³, Washington University in St. Louis⁴

01 May 2010-Nature Biotechnology

TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

Abstract: High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

13,337 citations

Journal Article•DOI•

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

[...]

Brian J. Haas¹, Alexie Papanicolaou², Moran Yassour³, Moran Yassour⁴, Manfred Grabherr⁵, Philip D. Blood⁶, Joshua C. Bowden², M. B. Couger⁷, David Eccles⁸, Bo Li⁹, Matthias Lieber¹⁰, Matthew D. MacManes¹¹, Michael Ott², Joshua Orvis, Nathalie Pochet¹², Nathalie Pochet³, Francesco Strozzi¹³, Nathan T. Weeks¹⁴, Rick Westerman¹⁵, Thomas William, Colin N. Dewey⁹, Robert Henschel¹⁶, Richard D. LeDuc¹⁶, Nir Friedman⁴, Aviv Regev³ - Show less +21 more•Institutions (16)

Broad Institute¹, Commonwealth Scientific and Industrial Research Organisation², Massachusetts Institute of Technology³, Hebrew University of Jerusalem⁴, Science for Life Laboratory⁵, Pittsburgh Supercomputing Center⁶, Oklahoma State University–Stillwater⁷, Griffith University⁸, University of Wisconsin-Madison⁹, Dresden University of Technology¹⁰, California Institute for Quantitative Biosciences¹¹, Flanders Institute for Biotechnology¹², Parco Tecnologico Padano¹³, United States Department of Agriculture¹⁴, Purdue University¹⁵, Indiana University¹⁶

01 Aug 2013-Nature Protocols

TL;DR: This protocol provides a workflow for genome-independent transcriptome analysis leveraging the Trinity platform and presents Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes.

...read moreread less

Abstract: De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

...read moreread less

6,369 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse