Home
/
Authors
/
Pablo Tamayo

Author

Pablo Tamayo

Other affiliations: University of California, Berkeley, Harvard University, Massachusetts Institute of Technology ...read more

Bio: Pablo Tamayo is an academic researcher from University of California, San Diego. The author has contributed to research in topics: Gene expression profiling & Cancer. The author has an hindex of 72, co-authored 177 publications receiving 97318 citations. Previous affiliations of Pablo Tamayo include University of California, Berkeley & Harvard University.

Topics: Gene expression profiling, Cancer, Gene, RNA interference, Medulloblastoma ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Molecular classification of multiple tumor types.

[...]

Chen-Hsiang Yeang¹, Sridhar Ramaswamy¹, Pablo Tamayo¹, Sayan Mukherjee¹, Ryan Rifkin¹, Michael Angelo¹, Michael Reich¹, Eric S. Lander¹, Jill P. Mesirov¹, Todd R. Golub¹ - Show less +6 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jun 2001-Bioinformatics

TL;DR: This work obtained 190 samples from 14 tumor classes and generated a combined expression dataset containing 16063 genes for each of those samples, and performed multi-class classification by combining the outputs of binary classifiers.

...read moreread less

Abstract: Using gene expression data to classify tumor types is a very promising tool in cancer diagnosis. Previous works show several pairs of tumor types can be successfully distinguished by their gene expression patterns (Golub et al. 1999, Ben-Dor et al. 2000, Alizadeh et al. 2000). However, the simultaneous classification across a heterogeneous set of tumor types has not been well studied yet. We obtained 190 samples from 14 tumor classes and generated a combined expression dataset containing 16063 genes for each of those samples. We performed multi-class classification by combining the outputs of binary classifiers. Three binary classifiers (k-nearest neighbors, weighted voting, and support vector machines) were applied in conjunction with three combination scenarios (one-vs-all, all-pairs, hierarchical partitioning). We achieved the best cross validation error rate of 18.75% and the best test error rate of 21.74% by using the one-vs-all support vector machine algorithm. The results demonstrate the feasibility of performing clinically useful classification from samples of multiple tumor types.

...read moreread less

305 citations

Journal Article•DOI•

Estimating dataset size requirements for classifying DNA microarray data.

[...]

Sayan Mukherjee¹, Pablo Tamayo, Simon Rogers, Ryan Rifkin, Anna Engle, Colin Campbell, Todd R. Golub, Jill P. Mesirov - Show less +4 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2003-Journal of Computational Biology

TL;DR: A statistical methodology for estimating dataset size requirements for classifying microarray data using learning curves is introduced, based on fitting inverse power-law models to construct empirical learning curves.

...read moreread less

Abstract: A statistical methodology for estimating dataset size requirements for classifying microarray data using learning curves is introduced. The goal is to use existing classification results to estimate dataset size requirements for future classification experiments and to evaluate the gain in accuracy and significance of classifiers built with additional data. The method is based on fitting inverse power-law models to construct empirical learning curves. It also includes a permutation test procedure to assess the statistical significance of classification performance for a given dataset size. This procedure is applied to several molecular classification problems representing a broad spectrum of levels of complexity.

...read moreread less

274 citations

Journal Article•DOI•

Erratum: Parallel genome-scale loss of function screens in 216 cancer cell lines for the identification of context-specific genetic dependencies

[...]

Glenn S. Cowley, Barbara A. Weir, Francisca Vazquez, Pablo Tamayo, Justine A. Scott, Scott F. Rusin, Alexandra East-Seletsky, Levi D. Ali, William F.J Gerath, Sarah E Pantel, Patrick H. Lizotte, Guozhi Jiang, Jessica Hsiao, Aviad Tsherniak, Elizabeth Dwinell, Simon Aoyama, Michael Okamoto, William F. Harrington, Ellen Gelfand, Thomas M Green, Mark J Tomko, Shuba Gopal, Terence C. Wong, Hubo Li, Sara Howell, Nicolas Stransky, Ted Liefeld, Dongkeun Jang, Jonathan Bistline, Barbara Hill Meyers, Scott A. Armstrong, Kenneth C. Anderson, Kimberly Stegmaier, Michael R. Reich, David Pellman, Jesse S. Boehm, Jill P. Mesirov, Todd R. Golub, David E. Root, William C. Hahn - Show less +36 more

11 Nov 2014-Scientific Data

TL;DR: The original version of this Data Descriptor contained a typographical error in the spelling of the author Terence C. Wong, which was incorrectly given as Terrence C Wong as discussed by the authors.

...read moreread less

Abstract: Scientific Data 1:140035 doi: 10.1038/sdata.2014.35 (2014); Published 30 September 2014; Updated 11 November 2014 The original version of this Data Descriptor contained a typographical error in the spelling of the author Terence C. Wong, which was incorrectly given as Terrence C. Wong. This has now been corrected in the PDF and HTML versions of the Data Descriptor.

...read moreread less

244 citations

Journal Article•DOI•

Distinct physiological states of Plasmodium falciparum in malaria-infected patients

[...]

Johanna P. Daily¹, Daniel Scanfeld², Nathalie Pochet², Nathalie Pochet¹, K. G. Le Roch³, David Plouffe⁴, Michael Kamal², Ousmane Sarr, S. Mboup, Omar Ndir⁵, David Wypij¹, K. Levasseur, Edwin L. Thomas², Pablo Tamayo², Carolyn K. Dong, Yingyao Zhou⁴, Eric S. Lander², Daouda Ndiaye⁵, Dyann F. Wirth, Elizabeth A. Winzeler⁴, Elizabeth A. Winzeler⁶, Jill P. Mesirov², Aviv Regev² - Show less +19 more•Institutions (6)

Harvard University¹, Massachusetts Institute of Technology², University of California, Riverside³, Genomics Institute of the Novartis Research Foundation⁴, Cheikh Anta Diop University⁵, Scripps Research Institute⁶

13 Dec 2007-Nature

TL;DR: A large study of in vivo expression profiles of parasites derived directly from blood samples from infected patients reveals a previously unknown physiological diversity in the in vivo biology of the malaria parasite, and indicates in vivo and in vitro studies to determine how this variation may affect disease manifestations and treatment.

...read moreread less

Abstract: Infection with the malaria parasite Plasmodium falciparum leads to widely different clinical conditions in children, ranging from mild flu-like symptoms to coma and death. Despite the immense medical implications, the genetic and molecular basis of this diversity remains largely unknown. Studies of in vitro gene expression have found few transcriptional differences between different parasite strains. Here we present a large study of in vivo expression profiles of parasites derived directly from blood samples from infected patients. The in vivo expression profiles define three distinct transcriptional states. The biological basis of these states can be interpreted by comparison with an extensive compendium of expression data in the yeast Saccharomyces cerevisiae. The three states in vivo closely resemble, first, active growth based on glycolytic metabolism, second, a starvation response accompanied by metabolism of alternative carbon sources, and third, an environmental stress response. The glycolytic state is highly similar to the known profile of the ring stage in vitro, but the other states have not been observed in vitro. The results reveal a previously unknown physiological diversity in the in vivo biology of the malaria parasite, in particular evidence for a functional mitochondrion in the asexual-stage parasite, and indicate in vivo and in vitro studies to determine how this variation may affect disease manifestations and treatment.

...read moreread less

243 citations

Journal Article•DOI•

Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation

[...]

Jernej Godec¹, Yan Tan², Yan Tan³, Arthur Liberzon³, Pablo Tamayo³, Sanchita Bhattacharya⁴, Atul J. Butte⁴, Jill P. Mesirov², Jill P. Mesirov³, W. Nicholas Haining³, W. Nicholas Haining¹ - Show less +7 more•Institutions (4)

Harvard University¹, Boston University², Broad Institute³, University of California, San Francisco⁴

19 Jan 2016-Immunity

TL;DR: Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice and species-specific biological processes in the sepsis transcriptional response.

...read moreread less

226 citations

1
2
3
4
…
5
6
7
8
9
10
11
…
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

[...]

Aravind Subramanian¹, Pablo Tamayo¹, Vamsi K. Mootha², Sayan Mukherjee³, Benjamin L. Ebert², Michael A. Gillette², Amanda G. Paulovich⁴, Scott L. Pomeroy², Todd R. Golub², Eric S. Lander¹, Jill P. Mesirov¹ - Show less +7 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², Duke University³, Fred Hutchinson Cancer Research Center⁴

25 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The Gene Set Enrichment Analysis (GSEA) method as discussed by the authors focuses on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation.

...read moreread less

Abstract: Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.

...read moreread less

34,830 citations

Journal Article•DOI•

Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks

[...]

Paul Shannon¹, Andrew Markiel, Owen Ozier, Nitin S. Baliga, Jonathan T. Wang, Daniel Ramage, Nada Amin, Benno Schwikowski, Trey Ideker - Show less +5 more•Institutions (1)

Institute for Systems Biology¹

01 Nov 2003-Genome Research

TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.

...read moreread less

Abstract: Cytoscape is an open source software project for integrating biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework. Although applicable to any system of molecular components and interactions, Cytoscape is most powerful when used in conjunction with large databases of protein-protein, protein-DNA, and genetic interactions that are increasingly available for humans and model organisms. Cytoscape's software Core provides basic functionality to layout and query the network; to visually integrate the network with expression profiles, phenotypes, and other molecular states; and to link the network to databases of functional annotations. The Core is extensible through a straightforward plug-in architecture, allowing rapid development of additional computational analyses and features. Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.

...read moreread less

32,980 citations

Journal Article•DOI•

Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

[...]

Da-Wei Huang¹, Brad T. Sherman¹, Richard A. Lempicki¹•Institutions (1)

Science Applications International Corporation¹

01 Jan 2009-Nature Protocols

TL;DR: By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.

...read moreread less

Abstract: DAVID bioinformatics resources consists of an integrated biological knowledgebase and analytic tools aimed at systematically extracting biological meaning from large gene/protein lists. This protocol explains how to use DAVID, a high-throughput and integrated data-mining environment, to analyze gene lists derived from high-throughput genomic experiments. The procedure first requires uploading a gene list containing any number of common gene identifiers followed by analysis using one or more text and pathway-mining tools such as gene functional classification, functional annotation chart or clustering and functional annotation table. By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.

...read moreread less

31,015 citations

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

Journal Article•DOI•

Regularization and variable selection via the elastic net

[...]

Hui Zou¹, Trevor Hastie¹•Institutions (1)

Stanford University¹

01 Apr 2005-Journal of The Royal Statistical Society Series B-statistical Methodology

TL;DR: It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.

...read moreread less

Abstract: Summary. We propose the elastic net, a new regularization and variable selection method. Real world data and a simulation study show that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation. In addition, the elastic net encourages a grouping effect, where strongly correlated predictors tend to be in or out of the model together.The elastic net is particularly useful when the number of predictors (p) is much bigger than the number of observations (n). By contrast, the lasso is not a very satisfactory variable selection method in the

...read moreread less

16,538 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse