Home
/
Authors
/
Sohrab P. Shah

Author

Sohrab P. Shah

Other affiliations: University of British Columbia, BC Cancer Agency, BC Cancer Research Centre ...read more

Bio: Sohrab P. Shah is an academic researcher from Memorial Sloan Kettering Cancer Center. The author has contributed to research in topics: Cancer & Diffuse large B-cell lymphoma. The author has an hindex of 68, co-authored 179 publications receiving 25390 citations. Previous affiliations of Sohrab P. Shah include University of British Columbia & BC Cancer Agency.

Topics: Cancer, Diffuse large B-cell lymphoma, Genome, Lymphoma, Germline mutation ...read more

Papers published on a yearly basis

2023
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups

[...]

Christina Curtis¹, Christina Curtis², Sohrab P. Shah³, Suet-Feung Chin², Gulisa Turashvili³, Oscar M. Rueda², Mark J Dunning, Doug Speed¹, Doug Speed², Andy G. Lynch², Shamith A. Samarajiwa², Yinyin Yuan², Stefan Gräf², Gavin Ha³, Gholamreza Haffari³, Ali Bashashati³, Roslin Russell, Steven McKinney³, Anita Langerød⁴, Andrew R. Green⁵, Elena Provenzano², Gordon C. Wishart², Sarah E Pinder⁶, Peter H. Watson⁷, Peter H. Watson³, Florian Markowetz², Leigh C. Murphy⁷, Ian O. Ellis⁵, Arnie Purushotham⁶, Arnie Purushotham⁸, Anne Lise Børresen-Dale⁹, Anne Lise Børresen-Dale⁴, James D. Brenton, Simon Tavaré, Carlos Caldas, Samuel Aparicio³ - Show less +32 more•Institutions (9)

University of Southern California¹, University of Cambridge², University of British Columbia³, Oslo University Hospital⁴, University of Nottingham⁵, King's College London⁶, University of Manitoba⁷, Guy's and St Thomas' NHS Foundation Trust⁸, University of Oslo⁹

21 Jun 2012-Nature

TL;DR: The results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome, and identify novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort.

...read moreread less

Abstract: The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in 40% of genes, with the landscape dominated by cisand trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.

...read moreread less

4,722 citations

Journal Article•DOI•

The clonal and mutational evolution spectrum of primary triple-negative breast cancers

[...]

Sohrab P. Shah¹, Andrew Roth¹, Rodrigo Goya, Arusha Oloumi¹, Gavin Ha¹, Yongjun Zhao, Gulisa Turashvili¹, Jiarui Ding¹, Kane Tse, Gholamreza Haffari¹, Ali Bashashati¹, Leah M Prentice¹, Jaswinder Khattra¹, Angela Burleigh¹, Damian Yap¹, Virginie Bernard, Andrew McPherson¹, Karey Shumansky¹, Anamaria Crisan¹, Ryan Giuliany¹, Alireza Heravi-Moussavi¹, Jamie Rosner¹, Daniel Lai¹, Inanc Birol, Richard Varhol, Angela Tam, Noreen Dhalla, Thomas Zeng, Kevin C. Ma, Simon K. Chan, Malachi Griffith, Annie Moradian, S.-W. Grace Cheng, Gregg B. Morin¹, Peter H. Watson¹, Karen A. Gelmon², Stephen Chia², Suet-Feung Chin³, Christina Curtis⁴, Christina Curtis³, Oscar M. Rueda³, Paul D.P. Pharoah, Sambasivarao Damaraju⁵, John R. Mackey⁵, Kelly Hoon⁶, Timothy T. Harkins⁶, Vasisht Tadigotla⁶, Mahvash Sigaroudinia⁷, Philippe Gascard⁷, Thea D. Tlsty⁷, Joseph F. Costello⁷, Irmtraud M. Meyer¹, Connie J. Eaves², Wyeth W. Wasserman¹, Steven J.M. Jones¹, Steven J.M. Jones⁸, David G. Huntsman¹, David G. Huntsman², Martin Hirst¹, Carlos Caldas, Marco A. Marra¹, Samuel Aparicio¹ - Show less +58 more•Institutions (8)

University of British Columbia¹, BC Cancer Agency², University of Cambridge³, University of Southern California⁴, University of Alberta⁵, Life Technologies⁶, University of California, San Francisco⁷, Simon Fraser University⁸

21 Jun 2012-Nature

TL;DR: It is shown that understanding the biology and therapeutic responses of patients with TNBC will require the determination of individual tumour clonal genotypes, and for the first time in an epithelial tumour subtype, the relative abundance of clonal frequencies among cases representative of the population is determined.

...read moreread less

Abstract: Primary triple-negative breast cancers (TNBCs), a tumour type defined by lack of oestrogen receptor, progesterone receptor and ERBB2 gene amplification, represent approximately 16% of all breast cancers. Here we show in 104 TNBC cases that at the time of diagnosis these cancers exhibit a wide and continuous spectrum of genomic evolution, with some having only a handful of coding somatic aberrations in a few pathways, whereas others contain hundreds of coding somatic mutations. High-throughput RNA sequencing (RNA-seq) revealed that only approximately 36% of mutations are expressed. Using deep re-sequencing measurements of allelic abundance for 2,414 somatic mutations, we determine for the first time-to our knowledge-in an epithelial tumour subtype, the relative abundance of clonal frequencies among cases representative of the population. We show that TNBCs vary widely in their clonal frequencies at the time of diagnosis, with the basal subtype of TNBC showing more variation than non-basal TNBC. Although p53 (also known as TP53), PIK3CA and PTEN somatic mutations seem to be clonally dominant compared to other genes, in some tumours their clonal frequencies are incompatible with founder status. Mutations in cytoskeletal, cell shape and motility proteins occurred at lower clonal frequencies, suggesting that they occurred later during tumour progression. Taken together, our results show that understanding the biology and therapeutic responses of patients with TNBC will require the determination of individual tumour clonal genotypes.

...read moreread less

1,821 citations

Journal Article•DOI•

ARID1A mutations in endometriosis-associated ovarian carcinomas.

[...]

Kimberly C. Wiegand, Sohrab P. Shah¹, Osama M. Al-Agha¹, Yongjun Zhao, Kane Tse, Thomas Zeng, Janine Senz¹, Melissa K. McConechy¹, Michael S. Anglesio¹, Steve E. Kalloger¹, Winnie Yang¹, Alireza Heravi-Moussavi¹, Ryan Giuliany¹, Christine Chow, John Fee¹, Abdalnasser Zayed¹, Leah M Prentice¹, Nataliya Melnyk¹, Gulisa Turashvili¹, Allen Delaney, Jason Madore², Stephen Yip¹, Andrew McPherson¹, Gavin Ha¹, Lynda Bell¹, Sian Fereday³, Angela Tam, Laura Galletta³, Patricia N. Tonin⁴, Diane Provencher², Dianne Miller¹, Steven J.M. Jones, Richard A. Moore, Gregg B. Morin¹, Gregg B. Morin⁵, Arusha Oloumi¹, Niki Boyd¹, Samuel Aparicio¹, Ie Ming Shih, Anne Marie Mes-Masson², David D.L. Bowtell³, David D.L. Bowtell⁶, Martin Hirst, Blake Gilks¹, Marco A. Marra¹, Marco A. Marra⁵, David G. Huntsman¹ - Show less +43 more•Institutions (6)

University of British Columbia¹, Université de Montréal², Peter MacCallum Cancer Centre³, McGill University⁴, Simon Fraser University⁵, University of Melbourne⁶

14 Oct 2010-The New England Journal of Medicine

TL;DR: These data implicate ARID1A as a tumor-suppressor gene frequently disrupted in ovarian clear-cell and endometrioid carcinomas.

...read moreread less

Abstract: Background Ovarian clear-cell and endometrioid carcinomas may arise from endometriosis, but the molecular events involved in this transformation have not been described. Methods We sequenced the whole transcriptomes of 18 ovarian clear-cell carcinomas and 1 ovarian clear-cell carcinoma cell line and found somatic mutations in ARID1A (the AT-rich interactive domain 1A [SWI-like] gene) in 6 of the samples. ARID1A encodes BAF250a, a key component of the SWI–SNF chromatin remodeling complex. We sequenced ARID1A in an additional 210 ovarian carcinomas and a second ovarian clear-cell carcinoma cell line and measured BAF250a expression by means of immunohistochemical analysis in an additional 455 ovarian carcinomas. Results ARID1A mutations were seen in 55 of 119 ovarian clear-cell carcinomas (46%), 10 of 33 endometrioid carcinomas (30%), and none of the 76 high-grade serous ovarian carcinomas. Seventeen carcinomas had two somatic mutations each. Loss of the BAF250a protein correlated strongly with the ovarian c...

...read moreread less

1,485 citations

Journal Article•DOI•

Somatic mutations altering EZH2 (Tyr641) in follicular and diffuse large B-cell lymphomas of germinal-center origin

[...]

Ryan D. Morin¹, Nathalie A. Johnson¹, Tesa M. Severson¹, Andrew J. Mungall¹, Jianghong An¹, Rodrigo Goya¹, Paul Je¹, Merrill Boyle¹, Bruce Woolcock¹, Florian Kuchenbauer¹, Damian Yap¹, Humphries Rk¹, Obi L. Griffith¹, Sohrab P. Shah¹, Hao Zhu, Kimbara M, Shashkin P, Charlot Jf, Tcherpakov M, Richard Corbett¹, Angela K Y Tam¹, Richard Varhol¹, Duane E Smailus¹, Michelle Moksa¹, Yongjun Zhao¹, Allen Delaney¹, Hong Qian¹, Inanc Birol¹, Jacquie Schein¹, Richard A. Moore¹, Robert A. Holt¹, Douglas E. Horsman², Joseph M. Connors¹, Joseph M. Connors², Steven J.M. Jones¹, Samuel Aparicio¹, Martin Hirst¹, Randy D. Gascoyne², Marco A. Marra¹, Marco A. Marra² - Show less +36 more•Institutions (2)

BC Cancer Agency¹, University of British Columbia²

01 Feb 2010-Nature Genetics

TL;DR: Recurrent somatic mutations affecting the polycomb-group oncogene EZH2, which encodes a histone methyltransferase responsible for trimethylating Lys27 of histone H3 (H3K27), are reported, consistent with the notion that EZh2 proteins with mutant Tyr641 have reduced enzymatic activity in vitro.

...read moreread less

Abstract: Marco Marra and colleagues identify somatic mutations in EZH2 in diffuse large B-cell lymphomas and follicular lymphomas. EZH2 is a histone methyltransferase that participates in trimethylation of H3 Lys27 (H3K27) as part of the PRC2 complex. The mutations alter a single tyrosine residue in the SET domain of EZH2 and reduce the ability of PRC2 to trimethylate H3K27 in vitro.

...read moreread less

1,468 citations

Journal Article•DOI•

The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes

[...]

Bernard Pereira¹, Suet-Feung Chin¹, Oscar M. Rueda¹, Hans Kristian Moen Vollan², Elena Provenzano¹, Helen Bardwell¹, Michelle Pugh, Linda Jones¹, Roslin Russell¹, Stephen John Sammut¹, Dana W.Y. Tsui¹, Bin Liu¹, Sarah-Jane Dawson³, Sarah-Jane Dawson¹, Jean Abraham¹, Helen Northen⁴, John F. Peden⁴, Abhik Mukherjee⁵, Gulisa Turashvili⁶, Andrew R. Green⁵, Steve McKinney⁷, Arusha Oloumi⁷, Sohrab P. Shah⁷, Nitzan Rosenfeld¹, Leigh C. Murphy, David R. Bentley⁴, Ian O. Ellis⁵, Arnie Purushotham⁸, Sarah E Pinder⁸, Anne Lise Børresen-Dale², Anne Lise Børresen-Dale⁹, Helena M. Earl¹, Paul D.P. Pharoah¹, Mark T. Ross⁴, Samuel Aparicio⁷, Carlos Caldas - Show less +32 more•Institutions (9)

University of Cambridge¹, Oslo University Hospital², Peter MacCallum Cancer Centre³, Illumina⁴, University of Nottingham⁵, Queen's University⁶, BC Cancer Research Centre⁷, Guy's and St Thomas' NHS Foundation Trust⁸, The Breast Cancer Research Foundation⁹

10 May 2016-Nature Communications

TL;DR: This study sequence 173 genes in 2,433 primary breast tumours that have copy number aberration, gene expression and long-term clinical follow-up data, and determines associations between mutations, driver CNA profiles, clinical-pathological parameters and survival.

...read moreread less

Abstract: The genomic landscape of breast cancer is complex, and inter- and intra-tumour heterogeneity are important challenges in treating the disease. In this study, we sequence 173 genes in 2,433 primary breast tumours that have copy number aberration (CNA), gene expression and long-term clinical follow-up data. We identify 40 mutation-driver (Mut-driver) genes, and determine associations between mutations, driver CNA profiles, clinical-pathological parameters and survival. We assess the clonal states of Mut-driver mutations, and estimate levels of intra-tumour heterogeneity using mutant-allele fractions. Associations between PIK3CA mutations and reduced survival are identified in three subgroups of ER-positive cancer (defined by amplification of 17q23, 11q13–14 or 8q24). High levels of intra-tumour heterogeneity are in general associated with a worse outcome, but highly aggressive tumours with 11q13–14 amplification have low levels of intra-tumour heterogeneity. These results emphasize the importance of genome-based stratification of breast cancer, and have important implications for designing therapeutic strategies.

...read moreread less

1,205 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

The blockade of immune checkpoints in cancer immunotherapy

[...]

Drew M. Pardoll¹•Institutions (1)

Johns Hopkins University School of Medicine¹

22 Mar 2012-Nature Reviews Cancer

TL;DR: Preliminary clinical findings with blockers of additional immune-checkpoint proteins, such as programmed cell death protein 1 (PD1), indicate broad and diverse opportunities to enhance antitumour immunity with the potential to produce durable clinical responses.

...read moreread less

Abstract: Immune checkpoints refer to the plethora of inhibitory pathways that are crucial to maintaining self-tolerance. Tumour cells induce immune checkpoints to evade immunosurveillance. This Review discusses the progress in targeting immune checkpoints, the considerations for combinatorial therapy and the potential for additional immune-checkpoint targets.

...read moreread less

10,602 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Comprehensive molecular portraits of human breast tumours

[...]

Daniel C. Koboldt¹, Robert S. Fulton¹, Michael D. McLellan¹, Heather Schmidt¹ +352 more•Institutions (35)

04 Oct 2012-Nature

TL;DR: The ability to integrate information across platforms provided key insights into previously defined gene expression subtypes and demonstrated the existence of four main breast cancer classes when combining data from five platforms, each of which shows significant molecular heterogeneity.

...read moreread less

Abstract: We analysed primary breast cancers by genomic DNA copy number arrays, DNA methylation, exome sequencing, messenger RNA arrays, microRNA sequencing and reverse-phase protein arrays. Our ability to integrate information across platforms provided key insights into previously defined gene expression subtypes and demonstrated the existence of four main breast cancer classes when combining data from five platforms, each of which shows significant molecular heterogeneity. Somatic mutations in only three genes (TP53, PIK3CA and GATA3) occurred at >10% incidence across all breast cancers; however, there were numerous subtype-associated and novel gene mutations including the enrichment of specific mutations in GATA3, PIK3CA and MAP3K1 with the luminal A subtype. We identified two novel protein-expression-defined subgroups, possibly produced by stromal/microenvironmental elements, and integrated analyses identified specific signalling pathways dominant in each molecular subtype including a HER2/phosphorylated HER2/EGFR/phosphorylated EGFR signature within the HER2-enriched expression subtype. Comparison of basal-like breast tumours with high-grade serous ovarian tumours showed many molecular commonalities, indicating a related aetiology and similar therapeutic opportunities. The biological finding of the four main breast cancer subtypes caused by different subsets of genetic and epigenetic abnormalities raises the hypothesis that much of the clinically observable plasticity and heterogeneity occurs within, and not across, these major biological subtypes of breast cancer.

...read moreread less

9,355 citations

Journal Article•DOI•

StringTie enables improved reconstruction of a transcriptome from RNA-seq reads

[...]

Mihaela Pertea¹, Geo Pertea¹, Corina Antonescu¹, Tsung Cheng Chang², Joshua T. Mendell², Steven L. Salzberg¹ - Show less +2 more•Institutions (2)

Johns Hopkins University¹, University of Texas Southwestern Medical Center²

01 Mar 2015-Nature Biotechnology

TL;DR: StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts produces more complete and accurate reconstructions of genes and better estimates of expression levels.

...read moreread less

Abstract: Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.

...read moreread less

6,594 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse