Genomic Landscape of Non-Small Cell Lung Cancer in Smokers and Never-Smokers

doi:10.1016/J.CELL.2012.08.024

Home
/
Papers
/
Genomic Landscape of Non-Small Cell Lung Cancer in Smokers and Never-Smokers

Journal Article•DOI•

Genomic Landscape of Non-Small Cell Lung Cancer in Smokers and Never-Smokers

Ramaswamy Govindan¹, Li Ding¹, Malachi Griffith¹, Janakiraman Subramanian¹, Nathan D. Dees¹, Krishna L. Kanchi¹, Christopher G. Maher¹, Robert S. Fulton¹, Lucinda Fulton¹, John W. Wallis¹, Ken Chen², Jason Walker¹, Sandra McDonald¹, Ron Bose¹, David M. Ornitz¹, Dong Hai Xiong³, Ming You³, David J. Dooling¹, Mark A. Watson¹, Elaine R. Mardis¹, Richard K. Wilson¹ - Show less +17 more•Institutions (3)

Washington University in St. Louis¹, University of Texas MD Anderson Cancer Center², Medical College of Wisconsin³

14 Sep 2012-Cell (Cell Press)-Vol. 150, Iss: 6, pp 1121-1134

TL;DR: Cell-cycle and JAK-STAT pathways are significantly altered in lung cancer, along with perturbations in 54 genes that are potentially targetable with currently available drugs, including ROS1 and ALK, as well as novel metabolic enzymes.

read less

About: This article is published in Cell.The article was published on 2012-09-14 and is currently open access. It has received 1021 citations till now. The article focuses on the topics: Cancer & KRAS.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Cancer Genome Landscapes

[...]

Bert Vogelstein¹, Nickolas Papadopoulos¹, Victor E. Velculescu¹, Shibin Zhou¹, Luis A. Diaz¹, Kenneth W. Kinzler¹ - Show less +2 more•Institutions (1)

Howard Hughes Medical Institute¹

29 Mar 2013-Science

TL;DR: This work has revealed the genomic landscapes of common forms of human cancer, which consists of a small number of “mountains” (genes altered in a high percentage of tumors) and a much larger number of "hills" (Genes altered infrequently).

...read moreread less

Abstract: Over the past decade, comprehensive sequencing efforts have revealed the genomic landscapes of common forms of human cancer. For most cancer types, this landscape consists of a small number of “mountains” (genes altered in a high percentage of tumors) and a much larger number of “hills” (genes altered infrequently). To date, these studies have revealed ~140 genes that, when altered by intragenic mutations, can promote or “drive” tumorigenesis. A typical tumor contains two to eight of these “driver gene” mutations; the remaining mutations are passengers that confer no selective growth advantage. Driver genes can be classified into 12 signaling pathways that regulate three core cellular processes: cell fate, cell survival, and genome maintenance. A better understanding of these pathways is one of the most pressing needs in basic cancer research. Even now, however, our knowledge of cancer genomes is sufficient to guide the development of more effective approaches for reducing cancer morbidity and mortality.

...read moreread less

6,441 citations

Journal Article•DOI•

Mutational landscape determines sensitivity to PD-1 blockade in non–small cell lung cancer

[...]

Naiyer A. Rizvi¹, Naiyer A. Rizvi², Matthew D. Hellmann¹, Matthew D. Hellmann², Alexandra Snyder¹, Alexandra Snyder², Pia Kvistborg³, Vladimir Makarov¹, Jonathan J. Havel¹, William Lee¹, Jianda Yuan¹, Phillip Wong¹, Teresa S. Ho¹, Martin L. Miller¹, Natasha Rekhtman¹, Andre L. Moreira¹, Fawzia Ibrahim¹, Cameron Bruggeman⁴, Billel Gasmi¹, Roberta Zappasodi¹, Yuka Maeda¹, Chris Sander¹, Edward B. Garon⁵, Taha Merghoub¹, Jedd D. Wolchok², Jedd D. Wolchok¹, Ton N. Schumacher³, Timothy A. Chan¹, Timothy A. Chan² - Show less +25 more•Institutions (5)

Memorial Sloan Kettering Cancer Center¹, Cornell University², Netherlands Cancer Institute³, Columbia University⁴, University of California, Los Angeles⁵

03 Apr 2015-Science

TL;DR: Treatment efficacy was associated with a higher number of mutations in the tumors, and a tumor-specific T cell response paralleled tumor regression in one patient, suggesting that the genomic landscape of lung cancers shapes response to anti–PD-1 therapy.

...read moreread less

Abstract: Immune checkpoint inhibitors, which unleash a patient’s own T cells to kill tumors, are revolutionizing cancer treatment. To unravel the genomic determinants of response to this therapy, we used whole-exome sequencing of non–small cell lung cancers treated with pembrolizumab, an antibody targeting programmed cell death-1 (PD-1). In two independent cohorts, higher nonsynonymous mutation burden in tumors was associated with improved objective response, durable clinical benefit, and progression-free survival. Efficacy also correlated with the molecular smoking signature, higher neoantigen burden, and DNA repair pathway mutations; each factor was also associated with mutation burden. In one responder, neoantigen-specific CD8+ T cell responses paralleled tumor regression, suggesting that anti–PD-1 therapy enhances neoantigen-specific T cell reactivity. Our results suggest that the genomic landscape of lung cancers shapes response to anti–PD-1 therapy.

...read moreread less

6,215 citations

Journal Article•DOI•

Comprehensive molecular profiling of lung adenocarcinoma: The cancer genome atlas research network

[...]

Eric A. Collisson¹, Joshua D. Campbell², Angela N. Brooks², Angela N. Brooks³ +315 more•Institutions (41)

01 Jan 2014-Nature

TL;DR: In this paper, the authors report molecular profiling of 230 resected lung adnocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses.

...read moreread less

Abstract: Adenocarcinoma of the lung is the leading cause of cancer death worldwide. Here we report molecular profiling of 230 resected lung adenocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses. High rates of somatic mutation were seen (mean 8.9 mutations per megabase). Eighteen genes were statistically significantly mutated, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification. EGFR mutations were more frequent in female patients, whereas mutations in RBM10 were more common in males. Aberrations in NF1, MET, ERBB2 and RIT1 occurred in 13% of cases and were enriched in samples otherwise lacking an activated oncogene, suggesting a driver role for these events in certain tumours. DNA and mRNA sequence from the same tumour highlighted splicing alterations driven by somatic genomic changes, including exon 14 skipping in MET mRNA in 4% of cases. MAPK and PI(3)K pathway activity, when measured at the protein level, was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation. These data establish a foundation for classification and further investigations of lung adenocarcinoma molecular pathogenesis.

...read moreread less

4,104 citations

Journal Article•DOI•

Integrated genomic characterization of endometrial carcinoma

[...]

Gad Getz¹, Stacey Gabriel¹, Kristian Cibulskis¹, Eric S. Lander¹ +280 more•Institutions (31)

02 May 2013-Nature

TL;DR: In this paper, the authors performed an integrated genomic, transcriptomic and proteomic characterization of 373 endometrial carcinomas using array-and-sequencing-based technologies, and classified them into four categories: POLE ultramutated, microsatellite instability hypermutated, copy-number low, and copy number high.

...read moreread less

Abstract: We performed an integrated genomic, transcriptomic and proteomic characterization of 373 endometrial carcinomas using array- and sequencing-based technologies. Uterine serous tumours and ∼25% of high-grade endometrioid tumours had extensive copy number alterations, few DNA methylation changes, low oestrogen receptor/progesterone receptor levels, and frequent TP53 mutations. Most endometrioid tumours had few copy number alterations or TP53 mutations, but frequent mutations in PTEN, CTNNB1, PIK3CA, ARID1A and KRAS and novel mutations in the SWI/SNF chromatin remodelling complex gene ARID5B. A subset of endometrioid tumours that we identified had a markedly increased transversion mutation frequency and newly identified hotspot mutations in POLE. Our results classified endometrial cancers into four categories: POLE ultramutated, microsatellite instability hypermutated, copy-number low, and copy-number high. Uterine serous carcinomas share genomic features with ovarian serous and basal-like breast carcinomas. We demonstrated that the genomic features of endometrial carcinomas permit a reclassification that may affect post-surgical adjuvant treatment for women with aggressive tumours.

...read moreread less

3,719 citations

Comprehensive molecular profiling of lung adenocarcinoma

[...]

Eric S. Lander

01 Jul 2014

TL;DR: High rates of somatic mutation were seen, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification, and MAPK and PI(3)K pathway activity was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation.

...read moreread less

Abstract: Adenocarcinoma of the lung is the leading cause of cancer death worldwide. Here we report molecular profiling of 230 resected lung adenocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses. High rates of somatic mutation were seen(mean 8.9 mutations per megabase). Eighteen genes were statistically significantly mutated, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification. EGFR mutations were more frequent in female patients, whereas mutations in RBM10 were more common in males. Aberrations in NF1, MET, ERBB2 and RIT1 occurred in 13% of cases and were enriched in samples otherwise lacking an activated oncogene, suggesting a driver role for these events in certain tumours. DNA and mRNA sequence from the same tumour highlighted splicing alterations driven by somatic genomic changes, including exon 14 skipping in MET mRNA in 4% of cases. MAPK and PI(3)K pathway activity, when measured at the protein level, was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation. These data establish a foundation for classification and further investigations of lung adenocarcinoma molecular pathogenesis.

...read moreread less

2,847 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

The Sequence Alignment/Map format and SAMtools

[...]

Heng Li¹, Bob Handsaker², Alec Wysoker², T. J. Fennell², Jue Ruan³, Nils Homer², Gabor T. Marth⁴, Gonçalo R. Abecasis², Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of California, Los Angeles², Chinese Academy of Sciences³, Boston College⁴

01 Aug 2009-Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

...read moreread less

45,957 citations

Journal Article•DOI•

KEGG: Kyoto Encyclopedia of Genes and Genomes

[...]

Minoru Kanehisa¹, Susumu Goto¹•Institutions (1)

Kyoto University¹

01 Jan 1999-Nucleic Acids Research

TL;DR: The Kyoto Encyclopedia of Genes and Genomes (KEGG) as discussed by the authors is a knowledge base for systematic analysis of gene functions in terms of the networks of genes and molecules.

...read moreread less

Abstract: Kyoto Encyclopedia of Genes and Genomes (KEGG) is a knowledge base for systematic analysis of gene functions in terms of the networks of genes and molecules. The major component of KEGG is the PATHWAY database that consists of graphical diagrams of biochemical pathways including most of the known metabolic pathways and some of the known regulatory pathways. The pathway information is also represented by the ortholog group tables summarizing orthologous and paralogous gene groups among different organisms. KEGG maintains the GENES database for the gene catalogs of all organisms with complete genomes and selected organisms with partial genomes, which are continuously re-annotated, as well as the LIGAND database for chemical compounds and enzymes. Each gene catalog is associated with the graphical genome map for chromosomal locations that is represented by Java applet. In addition to the data collection efforts, KEGG develops and provides various computational tools, such as for reconstructing biochemical pathways from the complete genome sequence and for predicting gene regulatory networks from the gene expression profiles. The KEGG databases are daily updated and made freely available (http://www.genome.ad.jp/kegg/).

...read moreread less

24,024 citations

Journal Article•DOI•

Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008.

[...]

Jacques Ferlay¹, Hai-Rim Shin¹, Freddie Bray¹, David Forman¹, Colin Mathers², Donald Maxwell Parkin³ - Show less +2 more•Institutions (3)

International Agency for Research on Cancer¹, World Health Organization², University of Oxford³

15 Dec 2010-International Journal of Cancer

TL;DR: The results for 20 world regions are presented, summarizing the global patterns for the eight most common cancers, and striking differences in the patterns of cancer from region to region are observed.

...read moreread less

Abstract: Estimates of the worldwide incidence and mortality from 27 cancers in 2008 have been prepared for 182 countries as part of the GLOBOCAN series published by the International Agency for Research on Cancer. In this article, we present the results for 20 world regions, summarizing the global patterns for the eight most common cancers. Overall, an estimated 12.7 million new cancer cases and 7.6 million cancer deaths occur in 2008, with 56% of new cancer cases and 63% of the cancer deaths occurring in the less developed regions of the world. The most commonly diagnosed cancers worldwide are lung (1.61 million, 12.7% of the total), breast (1.38 million, 10.9%) and colorectal cancers (1.23 million, 9.7%). The most common causes of cancer death are lung cancer (1.38 million, 18.2% of the total), stomach cancer (738,000 deaths, 9.7%) and liver cancer (696,000 deaths, 9.2%). Cancer is neither rare anywhere in the world, nor mainly confined to high-resource countries. Striking differences in the patterns of cancer from region to region are observed.

...read moreread less

21,040 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

"Genomic Landscape of Non-Small Cell..." refers background in this paper

...Point mutations, small (<30 bp) indels, copy number alterations, and structural variants (SVs) were discovered by using various computational approaches (Chen et al., 2009; Larson et al., 2012; Li et al., 2009; McKenna et al., 2010; Ye et al., 2009)....
[...]

Journal Article•DOI•

TopHat: discovering splice junctions with RNA-Seq

[...]

Cole Trapnell¹, Lior Pachter¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

01 May 2009-Bioinformatics

TL;DR: The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer.

...read moreread less

Abstract: Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: ude.dmu.sc@eloc Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

11,473 citations