Home
/
Authors
/
Yao He

Author

Yao He

Bio: Yao He is an academic researcher from Peking University. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 11, co-authored 13 publications receiving 1574 citations. Previous affiliations of Yao He include Tsinghua University.

Topics: Genome, Gene, Medicine, Genomics, Computational biology ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Pan-cancer analysis of whole genomes

[...]

Peter J. Campbell¹, Gad Getz², Jan O. Korbel³, Joshua M. Stuart⁴ +1329 more•Institutions (238)

06 Feb 2020-Nature

TL;DR: The flagship paper of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium describes the generation of the integrative analyses of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types, the structures for international data sharing and standardized analyses, and the main scientific findings from across the consortium studies.

...read moreread less

Abstract: Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1,2,3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4–5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10,11,12,13,14,15,16,17,18.

...read moreread less

1,600 citations

Journal Article•DOI•

Landscape and Dynamics of Single Immune Cells in Hepatocellular Carcinoma.

[...]

Qiming Zhang¹, Yao He¹, Nan Luo², Shashank J. Patel³, Yanjie Han¹, Ranran Gao¹, Madhura Modak³, Sebastian Carotta³, Christian Haslinger³, David Kind³, Gregory W. Peet³, Guojie Zhong¹, Shuangjia Lu¹, Weihua Zhu¹, Yilei Mao⁴, Mengmeng Xiao¹, Michael Bergmann⁵, Xueda Hu¹, Sid P. Kerkar³, Anne Vogt³, Stefan Pflanz³, Kang Liu³, Jirun Peng², Jirun Peng¹, Xianwen Ren¹, Zemin Zhang¹ - Show less +22 more•Institutions (5)

Peking University¹, Capital Medical University², Boehringer Ingelheim³, Peking Union Medical College Hospital⁴, Medical University of Vienna⁵

31 Oct 2019-Cell

TL;DR: The dynamic properties of diverse CD45+ cell types revealed by this study add new dimensions to the immune landscape of HCC.

...read moreread less

700 citations

Journal Article•DOI•

Single-Cell Analyses Inform Mechanisms of Myeloid-Targeted Therapies in Colon Cancer.

[...]

Lei Zhang¹, Ziyi Li¹, Katarzyna M. Skrzypczynska², Qiao Fang¹, Wei Zhang¹, Sarah A. O’Brien², Yao He¹, Lynn Wang², Qiming Zhang¹, Aeryon Kim², Ranran Gao¹, Jessica Orf², Tao Wang¹, Deepali V. Sawant², Jiajinlong Kang¹, Dev Bhatt², Daniel Lu², Chi-Ming Li², Aaron S. Rapaport², Kristy Perez², Yingjiang Ye¹, Shan Wang¹, Xueda Hu¹, Xianwen Ren¹, Wenjun Ouyang², Zhanlong Shen¹, Jackson G. Egen², Zemin Zhang¹, Xin Yu² - Show less +25 more•Institutions (2)

Peking University¹, Amgen²

16 Apr 2020-Cell

TL;DR: This comprehensive analysis of key myeloid subsets in human and mouse identifies critical cellular interactions regulating tumor immunity and defines mechanisms underlying myeloids-targeted immunotherapies currently undergoing clinical testing.

...read moreread less

549 citations

Journal Article•DOI•

Genomic basis for RNA alterations in cancer

[...]

Claudia Calabrese¹, Natalie R. Davidson, Deniz Demircioğlu², Deniz Demircioğlu³, Nuno A. Fonseca¹, Yao He⁴, André Kahles, Kjong-Van Lehmann, Fenglin Liu⁴, Yuichi Shiraishi⁵, Cameron M. Soulette⁶, Lara Urban¹, Liliana Greger¹, Siliang Li, Dongbing Liu, Marc D. Perry⁷, Marc D. Perry⁸, Qian Xiang⁷, Fan Zhang⁴, Junjun Zhang⁷, Peter Bailey⁹, Serap Erkek, Katherine A. Hoadley¹⁰, Yong Hou, Matthew R. Huska¹¹, Helena Kilpinen¹², Jan O. Korbel, Maximillian G. Marin⁶, Julia Markowski¹¹, Tannistha Nandi², Qiang Pan-Hammarström¹³, Chandra Sekhar Pedamallu¹⁴, Chandra Sekhar Pedamallu¹⁵, Reiner Siebert¹⁶, Stefan G. Stark, Hong Su, Patrick Tan³, Patrick Tan², Sebastian M. Waszak, Christina K. Yung⁷, Shida Zhu, Philip Awadalla¹⁷, Philip Awadalla⁷, Chad J. Creighton¹⁸, Matthew Meyerson¹⁴, Matthew Meyerson¹⁵, B. F. Francis Ouellette¹⁷, Kui Wu, Huanming Yang, Alvis Brazma¹, Angela N. Brooks¹⁴, Angela N. Brooks⁶, Angela N. Brooks¹⁵, Jonathan Göke², Gunnar Rätsch, Roland F. Schwarz, Oliver Stegle¹⁹, Oliver Stegle¹, Zemin Zhang⁴ - Show less +55 more•Institutions (19)

European Bioinformatics Institute¹, Genome Institute of Singapore², National University of Singapore³, Peking University⁴, University of Tokyo⁵, University of California, Santa Cruz⁶, Ontario Institute for Cancer Research⁷, University of California, San Francisco⁸, University of Glasgow⁹, University of North Carolina at Chapel Hill¹⁰, Max Delbrück Center for Molecular Medicine¹¹, University College London¹², Karolinska Institutet¹³, Broad Institute¹⁴, Harvard University¹⁵, University of Ulm¹⁶, University of Toronto¹⁷, Baylor College of Medicine¹⁸, German Cancer Research Center¹⁹

06 Feb 2020-Nature

TL;DR: The most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Gome Atlas (TCGA) was presented in this article.

...read moreread less

Abstract: Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer.

...read moreread less

259 citations

Journal Article•DOI•

Direct Comparative Analyses of 10X Genomics Chromium and Smart-seq2.

[...]

Xiliang Wang¹, Yao He¹, Qiming Zhang¹, Xianwen Ren¹, Zemin Zhang¹ - Show less +1 more•Institutions (1)

Peking University¹

02 Mar 2021-Genomics, Proteomics & Bioinformatics

TL;DR: In this article, the authors compared the performance of droplet-based 10X Genomics Chromium (10X) and plate-based Smart-seq2 full-length methods.

...read moreread less

116 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje, Wouter Meuleman, Jason Ernst, Angela Yen, Pouya Kheradpour, Zhizhuo Zhang, Jianrong Wang, Lucas D. Ward, Abhishek Sarkar, Gerald Quon, Matthew L. Eaton, Yi-Chieh Wu, Andreas R. Pfenning, Xinchen Wang, Melina Claussnitzer, Yaping Liu, Mukul S. Bansal, Soheil Feizi-Khankandi, Ah Ram Kim, Richard C Sallari, Nicholas A Sinnott-Armstrong, Laurie A. Boyer, Elizabeta Gjoneska, Li-Huei Tsai, Manolis Kellis - Show less +21 more

01 Feb 2015

TL;DR: In this article, the authors describe the integrative analysis of 111 reference human epigenomes generated as part of the NIH Roadmap Epigenomics Consortium, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

4,409 citations

Journal Article•DOI•

Visualizing and interpreting cancer genomics data via the Xena platform.

[...]

Mary Goldman¹, Brian Craft¹, Mim Hastie, Kristupas Repečka², Fran McDade, Akhil Kamath³, Ayan Banerjee⁴, Yunhai Luo⁵, Dave Rogers, Angela N. Brooks¹, Jingchun Zhu¹, David Haussler¹ - Show less +8 more•Institutions (5)

University of California, Santa Cruz¹, Vilnius University², Birla Institute of Technology and Science³, National Institute of Technology, Durgapur⁴, Stanford University⁵

22 May 2020-Nature Biotechnology

TL;DR: Xena’s Visual Spreadsheet visualization integrates gene-centric and genomic-coordinate-centric views across multiple data modalities, providing a deep, comprehensive view of genomic events within a cohort of tumors.

...read moreread less

Abstract: To the Editor — There is a great need for easy-to-use cancer genomics visualization tools for both large public data resources such as TCGA (The Cancer Genome Atlas)1 and the GDC (Genomic Data Commons)2, as well as smaller-scale datasets generated by individual labs. Commonly used interactive visualization tools are either web-based portals or desktop applications. Data portals have a dedicated back end and are a powerful means of viewing centrally hosted resource datasets (for example, Xena’s predecessor, the University of California, Santa Cruz (UCSC) Cancer Browser (currently retired3), cBioPortal4, ICGC (International Cancer Genomics Consortium) Data Portal5, GDC Data Portal2). However, researchers wishing to use a data portal to explore their own data have to either redeploy the entire platform, a difficult task even for bioinformaticians, or upload private data to a server outside the user’s control, a non-starter for protected patient data, such as germline variants (for example, MAGI (Mutation Annotation and Genome Interpretation6), WebMeV7 or Ordino8). Desktop tools can view a user’s own data securely (for example, Integrated Genomics Viewer (IGV)9, Gitools10), but lack well-maintained, prebuilt files for the ever-evolving and expanding public data resources. This dichotomy between data portals and desktop tools highlights the challenge of using a single platform for both large public data and smaller-scale datasets generated by individual labs. Complicating this dichotomy is the expanding amount, and complexity, of cancer genomics data resulting from numerous technological advances, including lower-cost high-throughput sequencing and single-cell-based technologies. Cancer genomics datasets are now being generated using new assays, such as whole-genome sequencing11, DNA methylation whole-genome bisulfite sequencing12 and ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing13). Visualizing and exploring these diverse data modalities is important but challenging, especially as many tools have traditionally specialized in only one or perhaps a few data types. And although these complex datasets generate insights individually, integration with other omics datasets is crucial to help researchers discover and validate findings. UCSC Xena was developed as a high-performance visualization and analysis tool for both large public repositories and private datasets. It was built to scale with the current and future data growth and complexity. Xena’s privacy-aware architecture enables cancer researchers of all computational backgrounds to explore large, diverse datasets. Researchers use the same system to securely explore their own data, together or separately from the public data, all the while keeping private data secure. The system easily supports many tens of thousands of samples and has been tested with up to a million cells. The simple and flexible architecture supports a variety of common and uncommon data types. Xena’s Visual Spreadsheet visualization integrates gene-centric and genomic-coordinate-centric views across multiple data modalities, providing a deep, comprehensive view of genomic events within a cohort of tumors. UCSC Xena (http://xena.ucsc.edu) has two components: the front end Xena Browser and the back end Xena Hubs (Fig. 1). The web-based Xena Browser empowers biologists to explore data across multiple Xena Hubs with a variety of visualizations and analyses. The back end Xena Hubs host genomics data from laptops, public servers, behind a firewall, or in the cloud, and can be public or private (Supplementary Fig. 1). The Xena Browser receives data simultaneously from multiple Xena Hubs and integrates them into a single coherent visualization within the browser. A private Xena Hub is a hub installed on a user’s own computer (Supplementary Fig. 2). It is configured to only respond to requests from the computer’s localhost network interface (that is, http://127.0.0.1). This ensures that the hub only communicates with the computer on which the hub is installed. A public hub is configured to respond to requests from external computers. There are two types of public Xena Hubs (Supplementary Fig. 2). The first type is an open-public hub, which is a public hub accessible by everyone. While we host several open-public hubs (Supplementary Table 1), users can also set up their own as a way to share data. An example of one is the Treehouse Hub set up by the Childhood Cancer Initiative to share pediatric cancer RNA-seq gene expression data (Supplementary Note). The second type W eb s er ve r

...read moreread less

1,644 citations

Journal Article•DOI•

Pan-cancer analysis of whole genomes

[...]

Peter J. Campbell¹, Gad Getz², Jan O. Korbel³, Joshua M. Stuart⁴ +1329 more•Institutions (238)

06 Feb 2020-Nature

...read moreread less

1,600 citations

Journal Article•DOI•

The Repertoire of Mutational Signatures in Human Cancer

[...]

Ludmil B. Alexandrov¹, Jaegil Kim², Nicholas J. Haradhvala², Nicholas J. Haradhvala³, Mi Ni Huang⁴, Alvin Wei Tian Ng⁴, Yang Wu⁴, Arnoud Boot⁴, Kyle R. Covington⁵, Dmitry A. Gordenin⁶, Erik N. Bergstrom¹, S M Ashiqul Islam¹, Nuria Lopez-Bigas⁷, Nuria Lopez-Bigas⁸, Leszek J. Klimczak⁶, John R. McPherson⁴, Sandro Morganella⁹, Radhakrishnan Sabarinathan¹⁰, Radhakrishnan Sabarinathan⁸, David A. Wheeler⁵, Ville Mustonen¹¹, Gad Getz, Steven G. Rozen⁴, Michael R. Stratton⁹ - Show less +20 more•Institutions (11)

University of California, San Diego¹, Broad Institute², Harvard University³, National University of Singapore⁴, Baylor College of Medicine⁵, National Institutes of Health⁶, Catalan Institution for Research and Advanced Studies⁷, Pompeu Fabra University⁸, Wellcome Trust Sanger Institute⁹, National Centre for Biological Sciences¹⁰, University of Helsinki¹¹

05 Feb 2020-Nature

TL;DR: The characterization of 4,645 whole-genome and 19,184 exome sequences, covering most types of cancer, identifies 81 single-base substitution, doublet- base substitution and small-insertion-and-deletion mutational signatures, providing a systematic overview of the mutational processes that contribute to cancer development.

...read moreread less

Abstract: Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3–15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated—but distinct—DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer. The characterization of 4,645 whole-genome and 19,184 exome sequences, covering most types of cancer, identifies 81 single-base substitution, doublet-base substitution and small-insertion-and-deletion mutational signatures, providing a systematic overview of the mutational processes that contribute to cancer development.

...read moreread less

1,521 citations

Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq

[...]

Itay Tirosh, Benjamin Izar, Daniel J. Treacy, John J. Trombetta, Asaf Rotem, Christopher Rodman, Christine G. Lian, George F. Murphy, Mohammad Fallahi-Sichani, Ken Dutton-Regester, Jia-Ren Lin, Ofir Cohen, Parin Shah, Diana Lu, Alexandra-Chloé Villani, Aleksandr Andreev, E.M. Van Allen, Monica M. Bertagnolli, Peter K. Sorger, Ryan J. Sullivan, Keith T. Flaherty, Dennie T. Frederick, Judit Jané-Valbuena, Orit Rozenblatt-Rosen, Sanjay M. Prakadan, Marc H. Wadsworth, Alex S. Genshaft, Travis K. Hughes, Carly G. K. Ziegler, Samuel W. Kazer, Alethe Gaillard de Saint Germain, Kellie E. Kolb, Cory M. Johannessen, Clifford H. Yoon, Alex K. Shalek, Aviv Regev, Levi A. Garraway - Show less +33 more

01 Apr 2016

TL;DR: Tirosh et al. as discussed by the authors applied single-cell RNA sequencing (RNA-seq) to 4645 single cells isolated from 19 patients, profiling malignant, immune, stromal, and endothelial cells.

...read moreread less

Abstract: Single-cell expression profiles of melanoma Tumors harbor multiple cell types that are thought to play a role in the development of resistance to drug treatments. Tirosh et al. used single-cell sequencing to investigate the distribution of these differing genetic profiles within melanomas. Many cells harbored heterogeneous genetic programs that reflected two different states of genetic expression, one of which was linked to resistance development. Following drug treatment, the resistance-linked expression state was found at a much higher level. Furthermore, the environment of the melanoma cells affected their gene expression programs. Science, this issue p. 189 Melanoma cells show transcriptional heterogeneity. To explore the distinct genotypic and phenotypic states of melanoma tumors, we applied single-cell RNA sequencing (RNA-seq) to 4645 single cells isolated from 19 patients, profiling malignant, immune, stromal, and endothelial cells. Malignant cells within the same tumor displayed transcriptional heterogeneity associated with the cell cycle, spatial context, and a drug-resistance program. In particular, all tumors harbored malignant cells from two distinct transcriptional cell states, such that tumors characterized by high levels of the MITF transcription factor also contained cells with low MITF and elevated levels of the AXL kinase. Single-cell analyses suggested distinct tumor microenvironmental patterns, including cell-to-cell interactions. Analysis of tumor-infiltrating T cells revealed exhaustion programs, their connection to T cell activation and clonal expansion, and their variability across patients. Overall, we begin to unravel the cellular ecosystem of tumors and how single-cell genomics offers insights with implications for both targeted and immune therapies.

...read moreread less

823 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse