Home
/
Authors
/
Hanlee P. Ji

Author

Hanlee P. Ji

Other affiliations: Arizona State University, University of Washington, VA Palo Alto Healthcare System

Bio: Hanlee P. Ji is an academic researcher from Stanford University. The author has contributed to research in topics: Genome & Cancer. The author has an hindex of 38, co-authored 155 publications receiving 12020 citations. Previous affiliations of Hanlee P. Ji include Arizona State University & University of Washington.

Topics: Genome, Cancer, Genomics, Biology, Exome sequencing ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2001
1999
1998
1997
1993

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Next-generation DNA sequencing.

[...]

Jay Shendure¹, Hanlee P. Ji²•Institutions (2)

University of Washington¹, Stanford University²

01 Oct 2008-Nature Biotechnology

TL;DR: Next-generation DNA sequencing has the potential to dramatically accelerate biological and biomedical research, by enabling the comprehensive analysis of genomes, transcriptomes and interactomes to become inexpensive, routine and widespread, rather than requiring significant production-scale efforts.

...read moreread less

Abstract: DNA sequence represents a single format onto which a broad range of biological phenomena can be projected for high-throughput data collection. Over the past three years, massively parallel DNA sequencing platforms have become widely available, reducing the cost of DNA sequencing by over two orders of magnitude, and democratizing the field by putting the sequencing capacity of a major genome center in the hands of individual investigators. These new technologies are rapidly evolving, and near-term challenges include the development of robust protocols for generating sequencing libraries, building effective new approaches to data-analysis, and often a rethinking of experimental design. Next-generation DNA sequencing has the potential to dramatically accelerate biological and biomedical research, by enabling the comprehensive analysis of genomes, transcriptomes and interactomes to become inexpensive, routine and widespread, rather than requiring significant production-scale efforts.

...read moreread less

4,458 citations

Journal Article•DOI•

The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements

[...]

Leming Shi¹, Laura H. Reid, Wendell D. Jones, Richard Shippy², Janet A. Warrington³, Shawn C. Baker⁴, Patrick J. Collins⁵, Francoise de Longueville, Ernest S. Kawasaki⁶, Kathleen Y. Lee⁷, Yuling Luo, Yongming Andrew Sun⁷, James C. Willey⁸, Robert Setterquist⁷, Gavin M. Fischer⁹, Weida Tong¹, Yvonne P. Dragan¹, David J. Dix¹⁰, Felix W. Frueh¹, Federico Goodsaid¹, Damir Herman⁶, Roderick V. Jensen¹¹, Charles D. Johnson, Edward K. Lobenhofer¹², Raj K. Puri¹, Uwe Scherf¹, Jean Thierry-Mieg⁶, Charles Wang¹³, Michael A Wilson⁷, Paul K. Wolber⁵, Lu Zhang⁷, William Slikker¹, Shashi Amur¹, Wenjun Bao¹⁴, Catalin Barbacioru⁷, Anne Bergstrom Lucas⁵, Vincent Bertholet, Cecilie Boysen, Bud Bromley, Donna Brown, Alan Brunner², Roger D. Canales⁷, Xiaoxi Megan Cao, Thomas A. Cebula¹, James J. Chen¹, Jing Cheng, Tzu Ming Chu¹⁴, Eugene Chudin⁴, John F. Corson⁵, J. Christopher Corton¹⁰, Lisa J. Croner¹⁵, Christopher Davies³, Timothy Davison, Glenda C. Delenstarr⁵, Xutao Deng¹³, David Dorris⁷, Aron Charles Eklund¹¹, Xiaohui Fan¹, Hong Fang, Stephanie Fulmer-Smentek⁵, James C. Fuscoe¹, Kathryn Gallagher¹⁰, Weigong Ge¹, Lei Guo¹, Xu Guo³, Janet Hager¹⁶, Paul K. Haje, Jing Han¹, Tao Han¹, Heather Harbottle¹, Stephen C. Harris¹, Eli Hatchwell¹⁷, Craig A. Hauser¹⁸, Susan D. Hester¹⁰, Huixiao Hong, Patrick Hurban¹², Scott A. Jackson¹, Hanlee P. Ji¹⁹, Charles R. Knight, Winston Patrick Kuo²⁰, J. Eugene LeClerc¹, Shawn Levy²¹, Quan Zhen Li, Chunmei Liu³, Ying Liu²², Michael Lombardi¹¹, Yunqing Ma, Scott R. Magnuson, Botoul Maqsodi, Timothy K. McDaniel³, Nan Mei¹, Ola Myklebost²³, Baitang Ning¹, Natalia Novoradovskaya⁹, Michael S. Orr¹, Terry Osborn, Adam Papallo¹¹, Tucker A. Patterson¹, Roger Perkins, Elizabeth Herness Peters, Ron L. Peterson²⁴, Kenneth L. Philips¹², P. Scott Pine¹, Lajos Pusztai²⁵, Feng Qian, Hongzu Ren¹⁰, Mitch Rosen¹⁰, Barry A. Rosenzweig¹, Raymond R. Samaha⁷, Mark Schena, Gary P. Schroth, Svetlana Shchegrova⁵, Dave D. Smith²⁶, Frank Staedtler²⁴, Zhenqiang Su¹, Hongmei Sun, Zoltan Szallasi²⁰, Zivana Tezak¹, Danielle Thierry-Mieg⁶, Karol L. Thompson¹, Irina Tikhonova¹⁶, Yaron Turpaz³, Beena Vallanat¹⁰, Christophe Van, Stephen J. Walker²⁷, Sue Jane Wang¹, Yonghong Wang⁶, Russell D. Wolfinger¹⁴, Alexander Wong⁵, Jie Wu, Chunlin Xiao⁷, Qian Xie, Jun Xu¹³, Wen Yang, Liang Zhang, Sheng Zhong²⁸, Yaping Zong - Show less +133 more•Institutions (28)

Food and Drug Administration¹, GE Healthcare², Thermo Fisher Scientific³, Illumina⁴, Agilent Technologies⁵, National Institutes of Health⁶, Applied Biosystems⁷, University of Toledo⁸, Stratagene⁹, United States Environmental Protection Agency¹⁰, University of Massachusetts Boston¹¹, Clinical Data, Inc¹², University of California, Los Angeles¹³, SAS Institute¹⁴, Biogen Idec¹⁵, Yale University¹⁶, Cold Spring Harbor Laboratory¹⁷, Discovery Institute¹⁸, Stanford University¹⁹, Harvard University²⁰, Vanderbilt University²¹, University of Texas at Dallas²², University of Oslo²³, Novartis²⁴, University of Texas MD Anderson Cancer Center²⁵, Luminex Corporation²⁶, Wake Forest University²⁷, University of Illinois at Urbana–Champaign²⁸

01 Sep 2006-Nature Biotechnology

TL;DR: This study describes the experimental design and probe mapping efforts behind the MicroArray Quality Control project and shows intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed.

...read moreread less

Abstract: Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, has raised concerns about the reliability of this technology. The MicroArray Quality Control (MAQC) project was initiated to address these concerns, as well as other performance and data analysis issues. Expression data on four titration pools from two distinct reference RNA samples were generated at multiple test sites using a variety of microarray-based and alternative technology platforms. Here we describe the experimental design and probe mapping efforts behind the MAQC project. We show intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed. This study provides a resource that represents an important first step toward establishing a framework for the use of microarrays in clinical and regulatory settings.

...read moreread less

1,987 citations

Journal Article•DOI•

Haplotyping germline and cancer genomes with high-throughput linked-read sequencing

[...]

Grace X.Y. Zheng, Billy T. Lau¹, Michael Schnall-Levin, Mirna Jarosz, John Bell¹, Christopher Hindson, Sofia Kyriazopoulou-Panagiotopoulou, Donald A. Masquelier, Landon Merrill, Jessica M. Terry, Patrice A Mudivarti, Paul Wyatt, Rajiv Bharadwaj, Anthony J. Makarewicz, Yuan Li, Phillip Belgrader, Andrew D. Price, Adam Lowe, Patrick Marks, Gerard M Vurens, Paul Hardenbol, Luz Montesclaros, Melissa Luo, Lawrence Greenfield, Alexander Wong, David E Birch, Steven W Short, Keith Bjornson, Pranav Patel, Erik S. Hopmans¹, Christina Wood¹, Sukhvinder Kaur, Glenn K. Lockwood, David Stafford, Joshua Delaney, Indira Wu, Heather Ordonez, Susan M. Grimes¹, Stephanie Greer¹, Josephine Y Lee, Kamila Belhocine, Kristina Giorda, William Haynes Heaton, Geoffrey P. McDermott, Zachary Bent, Francesca Meschi, Nikola O Kondov, Ryan Wilson, Jorge Bernate, Shawn Gauby, Alex Kindwall, Clara Bermejo, Adrian Fehr, Adrian Chan, Serge Saxonov, Kevin D. Ness, Benjamin J. Hindson, Hanlee P. Ji¹ - Show less +54 more•Institutions (1)

Stanford University¹

01 Mar 2016-Nature Biotechnology

TL;DR: A microfluidics-based, linked-read sequencing technology that can phase and haplotypes generated from whole-genome sequencing of a primary colorectal adenocarcinoma and cancer genomes using nanograms of input DNA is presented.

...read moreread less

Abstract: Haplotyping of human chromosomes is a prerequisite for cataloguing the full repertoire of genetic variation. We present a microfluidics-based, linked-read sequencing technology that can phase and haplotype germline and cancer genomes using nanograms of input DNA. This high-throughput platform prepares barcoded libraries for short-read sequencing and computationally reconstructs long-range haplotype and structural variant information. We generate haplotype blocks in a nuclear trio that are concordant with expected inheritance patterns and phase a set of structural variants. We also resolve the structure of the EML4-ALK gene fusion in the NCI-H2228 cancer cell line using phased exome sequencing. Finally, we assign genetic aberrations to specific megabase-scale haplotypes generated from whole-genome sequencing of a primary colorectal adenocarcinoma. This approach resolves haplotype information using up to 100 times less genomic DNA than some methods and enables the accurate detection of structural variants.

...read moreread less

686 citations

Journal Article•DOI•

Pan-cancer analysis of the extent and consequences of intratumor heterogeneity

[...]

Noemi Andor¹, Trevor A. Graham², Marnix Jansen², Li C. Xia¹, C. Athena Aktipis³, C. Athena Aktipis⁴, Claudia Petritsch⁴, Hanlee P. Ji³, Hanlee P. Ji¹, Carlo C. Maley³, Carlo C. Maley⁴ - Show less +7 more•Institutions (4)

Stanford University¹, Queen Mary University of London², Arizona State University³, University of California, San Francisco⁴

01 Jan 2016-Nature Medicine

TL;DR: Intratumor heterogeneity and genomic instability have the potential to be useful measures that can universally be applied to all cancers.

...read moreread less

Abstract: Intratumor heterogeneity (ITH) drives neoplastic progression and therapeutic resistance. We used the bioinformatics tools 'expanding ploidy and allele frequency on nested subpopulations' (EXPANDS) and PyClone to detect clones that are present at a ≥10% frequency in 1,165 exome sequences from tumors in The Cancer Genome Atlas. 86% of tumors across 12 cancer types had at least two clones. ITH in the morphology of nuclei was associated with genetic ITH (Spearman's correlation coefficient, ρ = 0.24-0.41; P 2 clones coexisted in the same tumor sample (HR = 1.49, 95% CI: 1.20-1.87). In two independent data sets, copy-number alterations affecting either 75% of a tumor's genome predicted reduced risk (HR = 0.15, 95% CI: 0.08-0.29). Mortality risk also declined when >4 clones coexisted in the sample, suggesting a trade-off between the costs and benefits of genomic instability. ITH and genomic instability thus have the potential to be useful measures that can universally be applied to all cancers.

...read moreread less

630 citations

Journal Article•DOI•

Oncogenic transformation of diverse gastrointestinal tissues in primary organoid culture

[...]

Xingnan Li¹, Lincoln Nadauld¹, Akifumi Ootani¹, David C Corney¹, Reetesh K. Pai¹, Olivier Gevaert¹, Michael A. Cantrell¹, Paul G. Rack¹, James T. Neal¹, Carol W.M. Chan¹, Trevor M. Yeung¹, Xue Gong¹, Jenny Yuan¹, Julie Wilhelmy¹, Sylvie Robine², Laura D. Attardi¹, Sylvia K. Plevritis¹, Kenneth E. Hung³, Chang-Zheng Chen¹, Hanlee P. Ji¹, Calvin J. Kuo¹ - Show less +17 more•Institutions (3)

Stanford University¹, Curie Institute², Tufts University³

01 Jul 2014-Nature Medicine

TL;DR: The general utility of a highly tractable primary organoid system for cancer modeling and driver oncogene validation in diverse gastrointestinal tissues is demonstrated.

...read moreread less

Abstract: The application of primary organoid cultures containing epithelial and mesenchymal elements to cancer modeling holds promise for combining the accurate multilineage differentiation and physiology of in vivo systems with the facile in vitro manipulation of transformed cell lines. Here we used a single air-liquid interface culture method without modification to engineer oncogenic mutations into primary epithelial and mesenchymal organoids from mouse colon, stomach and pancreas. Pancreatic and gastric organoids exhibited dysplasia as a result of expression of Kras carrying the G12D mutation (Kras(G12D)), p53 loss or both and readily generated adenocarcinoma after in vivo transplantation. In contrast, primary colon organoids required combinatorial Apc, p53, Kras(G12D) and Smad4 mutations for progressive transformation to invasive adenocarcinoma-like histology in vitro and tumorigenicity in vivo, recapitulating multi-hit models of colorectal cancer (CRC), as compared to the more promiscuous transformation of small intestinal organoids. Colon organoid culture functionally validated the microRNA miR-483 as a dominant driver oncogene at the IGF2 (insulin-like growth factor-2) 11p15.5 CRC amplicon, inducing dysplasia in vitro and tumorigenicity in vivo. These studies demonstrate the general utility of a highly tractable primary organoid system for cancer modeling and driver oncogene validation in diverse gastrointestinal tissues.

...read moreread less

334 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

[...]

Bo Li¹, Colin N. Dewey¹•Institutions (1)

University of Wisconsin-Madison¹

04 Aug 2011-BMC Bioinformatics

TL;DR: It is shown that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads, and estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired- end reads, depending on the number of possible splice forms for each gene.

...read moreread less

Abstract: RNA-Seq is revolutionizing the way transcript abundances are measured. A key challenge in transcript quantification from RNA-Seq data is the handling of reads that map to multiple genes or isoforms. This issue is particularly important for quantification with de novo transcriptome assemblies in the absence of sequenced genomes, as it is difficult to determine which transcripts are isoforms of the same gene. A second significant issue is the design of RNA-Seq experiments, in terms of the number of reads, read length, and whether reads come from one or both ends of cDNA fragments. We present RSEM, an user-friendly software package for quantifying gene and isoform abundances from single-end or paired-end RNA-Seq data. RSEM outputs abundance estimates, 95% credibility intervals, and visualization files and can also simulate RNA-Seq data. In contrast to other existing tools, the software does not require a reference genome. Thus, in combination with a de novo transcriptome assembler, RSEM enables accurate transcript quantification for species without sequenced genomes. On simulated and real data sets, RSEM has superior or comparable performance to quantification methods that rely on a reference genome. Taking advantage of RSEM's ability to effectively use ambiguously-mapping reads, we show that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads. On the other hand, estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired-end reads, depending on the number of possible splice forms for each gene. RSEM is an accurate and user-friendly software tool for quantifying transcript abundances from RNA-Seq data. As it does not rely on the existence of a reference genome, it is particularly useful for quantification with de novo transcriptome assemblies. In addition, RSEM has enabled valuable guidance for cost-efficient design of quantification experiments with RNA-Seq, which is currently relatively expensive.

...read moreread less

14,524 citations

Journal Article•DOI•

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

[...]

Yang Liao¹, Gordon K. Smyth¹, Wei Shi¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

01 Apr 2014-Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Abstract: MOTIVATION: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. RESULTS: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. AVAILABILITY AND IMPLEMENTATION: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

...read moreread less

14,103 citations

Journal Article•DOI•

DnaSP v5

[...]

Pablo Librado¹, Julio Rozas¹•Institutions (1)

University of Barcelona¹

01 Jun 2009-Bioinformatics

TL;DR: Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets, including visualizing sliding window results integrated with available genome annotations in the UCSC browser.

...read moreread less

Abstract: Motivation: DnaSP is a software package for a comprehensive analysis of DNA polymorphism data. Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets. Among other features, the newly implemented methods allow for: (i) analyses on multiple data files; (ii) haplotype phasing; (iii) analyses on insertion/deletion polymorphism data; (iv) visualizing sliding window results integrated with available genome annotations in the UCSC browser. Availability: Freely available to academic users from: http://www.ub.edu/dnasp Contact: [email protected]

...read moreread less

13,511 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse