Home
/
Authors
/
Kai How Farh

Author

Kai How Farh

Other affiliations: Broad Institute, University of Southern California

Bio: Kai How Farh is an academic researcher from Harvard University. The author has contributed to research in topics: Copy-number variation & Genome-wide association study. The author has an hindex of 5, co-authored 5 publications receiving 6167 citations. Previous affiliations of Kai How Farh include Broad Institute & University of Southern California.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje¹, Wouter Meuleman¹, Wouter Meuleman², Jason Ernst³, Misha Bilenky⁴, Angela Yen², Angela Yen¹, Alireza Heravi-Moussavi⁴, Pouya Kheradpour¹, Pouya Kheradpour², Zhizhuo Zhang¹, Zhizhuo Zhang², Jianrong Wang², Jianrong Wang¹, Michael J. Ziller², Viren Amin⁵, John W. Whitaker, Matthew D. Schultz⁶, Lucas D. Ward¹, Lucas D. Ward², Abhishek Sarkar², Abhishek Sarkar¹, Gerald Quon², Gerald Quon¹, Richard Sandstrom⁷, Matthew L. Eaton¹, Matthew L. Eaton², Yi-Chieh Wu², Yi-Chieh Wu¹, Andreas R. Pfenning¹, Andreas R. Pfenning², Xinchen Wang², Xinchen Wang¹, Melina Claussnitzer², Melina Claussnitzer¹, Yaping Liu², Yaping Liu¹, Cristian Coarfa⁵, R. Alan Harris⁵, Noam Shoresh², Charles B. Epstein², Elizabeta Gjoneska², Elizabeta Gjoneska¹, Danny Leung⁸, Wei Xie⁸, R. David Hawkins⁸, Ryan Lister⁶, Chibo Hong⁹, Philippe Gascard⁹, Andrew J. Mungall⁴, Richard A. Moore⁴, Eric Chuah⁴, Angela Tam⁴, Theresa K. Canfield⁷, R. Scott Hansen⁷, Rajinder Kaul⁷, Peter J. Sabo⁷, Mukul S. Bansal¹, Mukul S. Bansal², Mukul S. Bansal¹⁰, Annaick Carles⁴, Jesse R. Dixon⁸, Kai How Farh², Soheil Feizi¹, Soheil Feizi², Rosa Karlic¹¹, Ah Ram Kim¹, Ah Ram Kim², Ashwinikumar Kulkarni¹², Daofeng Li¹³, Rebecca F. Lowdon¹³, Ginell Elliott¹³, Tim R. Mercer¹⁴, Shane Neph⁷, Vitor Onuchic⁵, Paz Polak¹⁵, Paz Polak², Nisha Rajagopal⁸, Pradipta R. Ray¹², Richard C Sallari¹, Richard C Sallari², Kyle Siebenthall⁷, Nicholas A Sinnott-Armstrong², Nicholas A Sinnott-Armstrong¹, Michael Stevens¹³, Robert E. Thurman⁷, Jie Wu¹⁶, Bo Zhang¹³, Xin Zhou¹³, Arthur E. Beaudet⁵, Laurie A. Boyer¹, Philip L. De Jager², Philip L. De Jager¹⁵, Peggy J. Farnham¹⁷, Susan J. Fisher⁹, David Haussler¹⁸, Steven J.M. Jones¹⁹, Steven J.M. Jones⁴, Wei Li⁵, Marco A. Marra⁴, Michael T. McManus⁹, Shamil R. Sunyaev¹⁵, Shamil R. Sunyaev², James A. Thomson²⁰, Thea D. Tlsty⁹, Li-Huei Tsai¹, Li-Huei Tsai², Wei Wang, Robert A. Waterland⁵, Michael Q. Zhang²¹, Lisa Helbling Chadwick²², Bradley E. Bernstein², Bradley E. Bernstein¹⁵, Bradley E. Bernstein⁶, Joseph F. Costello⁹, Joseph R. Ecker¹¹, Martin Hirst⁴, Alexander Meissner², Aleksandar Milosavljevic⁵, Bing Ren⁸, John A. Stamatoyannopoulos⁷, Ting Wang¹³, Manolis Kellis¹, Manolis Kellis² - Show less +120 more•Institutions (22)

Massachusetts Institute of Technology¹, Broad Institute², University of California, Los Angeles³, University of British Columbia⁴, Baylor College of Medicine⁵, Howard Hughes Medical Institute⁶, University of Washington⁷, Ludwig Institute for Cancer Research⁸, University of California, San Francisco⁹, University of Connecticut¹⁰, University of Zagreb¹¹, University of Texas at Austin¹², Washington University in St. Louis¹³, University of Queensland¹⁴, Harvard University¹⁵, Cold Spring Harbor Laboratory¹⁶, University of Southern California¹⁷, University of California, Santa Cruz¹⁸, Simon Fraser University¹⁹, Morgridge Institute for Research²⁰, University of Texas at Dallas²¹, National Institutes of Health²²

19 Feb 2015-Nature

TL;DR: It is shown that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

5,037 citations

Journal Article•DOI•

Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores

[...]

Bjarni J. Vilhjálmsson¹, Jian Yang², Hilary K. Finucane³, Alexander Gusev⁴ +391 more•Institutions (14)

01 Oct 2015-American Journal of Human Genetics

TL;DR: LDpred is introduced, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel, and outperforms the approach of pruning followed by thresholding, particularly at large sample sizes.

...read moreread less

Abstract: Polygenic risk scores have shown great promise in predicting complex disease risk and will become more accurate as training sample sizes increase. The standard approach for calculating risk scores involves linkage disequilibrium (LD)-based marker pruning and applying a p value threshold to association statistics, but this discards information and can reduce predictive accuracy. We introduce LDpred, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel. Theory and simulations show that LDpred outperforms the approach of pruning followed by thresholding, particularly at large sample sizes. Accordingly, predicted R(2) increased from 20.1% to 25.3% in a large schizophrenia dataset and from 9.8% to 12.0% in a large multiple sclerosis dataset. A similar relative improvement in accuracy was observed for three additional large disease datasets and for non-European schizophrenia samples. The advantage of LDpred over existing methods will grow as sample sizes increase.

...read moreread less

1,088 citations

Journal Article•DOI•

Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects

[...]

Christian R. Marshall, Daniel P. Howrigan¹, Daniel P. Howrigan², Daniele Merico +326 more•Institutions (98)

01 Jan 2017-Nature Genetics

TL;DR: In this article, a centralized analysis pipeline was applied to a SCZ cohort of 21,094 cases and 20,227 controls, and a global enrichment of copy number variants (CNVs) was observed in cases (odds ratio (OR) = 1.11, P = 5.7 × 10-15), which persisted after excluding loci implicated in previous studies.

...read moreread less

Abstract: Copy number variants (CNVs) have been strongly implicated in the genetic etiology of schizophrenia (SCZ). However, genome-wide investigation of the contribution of CNV to risk has been hampered by limited sample sizes. We sought to address this obstacle by applying a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. A global enrichment of CNV burden was observed in cases (odds ratio (OR) = 1.11, P = 5.7 × 10-15), which persisted after excluding loci implicated in previous studies (OR = 1.07, P = 1.7 × 10-6). CNV burden was enriched for genes associated with synaptic function (OR = 1.68, P = 2.8 × 10-11) and neurobehavioral phenotypes in mouse (OR = 1.18, P = 7.3 × 10-5). Genome-wide significant evidence was obtained for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. Suggestive support was found for eight additional candidate susceptibility and protective loci, which consisted predominantly of CNVs mediated by nonallelic homologous recombination.

...read moreread less

774 citations

Posted Content•DOI•

A contribution of novel CNVs to schizophrenia from a genome-wide study of 41,321 subjects

[...]

Christian R. Marshall¹, Daniel P. Howrigan², Daniele Merico¹, Bhooma Thiruvahindrapuram¹ +252 more•Institutions (87)

23 Feb 2016-bioRxiv

TL;DR: A collaborative effort in which a centralized analysis pipeline is applied to a SCZ cohort, finding support at a suggestive level for nine additional candidate susceptibility and protective loci, which consist predominantly of CNVs mediated by non-allelic homologous recombination (NAHR).

...read moreread less

Abstract: Genomic copy number variants (CNVs) have been strongly implicated in the etiology of schizophrenia (SCZ). However, apart from a small number of risk variants, elucidation of the CNV contribution to risk has been difficult due to the rarity of risk alleles, all occurring in less than 1% of cases. We sought to address this obstacle through a collaborative effort in which we applied a centralized analysis pipeline to a SCZ cohort of 21,094 cases and 20,227 controls. We observed a global enrichment of CNV burden in cases (OR=1.11, P=5.7e-15), which persisted after excluding loci implicated in previous studies (OR=1.07, P=1.7e-6). CNV burden is also enriched for genes associated with synaptic function (OR = 1.68, P = 2.8e-11) and neurobehavioral phenotypes in mouse (OR = 1.18, P= 7.3e-5). We identified genome-wide significant support for eight loci, including 1q21.1, 2p16.3 (NRXN1), 3q29, 7q11.2, 15q13.3, distal 16p11.2, proximal 16p11.2 and 22q11.2. We find support at a suggestive level for nine additional candidate susceptibility and protective loci, which consist predominantly of CNVs mediated by non-allelic homologous recombination (NAHR).

...read moreread less

764 citations

Journal Article•DOI•

Age at first birth in women is genetically associated with increased risk of schizophrenia

[...]

Guiyan Ni¹, Guiyan Ni², Jacob Gratten³, Naomi R. Wray³ +362 more•Institutions (106)

05 Jul 2018-Scientific Reports

TL;DR: The results suggest that early, and perhaps also late, age at first birth in women is associated with increased genetic risk for schizophrenia in the UK Biobank sample, contributing new insights into factors contributing to the complex bio-social risk architecture underpinning the association between parental age and offspring mental health.

...read moreread less

Abstract: Previous studies have shown an increased risk for mental health problems in children born to both younger and older parents compared to children of average-aged parents. We previously used a novel design to reveal a latent mechanism of genetic association between schizophrenia and age at first birth in women (AFB). Here, we use independent data from the UK Biobank (N = 38,892) to replicate the finding of an association between predicted genetic risk of schizophrenia and AFB in women, and to estimate the genetic correlation between schizophrenia and AFB in women stratified into younger and older groups. We find evidence for an association between predicted genetic risk of schizophrenia and AFB in women (P-value = 1.12E-05), and we show genetic heterogeneity between younger and older AFB groups (P-value = 3.45E-03). The genetic correlation between schizophrenia and AFB in the younger AFB group is -0.16 (SE = 0.04) while that between schizophrenia and AFB in the older AFB group is 0.14 (SE = 0.08). Our results suggest that early, and perhaps also late, age at first birth in women is associated with increased genetic risk for schizophrenia in the UK Biobank sample. These findings contribute new insights into factors contributing to the complex bio-social risk architecture underpinning the association between parental age and offspring mental health.

...read moreread less

16 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans

[...]

Kristin G. Ardlie, David S. DeLuca, Ayellet V. Segrè, Timothy J. Sullivan, Taylor Young, Ellen Gelfand, Casandra A. Trowbridge, Julian Maller, Taru Tukiainen, Monkol Lek, Lucas D. Ward, Pouya Kheradpour, Benjamin Iriarte, Yan Meng, Cameron D. Palmer, Tõnu Esko, Wendy Winckler, Joel N. Hirschhorn, Manolis Kellis, Daniel G. MacArthur, Gad Getz, Andrey A. Shabalin, Gen Li, Yi-Hui Zhou, Andrew B. Nobel, Ivan Rusyn, Fred A. Wright, Tuuli Lappalainen, Pedro G. Ferreira, Halit Ongen, Manuel A. Rivas, Alexis Battle, Sara Mostafavi, Jean Monlong, Michael Sammeth, Marta Melé, Ferran Reverter, Jakob M. Goldmann, Daphne Koller, Roderic Guigó, Mark I. McCarthy, Emmanouil T. Dermitzakis, Eric R. Gamazon, Hae Kyung Im, Anuar Konkashbaev, Dan L. Nicolae, Nancy J. Cox, Timothée Flutre, Xiaoquan Wen, Matthew Stephens, Jonathan K. Pritchard, Zhidong Tu, Bin Zhang, Tao Huang, Quan Long, Luan Lin, Jialiang Yang, Jun Zhu, Jun Liu, Amanda Brown, Bernadette Mestichelli, Denee Tidwell, Edmund Lo, Mike Salvatore, Saboor Shad, Jeffrey A. Thomas, John T. Lonsdale, Michael T. Moser, Bryan Gillard, Ellen Karasik, Kimberly Ramsey, Christopher Choi, Barbara A. Foster, John Syron, Johnell Fleming, Harold Magazine, Rick Hasz, Gary Walters, Jason Bridge, Mark Miklos, Susan L. Sullivan, Laura Barker, Heather M. Traino, Maghboeba Mosavel, Laura A. Siminoff, Dana R. Valley, Daniel C. Rohrer, Scott D. Jewell, Philip A. Branton, Leslie H. Sobin, Mary Barcus, Liqun Qi, Jeffrey McLean, Pushpa Hariharan, Ki Sung Um, Shenpei Wu, David Tabor, Charles Shive, Anna M. Smith, Stephen A. Buia, Anita H. Undale, Karna Robinson, Nancy Roche, Kimberly M. Valentino, Angela Britton, Robin Burges, Debra Bradbury, Kenneth W. Hambright, John Seleski, Greg E. Korzeniewski, Kenyon Erickson, Yvonne Marcus, Jorge Tejada, Mehran Taherian, Chunrong Lu, Margaret J. Basile, Deborah C. Mash, Simona Volpi, Jeffery P. Struewing, Gary F. Temple, Joy T. Boyer, Deborah Colantuoni, Roger Little, Susan E. Koester, Latarsha J. Carithers, Helen M. Moore, Ping Guan, Carolyn C. Compton, Sherilyn Sawyer, Joanne P. Demchok, Jimmie B. Vaught, Chana A. Rabiner, Nicole C. Lockhart - Show less +129 more

08 May 2015-Science

TL;DR: The landscape of gene expression across tissues is described, thousands of tissue-specific and shared regulatory expression quantitative trait loci (eQTL) variants are cataloged, complex network relationships are described, and signals from genome-wide association studies explained by eQTLs are identified.

...read moreread less

Abstract: Understanding the functional consequences of genetic variation, and how it affects complex human disease and quantitative traits, remains a critical challenge for biomedicine. We present an analysi...

...read moreread less

4,418 citations

Journal Article•DOI•

deepTools2: a next generation web server for deep-sequencing data analysis

[...]

Fidel Ramírez¹, Devon Ryan¹, Björn Grüning², Vivek Bhardwaj¹, Fabian Kilpert¹, Andreas S. Richter¹, Steffen Heyne¹, Friederike Dündar³, Thomas Manke¹ - Show less +5 more•Institutions (3)

Max Planck Society¹, University of Freiburg², Cornell University³

08 Jul 2016-Nucleic Acids Research

TL;DR: An update to the Galaxy-based web server deepTools, which allows users to perform complete bioinformatic workflows ranging from quality controls and normalizations of aligned reads to integrative analyses, including clustering and visualization approaches, is presented.

...read moreread less

Abstract: We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows ranging from quality controls and normalizations of aligned reads to integrative analyses, including clustering and visualization approaches. Since we first described our deepTools Galaxy server in 2014, we have implemented new solutions for many requests from the community and our users. Here, we introduce significant enhancements and new tools to further improve data visualization and interpretation. deepTools continue to be open to all users and freely available as a web service at deeptools.ie-freiburg.mpg.de The new deepTools2 suite can be easily deployed within any Galaxy framework via the toolshed repository, and we also provide source code for command line usage under Linux and Mac OS X. A public and documented API for access to deepTools functionality is also available.

...read moreread less

4,359 citations

Journal Article•DOI•

Genetic effects on gene expression across human tissues.

[...]

Enhancing GTEx (eGTEx) groups¹, Nih Common Fund², Nhgri, Biospecimen Core Resource—VARI, Elsi study, Genome Browser Data Integration Visualization—EBI, Lead analysts, Alexis Battle³, Christopher D. Brown⁴, Barbara E. Engelhardt¹, Stephen B. Montgomery² - Show less +7 more•Institutions (4)

Princeton University¹, Stanford University², Johns Hopkins University³, University of Pennsylvania⁴

12 Oct 2017-Nature

TL;DR: It is found that local genetic variation affects gene expression levels for the majority of genes, and inter-chromosomal genetic effects for 93 genes and 112 loci are identified, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.

...read moreread less

Abstract: Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.

...read moreread less

3,289 citations

Journal Article•DOI•

Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq

[...]

Itay Tirosh¹, Benjamin Izar¹, Benjamin Izar², Sanjay M. Prakadan, Marc H. Wadsworth, Daniel J. Treacy¹, John J. Trombetta¹, Asaf Rotem², Asaf Rotem¹, Christopher Rodman¹, Christine G. Lian³, George F. Murphy³, Mohammad Fallahi-Sichani², Ken Dutton-Regester¹, Ken Dutton-Regester², Ken Dutton-Regester⁴, Jia-Ren Lin², Ofir Cohen¹, Parin Shah², Diana Lu¹, Alex S. Genshaft, Travis K. Hughes, Carly G. K. Ziegler, Samuel W. Kazer, Aleth Gaillard, Kellie E. Kolb, Alexandra-Chloé Villani¹, Cory M. Johannessen¹, Aleksandr Andreev¹, Eliezer M. Van Allen², Eliezer M. Van Allen¹, Monica M. Bertagnolli², Monica M. Bertagnolli³, Peter K. Sorger², Ryan J. Sullivan², Keith T. Flaherty², Dennie T. Frederick², Judit Jané-Valbuena¹, Charles H. Yoon³, Charles H. Yoon², Orit Rozenblatt-Rosen¹, Alex K. Shalek, Aviv Regev¹, Aviv Regev⁵, Aviv Regev⁶, Levi A. Garraway - Show less +42 more•Institutions (6)

Broad Institute¹, Harvard University², Brigham and Women's Hospital³, QIMR Berghofer Medical Research Institute⁴, Massachusetts Institute of Technology⁵, Howard Hughes Medical Institute⁶

08 Apr 2016-Science

TL;DR: The cellular ecosystem of tumors is begin to unravel and how single-cell genomics offers insights with implications for both targeted and immune therapies is unraveled.

...read moreread less

Abstract: To explore the distinct genotypic and phenotypic states of melanoma tumors, we applied single-cell RNA sequencing (RNA-seq) to 4645 single cells isolated from 19 patients, profiling malignant, immune, stromal, and endothelial cells. Malignant cells within the same tumor displayed transcriptional heterogeneity associated with the cell cycle, spatial context, and a drug-resistance program. In particular, all tumors harbored malignant cells from two distinct transcriptional cell states, such that tumors characterized by high levels of the MITF transcription factor also contained cells with low MITF and elevated levels of the AXL kinase. Single-cell analyses suggested distinct tumor microenvironmental patterns, including cell-to-cell interactions. Analysis of tumor-infiltrating T cells revealed exhaustion programs, their connection to T cell activation and clonal expansion, and their variability across patients. Overall, we begin to unravel the cellular ecosystem of tumors and how single-cell genomics offers insights with implications for both targeted and immune therapies.

...read moreread less

3,061 citations

Journal Article•DOI•

10 Years of GWAS Discovery: Biology, Function, and Translation

[...]

Peter M. Visscher¹, Naomi R. Wray¹, Qian Zhang¹, Pamela Sklar², Mark I. McCarthy³, Matthew A. Brown⁴, Jian Yang¹ - Show less +3 more•Institutions (4)

University of Queensland¹, Icahn School of Medicine at Mount Sinai², Wellcome Trust Centre for Human Genetics³, Queensland University of Technology⁴

06 Jul 2017-American Journal of Human Genetics

TL;DR: The remarkable range of discoveriesGWASs has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics are reviewed.

...read moreread less

Abstract: Application of the experimental design of genome-wide association studies (GWASs) is now 10 years old (young), and here we review the remarkable range of discoveries it has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics. We predict the likely discoveries in the next 10 years, when GWASs will be based on millions of samples with array data imputed to a large fully sequenced reference panel and on hundreds of thousands of samples with whole-genome sequencing data.

...read moreread less

2,669 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse