scispace - formally typeset
Search or ask a question
Author

Hongyu Zhao

Bio: Hongyu Zhao is an academic researcher from Yale University. The author has contributed to research in topics: Medicine & Genome-wide association study. The author has an hindex of 93, co-authored 710 publications receiving 33183 citations. Previous affiliations of Hongyu Zhao include University of North Carolina at Chapel Hill & Boston University.


Papers
More filters
Journal ArticleDOI
Luke Jostins1, Stephan Ripke2, Rinse K. Weersma3, Richard H. Duerr4, Dermot P.B. McGovern5, Ken Y. Hui6, James Lee7, L. Philip Schumm8, Yashoda Sharma6, Carl A. Anderson1, Jonah Essers9, Mitja Mitrovic3, Kaida Ning6, Isabelle Cleynen10, Emilie Theatre11, Sarah L. Spain12, Soumya Raychaudhuri9, Philippe Goyette13, Zhi Wei14, Clara Abraham6, Jean-Paul Achkar15, Tariq Ahmad16, Leila Amininejad17, Ashwin N. Ananthakrishnan9, Vibeke Andersen18, Jane M. Andrews19, Leonard Baidoo4, Tobias Balschun20, Peter A. Bampton21, Alain Bitton22, Gabrielle Boucher13, Stephan Brand23, Carsten Büning24, Ariella Cohain25, Sven Cichon26, Mauro D'Amato27, Dirk De Jong3, Kathy L Devaney9, Marla Dubinsky5, Cathryn Edwards28, David Ellinghaus20, Lynnette R. Ferguson29, Denis Franchimont17, Karin Fransen3, Richard B. Gearry30, Michel Georges11, Christian Gieger, Jürgen Glas22, Talin Haritunians5, Ailsa Hart31, Christopher J. Hawkey32, Matija Hedl6, Xinli Hu9, Tom H. Karlsen33, Limas Kupčinskas34, Subra Kugathasan35, Anna Latiano36, Debby Laukens37, Ian C. Lawrance38, Charlie W. Lees39, Edouard Louis11, Gillian Mahy40, John C. Mansfield41, Angharad R. Morgan29, Craig Mowat42, William G. Newman43, Orazio Palmieri36, Cyriel Y. Ponsioen44, Uroš Potočnik45, Natalie J. Prescott6, Miguel Regueiro4, Jerome I. Rotter5, Richard K Russell46, Jeremy D. Sanderson47, Miquel Sans, Jack Satsangi39, Stefan Schreiber20, Lisa A. Simms48, Jurgita Sventoraityte34, Stephan R. Targan, Kent D. Taylor5, Mark Tremelling49, Hein W. Verspaget50, Martine De Vos37, Cisca Wijmenga3, David C. Wilson39, Juliane Winkelmann51, Ramnik J. Xavier9, Sebastian Zeissig20, Bin Zhang25, Clarence K. Zhang6, Hongyu Zhao6, Mark S. Silverberg52, Vito Annese, Hakon Hakonarson53, Steven R. Brant54, Graham L. Radford-Smith55, Christopher G. Mathew12, John D. Rioux13, Eric E. Schadt25, Mark J. Daly2, Andre Franke20, Miles Parkes7, Severine Vermeire10, Jeffrey C. Barrett1, Judy H. Cho6 
Wellcome Trust Sanger Institute1, Broad Institute2, University of Groningen3, University of Pittsburgh4, Cedars-Sinai Medical Center5, Yale University6, University of Cambridge7, University of Chicago8, Harvard University9, Katholieke Universiteit Leuven10, University of Liège11, King's College London12, Université de Montréal13, New Jersey Institute of Technology14, Cleveland Clinic15, Peninsula College of Medicine and Dentistry16, Université libre de Bruxelles17, Aarhus University18, University of Adelaide19, University of Kiel20, Flinders University21, McGill University22, Ludwig Maximilian University of Munich23, Charité24, Icahn School of Medicine at Mount Sinai25, University of Bonn26, Karolinska Institutet27, Torbay Hospital28, University of Auckland29, Christchurch Hospital30, Imperial College London31, Queen's University32, University of Oslo33, Lithuanian University of Health Sciences34, Emory University35, Casa Sollievo della Sofferenza36, Ghent University37, University of Western Australia38, University of Edinburgh39, Queensland Health40, Newcastle University41, University of Dundee42, University of Manchester43, University of Amsterdam44, University of Maribor45, Royal Hospital for Sick Children46, Guy's and St Thomas' NHS Foundation Trust47, QIMR Berghofer Medical Research Institute48, Norfolk and Norwich University Hospital49, Leiden University50, Technische Universität München51, University of Toronto52, University of Pennsylvania53, Johns Hopkins University54, University of Queensland55
01 Nov 2012-Nature
TL;DR: A meta-analysis of Crohn’s disease and ulcerative colitis genome-wide association scans is undertaken, followed by extensive validation of significant findings, with a combined total of more than 75,000 cases and controls.
Abstract: Crohn's disease and ulcerative colitis, the two common forms of inflammatory bowel disease (IBD), affect over 2.5 million people of European ancestry, with rising prevalence in other populations. Genome-wide association studies and subsequent meta-analyses of these two diseases as separate phenotypes have implicated previously unsuspected mechanisms, such as autophagy, in their pathogenesis and showed that some IBD loci are shared with other inflammatory diseases. Here we expand on the knowledge of relevant pathways by undertaking a meta-analysis of Crohn's disease and ulcerative colitis genome-wide association scans, followed by extensive validation of significant findings, with a combined total of more than 75,000 cases and controls. We identify 71 new associations, for a total of 163 IBD loci, that meet genome-wide significance thresholds. Most loci contribute to both phenotypes, and both directional (consistently favouring one allele over the course of human history) and balancing (favouring the retention of both alleles within populations) selection effects are evident. Many IBD loci are also implicated in other immune-mediated disorders, most notably with ankylosing spondylitis and psoriasis. We also observe considerable overlap between susceptibility loci for IBD and mycobacterial infection. Gene co-expression network analysis emphasizes this relationship, with pathways shared between host responses to mycobacteria and those predisposing to IBD.

4,094 citations

Journal ArticleDOI
TL;DR: It is demonstrated that in vivo association of HY5 with promoter targets is not altered under distinct light qualities or during light-to-dark transition, and a model in which HY5 is a high hierarchical regulator of the transcriptional cascades for photomorphogenesis is supported.
Abstract: The transcription factor LONG HYPOCOTYL5 (HY5) acts downstream of multiple families of the photoreceptors and promotes photomorphogenesis. Although it is well accepted that HY5 acts to regulate target gene expression, in vivo binding of HY5 to any of its target gene promoters has yet to be demonstrated. Here, we used a chromatin immunoprecipitation procedure to verify suspected in vivo HY5 binding sites. We demonstrated that in vivo association of HY5 with promoter targets is not altered under distinct light qualities or during light-to-dark transition. Coupled with DNA chip hybridization using a high-density 60-nucleotide oligomer microarray that contains one probe for every 500 nucleotides over the entire Arabidopsis thaliana genome, we mapped genome-wide in vivo HY5 binding sites. This analysis showed that HY5 binds preferentially to promoter regions in vivo and revealed >3000 chromosomal sites as putative HY5 binding targets. HY5 binding targets tend to be enriched in the early light-responsive genes and transcription factor genes. Our data thus support a model in which HY5 is a high hierarchical regulator of the transcriptional cascades for photomorphogenesis.

812 citations

Journal ArticleDOI
TL;DR: Using comparative genomics, genetics and biochemistry, subjects with mutations proven or inferred to be functional are identified, and many rare alleles that alter renal salt handling in blood pressure variation in the general population are implicate.
Abstract: The effects of alleles in many genes are believed to contribute to common complex diseases such as hypertension. Whether risk alleles comprise a small number of common variants or many rare independent mutations at trait loci is largely unknown. We screened members of the Framingham Heart Study (FHS) for variation in three genes-SLC12A3 (NCCT), SLC12A1 (NKCC2) and KCNJ1 (ROMK)-causing rare recessive diseases featuring large reductions in blood pressure. Using comparative genomics, genetics and biochemistry, we identified subjects with mutations proven or inferred to be functional. These mutations, all heterozygous and rare, produce clinically significant blood pressure reduction and protect from development of hypertension. Our findings implicate many rare alleles that alter renal salt handling in blood pressure variation in the general population, and identify alleles with health benefit that are nonetheless under purifying selection. These findings have implications for the genetic architecture of hypertension and other common complex traits.

784 citations

Journal ArticleDOI
13 Jun 2013-Nature
TL;DR: Comparing the incidence of de novo mutations in severe CHD cases and controls by analysing exome sequencing of parent–offspring trios suggests that several hundreds of genes collectively contribute to approximately 10% of severeCHD.
Abstract: Exome sequencing of patients with congenital heart disease (CHD) and their unaffected parents reveals an excess of strong-effect, protein-altering de novo mutations in genes expressed in the developing heart, many of which regulate chromatin modification in key developmental genes; collectively, these mutations are predicted to account for approximately 10% of severe CHD cases. This paper demonstrates that de novo mutations with large effect have a role in the pathogenesis of at least 10% of cases of congenital heart disease (CHD). Using exome sequence analysis in parent–offspring trios Richard Lifton and colleagues compared the frequency of de novo mutations, identified by exome sequencing, in 362 CHD parent–offspring trios and 264 control trios. Gene ontology analysis demonstrated significant enrichment of de novo protein-altering mutation of genes involved in chromatin modification, notably a marked enrichment of genes involved in the production, removal and reading of methylation of histone H3K4 and H3K27. Congenital heart disease (CHD) is the most frequent birth defect, affecting 0.8% of live births1. Many cases occur sporadically and impair reproductive fitness, suggesting a role for de novo mutations. Here we compare the incidence of de novo mutations in 362 severe CHD cases and 264 controls by analysing exome sequencing of parent–offspring trios. CHD cases show a significant excess of protein-altering de novo mutations in genes expressed in the developing heart, with an odds ratio of 7.5 for damaging (premature termination, frameshift, splice site) mutations. Similar odds ratios are seen across the main classes of severe CHD. We find a marked excess of de novo mutations in genes involved in the production, removal or reading of histone 3 lysine 4 (H3K4) methylation, or ubiquitination of H2BK120, which is required for H3K4 methylation2,3,4. There are also two de novo mutations in SMAD2, which regulates H3K27 methylation in the embryonic left–right organizer5. The combination of both activating (H3K4 methylation) and inactivating (H3K27 methylation) chromatin marks characterizes ‘poised’ promoters and enhancers, which regulate expression of key developmental genes6. These findings implicate de novo point mutations in several hundreds of genes that collectively contribute to approximately 10% of severe CHD.

778 citations

Journal ArticleDOI
TL;DR: A study of the prevalence, awareness, treatment, and control of hypertension in China and assessed their variations across many subpopulations, finding that among Chinese adults aged 35-75 years, nearly half have hypertension, fewer than a third are being treated, and fewer than one in twelve are in control of their blood pressure.

695 citations


Cited by
More filters
Journal ArticleDOI
Eric S. Lander1, Lauren Linton1, Bruce W. Birren1, Chad Nusbaum1  +245 moreInstitutions (29)
15 Feb 2001-Nature
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

22,269 citations

Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

Journal Article
TL;DR: For the next few weeks the course is going to be exploring a field that’s actually older than classical population genetics, although the approach it’ll be taking to it involves the use of population genetic machinery.
Abstract: So far in this course we have dealt entirely with the evolution of characters that are controlled by simple Mendelian inheritance at a single locus. There are notes on the course website about gametic disequilibrium and how allele frequencies change at two loci simultaneously, but we didn’t discuss them. In every example we’ve considered we’ve imagined that we could understand something about evolution by examining the evolution of a single gene. That’s the domain of classical population genetics. For the next few weeks we’re going to be exploring a field that’s actually older than classical population genetics, although the approach we’ll be taking to it involves the use of population genetic machinery. If you know a little about the history of evolutionary biology, you may know that after the rediscovery of Mendel’s work in 1900 there was a heated debate between the “biometricians” (e.g., Galton and Pearson) and the “Mendelians” (e.g., de Vries, Correns, Bateson, and Morgan). Biometricians asserted that the really important variation in evolution didn’t follow Mendelian rules. Height, weight, skin color, and similar traits seemed to

9,847 citations

Journal ArticleDOI
Stephan Ripke1, Stephan Ripke2, Benjamin M. Neale1, Benjamin M. Neale2  +351 moreInstitutions (102)
24 Jul 2014-Nature
TL;DR: Associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses.
Abstract: Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain, providing biological plausibility for the findings. Many findings have the potential to provide entirely new insights into aetiology, but associations at DRD2 and several genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that have important roles in immunity, providing support for the speculated link between the immune system and schizophrenia.

6,809 citations