Author
Xia Yang
Other affiliations: Georgia State University, University of California, Vanderbilt University ...read more
Bio: Xia Yang is an academic researcher from University of California, Los Angeles. The author has contributed to research in topics: Genome-wide association study & Transcriptome. The author has an hindex of 42, co-authored 146 publications receiving 9230 citations. Previous affiliations of Xia Yang include Georgia State University & University of California.
Papers published on a yearly basis
Papers
More filters
••
TL;DR: It is shown that this approach can predict transcriptional responses to single gene–perturbation experiments using gene-expression data in the context of a segregating mouse population and the utility of this approach is demonstrated by identifying and experimentally validating the involvement of three new genes in susceptibility to obesity.
Abstract: A key goal of biomedical research is to elucidate the complex network of gene interactions underlying complex traits such as common human diseases. Here we detail a multistep procedure for identifying potential key drivers of complex traits that integrates DNA-variation and gene-expression data with other complex trait data in segregating mouse populations. Ordering gene expression traits relative to one another and relative to other complex traits is achieved by systematically testing whether variations in DNA that lead to variations in relative transcript abundances statistically support an independent, causative or reactive function relative to the complex traits under consideration. We show that this approach can predict transcriptional responses to single gene–perturbation experiments using gene-expression data in the context of a segregating mouse population. We also demonstrate the utility of this approach by identifying and experimentally validating the involvement of three new genes in susceptibility to obesity.
1,066 citations
••
TL;DR: This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases.
Abstract: Genetic variants that are associated with common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in higher-order disease traits. Therefore, identifying the molecular phenotypes that vary in response to changes in DNA and that also associate with changes in disease traits has the potential to provide the functional information required to not only identify and validate the susceptibility genes that are directly affected by changes in DNA, but also to understand the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. Toward that end, we profiled more than 39,000 transcripts and we genotyped 782,476 unique single nucleotide polymorphisms (SNPs) in more than 400 human liver samples to characterize the genetic architecture of gene expression in the human liver, a metabolically active tissue that is important in a number of common human diseases, including obesity, diabetes, and atherosclerosis. This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases. The utility of these data for elucidating the causes of common human diseases is demonstrated by integrating them with genotypic and expression data from other human and mouse populations. This provides much-needed functional support for the candidate susceptibility genes being identified at a growing number of genetic loci that have been identified as key drivers of disease from genome-wide association studies of disease. By using an integrative genomics approach, we highlight how the gene RPS26 and not ERBB3 is supported by our data as the most likely susceptibility gene for a novel type 1 diabetes locus recently identified in a large-scale, genome-wide association study. We also identify SORT1 and CELSR2 as candidate susceptibility genes for a locus recently associated with coronary artery disease and plasma low-density lipoprotein cholesterol levels in the process.
992 citations
••
TL;DR: Application of this method to liver and adipose gene expression data generated from a segregating mouse population results in the identification of a macrophage-enriched network supported as having a causal relationship with disease traits associated with metabolic syndrome.
Abstract: Identifying variations in DNA that increase susceptibility to disease is one of the primary aims of genetic studies using a forward genetics approach. However, identification of disease-susceptibility genes by means of such studies provides limited functional information on how genes lead to disease. In fact, in most cases there is an absence of functional information altogether, preventing a definitive identification of the susceptibility gene or genes. Here we develop an alternative to the classic forward genetics approach for dissecting complex disease traits where, instead of identifying susceptibility genes directly affected by variations in DNA, we identify gene networks that are perturbed by susceptibility loci and that in turn lead to disease. Application of this method to liver and adipose gene expression data generated from a segregating mouse population results in the identification of a macrophage-enriched network supported as having a causal relationship with disease traits associated with metabolic syndrome. Three genes in this network, lipoprotein lipase (Lpl), lactamase β (Lactb) and protein phosphatase 1-like (Ppm1l), are validated as previously unknown obesity genes, strengthening the association between this network and metabolic disease traits. Our analysis provides direct experimental support that complex traits such as obesity are emergent properties of molecular networks that are modulated by complex genetic loci and environmental factors. Complex human diseases result from the interplay of many genetic and environmental factors. To build up a picture of the factors contributing to one such disease, obesity, gene expression was evaluated as a quantitative trait in blood and adipose tissue samples from hundreds of Icelandic subjects aged 18 to 85. The results reveal a tendency to certain characteristic patterns of gene activation in the fatty tissues — though to a much lesser extent in the blood — of people with a higher body mass index. A transcriptional network constructed from the adipose tissue data has significant overlap with a network based on mouse adipose tissue data. Experimental support for the idea that complex diseases are emergent properties of molecular networks influenced by genes and environment comes from a study in mice. Mice were examined for disturbances in genetic expression networks that correlate with metabolic traits associated with obesity, diabetes and atherosclerosis. Three genes — Lpl, Lactb and Ppm1l — were identified as previously unknown obesity genes. This 'molecular network' approach raises the prospect that therapies might be directed at whole 'disease networks', rather than at one or two specific genes. Standard approaches to identify the genetic changes that lead to disease are reversed by examination of genetic networks for perturbations that are associated with disease states, and following up candidate genes from there. This begins with three genes in mice that lead to obesity when mutated, demonstrating that complex genetic–environmental traits can be dissected with this new approach.
886 citations
••
TL;DR: Genetic analyses provided evidence of the global regulation of subsets of the sexually dimorphic genes, as the transcript levels of a large number of these genes were controlled by several expression quantitative trait loci (eQTL) hotspots that exhibited tissue-specific control.
Abstract: We report a comprehensive analysis of gene expression differences between sexes in multiple somatic tissues of 334 mice derived from an intercross between inbred mouse strains C57BL/6J and C3H/HeJ. The analysis of a large number of individuals provided the power to detect relatively small differences in expression between sexes, and the use of an intercross allowed analysis of the genetic control of sexually dimorphic gene expression. Microarray analysis of 23,574 transcripts revealed that the extent of sexual dimorphism in gene expression was much greater than previously recognized. Thus, thousands of genes showed sexual dimorphism in liver, adipose, and muscle, and hundreds of genes were sexually dimorphic in brain. These genes exhibited highly tissue-specific patterns of expression and were enriched for distinct pathways represented in the Gene Ontology database. They also showed evidence of chromosomal enrichment, not only on the sex chromosomes, but also on several autosomes. Genetic analyses provided evidence of the global regulation of subsets of the sexually dimorphic genes, as the transcript levels of a large number of these genes were controlled by several expression quantitative trait loci (eQTL) hotspots that exhibited tissue-specific control. Moreover, many tissue-specific transcription factor binding sites were found to be enriched in the sexually dimorphic genes.
801 citations
••
TL;DR: This study provides direct evidence for a mechanistic link between genetic determinants and activity of Pon1 with systemic oxidative stress and prospective cardiovascular risk, indicating a potential mechanism for the atheroprotective function of PON1.
Abstract: Context Paraoxonase 1 (PON1) is reported to have antioxidant and cardioprotective properties. The relationship between PON1 genotypes and functional activity with systemic measures of oxidative stress and cardiovascular disease (CVD) risk in humans has not been systematically investigated. Objective To investigate the relationship of genetic and biochemical determinants of PON1 activity with systemic measures of oxidative stress and CVD risk in humans. Design, Setting, and Participants The association between systemic PON1 activity measures and a functional polymorphism (Q192R) resulting in high PON1 activity with prevalent CVD and future major adverse cardiac events (myocardial infarction, stroke, or death) was evaluated in 1399 sequential consenting patients undergoing diagnostic coronary angiography between September 2002 and November 2003 at the Cleveland Clinic. Patients were followed up until December 2006. Systemic levels of multiple structurally defined fatty acid oxidation products were also measured by mass spectrometry in 150 age-, sex-, and race-matched patients and compared with regard to PON1 genotype and activity. Main Outcome Measures Relationship between a functional PON1 polymorphism and PON1 activity with global indices of systemic oxidative stress and risk of CVD. Results The PON1 genotype demonstrated significant dose-dependent associations (QQ192 > QR192 > RR192) with decreased levels of serum PON1 activity and with increased levels of systemic indices of oxidative stress. Compared with participants with either the PON1 RR192 or QR192 genotype, participants with the QQ192 genotype demonstrated an increased risk of all-cause mortality (43/681 deaths [6.75%] in RR192 and QR192 and 62/584 deaths [11.1%] in QQ192; adjusted hazard ratio, 2.05; 95% confidence interval [CI], 1.32-3.18) and of major adverse cardiac events (88/681 events [13.6%] in RR192 and QR192 and 102/584 events [18.0%] in QQ192; adjusted hazard ratio, 1.48; 95% CI, 1.09-2.03; P = .01). The incidence of major adverse cardiac events was significantly lower in participants in the highest PON1 activity quartile (23/315 [7.3%]) and 235/324 [7.7%] for paraoxonase and arylesterase, respectively) compared with those in the lowest activity quartile (78/311 [25.1%] and 75/319 [23.5%]; P Conclusion This study provides direct evidence for a mechanistic link between genetic determinants and activity of PON1 with systemic oxidative stress and prospective cardiovascular risk, indicating a potential mechanism for the atheroprotective function of PON1.
497 citations
Cited by
More filters
••
TL;DR: WRITING GROUP MEMBERS Emelia J. Benjamin, MD, SCM, FAHA Michael J. Reeves, PhD Matthew Ritchey, PT, DPT, OCS, MPH Carlos J. Jiménez, ScD, SM Lori Chaffin Jordan,MD, PhD Suzanne E. Judd, PhD
Abstract: WRITING GROUP MEMBERS Emelia J. Benjamin, MD, SCM, FAHA Michael J. Blaha, MD, MPH Stephanie E. Chiuve, ScD Mary Cushman, MD, MSc, FAHA Sandeep R. Das, MD, MPH, FAHA Rajat Deo, MD, MTR Sarah D. de Ferranti, MD, MPH James Floyd, MD, MS Myriam Fornage, PhD, FAHA Cathleen Gillespie, MS Carmen R. Isasi, MD, PhD, FAHA Monik C. Jiménez, ScD, SM Lori Chaffin Jordan, MD, PhD Suzanne E. Judd, PhD Daniel Lackland, DrPH, FAHA Judith H. Lichtman, PhD, MPH, FAHA Lynda Lisabeth, PhD, MPH, FAHA Simin Liu, MD, ScD, FAHA Chris T. Longenecker, MD Rachel H. Mackey, PhD, MPH, FAHA Kunihiro Matsushita, MD, PhD, FAHA Dariush Mozaffarian, MD, DrPH, FAHA Michael E. Mussolino, PhD, FAHA Khurram Nasir, MD, MPH, FAHA Robert W. Neumar, MD, PhD, FAHA Latha Palaniappan, MD, MS, FAHA Dilip K. Pandey, MBBS, MS, PhD, FAHA Ravi R. Thiagarajan, MD, MPH Mathew J. Reeves, PhD Matthew Ritchey, PT, DPT, OCS, MPH Carlos J. Rodriguez, MD, MPH, FAHA Gregory A. Roth, MD, MPH Wayne D. Rosamond, PhD, FAHA Comilla Sasson, MD, PhD, FAHA Amytis Towfighi, MD Connie W. Tsao, MD, MPH Melanie B. Turner, MPH Salim S. Virani, MD, PhD, FAHA Jenifer H. Voeks, PhD Joshua Z. Willey, MD, MS John T. Wilkins, MD Jason HY. Wu, MSc, PhD, FAHA Heather M. Alger, PhD Sally S. Wong, PhD, RD, CDN, FAHA Paul Muntner, PhD, MHSc On behalf of the American Heart Association Statistics Committee and Stroke Statistics Subcommittee Heart Disease and Stroke Statistics—2017 Update
7,190 citations
••
Drexel University1, Yeshiva University2, Roswell Park Cancer Institute3, Virginia Commonwealth University4, Van Andel Institute5, Science Applications International Corporation6, Massachusetts Institute of Technology7, Harvard University8, University of Miami9, Icahn School of Medicine at Mount Sinai10, University of Chicago11, Howard Hughes Medical Institute12, University of Geneva13, Stanford University14, University of Oxford15, University of North Carolina at Chapel Hill16, National Institutes of Health17
TL;DR: The Genotype-Tissue Expression (GTEx) project is described, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
Abstract: Genome-wide association studies have identified thousands of loci for common diseases, but, for the majority of these, the mechanisms underlying disease susceptibility remain unknown. Most associated variants are not correlated with protein-coding changes, suggesting that polymorphisms in regulatory regions probably contribute to many disease phenotypes. Here we describe the Genotype-Tissue Expression (GTEx) project, which will establish a resource database and associated tissue bank for the scientific community to study the relationship between genetic variation and gene expression in human tissues.
6,545 citations
••
TL;DR: This work introduces Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner and constitutes a starting point to build pathway-centric models of biology.
Abstract: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org
.
6,125 citations
••
TL;DR: March 5, 2019 e1 WRITING GROUP MEMBERS Emelia J. Virani, MD, PhD, FAHA, Chair Elect On behalf of the American Heart Association Council on Epidemiology and Prevention Statistics Committee and Stroke Statistics Subcommittee.
Abstract: March 5, 2019 e1 WRITING GROUP MEMBERS Emelia J. Benjamin, MD, ScM, FAHA, Chair Paul Muntner, PhD, MHS, FAHA, Vice Chair Alvaro Alonso, MD, PhD, FAHA Marcio S. Bittencourt, MD, PhD, MPH Clifton W. Callaway, MD, FAHA April P. Carson, PhD, MSPH, FAHA Alanna M. Chamberlain, PhD Alexander R. Chang, MD, MS Susan Cheng, MD, MMSc, MPH, FAHA Sandeep R. Das, MD, MPH, MBA, FAHA Francesca N. Delling, MD, MPH Luc Djousse, MD, ScD, MPH Mitchell S.V. Elkind, MD, MS, FAHA Jane F. Ferguson, PhD, FAHA Myriam Fornage, PhD, FAHA Lori Chaffin Jordan, MD, PhD, FAHA Sadiya S. Khan, MD, MSc Brett M. Kissela, MD, MS Kristen L. Knutson, PhD Tak W. Kwan, MD, FAHA Daniel T. Lackland, DrPH, FAHA Tené T. Lewis, PhD Judith H. Lichtman, PhD, MPH, FAHA Chris T. Longenecker, MD Matthew Shane Loop, PhD Pamela L. Lutsey, PhD, MPH, FAHA Seth S. Martin, MD, MHS, FAHA Kunihiro Matsushita, MD, PhD, FAHA Andrew E. Moran, MD, MPH, FAHA Michael E. Mussolino, PhD, FAHA Martin O’Flaherty, MD, MSc, PhD Ambarish Pandey, MD, MSCS Amanda M. Perak, MD, MS Wayne D. Rosamond, PhD, MS, FAHA Gregory A. Roth, MD, MPH, FAHA Uchechukwu K.A. Sampson, MD, MBA, MPH, FAHA Gary M. Satou, MD, FAHA Emily B. Schroeder, MD, PhD, FAHA Svati H. Shah, MD, MHS, FAHA Nicole L. Spartano, PhD Andrew Stokes, PhD David L. Tirschwell, MD, MS, MSc, FAHA Connie W. Tsao, MD, MPH, Vice Chair Elect Mintu P. Turakhia, MD, MAS, FAHA Lisa B. VanWagner, MD, MSc, FAST John T. Wilkins, MD, MS, FAHA Sally S. Wong, PhD, RD, CDN, FAHA Salim S. Virani, MD, PhD, FAHA, Chair Elect On behalf of the American Heart Association Council on Epidemiology and Prevention Statistics Committee and Stroke Statistics Subcommittee
5,739 citations
••
TL;DR: The Statistical Update represents the most up-to-date statistics related to heart disease, stroke, and the cardiovascular risk factors listed in the AHA's My Life Check - Life’s Simple 7, which include core health behaviors and health factors that contribute to cardiovascular health.
Abstract: Each chapter listed in the Table of Contents (see next page) is a hyperlink to that chapter. The reader clicks the chapter name to access that chapter.
Each chapter listed here is a hyperlink. Click on the chapter name to be taken to that chapter.
Each year, the American Heart Association (AHA), in conjunction with the Centers for Disease Control and Prevention, the National Institutes of Health, and other government agencies, brings together in a single document the most up-to-date statistics related to heart disease, stroke, and the cardiovascular risk factors listed in the AHA’s My Life Check - Life’s Simple 7 (Figure1), which include core health behaviors (smoking, physical activity, diet, and weight) and health factors (cholesterol, blood pressure [BP], and glucose control) that contribute to cardiovascular health. The Statistical Update represents …
5,102 citations