Showing papers by "Yingrui Li published in 2014"

PDF

Open Access

Journal Article•DOI•

Phylogenomics resolves the timing and pattern of insect evolution

[...]

Bernhard Misof, Shanlin Liu, Karen Meusemann¹, Ralph S. Peters, Alexander Donath, Christoph Mayer, Paul B. Frandsen², Jessica L. Ware², Tomas Flouri³, Rolf G. Beutel⁴, Oliver Niehuis, Malte Petersen, Fernando Izquierdo-Carrasco³, Torsten Wappler⁵, Jes Rust⁵, Andre J. Aberer³, Ulrike Aspöck⁶, Ulrike Aspöck⁷, Horst Aspöck⁶, Daniela Bartel⁶, Alexander Blanke⁸, Simon Berger³, Alexander Böhm⁶, Thomas R. Buckley⁹, Brett Calcott¹⁰, Junqing Chen, Frank Friedrich¹¹, Makiko Fukui¹², Mari Fujita⁸, Carola Greve, Peter Grobe, Shengchang Gu, Ying Huang, Lars S. Jermiin¹, Akito Y. Kawahara¹³, Lars Krogmann¹⁴, Martin Kubiak¹¹, Robert Lanfear¹⁵, Robert Lanfear¹⁶, Robert Lanfear¹⁷, Harald Letsch⁶, Yiyuan Li, Zhenyu Li, Jiguang Li, Haorong Lu, Ryuichiro Machida⁸, Yuta Mashimo⁸, Pashalia Kapli¹⁸, Pashalia Kapli³, Duane D. McKenna¹⁹, Guanliang Meng, Yasutaka Nakagaki⁸, José Luis Navarrete-Heredia²⁰, Michael Ott²¹, Yanxiang Ou, Günther Pass⁶, Lars Podsiadlowski⁵, Hans Pohl⁴, Björn M. von Reumont²², Kai Schütte¹¹, Kaoru Sekiya⁸, Shota Shimizu⁸, Adam Slipinski¹, Alexandros Stamatakis³, Alexandros Stamatakis²³, Wenhui Song, Xu Su, Nikolaus U. Szucsich⁶, Meihua Tan, Xuemei Tan, Min Tang, Jingbo Tang, Gerald Timelthaler⁶, Shigekazu Tomizuka⁸, Michelle D. Trautwein²⁴, Xiaoli Tong²⁵, Toshiki Uchifune⁸, Manfred Walzl⁶, Brian M. Wiegmann²⁶, Jeanne Wilbrandt, Benjamin Wipfler⁴, Thomas K. F. Wong¹, Qiong Wu, Gengxiong Wu, Yinlong Xie, Shenzhou Yang, Qing Yang, David K. Yeates¹, Kazunori Yoshizawa²⁷, Qing Zhang, Rui Zhang, Wenwei Zhang, Yunhui Zhang, Jing Zhao, Chengran Zhou, Lili Zhou, Tanja Ziesmann, Shijie Zou, Yingrui Li, Xun Xu, Yong Zhang, Huanming Yang, Jian Wang, Jun Wang, Karl M. Kjer², Xin Zhou - Show less +102 more•Institutions (27)

Commonwealth Scientific and Industrial Research Organisation¹, Rutgers University², Heidelberg Institute for Theoretical Studies³, University of Jena⁴, University of Bonn⁵, University of Vienna⁶, Naturhistorisches Museum⁷, University of Tsukuba⁸, Landcare Research⁹, Johns Hopkins University¹⁰, University of Hamburg¹¹, Ehime University¹², Florida Museum of Natural History¹³, Staatliches Museum für Naturkunde Stuttgart¹⁴, National Evolutionary Synthesis Center¹⁵, Australian National University¹⁶, Macquarie University¹⁷, American Museum of Natural History¹⁸, University of Memphis¹⁹, University of Guadalajara²⁰, Bavarian Academy of Sciences and Humanities²¹, Natural History Museum²², Karlsruhe Institute of Technology²³, California Academy of Sciences²⁴, South China Agricultural University²⁵, North Carolina State University²⁶, Hokkaido University²⁷

07 Nov 2014-Science

TL;DR: The phylogeny of all major insect lineages reveals how and when insects diversified and provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

Abstract: Insects are the most speciose group of animals, but the phylogenetic relationships of many major lineages remain unresolved. We inferred the phylogeny of insects from 1478 protein-coding genes. Phylogenomic analyses of nucleotide and amino acid sequences, with site-specific nucleotide or domain-specific amino acid substitution models, produced statistically robust and congruent results resolving previously controversial phylogenetic relations hips. We dated the origin of insects to the Early Ordovician [~479 million years ago (Ma)], of insect flight to the Early Devonian (~406 Ma), of major extant lineages to the Mississippian (~345 Ma), and the major diversification of holometabolous insects to the Early Cretaceous. Our phylogenomic study provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

1,998 citations

Journal Article•DOI•

Identification of genomic alterations in oesophageal squamous cell cancer

[...]

Yongmei Song¹, Lin Li, Yunwei Ou¹, Zhibo Gao, En-Min Li², Xiangchun Li, Weimin Zhang¹, Jiaqian Wang, Li-Yan Xu³, Yong Zhou, Xiao-Juan Ma¹, Lingyan Liu¹, Zitong Zhao¹, Xuanlin Huang, Jing Fan¹, Lijia Dong¹, Gang Chen, Liying Ma¹, Jie Yang, Chen Longyun, Minghui He, Li Miao, Xuehan Zhuang, Kai Huang, Kunlong Qiu, Guangliang Yin, Guangwu Guo, Qiang Feng, Peishan Chen, Zhi-Yong Wu⁴, Jian-Yi Wu², Ling Ma¹, Jinyang Zhao, Longhai Luo, Ming Fu¹, Bainan Xu⁵, Bo Chen³, Yingrui Li, Tong Tong¹, Mingrong Wang¹, Zhihua Liu¹, Dongxin Lin¹, Xiuqing Zhang, Huanming Yang, Jun Wang, Qimin Zhan¹ - Show less +42 more•Institutions (5)

Peking Union Medical College¹, Laboratory of Molecular Biology², Shantou University³, Sun Yat-sen University⁴, Chinese PLA General Hospital⁵

01 May 2014-Nature

TL;DR: Genomic analyses suggest that ESCC and head and neck squamous cell carcinoma share some common pathogenic mechanisms, and ESCC development is associated with alcohol drinking, and novel biological markers and tumorigenic pathways that would greatly improve therapeutic strategies for ESCC are explored.

...read moreread less

Abstract: Oesophageal cancer is one of the most aggressive cancers and is the sixth leading cause of cancer death worldwide(1). Approximately 70% of global oesophageal cancer cases occur in China, with oesophageal squamous cell carcinoma (ESCC) being the histopathological form in the vast majority of cases (>90%)(2,3). Currently, there are limited clinical approaches for the early diagnosis and treatment of ESCC, resulting in a 10% five-year survival rate for patients. However, the full repertoire of genomic events leading to the pathogenesis of ESCC remains unclear. Here we describe a comprehensive genomic analysis of 158 ESCC cases, as part of the International Cancer Genome Consortium research project. We conducted whole-genome sequencing in 17 ESCC cases and whole-exome sequencing in 71 cases, of which 53 cases, plus an additional 70 ESCC cases not used in the whole-genome and whole-exome sequencing, were subjected to array comparative genomic hybridization analysis. We identified eight significantly mutated genes, of which six are well known tumour-associated genes (TP53, RB1, CDKN2A, PIK3CA, NOTCH1, NFE2L2), and two have not previously been described in ESCC (ADAM29 and FAM135B). Notably, FAM135B is identified as a novel cancer-implicated gene as assayed for its ability to promote malignancy of ESCC cells. Additionally, MIR548K, a microRNA encoded in the amplified 11q13.3-13.4 region, is characterized as a novel oncogene, and functional assays demonstrate that MIR548K enhances malignant phenotypes of ESCC cells. Moreover, we have found that several important histone regulator genes (MLL2 (also called KMT2D), ASH1L, MLL3 (KMT2C), SETD1B, CREBBP and EP300) are frequently altered in ESCC. Pathway assessment reveals that somatic aberrations are mainly involved in the Wnt, cell cycle and Notch pathways. Genomic analyses suggest that ESCC and head and neck squamous cell carcinoma share some common pathogenic mechanisms, and ESCC development is associated with alcohol drinking. This study has explored novel biological markers and tumorigenic pathways that would greatly improve therapeutic strategies for ESCC.

...read moreread less

853 citations

Journal Article•DOI•

Altitude adaptation in Tibetans caused by introgression of Denisovan-like DNA

[...]

Emilia Huerta-Sanchez¹, Xin Jin², Asan, Zhuoma Bianba, Benjamin M. Peter¹, Nicolas Vinckenbosch¹, Yu Liang, Xin Yi, Mingze He³, Mehmet Somel⁴, Peixiang Ni, Bo Wang, Xiaohua Ou, Huasang, Jiangbai Luosang, Zha Xi Ping Cuo, Kui Li, Guoyi Gao, Ye Yin, Wei Wang, Xiuqing Zhang, Xun Xu, Huanming Yang⁵, Yingrui Li, Jian Wang, Jun Wang⁶, Rasmus Nielsen¹ - Show less +23 more•Institutions (6)

University of California, Berkeley¹, South China University of Technology², Iowa State University³, Middle East Technical University⁴, King Abdulaziz University⁵, Macau University of Science and Technology⁶

14 Aug 2014-Nature

TL;DR: Re-sequencing the region around EPAS1 in 40 Tibetan and 40 Han individuals finds that this gene has a highly unusual haplotype structure that can only be convincingly explained by introgression of DNA from Denisovan or Denisovan-related individuals into humans.

...read moreread less

Abstract: As modern humans migrated out of Africa, they encountered many new environmental conditions, including greater temperature extremes, different pathogens and higher altitudes. These diverse environments are likely to have acted as agents of natural selection and to have led to local adaptations. One of the most celebrated examples in humans is the adaptation of Tibetans to the hypoxic environment of the high-altitude Tibetan plateau. A hypoxia pathway gene, EPAS1, was previously identified as having the most extreme signature of positive selection in Tibetans, and was shown to be associated with differences in haemoglobin concentration at high altitude. Re-sequencing the region around EPAS1 in 40 Tibetan and 40 Han individuals, we find that this gene has a highly unusual haplotype structure that can only be convincingly explained by introgression of DNA from Denisovan or Denisovan-related individuals into humans. Scanning a larger set of worldwide populations, we find that the selected haplotype is only found in Denisovans and in Tibetans, and at very low frequency among Han Chinese. Furthermore, the length of the haplotype, and the fact that it is not found in any other populations, makes it unlikely that the haplotype sharing between Tibetans and Denisovans was caused by incomplete ancestral lineage sorting rather than introgression. Our findings illustrate that admixture with other hominin species has provided genetic variation that helped humans to adapt to new environments.

...read moreread less

851 citations

Journal Article•DOI•

SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads

[...]

Yinlong Xie¹, Yinlong Xie², Gengxiong Wu, Jingbo Tang³, Ruibang Luo², Jordan Patterson⁴, Shanlin Liu, Weihua Huang, Guangzhu He, Shengchang Gu, Shengkang Li, Xin Zhou, Tak-Wah Lam², Yingrui Li, Xun Xu, Gane Ka-Shu Wong⁴, Jun Wang - Show less +13 more•Institutions (4)

South China University of Technology¹, University of Hong Kong², Central South University³, University of Alberta⁴

15 Jun 2014-Bioinformatics

TL;DR: The conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution, compared with two other popular transcriptome assemblers.

...read moreread less

Abstract: Motivation: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 � 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. Results: Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these wellannotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. Availability and implementation: Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/. Contact: xieyl@genomics.cn or bgi-soap@googlegroups.com Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

730 citations

Journal Article•DOI•

Whole-genome sequence variation, population structure and demographic history of the Dutch population

[...]

Laurent C. Francioli¹, Androniki Menelaou¹, Sara L. Pulit¹, Freerk van Dijk¹, Pier Francesco Palamara², Clara C. Elbers¹, Pieter B. Neerincx¹, Kai Ye³, Kai Ye⁴, Victor Guryev, Wigard P. Kloosterman¹, Patrick Deelen¹, Abdel Abdellaoui⁵, Elisabeth M. van Leeuwen⁶, Mannis van Oven⁶, Martijn Vermaat⁴, Mingkun Li⁷, Jeroen F. J. Laros⁴, Lennart C. Karssen⁶, Alexandros Kanterakis¹, Najaf Amin⁶, Jouke-Jan Hottenga⁵, Eric-Wubbo Lameijer⁴, Mathijs Kattenberg⁵, Martijn Dijkstra¹, Heorhiy Byelas¹, Jessica van Setten⁸, Barbera D. C. van Schaik⁵, Jan Bot, Isaac J. Nijman¹, Ivo Renkens¹, Tobias Marschall⁹, Alexander Schönhuth, Jayne Y. Hehir-Kwa¹⁰, Robert E. Handsaker¹⁰, Robert E. Handsaker¹¹, Paz Polak¹⁰, Mashaal Sohail¹⁰, Mashaal Sohail¹², Dana Vuzman¹², Fereydoun Hormozdiari, David van Enckevort, Hailiang Mei⁶, Vyacheslav Koval⁴, Matthijs Moed¹, K. Joeri van der Velde¹, Fernando Rivadeneira¹², Fernando Rivadeneira¹⁰, Fernando Rivadeneira⁶, Karol Estrada⁶, Carolina Medina-Gomez⁶, Aaron Isaacs¹¹, Aaron Isaacs¹⁰, Steven A. McCarroll⁴, Marian Beekman⁴, Anton J. M. de Craen⁴, H. Eka D. Suchiman⁴, Albert Hofman⁶, Ben A. Oostra⁶, André G. Uitterlinden⁶, Gonneke Willemsen⁵, Mathieu Platteel¹, Jan H. Veldink⁸, Leonard H. van den Berg¹³, Steven J. Pitts¹³, Shobha Potluri¹³, Purnima Sundar¹³, David R. Cox¹⁰, David R. Cox¹², Shamil R. Sunyaev⁴, Johan T. den Dunnen⁷, Mark Stoneking⁷, Peter de Knijff⁴, Manfred Kayser⁶, Qibin Li¹⁴, Yingrui Li¹⁴, Yuanping Du¹⁴, Ruoyan Chen¹⁴, Hongzhi Cao¹⁴, Ning Li, Sujie Cao, Jun Wang¹⁵, Jasper A. Bovenberg, Itsik Pe'er², P. Eline Slagboom⁴, Cornelia M. van Duijn⁶, Dorret I. Boomsma⁵, Gert-Jan B. van Ommen⁴, Paul I.W. de Bakker¹, Paul I.W. de Bakker⁸, Morris A. Swertz, Cisca Wijmenga - Show less +88 more•Institutions (15)

University of Groningen¹, Columbia University², University of Washington³, Leiden University⁴, University of Amsterdam⁵, Erasmus University Rotterdam⁶, Max Planck Society⁷, Utrecht University⁸, Centrum Wiskunde & Informatica⁹, Radboud University Nijmegen¹⁰, Massachusetts Institute of Technology¹¹, Harvard University¹², Pfizer¹³, Beijing Institute of Genomics¹⁴, University of Copenhagen¹⁵

01 Jun 2014-Nature Genetics

TL;DR: The Genome of the Netherlands (GoNL) Project is described, in which the whole genomes of 250 Dutch parent-offspring families were sequenced and a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions were constructed.

...read moreread less

Abstract: Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.

...read moreread less

677 citations

Journal Article•DOI•

Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization

[...]

Cheng Qin¹, Cheng Qin², Changshui Yu, Yaou Shen¹, Xiaodong Fang³, Xiaodong Fang⁴, Lang Chen, Jiumeng Min³, Jiaowen Cheng², Shancen Zhao³, Meng Xu³, Yong Luo, Yulan Yang³, Zhiming Wu⁵, Likai Mao³, Haiyang Wu³, Changying Ling-Hu, Huangkai Zhou³, Haijian Lin¹, Sandra Isabel González-Morales⁶, Diana L. Trejo-Saavedra⁶, Hao Tian, Xin Tang², Maojun Zhao¹, Zhiyong Huang³, Anwei Zhou, Xiaoming Yao³, Junjie Cui², Wenqi Li³, Zhe Chen¹, Yongqiang Feng, Yongchao Niu³, Shimin Bi, Xiuwei Yang, Weipeng Li², Huimin Cai³, Xirong Luo, Salvador Montes-Hernández, Marco Antonio Leyva-González⁶, Zhiqiang Xiong³, Xiujing He¹, Lijun Bai³, Shu Tan², Xiangqun Tang, Dan Liu³, Jinwen Liu³, Shangxing Zhang, Maoshan Chen³, Lu Zhang³, Lu Zhang⁷, Li Zhang², Yinchao Zhang¹, Weiqin Liao, Yan Zhang³, Min Wang, Xiaodan Lv³, Bo Wen³, Hongjun Liu¹, Hemi Luan³, Yonggang Zhang, Shuang Yang³, Xiaodian Wang, Jiaohui Xu³, Xueqin Li, Shuai Cheng Li⁷, Junyi Wang³, Alain Palloix, Paul W. Bosland⁸, Yingrui Li³, Anders Krogh⁴, Rafael F. Rivera-Bustamante⁶, Luis Herrera-Estrella⁶, Ye Yin³, Jiping Yu, Kailin Hu², Zhiming Zhang¹ - Show less +72 more•Institutions (8)

Sichuan Agricultural University¹, South China Agricultural University², Beijing Genomics Institute³, University of Copenhagen⁴, Zhongkai University of Agriculture and Engineering⁵, CINVESTAV⁶, City University of Hong Kong⁷, New Mexico State University⁸

08 Apr 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is found that dosage compensation effect of tandem duplication genes probably contributed to the pungent diversification in pepper and the Capsicum reference genome provides crucial information for the study of not only the evolution of the pepper genome but also, the Solanaceae family.

...read moreread less

Abstract: As an economic crop, pepper satisfies people’s spicy taste and has medicinal uses worldwide. To gain a better understanding of Capsicum evolution, domestication, and specialization, we present here the genome sequence of the cultivated pepper Zunla-1 (C. annuum L.) and its wild progenitor Chiltepin (C. annuum var. glabriusculum). We estimate that the pepper genome expanded ∼0.3 Mya (with respect to the genome of other Solanaceae) by a rapid amplification of retrotransposons elements, resulting in a genome comprised of ∼81% repetitive sequences. Approximately 79% of 3.48-Gb scaffolds containing 34,476 protein-coding genes were anchored to chromosomes by a high-density genetic map. Comparison of cultivated and wild pepper genomes with 20 resequencing accessions revealed molecular footprints of artificial selection, providing us with a list of candidate domestication genes. We also found that dosage compensation effect of tandem duplication genes probably contributed to the pungent diversification in pepper. The Capsicum reference genome provides crucial information for the study of not only the evolution of the pepper genome but also, the Solanaceae family, and it will facilitate the establishment of more effective pepper breeding programs.

...read moreread less

593 citations

Journal Article•DOI•

The locust genome provides insight into swarm formation and long-distance flight

[...]

Xianhui Wang¹, Xiaodong Fang, Pengcheng Yang¹, Xuanting Jiang, Feng Jiang¹, Dejian Zhao¹, Bolei Li¹, Feng Cui¹, Jianing Wei¹, Chuan Ma¹, Wang Yundan¹, Jing He¹, Yuan Luo¹, Zhifeng Wang¹, Xiaojiao Guo¹, Wei Guo¹, Xuesong Wang¹, Yi Zhang¹, Meiling Yang¹, Shuguang Hao¹, Bing Chen¹, Zongyuan Ma¹, Dan Yu¹, Zhiqiang Xiong, Yabing Zhu, Dingding Fan, Lijuan Han, Bo Wang, Yuanxin Chen, Junwen Wang, Lan Yang, Wei Zhao, Yue Feng, Guanxing Chen, Jinmin Lian, Qiye Li, Zhiyong Huang, Xiaoming Yao, Na Lv¹, Guojie Zhang, Yingrui Li, Jian Wang, Jun Wang, Baoli Zhu¹, Le Kang¹ - Show less +41 more•Institutions (1)

Chinese Academy of Sciences¹

14 Jan 2014-Nature Communications

TL;DR: A draft 6.5 Gb genome sequence of Locusta migratoria is presented, which is the largest animal genome sequenced so far, and complex regulatory mechanisms involved in microtubule dynamic-mediated synapse plasticity during phase change are revealed.

...read moreread less

Abstract: Locusts are one of the world's most destructive agricultural pests and represent a useful model system in entomology. Here we present a draft 6.5 Gb genome sequence of Locusta migratoria, which is the largest animal genome sequenced so far. Our findings indicate that the large genome size of L. migratoria is likely to be because of transposable element proliferation combined with slow rates of loss for these elements. Methylome and transcriptome analyses reveal complex regulatory mechanisms involved in microtubule dynamic-mediated synapse plasticity during phase change. We find significant expansion of gene families associated with energy consumption and detoxification, consistent with long-distance flight capacity and phytophagy. We report hundreds of potential insecticide target genes, including cys-loop ligand-gated ion channels, G-protein-coupled receptors and lethal genes. The L. migratoria genome sequence offers new insights into the biology and sustainable management of this pest species, and will promote its wide use as a model system.

...read moreread less

431 citations

Journal Article•DOI•

The Genome of the Netherlands: design, and project goals

[...]

Dorret I. Boomsma¹, Cisca Wijmenga, Eline Slagboom², Morris A. Swertz, Lennart C. Karssen³, Abdel Abdellaoui¹, Kai Ye², Victor Guryev⁴, Martijn Vermaat⁵, Freerk van Dijk⁶, Laurent C. Francioli⁷, Jouke-Jan Hottenga¹, Jeroen F. J. Laros⁵, Qibin Li, Yingrui Li, Hongzhi Cao, Ruoyan Chen, Yuanping Du, Ning Li, Sujie Cao, Jessica van Setten⁷, Androniki Menelaou⁷, Sara L. Pulit⁷, Jayne Y. Hehir-Kwa⁸, Marian Beekman⁵, Clara C. Elbers⁷, Heorhiy Byelas⁶, Anton J. M. de Craen⁵, Patrick Deelen⁶, Martijn Dijkstra⁶, Johan T. den Dunnen⁵, Peter de Knijff⁵, Jeanine J. Houwing-Duistermaat⁵, Vyacheslav Koval³, Karol Estrada³, Albert Hofman³, Alexandros Kanterakis⁶, David van Enckevort⁹, Hailiang Mai⁹, Mathijs Kattenberg¹, Elisabeth M. van Leeuwen³, Pieter B. Neerincx⁶, Ben A. Oostra³, Fernanodo Rivadeneira³, H. Eka D. Suchiman², André G. Uitterlinden³, Gonneke Willemsen¹, Bruce H. R. Wolffenbuttel⁶, Jun Wang¹⁰, Paul I.W. de Bakker⁷, Gert-Jan B. van Ommen⁵, Cornelia M. van Duijn³ - Show less +48 more•Institutions (10)

VU University Amsterdam¹, Leiden University Medical Center², Erasmus University Rotterdam³, University of Groningen⁴, Leiden University⁵, University Medical Center Groningen⁶, Utrecht University⁷, Radboud University Nijmegen⁸, Netherlands Bioinformatics Centre⁹, University of Copenhagen¹⁰

01 Feb 2014-European Journal of Human Genetics

TL;DR: The Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL, is described, a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population.

...read moreread less

Abstract: Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.

...read moreread less

267 citations

Journal Article•DOI•

Genome sequencing of the high oil crop sesame provides insight into oil biosynthesis

[...]

Linhai Wang¹, Sheng Yu², Chaobo Tong¹, Yingzhong Zhao¹, Yan Liu, Chi Song², Yanxin Zhang¹, Xudong Zhang², Ying Wang², Wei Hua¹, Donghua Li¹, Dan Li², Fang Li², Jingyin Yu¹, Chunyan Xu², Xuelian Han², Shunmou Huang¹, Shuaishuai Tai², Junyi Wang², Xun Xu², Yingrui Li², Shengyi Liu¹, Rajeev K. Varshney³, Rajeev K. Varshney⁴, Jun Wang², Jun Wang⁵, Xiurong Zhang¹ - Show less +23 more•Institutions (5)

Crops Research Institute¹, Beijing Genomics Institute², International Maize and Wheat Improvement Center³, International Crops Research Institute for the Semi-Arid Tropics⁴, University of Copenhagen⁵

27 Feb 2014-Genome Biology

TL;DR: The sesame genome will facilitate future research on the evolution of eudicots, as well as the study of lipid biosynthesis and potential genetic improvement of sesame, an important species from the order Lamiales and a high oil crop.

...read moreread less

Abstract: Background: Sesame, Sesamum indicum L., is considered the queen of oilseeds for its high oil content and quality, and is grown widely in tropical and subtropical areas as an important source of oil and protein. However, the molecular biology of sesame is largely unexplored. Results: Here, we report a high-quality genome sequence of sesame assembled de novo with a contig N50 of 52.2 kb and a scaffold N50 of 2.1 Mb, containing an estimated 27,148 genes. The results reveal novel, independent whole genome duplication and the absence of the Toll/interleukin-1 receptor domain in resistance genes. Candidate genes and oil biosynthetic pathways contributing to high oil content were discovered by comparative genomic and transcriptomic analyses. These revealed the expansion of type 1 lipid transfer genes by tandem duplication, the contraction of lipid degradation genes, and the differential expression of essential genes in the triacylglycerol biosynthesis pathway, particularly in the early stage of seed development. Resequencing data in 29 sesame accessions from 12 countries suggested that the high genetic diversity of lipid-related genes might be associated with the wide variation in oil content. Additionally, the results shed light on the pivotal stage of seed development, oil accumulation and potential key genes for sesamin production, an important pharmacological constituent of sesame. Conclusions: As an important species from the order Lamiales and a high oil crop, the sesame genome will facilitate future research on the evolution of eudicots, as well as the study of lipid biosynthesis and potential genetic improvement of sesame.

...read moreread less

225 citations

Journal Article•DOI•

A large-scale screen for coding variants predisposing to psoriasis.

[...]

Huayang Tang¹, Xin Jin², Yang Li¹, Hui Jiang, Xianfa Tang¹, Xu Yang, Hui Cheng¹, Ying Qiu, Gang Chen¹, Junpu Mei, Fusheng Zhou¹, Renhua Wu, Xianbo Zuo¹, Yong Zhang, Xiaodong Zheng¹, Qi Cai, Xianyong Yin¹, Cheng Quan¹, Haojing Shao, Yong Cui¹, Fangzhen Tian, Xia Zhao, Hong Liu³, Feng-Li Xiao¹, Fengping Xu, Jianwen Han⁴, Dongmei Shi, Anping Zhang¹, Cheng Zhou⁵, Qibin Li, Xing Fan¹, Liya Lin, Hongqing Tian³, Zaixing Wang¹, Huiling Fu, Fang Wang⁵, Baoqi Yang³, Shaowei Huang, Bo Liang¹, Xuefeng Xie, Yunqing Ren¹, Qingquan Gu, Guangdong Wen⁵, Yulin Sun³, Xueli Wu, Lin Dang⁶, Min Xia, Junjun Shan, Tianhang Li, Lin Yang, Xiuyun Zhang, Yu-Zhen Li⁶, Chundi He⁷, Ai-E Xu, Liping Wei⁵, Xiaohang Zhao³, Xinghua Gao⁷, Jinhua Xu⁸, Furen Zhang³, Jianzhong Zhang⁵, Yingrui Li, Liangdan Sun¹, Jianjun Liu¹, Runsheng Chen⁹, Sen Yang¹, Jun Wang¹⁰, Xuejun Zhang⁸ - Show less +63 more•Institutions (10)

Anhui Medical University¹, South China University of Technology², Peking Union Medical College³, Inner Mongolia Medical University⁴, Peking University⁵, Harbin Medical University⁶, China Medical University (PRC)⁷, Fudan University⁸, Chinese Academy of Sciences⁹, University of Copenhagen¹⁰

01 Jan 2014-Nature Genetics

TL;DR: Single-variant and gene-based association analyses of nonsynonymous SNVs did not identify newly associated genes for psoriasis in the regions subjected to targeted resequencing, which suggests that coding variants in the 1,326 targeted genes contribute only a limited fraction of the overall genetic risk for Psoriasis.

...read moreread less

Abstract: To explore the contribution of functional coding variants to psoriasis, we analyzed nonsynonymous single-nucleotide variants (SNVs) across the genome by exome sequencing in 781 psoriasis cases and 676 controls and through follow-up validation in 1,326 candidate genes by targeted sequencing in 9,946 psoriasis cases and 9,906 controls from the Chinese population. We discovered two independent missense SNVs in IL23R and GJB2 of low frequency and five common missense SNVs in LCE3D, ERAP1, CARD14 and ZNF816A associated with psoriasis at genome-wide significance. Rare missense SNVs in FUT2 and TARBP1 were also observed with suggestive evidence of association. Single-variant and gene-based association analyses of nonsynonymous SNVs did not identify newly associated genes for psoriasis in the regions subjected to targeted resequencing. This suggests that coding variants in the 1,326 targeted genes contribute only a limited fraction of the overall genetic risk for psoriasis.

...read moreread less

191 citations

Journal Article•DOI•

Activating Hotspot L205R Mutation in PRKACA and Adrenal Cushing's Syndrome

[...]

Yanan Cao¹, Minghui He, Zhibo Gao, Ying Peng¹, Yanli Li¹, Lin Li, Weiwei Zhou¹, Xiangchun Li, Xu Zhong¹, Yiming Lei, Tingwei Su¹, Hang Wang, Yiran Jiang¹, Lin Yang, Wei Wei¹, Xu Yang, Xiuli Jiang¹, Li Liu, He Juan¹, Junna Ye¹, Qing Wei¹, Yingrui Li, Weiqing Wang¹, Jun Wang, Guang Ning¹ - Show less +21 more•Institutions (1)

Shanghai Jiao Tong University¹

23 May 2014-Science

TL;DR: In this article, the authors performed whole-exome sequencing of 49 blood-tumor pairs and RNA sequencing of 44 tumors from cortisol-producing adenomas (ACAs), adrenocorticotropic hormone-independent macronodular hyperplasias (AIMAHs), and Adrenocortical oncocytomas (ADOs) and identified a hotspot in the PRKACA gene with a L205R mutation in 69.2% (27 out of 39) of ACAs and validated in 65.5% of a total of 87

...read moreread less

Abstract: Adrenal Cushing's syndrome is caused by excess production of glucocorticoid from adrenocortical tumors and hyperplasias, which leads to metabolic disorders. We performed whole-exome sequencing of 49 blood-tumor pairs and RNA sequencing of 44 tumors from cortisol-producing adrenocortical adenomas (ACAs), adrenocorticotropic hormone-independent macronodular adrenocortical hyperplasias (AIMAHs), and adrenocortical oncocytomas (ADOs). We identified a hotspot in the PRKACA gene with a L205R mutation in 69.2% (27 out of 39) of ACAs and validated in 65.5% of a total of 87 ACAs. Our data revealed that the activating L205R mutation, which locates in the P+1 loop of the protein kinase A (PKA) catalytic subunit, promoted PKA substrate phosphorylation and target gene expression. Moreover, we discovered the recurrently mutated gene DOT1L in AIMAHs and CLASP2 in ADOs. Collectively, these data highlight potentially functional mutated genes in adrenal Cushing's syndrome.

...read moreread less

Journal Article•DOI•

Targeted Gene Correction Minimally Impacts Whole-Genome Mutational Load in Human-Disease-Specific Induced Pluripotent Stem Cell Clones

[...]

Keiichiro Suzuki¹, Chang Yu, Jing Qu², Mo Li¹, Xiaotian Yao, Tingting Yuan², April Goebl¹, Senwei Tang³, Ruotong Ren², Emi Aizawa¹, Fan Zhang⁴, Xiuling Xu², Rupa Devi Soligalla¹, Feng Chen, Jessica Kim¹, Na Young Kim¹, Hsin-Kai Liao¹, Christopher Benner¹, Concepcion Rodriguez Esteban¹, Yabin Jin, Guang-Hui Liu², Yingrui Li, Juan Carlos Izpisua Belmonte¹ - Show less +19 more•Institutions (4)

Salk Institute for Biological Studies¹, Chinese Academy of Sciences², The Chinese University of Hong Kong³, University of Michigan⁴

03 Jul 2014-Cell Stem Cell

TL;DR: With careful monitoring via whole-genome sequencing it is possible to apply genome editing to human pluripotent cells with minimal impact on genomic mutational load, and a TALEN-HDAdV hybrid vector is developed, which significantly increased gene-correction efficiency in hiPSCs.

...read moreread less

Journal Article•DOI•

The South Asian Genome

[...]

John C. Chambers¹, James Abbott², Weihua Zhang², Ernest Turro³, William R. Scott², Sian-Tsung Tan², Uzma Afzal², Saima Afaq², Marie Loh², Benjamin Lehne², Paul F. O'Reilly², Kyle J. Gaulton⁴, Richard D. Pearson⁴, Xinzhong Li², Anita Lavery², Jana Vandrovcova², Mark N. Wass², Kathryn Miller⁵, Joban Sehmi², Laticia Oozageer⁵, Ishminder K. Kooner⁵, Abtehale Al-Hussaini², Rebecca Mills⁵, Jagvir Grewal⁵, Vasileios F. Panoulas¹, Alexandra M. Lewin², Korrinne Northwood², Gurpreet Singh Wander, Frank Geoghegan⁵, Yingrui Li, Jun Wang, Timothy J. Aitman², Mark I. McCarthy⁴, James Scott², Sarah Butcher², Paul Elliott¹, Jaspal S. Kooner¹ - Show less +33 more•Institutions (5)

Imperial College Healthcare¹, Imperial College London², University of Cambridge³, Wellcome Trust Centre for Human Genetics⁴, Ealing Hospital⁵

12 Aug 2014-PLOS ONE

TL;DR: This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity.

...read moreread less

Abstract: The genetic sequence variation of people from the Indian subcontinent who comprise one-quarter of the world's population, is not well described. We carried out whole genome sequencing of 168 South Asians, along with whole-exome sequencing of 147 South Asians to provide deeper characterisation of coding regions. We identify 12,962,155 autosomal sequence variants, including 2,946,861 new SNPs and 312,738 novel indels. This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity. Our results will accelerate the search for the genetic variants underlying susceptibility to disorders such as type-2 diabetes and cardiovascular disease which are highly prevalent amongst South Asians.

...read moreread less

Journal Article•DOI•

Discovery of biclonal origin and a novel oncogene SLC12A5 in colon cancer by single-cell sequencing

[...]

Chang Yu, Jun Yu¹, Xiaotian Yao, William K.K. Wu¹, Youyong Lu², Senwei Tang¹, Xiangchun Li, Li Bao, Xiaoxing Li¹, Yong Hou³, Renhua Wu, Min Jian, Ruoyan Chen⁴, Fan Zhang⁵, Lixia Xu¹, Fan Fan, Jun He¹, Qiaoyi Liang¹, Hongyi Wang², Xueda Hu, Minghui He, Xiang Zhang¹, Hancheng Zheng, Qibin Li, Hanjie Wu, Yan Chen, Xu Yang, Shida Zhu, Xun Xu, Huanming Yang, Jian Wang, Xiuqing Zhang, Joseph J.Y. Sung¹, Yingrui Li, Jun Wang⁶ - Show less +31 more•Institutions (6)

The Chinese University of Hong Kong¹, Peking University², Southeast University³, University of Hong Kong⁴, University of Michigan⁵, Macau University of Science and Technology⁶

04 Apr 2014-Cell Research

TL;DR: This study provides the first exome-wide evidence at single-cell level supporting that colon cancer could be of a biclonal origin, and suggests that low-prevalence mutations in a cohort may also play important protumorigenic roles at the individual level.

...read moreread less

Abstract: Single-cell sequencing is a powerful tool for delineating clonal relationship and identifying key driver genes for personalized cancer management. Here we performed single-cell sequencing analysis of a case of colon cancer. Population genetics analyses identified two independent clones in tumor cell population. The major tumor clone harbored APC and TP53 mutations as early oncogenic events, whereas the minor clone contained preponderant CDC27 and PABPC1 mutations. The absence of APC and TP53 mutations in the minor clone supports that these two clones were derived from two cellular origins. Examination of somatic mutation allele frequency spectra of additional 21 whole-tissue exome-sequenced cases revealed the heterogeneity of clonal origins in colon cancer. Next, we identified a mutated gene SLC12A5 that showed a high frequency of mutation at the single-cell level but exhibited low prevalence at the population level. Functional characterization of mutant SLC12A5 revealed its potential oncogenic effect in colon cancer. Our study provides the first exome-wide evidence at single-cell level supporting that colon cancer could be of a biclonal origin, and suggests that low-prevalence mutations in a cohort may also play important protumorigenic roles at the individual level.

...read moreread less

Journal Article•DOI•

Diverse modes of genomic alteration in hepatocellular carcinoma

[...]

Suchit Jhunjhunwala¹, Zhaoshi Jiang¹, Eric Stawiski¹, Florian Gnad¹, Jinfeng Liu¹, Oleg Mayba¹, Pan Du¹, Jingyu Diao¹, Stephanie Johnson¹, Kwong-Fai Wong², Zhibo Gao, Yingrui Li, Thomas D. Wu¹, Sharookh B. Kapadia¹, Zora Modrusan¹, Dorothy French¹, John M. Luk², John M. Luk³, John M. Luk⁴, Somasekar Seshagiri¹, Zemin Zhang¹ - Show less +17 more•Institutions (4)

Genentech¹, University of Hong Kong², National University of Singapore³, Agency for Science, Technology and Research⁴

26 Aug 2014-Genome Biology

TL;DR: Deep-sequence 42 HCC patients with a combination of whole genome, exome and transcriptome sequencing identify the mutational landscape of HCC and find frequent mutations in TP53, CTNNB1 and AXIN1, and rare but likely functional mutations in BAP1 and IDH1.

...read moreread less

Abstract: Background Hepatocellular carcinoma (HCC) is a heterogeneous disease with high mortality rate. Recent genomic studies have identified TP53, AXIN1, and CTNNB1 as the most frequently mutated genes. Lower frequency mutations have been reported in ARID1A, ARID2 and JAK1. In addition, hepatitis B virus (HBV) integrations into the human genome have been associated with HCC.

...read moreread less

Journal Article•DOI•

Concurrent alterations in TERT, KDM6A, and the BRCA pathway in bladder cancer.

[...]

Michael L. Nickerson¹, Garrett M. Dancik², Kate M. Im¹, Michael G. Edwards³, Sevilay Turan¹, Joseph Brown, Christina T. Ruiz-Rodriguez¹, Charles Owens², James C. Costello², Guangwu Guo, Shirley Tsang, Yingrui Li, Quan Zhou, Zhiming Cai, Lee E. Moore¹, M. Scott Lucia², Michael Dean¹, Dan Theodorescu² - Show less +14 more•Institutions (3)

National Institutes of Health¹, University of Colorado Boulder², University of Colorado Denver³

15 Sep 2014-Clinical Cancer Research

TL;DR: This study is the first to identify frequent BAP1 and BRCA pathway alterations in bladder cancer, show TERT promoter alterations are independent of other bladder cancer gene alterations, and show KDM6A loss is a driver of the bladder cancer phenotype.

...read moreread less

Abstract: Purpose: Genetic analysis of bladder cancer has revealed a number of frequently altered genes, including frequent alterations of the telomerase ( TERT ) gene promoter, although few altered genes have been functionally evaluated. Our objective is to characterize alterations observed by exome sequencing and sequencing of the TERT promoter, and to examine the functional relevance of histone lysine (K)–specific demethylase 6A ( KDM6A/UTX ), a frequently mutated histone demethylase, in bladder cancer. Experimental Design: We analyzed bladder cancer samples from 54 U.S. patients by exome and targeted sequencing and confirmed somatic variants using normal tissue from the same patient. We examined the biologic function of KDM6A using in vivo and in vitro assays. Results: We observed frequent somatic alterations in BRCA1 associated protein-1 (BAP1) in 15% of tumors, including deleterious alterations to the deubiquitinase active site and the nuclear localization signal. BAP1 mutations contribute to a high frequency of tumors with breast cancer (BRCA) DNA repair pathway alterations and were significantly associated with papillary histologic features in tumors. BAP1 and KDM6A mutations significantly co-occurred in tumors. Somatic variants altering the TERT promoter were found in 69% of tumors but were not correlated with alterations in other bladder cancer genes. We examined the function of KDM6A , altered in 24% of tumors, and show depletion in human bladder cancer cells, enhanced in vitro proliferation, in vivo tumor growth, and cell migration. Conclusions: This study is the first to identify frequent BAP1 and BRCA pathway alterations in bladder cancer, show TERT promoter alterations are independent of other bladder cancer gene alterations, and show KDM6A loss is a driver of the bladder cancer phenotype. Clin Cancer Res; 20(18); 4935–48. ©2014 AACR .

...read moreread less

Journal Article•DOI•

High-coverage sequencing and annotated assemblies of the budgerigar genome

[...]

Ganeshkumar Ganapathy¹, Jason T. Howard¹, James M. Ward², Jianwen Li, Bo Li, Yingrui Li, Yingqi Xiong, Yong Zhang, Shiguo Zhou³, David C. Schwartz³, Michael C. Schatz⁴, Robert Aboukhalil⁴, Olivier Fedrigo¹, Lisa Bukovnik¹, Ty Wang², Greg Wray¹, Isabelle Rasolonjatovo⁵, Roger Winer, James R. Knight, Sergey Koren⁶, Wesley C. Warren⁷, Guojie Zhang, Adam M. Phillippy⁶, Erich D. Jarvis¹ - Show less +20 more•Institutions (7)

Duke University¹, National Institutes of Health², University of Wisconsin-Madison³, Cold Spring Harbor Laboratory⁴, Illumina⁵, University of Maryland, College Park⁶, Washington University in St. Louis⁷

08 Jul 2014-GigaScience

TL;DR: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble.

...read moreread less

Abstract: Background: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. Findings: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) – the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. Conclusions: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.

...read moreread less

Journal Article•DOI•

Whole genome sequencing of Ethiopian highlanders reveals conserved hypoxia tolerance genes

[...]

Nitin Udpa¹, Roy Ronen¹, Dan Zhou¹, Junbin Liang, Tsering Stobdan¹, Otto Appenzeller², Ye Yin, Yuanping Du, Lixia Guo, Rui Cao, Yu Wang, Xin Jin, Chen Huang, Wenlong Jia, Dandan Cao, Guangwu Guo, Victoria E. Claydon³, Roger Hainsworth⁴, Jorge L. Gamboa⁵, Mehila Zibenigus⁶, Guta Zenebe⁶, Jin Xue¹, Siqi Liu⁷, Kelly A. Frazer¹, Yingrui Li, Vineet Bafna¹, Gabriel G. Haddad¹, Gabriel G. Haddad⁸ - Show less +24 more•Institutions (8)

University of California, San Diego¹, Marathon Oil², University of British Columbia³, University of Leeds⁴, Vanderbilt University Medical Center⁵, Addis Ababa University⁶, Beijing Institute of Genomics⁷, Boston Children's Hospital⁸

20 Feb 2014-Genome Biology

TL;DR: The first whole genome resequencing-based analysis identifying genes that likely modulate high altitude adaptation in native Ethiopians residing at 3,500 m above sea level on Bale Plateau or Chennek field in Ethiopia highlights the importance of whole genome sequencing for investigating adaptation by natural selection.

...read moreread less

Abstract: Although it has long been proposed that genetic factors contribute to adaptation to high altitude, such factors remain largely unverified. Recent advances in high-throughput sequencing have made it feasible to analyze genome-wide patterns of genetic variation in human populations. Since traditionally such studies surveyed only a small fraction of the genome, interpretation of the results was limited. We report here the results of the first whole genome resequencing-based analysis identifying genes that likely modulate high altitude adaptation in native Ethiopians residing at 3,500 m above sea level on Bale Plateau or Chennek field in Ethiopia. Using cross-population tests of selection, we identify regions with a significant loss of diversity, indicative of a selective sweep. We focus on a 208 kbp gene-rich region on chromosome 19, which is significant in both of the Ethiopian subpopulations sampled. This region contains eight protein-coding genes and spans 135 SNPs. To elucidate its potential role in hypoxia tolerance, we experimentally tested whether individual genes from the region affect hypoxia tolerance in Drosophila. Three genes significantly impact survival rates in low oxygen: cic, an ortholog of human CIC, Hsl, an ortholog of human LIPE, and Paf-AHα, an ortholog of human PAFAH1B3. Our study reveals evolutionarily conserved genes that modulate hypoxia tolerance. In addition, we show that many of our results would likely be unattainable using data from exome sequencing or microarray studies. This highlights the importance of whole genome sequencing for investigating adaptation by natural selection.

...read moreread less

Journal Article•DOI•

Sequencing-based approach identified three new susceptibility loci for psoriasis

[...]

Yujun Sheng¹, Xin Jin², Jinhua Xu¹, Jinping Gao³, Xiaoqing Du³, Dawei Duan³, Bing Li³, Jinhua Zhao³, Wenying Zhan³, Huayang Tang³, Xianfa Tang³, Yang Li³, Hui Cheng³, Xianbo Zuo³, Junpu Mei, Fusheng Zhou³, Bo Liang³, Gang Chen³, Changbing Shen³, Hongzhou Cui³, Xiaoguang Zhang³, Change Zhang³, Wenjun Wang³, Xiaodong Zheng³, Xing Fan³, Zaixing Wang³, Feng-Li Xiao³, Yong Cui³, Yingrui Li, Jun Wang⁴, Sen Yang³, Lei Xu⁵, Liangdan Sun³, Xuejun Zhang³ - Show less +30 more•Institutions (5)

Fudan University¹, South China University of Technology², Chinese Ministry of Education³, University of Copenhagen⁴, The Chinese University of Hong Kong⁵

09 Jul 2014-Nature Communications

TL;DR: The results of this study increase the number of confirmed Psoriasis risk loci and provide novel insight into the pathogenesis of psoriasis.

...read moreread less

Abstract: In a previous large-scale exome sequencing analysis for psoriasis, we discovered seven common and low-frequency missense variants within six genes with genome-wide significance. Here we describe an in-depth analysis of noncoding variants based on sequencing data (10,727 cases and 10,582 controls) with replication in an independent cohort of Han Chinese individuals consisting of 4,480 cases and 6,521 controls to identify additional psoriasis susceptibility loci. We confirmed four known psoriasis susceptibility loci (IL12B, IFIH1, ERAP1 and RNF114; 2.30 × 10(-20)≤P≤2.41 × 10(-7)) and identified three new susceptibility loci: 4q24 (NFKB1) at rs1020760 (P=2.19 × 10(-8)), 12p13.3 (CD27-LAG3) at rs758739 (P=4.08 × 10(-8)) and 17q12 (IKZF3) at rs10852936 (P=1.96 × 10(-8)). Two suggestive loci, 3p21.31 and 17q25, are also identified with P<1.00 × 10(-6). The results of this study increase the number of confirmed psoriasis risk loci and provide novel insight into the pathogenesis of psoriasis.

...read moreread less

Phylogenomics Resolves The Timing And Pattern Of Insect Evolution: Supplementary File Archives.

[...]

Bernhard Misof, Shanlin Liu, Karen Meusemann, Ralph S. Peters, Alexander Donath, Christoph Mayer, Paul B. Frandsen, Jessica L. Ware, Tomas Flouri, Rolf G. Beutel, Oliver Niehuis, Malte Petersen, Fernando Izquierdo-Carrasco, Torsten Wappler, Jes Rust, Andre J. Aberer, Ulrike Aspöck, Horst Aspöck, Daniela Bartel, Alexander Blanke, Simon Berger, Alexander Böhm, Thomas R. Buckley, Brett Calcott, Junqing Chen, Frank Friedrich, Makiko Fukui, Mari Fujita, Carola Greve, Peter Grobe, Shengchang Gu, Ying Huang, Lars S. Jermiin, Akito Y. Kawahara, Lars Krogmann, Martin Kubiak, Robert Lanfear, Harald Letsch, Yiyuan Li, Zhenyu Li, Jiguang Li, Haorong Lu, Ryuichiro Machida, Yuta Mashimo, Pashalia Kapli, Duane D. McKenna, Guanliang Meng, Yasutaka Nakagaki, José Luis Navarrete-Heredia, Michael Ott, Yanxiang Ou, Günther Pass, Lars Podsiadlowski, Hans Pohl, Björn M. von Reumont, Kai Schütte, Kaoru Sekiya, Shota Shimizu, Adam Slipinski, Alexandros Stamatakis, Wenhui Song, Xu Su, Nikolaus U. Szucsich, Meihua Tan, Xuemei Tan, Min Tang, Jingbo Tang, Gerald Timelthaler, Shigekazu Tomizuka, Michelle D. Trautwein, Xiaoli Tong, Toshiki Uchifune, Manfred Walzl, Brian M. Wiegmann, Jeanne Wilbrandt, Benjamin Wipfler, Thomas K. F. Wong, Qiong Wu, Gengxiong Wu, Yinlong Xie, Shenzhou Yang, Qing Yang, David K. Yeates, Kazunori Yoshizawa, Qing Zhang, Rui Zhang, Wenwei Zhang, Yunhui Zhang, Jing Zhao, Chengran Zhou, Lili Zhou, Tanja Ziesmann, Shijie Zou, Yingrui Li, Xun Xu, Yong Zhang, Huanming Yang, Jian Wang, Jun Wang, Karl M. Kjer, Xin Zhou - Show less +97 more

01 Jan 2014

TL;DR: A phylogenetic analysis of protein-coding genes from all major insect orders and close relatives was performed by Misof et al. as discussed by the authors, who used this resolved phylogenetic tree together with fossil analysis to date the origin of insects to ~479 million years ago and to resolve longcontroversial subjects in insect phylogeny.

...read moreread less

Abstract: Toward an insect evolution resolution Insects are the most diverse group of animals, with the largest number of species. However, many of the evolutionary relationships between insect species have been controversial and difficult to resolve. Misof et al. performed a phylogenomic analysis of protein-coding genes from all major insect orders and close relatives, resolving the placement of taxa. The authors used this resolved phylogenetic tree together with fossil analysis to date the origin of insects to ~479 million years ago and to resolve long-controversial subjects in insect phylogeny. Science, this issue p. 763 The phylogeny of all major insect lineages reveals how and when insects diversified. Insects are the most speciose group of animals, but the phylogenetic relationships of many major lineages remain unresolved. We inferred the phylogeny of insects from 1478 protein-coding genes. Phylogenomic analyses of nucleotide and amino acid sequences, with site-specific nucleotide or domain-specific amino acid substitution models, produced statistically robust and congruent results resolving previously controversial phylogenetic relations hips. We dated the origin of insects to the Early Ordovician [~479 million years ago (Ma)], of insect flight to the Early Devonian (~406 Ma), of major extant lineages to the Mississippian (~345 Ma), and the major diversification of holometabolous insects to the Early Cretaceous. Our phylogenomic study provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

Journal Article•DOI•

Whole-genome sequencing of matched primary and metastatic hepatocellular carcinomas

[...]

Limei Ouyang, Jeeyun Lee¹, Cheol-Keun Park¹, Mao Mao², Yujian Shi, Zhuolin Gong, Hancheng Zheng, Yingrui Li, Yonggang Zhao, Guangbiao Wang, Huiling Fu, Jhingook Kim¹, Ho Yeong Lim¹ - Show less +9 more•Institutions (2)

Samsung Medical Center¹, Pfizer²

09 Jan 2014-BMC Medical Genomics

TL;DR: Preservations in genomic profiles from liver primary tumors to metachronous lung metastases indicate that the genomic features during tumorigenesis may be retained during metastasis, which may explain the clinical observation that both primary and metastatic tumors are usually sensitive or resistant to the same systemic treatments.

...read moreread less

Abstract: To gain biological insights into lung metastases from hepatocellular carcinoma (HCC), we compared the whole-genome sequencing profiles of primary HCC and paired lung metastases. We used whole-genome sequencing at 33X-43X coverage to profile somatic mutations in primary HCC (HBV+) and metachronous lung metastases (> 2 years interval). In total, 5,027-13,961 and 5,275-12,624 somatic single-nucleotide variants (SNVs) were detected in primary HCC and lung metastases, respectively. Generally, 38.88-78.49% of SNVs detected in metastases were present in primary tumors. We identified 65–221 structural variations (SVs) in primary tumors and 60–232 SVs in metastases. Comparison of these SVs shows very similar and largely overlapped mutated segments between primary and metastatic tumors. Copy number alterations between primary and metastatic pairs were also found to be closely related. Together, these preservations in genomic profiles from liver primary tumors to metachronous lung metastases indicate that the genomic features during tumorigenesis may be retained during metastasis. We found very similar genomic alterations between primary and metastatic tumors, with a few mutations found specifically in lung metastases, which may explain the clinical observation that both primary and metastatic tumors are usually sensitive or resistant to the same systemic treatments.

...read moreread less

Journal Article•DOI•

Exome capture from saliva produces high quality genomic and metagenomic data

[...]

Jeffrey M. Kidd¹, Jeffrey M. Kidd², Thomas J. Sharpton³, Thomas J. Sharpton⁴, Dean Bobo⁵, Paul Norman¹, Alicia R. Martin¹, Meredith L. Carpenter¹, Martin Sikora¹, Christopher R. Gignoux³, Neda Nemat-Gorgani¹, Alexandra Adams¹, Moraima Guadalupe⁶, Xiaosen Guo, Qiang Feng, Yingrui Li, Xiao Liu, Peter Parham¹, Eileen G. Hoal⁷, Marcus W. Feldman¹, Katherine S. Pollard³, Jeffrey D. Wall³, Carlos Bustamante¹, Brenna M. Henn⁵, Brenna M. Henn¹ - Show less +21 more•Institutions (7)

Stanford University¹, University of Michigan², University of California, San Francisco³, Oregon State University⁴, Stony Brook University⁵, Agilent Technologies⁶, Stellenbosch University⁷

04 Apr 2014-BMC Genomics

TL;DR: It is shown that exome capture of saliva-derived DNA yields sufficient non-human sequences to characterize oral microbial communities, including detection of bacteria linked to oral disease (e.g. Prevotella melaninogenica).

...read moreread less

Abstract: Targeted capture of genomic regions reduces sequencing cost while generating higher coverage by allowing biomedical researchers to focus on specific loci of interest, such as exons. Targeted capture also has the potential to facilitate the generation of genomic data from DNA collected via saliva or buccal cells. DNA samples derived from these cell types tend to have a lower human DNA yield, may be degraded from age and/or have contamination from bacteria or other ambient oral microbiota. However, thousands of samples have been previously collected from these cell types, and saliva collection has the advantage that it is a non-invasive and appropriate for a wide variety of research. We demonstrate successful enrichment and sequencing of 15 South African KhoeSan exomes and 2 full genomes with samples initially derived from saliva. The expanded exome dataset enables us to characterize genetic diversity free from ascertainment bias for multiple KhoeSan populations, including new exome data from six HGDP Namibian San, revealing substantial population structure across the Kalahari Desert region. Additionally, we discover and independently verify thirty-one previously unknown KIR alleles using methods we developed to accurately map and call the highly polymorphic HLA and KIR loci from exome capture data. Finally, we show that exome capture of saliva-derived DNA yields sufficient non-human sequences to characterize oral microbial communities, including detection of bacteria linked to oral disease (e.g. Prevotella melaninogenica). For comparison, two samples were sequenced using standard full genome library preparation without exome capture and we found no systematic bias of metagenomic information between exome-captured and non-captured data. DNA from human saliva samples, collected and extracted using standard procedures, can be used to successfully sequence high quality human exomes, and metagenomic data can be derived from non-human reads. We find that individuals from the Kalahari carry a higher oral pathogenic microbial load than samples surveyed in the Human Microbiome Project. Additionally, rare variants present in the exomes suggest strong population structure across different KhoeSan populations.

...read moreread less

Journal Article•DOI•

Variation and association to diabetes in 2000 full mtDNA sequences mined from an exome study in a Danish population

[...]

Shengting Li¹, Søren Besenbacher¹, Yingrui Li, Karsten Kristiansen², Niels Grarup², Anders Albrechtsen², Thomas Sparsø², Thorfinn Sand Korneliussen², Torben Hansen², Jun Wang, Rasmus Nielsen³, Oluf Pedersen², Lars Bolund¹, Mikkel H. Schierup¹ - Show less +10 more•Institutions (3)

Aarhus University¹, University of Copenhagen², University of California, Berkeley³

22 Jan 2014-European Journal of Human Genetics

TL;DR: Full mtDNA sequences are mined from an exome capture data set of 2000 Danes, showing that it is possible to get high-quality full-genome sequences of the mitochondrion from this resource and characterising the variation found in the mtDNA sequence in Danes.

...read moreread less

Abstract: In this paper, we mine full mtDNA sequences from an exome capture data set of 2000 Danes, showing that it is possible to get high-quality full-genome sequences of the mitochondrion from this resource. The sample includes 1000 individuals with type 2 diabetes and 1000 controls. We characterise the variation found in the mtDNA sequence in Danes and relate the variation to diabetes risk as well as to several blood phenotypes of the controls but find no significant associations. We report 2025 polymorphisms, of which 393 have not been reported previously. These 393 mutations are both very rare and estimated to be caused by very recent mutations but individuals with type 2 diabetes do not possess more of these variants. Population genetics analysis using Bayesian skyline plot shows a recent history of rapid population growth in the Danish population in accordance with the fact that >40% of variable sites are observed as singletons.

...read moreread less

Journal Article•DOI•

Whole-exome sequencing for the identification of susceptibility genes of Kashin-Beck disease.

[...]

Zhenxing Yang¹, Yu Xu, Hongrong Luo¹, Xiaohong Ma¹, Qiang Wang¹, Yingcheng Wang¹, Wei Deng¹, Tao Jiang, Guangqing Sun, Tingting He, Jingchu Hu, Yingrui Li, Jun Wang, Tao Li¹, Xun Hu¹ - Show less +11 more•Institutions (1)

Sichuan University¹

28 Apr 2014-PLOS ONE

TL;DR: HLA-DRB1 and CD2AP gene were identified to be among the susceptibility genes of KBD, thus supporting the role of the autoimmune response in KBD and the possibility of shared etiology between osteoarthritis, rheumatoid arthritis, and KBD.

...read moreread less

Abstract: Objective To identify and investigate the susceptibility genes of Kashin–Beck disease (KBD) in Chinese population. Methods Whole-exome capturing and sequencing technology was used for the detection of genetic variations in 19 individuals from six families with high incidence of KBD. A total of 44 polymorphisms from 41 genes were genotyped from a total of 144 cases and 144 controls by using MassARRAY under the standard protocol from Sequenom. Association was applied on the data by using PLINK1.07. Results In the sequencing stage, each sample showed approximately 70-fold coverage, thus covering more than 99% of the target regions. Among the single nucleotide polymorphisms (SNPs) used in the transmission disequilibrium test, 108 had a p-value of <0.01, whereas 1056 had a p-value of <0.05. Kyoto Encyclopedia of Genes and Genomes(KEGG) pathway analysis indicates that these SNPs focus on three major pathways: regulation of actin cytoskeleton, focal adhesion, and metabolic pathways. In the validation stage, single locus effects revealed that two of these polymorphisms (rs7745040 and rs9275295) in the human leukocyte antigen (HLA)-DRB1 gene and one polymorphism (rs9473132) in CD2-associated protein (CD2AP) gene have a significant statistical association with KBD. Conclusions HLA-DRB1 and CD2AP gene were identified to be among the susceptibility genes of KBD, thus supporting the role of the autoimmune response in KBD and the possibility of shared etiology between osteoarthritis, rheumatoid arthritis, and KBD.

...read moreread less

Journal Article•DOI•

Two-step source tracing strategy of Yersinia pestis and its historical epidemiology in a specific region.

[...]

Yanfeng Yan, Hu Wang, Dongfang Li, Xianwei Yang, Zuyun Wang, Zhizhen Qi, Qingwen Zhang, Baizhong Cui, Zhaobiao Guo, Chang Yu, Jun Wang, Jian Wang, Guangming Liu¹, Yajun Song, Yingrui Li, Yujun Cui, Ruifu Yang - Show less +13 more•Institutions (1)

National University of Defense Technology¹

09 Jan 2014-PLOS ONE

TL;DR: The analytical strategy developed here will be of great help in fighting against the outbreaks of emerging infectious diseases, by pinpointing the source of pathogens rapidly with genomic epidemiological data and microbial forensics information.

...read moreread less

Abstract: Source tracing of pathogens is critical for the control and prevention of infectious diseases. Genome sequencing by high throughput technologies is currently feasible and popular, leading to the burst of deciphered bacterial genome sequences. Utilizing the flooding genomic data for source tracing of pathogens in outbreaks is promising, and challenging as well. Here, we employed Yersinia pestis genomes from a plague outbreak at Xinghai county of China in 2009 as an example, to develop a simple two-step strategy for rapid source tracing of the outbreak. The first step was to define the phylogenetic position of the outbreak strains in a whole species tree, and the next step was to provide a detailed relationship across the outbreak strains and their suspected relatives. Through this strategy, we observed that the Xinghai plague outbreak was caused by Y. pestis that circulated in the local plague focus, where the majority of historical plague epidemics in the Qinghai-Tibet Plateau may originate from. The analytical strategy developed here will be of great help in fighting against the outbreaks of emerging infectious diseases, by pinpointing the source of pathogens rapidly with genomic epidemiological data and microbial forensics information.

...read moreread less

Journal Article•DOI•

Correction: The genome sequence of the ground tit Pseudopodoces humilis provides insights into its adaptation to high altitude

[...]

Qingle Cai, Xiaoju Qian, Yongshan Lang, Yadan Luo, Jiaohui Xu, Shengkai Pan, Yuanyuan Hui, Caiyun Gou, Yue Cai, Meirong Hao, Jinyang Zhao, Songbo Wang, Zhaobao Wang, Xinming Zhang, Rongjun He, Jinchao Liu, Longhai Luo, Yingrui Li, Jun Wang¹, Jun Wang² - Show less +16 more•Institutions (2)

University of Copenhagen¹, King Abdulaziz University²

14 Feb 2014-Genome Biology

TL;DR: The phylogeny of the ground tit was confirmed as not belonging to the Corvidae family but to the Paridae family, which reflects the classification of this species to the Estrildidae family.

...read moreread less

Abstract: 1. Fumin Lei is no longer listed as an author of this article. Instead, his helpful input is noted in the acknowledgements section. 2. The provisional version of this article mistakenly stated that zebra finch belongs to the Paridae family. We have now corrected this error to reflect the classification of this species to the Estrildidae family. 3. In the abstract of the provisional version of the article we stated that the phylogeny of the ground tit was confirmed as belonging to the Paridae family. We have now re-phrased this sentence to say that ground tit phylogeny was confirmed as not belonging to the Corvidae family. 4. In the conclusions of the provisional version of the article we stated that the phylogeny of the ground tit was confirmed as not belonging to the Corvidae family but to the Paridae family. We have now re-phrased this conclusion to say that ground tit phylogeny was confirmed as not belonging to the Corvidae family.

...read moreread less

Data from: Phylogenomics resolves the timing and pattern of insect evolution

[...]

01 Jan 2014

Patent•

Method of Gap Closing in Nucleotide Sequence and Apparatus Thereof

[...]

Binghang Liu, Zhenyu Li, Yanxiang Chen, Yingrui Li, Jian Wang, Jun Wang, Huanming Yang - Show less +3 more

27 Nov 2014

TL;DR: In this paper, the authors proposed a method of gap closing in nucleotide sequence, which consists of selecting reads having an overlap with one end of the first contig close to the gap as a set of reads for gap closing, selecting reads with a shortest overlap with the first-closest contig in the set of read candidates, and determining whether reads having no overlapping relationship with the candidate read present in the read candidates present for gap-closing.

...read moreread less

Abstract: Provided is a method of gap closing in nucleotide sequence. The nucleic acid sequence comprises a first contig at one end of a gap in an unassembled region, and a second contig at the other end of the gap in the unassembled region. The method comprises: selecting reads having an overlap with one end of the first contig close to the gap as a set of reads for gap closing; selecting reads having a shortest overlap with the first contig in the set of reads for gap closing as a candidate read; determining whether reads having an overlapping length with the first contig shorter than an overlapping length between the candidate read and the first contig present in the set of reads for gap closing, and determining whether reads having no overlapping relationship with the candidate read present in the set of reads for gap closing; obtaining a result of presenting an extension conflict, and determining an unconfident candidate read, if reads having an overlapping length with the first contig shorter than an overlapping length between the candidate read and the first contig present in the set of reads for gap closing, reads having no overlapping relationship with the candidate read present in the set of reads for gap closing, or both reads having an overlapping length with the first contig shorter than an overlapping length between the candidate read and the first contig, and reads having no overlapping relationship with the candidate read present in the set of reads for gap closing; reselecting the candidate read until obtaining a confident candidate read, if the candidate read is unconfident; connecting the confident candidate read to the first contig, to form a new first contig; determining whether one end of the new first contig close to the gap has an overlap with one end of the second contig close to the gap; performing the step of selecting the set of reads for gap closing on the basis of the new first contig, if the one end of the new first contig close to the gap has no overlap with the one end of the second contig close to the gap, wherein the first contig in the step of selecting the set of reads for gap closing is replaced with the new first contig; connecting the new first contig to the second contig to complete gap closing, if one end of the new first contig close to the gap has an overlap with one end of the second contig close to the gap.

...read moreread less