Showing papers by "Yingrui Li published in 2010"

PDF

Open Access

Journal Article•DOI•

A human gut microbial gene catalogue established by metagenomic sequencing

[...]

Junjie Qin¹, Ruiqiang Li¹, Jeroen Raes², Manimozhiyan Arumugam, Kristoffer Sølvsten Burgdorf, Chaysavanh Manichanh, Trine Nielsen, Nicolas Pons³, Florence Levenez³, Takuji Yamada, Daniel R. Mende, Junhua Li¹, Junming Xu¹, Shaochuan Li¹, Dongfang Li¹, Jianjun Cao¹, Bo Wang¹, Huiqing Liang¹, Huisong Zheng¹, Yinlong Xie¹, Julien Tap³, Patricia Lepage³, Marcelo Bertalan, Jean-Michel Batto³, Torben Hansen, Denis Le Paslier, Allan Linneberg, H. Bjørn Nielsen, Eric Pelletier, Pierre Renault³, Thomas Sicheritz-Pontén, Keith Turner⁴, Hongmei Zhu¹, Chang Yu¹, Shengting Li¹, Min Jian¹, Yan Zhou¹, Yingrui Li¹, Xiuqing Zhang¹, Songgang Li¹, Nan Qin¹, Huanming Yang¹, Jian Wang¹, Søren Brunak, Joël Doré³, Francisco Guarner⁵, Karsten Kristiansen, Oluf Pedersen, Julian Parkhill, Jean Weissenbach, Peer Bork, S. Dusko Ehrlich³, Jun Wang¹ - Show less +49 more•Institutions (5)

Beijing Genomics Institute¹, Vrije Universiteit Brussel², Institut national de la recherche agronomique³, Wellcome Trust Sanger Institute⁴, Hebron University⁵

04 Mar 2010-Nature

TL;DR: The Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals are described, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species.

...read moreread less

Abstract: To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, ~150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively

...read moreread less

9,268 citations

Journal Article•DOI•

De novo assembly of human genomes with massively parallel short read sequencing

[...]

Ruiqiang Li¹, Hongmei Zhu, Jue Ruan, Wubin Qian, Xiaodong Fang, Zhongbin Shi, Yingrui Li, Shengting Li², Gao Shan, Karsten Kristiansen, Songgang Li, Huanming Yang, Jing Wang, Jun Wang - Show less +10 more•Institutions (2)

Beijing Genomics Institute¹, Aarhus University²

01 Feb 2010-Genome Research

TL;DR: The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.

...read moreread less

Abstract: Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.

...read moreread less

2,760 citations

Journal Article•DOI•

Sequencing of 50 Human Exomes Reveals Adaptation to High Altitude

[...]

Xin Yi, Yu Liang¹, Emilia Huerta-Sanchez², Xin Jin³, Zha Xi Ping Cuo¹, John E. Pool², John E. Pool⁴, Xun Xu, Hui Jiang, Nicolas Vinckenbosch², Thorfinn Sand Korneliussen⁵, Hancheng Zheng³, Tao Liu, Weiming He³, Kui Li¹, Ruibang Luo³, Xifang Nie, Honglong Wu⁶, Meiru Zhao, Hongzhi Cao⁶, Jing Zou, Ying Shan³, Shuzheng Li, Qi Yang, Asan¹, Peixiang Ni, Geng Tian¹, Junming Xu, Xiao Liu, Tao Jiang⁶, Renhua Wu, Guangyu Zhou, Meifang Tang, Junjie Qin, Tong Wang, Shuijian Feng, Guohong Li, Huasang, Jiangbai Luosang, Wei Wang, Fang Chen, Yading Wang, Xiaoguang Zheng¹, Zhuo Li, Zhuoma Bianba, Ge Yang, Xiznping Wang, Shuhui Tang, Guoyi Gao, Yong Chen, Zhen Luo, Lamu Gusang, Zheng Cao, Qinghui Zhang, Wei-Han OuYang, Xiaoli Ren, Huiqing Liang, Huisong Zheng, Yebo Huang, Jingxiang Li, Lars Bolund, Karsten Kristiansen⁵, Yingrui Li, Yong Zhang, Xiuqing Zhang, Ruiqiang Li⁵, Songgang Li, Huanming Yang, Rasmus Nielsen⁵, Rasmus Nielsen², Jun Wang⁵, Jing Wang - Show less +68 more•Institutions (6)

Chinese Academy of Sciences¹, University of California, Berkeley², South China University of Technology³, University of California, Davis⁴, University of Copenhagen⁵, Shenzhen University⁶

02 Jul 2010-Science

TL;DR: A population genomic survey has revealed a functionally important locus in genetic adaptation to high altitude, and the strongest signal of natural selection came from endothelial Per-Arnt-Sim domain protein 1 (EPAS1), a transcription factor involved in response to hypoxia.

...read moreread less

Abstract: Residents of the Tibetan Plateau show heritable adaptations to extreme altitude. We sequenced 50 exomes of ethnic Tibetans, encompassing coding sequences of 92% of human genes, with an average coverage of 18x per individual. Genes showing population-specific allele frequency changes, which represent strong candidates for altitude adaptation, were identified. The strongest signal of natural selection came from endothelial Per-Arnt-Sim (PAS) domain protein 1 (EPAS1), a transcription factor involved in response to hypoxia. One single-nucleotide polymorphism (SNP) at EPAS1 shows a 78% frequency difference between Tibetan and Han samples, representing the fastest allele frequency change observed at any human gene to date. This SNP's association with erythrocyte abundance supports the role of EPAS1 in adaptation to hypoxia. Thus, a population genomic survey has revealed a functionally important locus in genetic adaptation to high altitude.

...read moreread less

1,325 citations

Journal Article•DOI•

The sequence and de novo assembly of the giant panda genome

[...]

Ruiqiang Li, Wei Fan, Geng Tian¹, Hongmei Zhu, Lin He², Lin He³, Jing Cai¹, Jing Cai⁴, Quanfei Huang, Qingle Cai⁵, Bo Li, Yinqi Bai, Zhihe Zhang⁶, Ya-Ping Zhang⁴, Wen Wang⁴, Jun Li, Fuwen Wei¹, Heng Li⁷, Min Jian, Jianwen Li, Zhaolei Zhang⁸, Rasmus Nielsen⁹, Dawei Li, Wanjun Gu¹⁰, Zhentao Yang, Zhaoling Xuan, Oliver A. Ryder, Frederick C. Leung¹¹, Yan Zhou, Jianjun Cao, Xiao Sun¹⁰, Yonggui Fu¹², Xiaodong Fang, Xiaosen Guo, Bo Wang, Rong Hou⁶, Fujun Shen⁶, Bo Mu, Peixiang Ni, Runmao Lin, Wubin Qian, Guo-Dong Wang⁴, Guo-Dong Wang¹, Chang Yu, Wenhui Nie⁴, Jinhuan Wang⁴, Zhigang Wu, Huiqing Liang, Jiumeng Min⁵, Qi Wu¹, Shifeng Cheng⁵, Jue Ruan¹, Mingwei Wang, Zhongbin Shi, Ming Wen, Binghang Liu, Xiaoli Ren, Huisong Zheng, Dong Dong⁸, Kathleen Cook⁸, Gao Shan, Hao Zhang, Carolin Kosiol¹³, Xueying Xie¹⁰, Zuhong Lu¹⁰, Hancheng Zheng, Yingrui Li¹, Cynthia C. Steiner, Tommy Tsan-Yuk Lam¹¹, Siyuan Lin, Qinghui Zhang, Guoqing Li, Jing Tian, Timing Gong, Hongde Liu¹⁰, Dejin Zhang¹⁰, Lin Fang, Chen Ye, Juanbin Zhang, Wenbo Hu¹², Anlong Xu¹², Yuanyuan Ren, Guojie Zhang¹, Guojie Zhang⁴, Michael William Bruford¹⁴, Qibin Li¹, Lijia Ma¹, Yiran Guo¹, Na An, Yujie Hu¹, Yang Zheng¹, Yongyong Shi², Zhiqiang Li², Qing Liu, Yanling Chen, Jing Zhao, Ning Qu⁵, Shancen Zhao, Feng Tian, Xiaoling Wang, Haiyin Wang, Lizhi Xu, Xiao Liu, Tomas Vinar¹⁵, Yajun Wang¹⁶, Tak-Wah Lam¹¹, Siu-Ming Yiu¹¹, Shiping Liu¹⁷, Hemin Zhang, Desheng Li, Yan Huang, Xia Wang, Guohua Yang, Zhi Jiang, Junyi Wang, Nan Qin, Li Li, Jingxiang Li, Lars Bolund, Karsten Kristiansen¹⁸, Gane Ka-Shu Wong¹⁹, Maynard V. Olson²⁰, Xiuqing Zhang, Songgang Li, Huanming Yang, Jing Wang, Jun Wang¹⁸ - Show less +123 more•Institutions (20)

21 Jan 2010-Nature

TL;DR: Using next-generation sequencing technology alone, a draft sequence of the giant panda genome is generated and assembled, indicating that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition.

...read moreread less

Abstract: Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.

...read moreread less

1,109 citations

Journal Article•DOI•

Ancient human genome sequence of an extinct Palaeo-Eskimo

[...]

Morten Rasmussen¹, Yingrui Li¹, Stinus Lindgreen¹, Jakob Skou Pedersen¹, Anders Albrechtsen¹, Ida Moltke¹, Mait Metspalu², Ene Metspalu², Toomas Kivisild³, Toomas Kivisild², Ramneek Gupta⁴, Marcelo Bertalan⁴, Kasper Nielsen⁴, M. Thomas P. Gilbert¹, Yong Wang⁵, Maanasa Raghavan⁶, Maanasa Raghavan¹, Paula F. Campos¹, Hanne Munkholm Kamp¹, Andrew Wilson⁷, Andrew Gledhill⁷, Silvana R. Tridico⁸, Silvana R. Tridico⁹, Michael Bunce⁸, Eline D. Lorenzen¹, Jonas Binladen¹, Xiaosen Guo¹, Jing Zhao¹, Xiuqing Zhang¹, Hao Zhang¹, Zhuo Li¹, Minfeng Chen¹, Ludovic Orlando¹⁰, Karsten Kristiansen¹, Mads Bak¹, Niels Tommerup¹, Christian Bendixen¹¹, Tracey Pierre³, Bjarne Grønnow, Morten Meldgaard¹, Claus Andreasen, S. A. Fedorova², S. A. Fedorova¹², Ludmila P. Osipova¹³, Thomas Higham⁶, Christopher Bronk Ramsey⁷, Thomas Hansen¹, Finn Cilius Nielsen¹, Michael H. Crawford¹⁴, Søren Brunak⁴, Søren Brunak¹, Thomas Sicheritz-Pontén⁴, Richard Villems², Rasmus Nielsen⁵, Rasmus Nielsen¹, Anders Krogh¹, Jun Wang¹, Eske Willerslev¹ - Show less +54 more•Institutions (14)

University of Copenhagen¹, Estonian Biocentre², University of Cambridge³, Technical University of Denmark⁴, University of California, Berkeley⁵, University of Oxford⁶, University of Bradford⁷, Murdoch University⁸, Australian Federal Police⁹, École normale supérieure de Lyon¹⁰, Aarhus University¹¹, Russian Academy¹², Russian Academy of Sciences¹³, University of Kansas¹⁴

11 Feb 2010-Nature

TL;DR: This genome sequence of an ancient human obtained from ∼4,000-year-old permafrost-preserved hair provides evidence for a migration from Siberia into the New World some 5,500 years ago, independent of that giving rise to the modern Native Americans and Inuit.

...read moreread less

Abstract: We report here the genome sequence of an ancient human. Obtained from approximately 4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20x, we recover 79% of the diploid genome, an amount close to the practical limit of current sequencing technologies. We identify 353,151 high-confidence single-nucleotide polymorphisms (SNPs), of which 6.8% have not been reported previously. We estimate raw read contamination to be no higher than 0.8%. We use functional SNP assessment to assign possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence for a migration from Siberia into the New World some 5,500 years ago, independent of that giving rise to the modern Native Americans and Inuit.

...read moreread less

749 citations

A map of human genome variation from population-scale sequencing

[...]

Richard Durbin, David Altshuler, Gonçalo R. Abecasis, David R. Bentley +358 more

01 Oct 2010

TL;DR: The pilot phase of the 1000 Genomes Project is presented, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms, and the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants are described.

...read moreread less

599 citations

Journal Article•DOI•

The DNA Methylome of Human Peripheral Blood Mononuclear Cells

[...]

Yingrui Li, Jingde Zhu¹, Geng Tian², Geng Tian³, Ning Li, Qibin Li, Mingzhi Ye, Hancheng Zheng, Jian-Xin Yu¹, Honglong Wu, Jihua Sun, Hongyu Zhang¹, Quan Chen, Ruibang Luo⁴, Minfeng Chen, Yinghua He¹, Xin Jin⁴, Qinghui Zhang, Chang Yu, Guangyu Zhou, Jinfeng Sun¹, Yebo Huang, Huisong Zheng, Hongzhi Cao, Xiaoyu Zhou¹, Shicheng Guo¹, Xueda Hu, Xin Li⁵, Karsten Kristiansen⁶, Lars Bolund⁷, Jiujin Xu, Wen-Wen Wang⁵, Huanming Yang, Jing Wang, Ruiqiang Li, Stephan Beck⁸, Jun-Jun Wang⁶, Xiuqing Zhang - Show less +34 more•Institutions (8)

Shanghai Jiao Tong University¹, Beijing Institute of Genomics², Chinese Academy of Sciences³, South China University of Technology⁴, Kunming Institute of Zoology⁵, University of Copenhagen⁶, Aarhus University⁷, University College London⁸

09 Nov 2010-PLOS Biology

TL;DR: Analysis across the genome of patterns of DNA methylation reveals a rich landscape of allele-specific epigenetic modification and consequent effects on allele- specific gene expression.

...read moreread less

Abstract: DNA methylation plays an important role in biological processes in human health and disease. Recent technological advances allow unbiased whole-genome DNA methylation (methylome) analysis to be carried out on human cells. Using whole-genome bisulfite sequencing at 24.7-fold coverage (12.3-fold per strand), we report a comprehensive (92.62%) methylome and analysis of the unique sequences in human peripheral blood mononuclear cells (PBMC) from the same Asian individual whose genome was deciphered in the YH project. PBMC constitute an important source for clinical blood tests world-wide. We found that 68.4% of CpG sites and 80% displayed allele-specific expression (ASE). These data demonstrate that ASM is a recurrent phenomenon and is highly correlated with ASE in human PBMCs. Together with recently reported similar studies, our study provides a comprehensive resource for future epigenomic research and confirms new sequencing technology as a paradigm for large-scale epigenomics studies.

...read moreread less

336 citations

Journal Article•DOI•

Single base-resolution methylome of the silkworm reveals a sparse epigenomic map

[...]

Hui Xiang¹, Jingde Zhu², Jingde Zhu³, Quan Chen, Fangyin Dai⁴, Xin Li¹, Muwang Li, Hongyu Zhang³, Guojie Zhang, Dong Li⁴, Yang Dong¹, Li Zhao¹, Ying Lin⁴, Daojun Cheng⁴, Jian Yu³, Jinfeng Sun³, Xiaoyu Zhou³, Kelong Ma³, Yinghua He³, Yangxing Zhao³, Shicheng Guo³, Mingzhi Ye, Guangwu Guo, Yingrui Li, Ruiqiang Li, Xiuqing Zhang, Lijia Ma, Karsten Kristiansen⁵, Qiuhong Guo⁶, Jianhao Jiang⁶, Stephan Beck⁷, Qingyou Xia⁴, Wen Wang¹, Jun Wang⁵ - Show less +30 more•Institutions (7)

Kunming Institute of Zoology¹, Fudan University², Shanghai Jiao Tong University³, Southwest University⁴, University of Copenhagen⁵, Chinese Academy of Sciences⁶, University College London⁷

01 May 2010-Nature Biotechnology

TL;DR: The methylome of a model insect, the silkworm Bombyx mori, is surveyed at single-base resolution using Illumina high-throughput bisulfite sequencing (MethylC-Seq), finding that transposable elements, promoters and ribosomal DNAs are hypomethylated, but in contrast, genomic loci matching small RNAs in gene bodies are densely methylated.

...read moreread less

Abstract: Epigenetic regulation in insects may have effects on diverse biological processes. Here we survey the methylome of a model insect, the silkworm Bombyx mori, at single-base resolution using Illumina high-throughput bisulfite sequencing (MethylC-Seq). We conservatively estimate that 0.11% of genomic cytosines are methylcytosines, all of which probably occur in CG dinucleotides. CG methylation is substantially enriched in gene bodies and is positively correlated with gene expression levels, suggesting it has a positive role in gene transcription. We find that transposable elements, promoters and ribosomal DNAs are hypomethylated, but in contrast, genomic loci matching small RNAs in gene bodies are densely methylated. This work contributes to our understanding of epigenetics in insects, and in contrast to previous studies of the highly methylated genomes of Arabidopsis and human, demonstrates a strategy for sequencing the epigenomes of organisms such as insects that have low levels of methylation.

...read moreread less

321 citations

Journal Article•DOI•

Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants

[...]

Yingrui Li, Nicolas Vinckenbosch¹, Geng Tian, Emilia Huerta-Sanchez¹, Tao Jiang, Hui Jiang, Anders Albrechtsen², Gitte Andersen, Hongzhi Cao, Thorfinn Sand Korneliussen², Niels Grarup, Yiran Guo, Ines Hellman³, Xin Jin⁴, Qibin Li, Jiangtao Liu, Xiao Liu, Thomas Sparsø, Meifang Tang, Honglong Wu, Renhua Wu, Chang Yu, Hancheng Zheng⁴, Arne Astrup², Lars Bolund⁵, Lars Bolund⁶, Johan Holmkvist, Torben Jørgensen⁷, Torben Jørgensen⁸, Karsten Kristiansen², Ole Schmitz⁹, Ole Schmitz⁵, Thue W. Schwartz², Xiuqing Zhang, Ruiqiang Li², Huanming Yang, Jing Wang, Torben Hansen¹⁰, Oluf Pedersen⁵, Oluf Pedersen², Rasmus Nielsen², Rasmus Nielsen¹, Jun Wang² - Show less +39 more•Institutions (10)

University of California, Berkeley¹, University of Copenhagen², University of Vienna³, South China University of Technology⁴, Aarhus University⁵, The Breast Cancer Research Foundation⁶, Glostrup Hospital⁷, Health Science University⁸, Aarhus University Hospital⁹, University of Southern Denmark¹⁰

01 Nov 2010-Nature Genetics

TL;DR: Exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold is reported, suggesting that deleterious substitutions are primarily recessive.

...read moreread less

Abstract: Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.

...read moreread less

319 citations

Journal Article•DOI•

TGM6 identified as a novel causative gene of spinocerebellar ataxias using exome sequencing

[...]

Junling Wang¹, Xu Yang, Kun Xia, Zheng mao Hu, Ling Weng¹, Xin Jin², Hong Jiang¹, Peng Zhang, Lu Shen¹, Ji Feng Guo¹, Nan Li¹, Yingrui Li, Li fang Lei¹, Jie Zhou¹, Juan Du¹, Ya Fang Zhou¹, Qian Pan, Jing Wang, Jun Wang³, Rui Qiang Li³, Bei Sha Tang¹ - Show less +17 more•Institutions (3)

Central South University¹, South China University of Technology², University of Copenhagen³

01 Dec 2010-Brain

TL;DR: The finding of TGM6 as a novel causative gene of spinocerebellar ataxia illustrates whole-exome sequencing of affected individuals from one family as an effective and cost efficient method for mapping genes of rare Mendelian disorders and the use of linkage analysis and exome sequencing for further improving efficiency.

...read moreread less

Abstract: Autosomal-dominant spinocerebellar ataxias constitute a large, heterogeneous group of progressive neurodegenerative diseases with multiple types. To date, classical genetic studies have revealed 31 distinct genetic forms of spinocerebellar ataxias and identified 19 causative genes. Traditional positional cloning strategies, however, have limitations for finding causative genes of rare Mendelian disorders. Here, we used a combined strategy of exome sequencing and linkage analysis to identify a novel spinocerebellar ataxia causative gene, TGM6. We sequenced the whole exome of four patients in a Chinese four-generation spinocerebellar ataxia family and identified a missense mutation, c.1550T-G transition (L517W), in exon 10 of TGM6. This change is at a highly conserved position, is predicted to have a functional impact, and completely cosegregated with the phenotype. The exome results were validated using linkage analysis. The mutation we identified using exome sequencing was located in the same region (20p13-12.2) as that identified by linkage analysis, which cross-validated TGM6 as the causative spinocerebellar ataxia gene in this family. We also showed that the causative gene could be mapped by a combined method of linkage analysis and sequencing of one sample from the family. We further confirmed our finding by identifying another missense mutation c.980A-G transition (D327G) in exon seven of TGM6 in an additional spinocerebellar ataxia family, which also cosegregated with the phenotype. Both mutations were absent in 500 normal unaffected individuals of matched geographical ancestry. The finding of TGM6 as a novel causative gene of spinocerebellar ataxia illustrates whole-exome sequencing of affected individuals from one family as an effective and cost efficient method for mapping genes of rare Mendelian disorders and the use of linkage analysis and exome sequencing for further improving efficiency.

...read moreread less

283 citations

Journal Article•

Single base-resolution methylome of the silkworm reveals a sparse epigenomic map (vol 28, pg 516, 2010)

[...]

Hui Xiang, Jingde Zhu, Quan Chen, Fangyin Dai, Xin Li, Muwang Li, Hongyu Zhang, Guojie Zhang, Dong Li, Yang Dong, Li Zhao, Ying Lin, Daojun Cheng, Ja Yu, Jinfeng Sun, Xiaoyu Zhou, KL Ma, Yinghua He, Yangxing Zhao, Shicheng Guo, Mingzhi Ye, Guangwu Guo, Yingrui Li, RQ Li, Xiuqing Zhang, LJ Ma, K Kristiansen, Qiuhong Guo, JH Jiang, S Beck, Qingyou Xia, Wen Wang, Juan Wang - Show less +29 more

01 Jul 2010-Nature Biotechnology

Journal Article•DOI•

Building the sequence map of the human pan-genome

[...]

Ruiqiang Li¹, Yingrui Li, Hancheng Zheng², Ruibang Luo², Hongmei Zhu, Qibin Li, Wubin Qian, Yuanyuan Ren, Geng Tian, Jinxiang Li, Guangyu Zhou, Xuan Zhu, Honglong Wu³, Junjie Qin, Xin Jin², Dongfang Li³, Hongzhi Cao³, Xueda Hu, HÃ©lÃ¨ne Blanche⁴, Howard M. Cann⁴, Xiuqing Zhang, Songgang Li, Lars Bolund⁵, Karsten Kristiansen¹, Huanming Yang, Jun Wang¹, Jing Wang - Show less +23 more•Institutions (5)

University of Copenhagen¹, South China University of Technology², Shenzhen University³, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain⁴, Aarhus University⁵

01 Jan 2010-Nature Biotechnology

TL;DR: It is estimated that a complete human pan-genome would contain ∼19–40 Mb of novel sequence not present in the extant reference genome, indicating the importance of using complete genome sequencing and de novo assembly.

...read moreread less

Journal Article•DOI•

Whole genome DNA methylation analysis based on high throughput sequencing technology.

[...]

Ning Li¹, Mingzhi Ye¹, Yingrui Li¹, Zhixiang Yan¹, Lee M. Butcher², Jihua Sun¹, Xu Han¹, Quan Chen¹, Xiuqing Zhang¹, Jun Wang¹, Jun Wang³ - Show less +7 more•Institutions (3)

Beijing Genomics Institute¹, University College London², University of Copenhagen³

01 Nov 2010-Methods

TL;DR: It is found that 3gigabases (Gbp) 45bp paired-end MeDIP-seq or MBD-seq uniquely mapped reads is the minimum requirement and cost-effective strategy for methylome pattern analysis.

...read moreread less

Journal Article•DOI•

Design of association studies with pooled or un-pooled next-generation sequencing data.

[...]

Su Yeon Kim¹, Yingrui Li², Yiran Guo², Ruiqiang Li², Johan Holmkvist, Torben Hansen³, Oluf Pedersen⁴, Oluf Pedersen⁵, Jun Wang⁵, Jun Wang², Rasmus Nielsen⁵, Rasmus Nielsen², Rasmus Nielsen¹ - Show less +9 more•Institutions (5)

University of California, Berkeley¹, Beijing Genomics Institute², University of Southern Denmark³, Health Science University⁴, University of Copenhagen⁵

01 Jul 2010-Genetic Epidemiology

TL;DR: These results provide guidelines for researchers who are developing association mapping studies based on next‐generation sequencing and suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates.

...read moreread less

Abstract: Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing.

...read moreread less

Journal Article•DOI•

State of the art de novo assembly of human genomes from massively parallel sequencing data.

[...]

Yingrui Li, Yujie Hu¹, Lars Bolund, Jun Wang¹•Institutions (1)

Chinese Academy of Sciences¹

01 Apr 2010-Human Genomics

TL;DR: Two public short-read de novo assembly applications that can handle human genomes, ABySS and SOAPdenovo are described.

...read moreread less

Abstract: Recent studies in human genomes have demonstrated the use of de novo assemblies to identify genetic variations that are difficult for mapping-based approaches. Construction of multiple human genome assemblies is enabled by massively parallel sequencing, but a conventional bioinformatics solution is costly and slow, creating bottle-necks in the process. This review describes two public short-read de novo assembly applications that can handle human genomes, ABySS and SOAPdenovo. It also discusses the technical aspects and future challenges of human genome de novo assembly by short reads.

...read moreread less

Patent•

Method and system for detecting polymorphism locus of genome target region

[...]

Yingrui Li, Chang Yu, Ruibang Luo, Fan Zhang

15 Dec 2010

TL;DR: In this paper, a method and a system for detecting a polymorphism locus of a genome target region is described, which comprises the steps of obtaining an exon sequencing result, removing redundancy and sequencing, carrying out statistic analysis I, detecting an SNP (Single Nucleotide Polymorphism) locus, filtering the SNP locus and noting the SNP.

...read moreread less

Abstract: The invention discloses a method and a system for detecting a polymorphism locus of a genome target region. The method comprises the steps of: obtaining an exon sequencing result, removing redundancy and sequencing, carrying out statistic analysis I, detecting an SNP (Single Nucleotide Polymorphism) locus, filtering the SNP locus, carrying out statistic analysis II and noting the SNP. The SNP analysis can be carried out by sequencing a genome specific region; and the invention has the advantages of high detection accuracy of SNP result, high speed and low cost, and can realize the automation in the whole process, i.e. the high-quality SNP locus is automatically generated by using original sequencing data as a data source, and the SNP locus can be noted and classified.

...read moreread less

Journal Article•DOI•

The sequence and de novo assembly of the giant panda genome [Correction]

[...]

Ruiqiang Li¹, Ruiqiang Li², Wei Fan², Geng Tian², Geng Tian³, Zhu Hongmei², Lin He⁴, Lin He⁵, Jing Cai⁶, Jing Cai³, Quanfei Huang², Qingle Cai², Qingle Cai⁷, Bo Li², Yinqi Bai², Zhihe Zhang⁸, Ya-Ping Zhang⁶, Wen Wang⁶, Jun Li², Fuwen Wei, Heng Li⁹, Min Jian², Jianwen Li², Zhaolei Zhang¹⁰, Rasmus Nielsen¹¹, Dawei Li², Wanjun Gu¹², Zhentao Yang², Zhaoling Xuan², Oliver A. Ryder, Frederick C. Leung¹³, Yan Zhou², Jianjun Cao², Xiao Sun¹², Yonggui Fu¹⁴, Xiaodong Fang², Xiaosen Guo², Bo Wang², Rong Hou⁸, Fujun Shen⁸, Bo Mu², Peixiang Ni², Runmao Lin², Wubin Qian², Guo-Dong Wang⁶, Guo-Dong Wang³, Chang Yu², Wenhui Nie⁶, Jinhuan Wang⁶, Zhigang Wu², Huiqing Liang², Jiumeng Min⁷, Jiumeng Min², Qi Wu, Shifeng Cheng⁷, Shifeng Cheng², Jue Ruan³, Jue Ruan², Mingwei Wang², Zhongbin Shi², Ming Wen², Binghang Liu², Xiaoli Ren², Huisong Zheng², Dong Dong¹⁰, Kathleen Cook¹⁰, Gao Shan², Hao Zhang², Carolin Kosiol¹⁵, Xueying Xie¹², Zuhong Lu¹², Hancheng Zheng², Yingrui Li³, Yingrui Li², Cynthia C. Steiner, Tommy Tsan-Yuk Lam¹³, Siyuan Lin², Qinghui Zhang², Guoqing Li², Jing Tian², Timing Gong², Hongde Liu¹², Dejin Zhang¹², Lin Fang², Chen Ye², Juanbin Zhang², Wenbo Hu¹⁴, Anlong Xu¹⁴, Yuanyuan Ren², Guojie Zhang³, Guojie Zhang⁶, Guojie Zhang², Michael William Bruford¹⁶, Qibin Li³, Qibin Li², Lijia Ma², Lijia Ma³, Yiran Guo³, Yiran Guo², Na An², Yujie Hu³, Yujie Hu², Yang Zheng², Yang Zheng³, Yongyong Shi⁴, Zhiqiang Li⁴, Qing Liu², Yanling Chen², Jing Zhao², Ning Qu⁷, Ning Qu², Shancen Zhao², Feng Tian², Xiaoling Wang², Haiyin Wang², Lizhi Xu², Xiao Liu², Tomas Vinar¹⁷, Yajun Wang¹⁸, Tak-Wah Lam¹³, Siu-Ming Yiu¹³, Shiping Liu¹⁹, Hemin Zhang, Desheng Li, Yan Huang, Xia Wang², Guohua Yang², Zhi Jiang², Junyi Wang², Nan Qin², Li Li², Jingxiang Li², Lars Bolund², Karsten Kristiansen², Karsten Kristiansen¹, Gane Ka-Shu Wong², Gane Ka-Shu Wong²⁰, Maynard V. Olson²¹, Xiuqing Zhang², Songgang Li², Huanming Yang², Jian Wang², Jun Wang², Jun Wang¹ - Show less +140 more•Institutions (21)

25 Feb 2010-Nature

TL;DR: This corrects the article to show that the Higgs boson genome is a “spatially aggregating ‘spatiotemporal ’ organisation’, rather than a ‘cell-based’ organisation, which is more closely related to the immune system.

...read moreread less

Abstract: Nature 463, 311–317 (2010) In this Article, the Latin species name of the giant panda was written incorrectly as Ailuropoda melanoleura. The correct name is Ailuropoda melanoleuca.

...read moreread less

Building the sequence map of the human

[...]

01 Jan 2010

TL;DR: Here, the de novo assembly of an Asian and an African genome with the NCBI reference human genome is integrated, as a step toward constructing the human pan-genome.

...read moreread less

Abstract: Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference human genome, as a step toward constructing the human pan-genome. We identified ~5 Mb of novel sequences not present in the reference genome in each of these assemblies. Most novel sequences are individual or population specific, as revealed by their comparison to all available human DNA sequence and by PCR validation using the human genome diversity cell line panel. We found novel sequences present in patterns consistent with known human migration paths. Cross-species conservation analysis of predicted genes indicated that the novel sequences contain potentially functional coding regions. We estimate that a complete human pan-genome would contain ~19–40 Mb of novel sequence not present in the extant reference genome. The extensive amount of novel sequence contributing to the genetic variation of the pan-genome indicates the importance of using complete genome sequencing and de novo assembly. The Human Genome Project 1 established the foundation for human genomics studies. Subsequent analyses unveiled genetic variations and identified their effects on phenotypic diversity and differences in disease susceptibility 2 . Guided by the National Center for Biotechnology Information (NCBI) reference genome, initial studies of human genetic variation focused largely on identifying 3 and cataloging 4,5 single-nucleotide polymorphisms (SNPs) and studying their association to human diseases 6 . Structural variation (which is thought to contribute more variant sequences than SNPs) has also been extensively identified and analyzed in the human genome 7–10 . The availability of a number of individual human genomes 11–15 has provided an unprecedented opportunity to investigate detailed genetic differences at the individual level. Preliminary analyses have revealed that these genomes contain sequences that could not be mapped onto the human reference genome (novel sequences), resulting in the proposal that the majority of these sequences likely belong to the gap regions in the current version of the human genome assembly 12 . When fosmid clones from HapMap samples were sequenced, 525 sequences were identified that mapped instead to highly poly

...read moreread less

Journal Article•DOI•

Archaeology Augments Tibet's Genetic History--Response

[...]

Xin Yi¹, Yu Liang¹, Emilia Huerta-Sanchez², Xin Jin³, Zha Xi Ping Cuo¹, John E. Pool², John E. Pool⁴, Xun Xu, Hui Jiang, Nicolas Vinckenbosch², Thorfinn Sand Korneliussen⁵, Hancheng Zheng³, Tao Liu, Weiming He³, Kui Li¹, Ruibang Luo³, Xifang Nie, Honglong Wu⁶, Meiru Zhao, Hongzhi Cao⁶, Jing Zou, Ying Shan³, Shuzheng Li, Qi Yang, Asan¹, Peixiang Ni, Geng Tian¹, Junming Xu, Xiao Liu, Tao Jiang⁶, Renhua Wu, Guangyu Zhou, Meifang Tang, Junjie Qin, Tong Wang, Shuijian Feng, Guohong Li, Huasang, Jiangbai Luosang, Wei Wang, Fang Chen, Yading Wang, Xiaoguang Zheng¹, Zhuo Li, Zhuoma Bianba, Ge Yang, Xinping Wang, Shuhui Tang, Guoyi Gao, Yong Chen, Zhen Luo, Lamu Gusang, Zheng Cao, Qinghui Zhang, Wei-Han OuYang, Xiaoli Ren, Huiqing Liang, Huisong Zheng, Yebo Huang, Jingxiang Li, Lars Bolund, Karsten Kristiansen⁵, Yingrui Li, Yong Zhang, Xiuqing Zhang, Ruiqiang Li⁵, Songgang Li, Huanming Yang, Rasmus Nielsen⁵, Rasmus Nielsen², Jun Wang⁵, Jing Wang - Show less +68 more•Institutions (6)

Chinese Academy of Sciences¹, University of California, Berkeley², South China University of Technology³, University of California, Davis⁴, University of Copenhagen⁵, Shenzhen University⁶

17 Sep 2010-Science

TL;DR: The understanding that the majority of the current population of the Tibetan plateau may trace their genetic ancestry back to quite recent immigrants into Tibet, even though humans have lived in Tibet for a much longer time—possibly with some continuity of culture—is important for understanding the difference between inferencesbased on archaeology and inferences based on genetics.

...read moreread less

Abstract: We thank Brantingham et al. for their interest in our study; we agree that both molecular and archaeological evidence should be used to understand the demographic history of the Tibetan people. Our Report focused not on the demographic history of the Tibetan population, but rather the selection acting on specific putatively adaptive mutations segregating in the Tibetan population. We included some limited demographic analyses because they helped illuminate our results regarding natural selection. The real demographic model is clearly likely to be more complex than the simple models of two populations diverging from each other. For example, Zhao et al. ([ 1 ][1]) used mitochondrial DNA to argue that late settlers of the Tibetan plateau may not have entirely replaced the original population but that a small proportion of them carry mitochondrial DNA lineages tracing back to Late Paleolithic inhabitants on the plateau. If this is the case, even if the EPAS1 variant was present in the early inhabitants of Tibet, strong selection would be needed to increase its frequency in the modern Tibetan gene pool. The understanding that the majority of the current population of the Tibetan plateau may trace their genetic ancestry back to quite recent immigrants into Tibet, even though humans have lived in Tibet for a much longer time—possibly with some continuity of culture—is important for understanding the difference between inferences based on archaeology and inferences based on genetics. 1. [↵][2] 1. M. Zhao 2. et al ., Proc. Natl. Acad. Sci. U.S.A. 106, 21230 (2009). [OpenUrl][3][Abstract/FREE Full Text][4] [1]: #ref-1 [2]: #xref-ref-1-1 "View reference 1 in text" [3]: {openurl}?query=rft.jtitle%253DProc.%2BNatl.%2BAcad.%2BSci.%2BU.S.A.%26rft_id%253Dinfo%253Adoi%252F10.1073%252Fpnas.0907844106%26rft_id%253Dinfo%253Apmid%252F19955425%26rft.genre%253Darticle%26rft_val_fmt%253Dinfo%253Aofi%252Ffmt%253Akev%253Amtx%253Ajournal%26ctx_ver%253DZ39.88-2004%26url_ver%253DZ39.88-2004%26url_ctx_fmt%253Dinfo%253Aofi%252Ffmt%253Akev%253Amtx%253Actx [4]: /lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTA2LzUwLzIxMjMwIjtzOjQ6ImF0b20iO3M6MjU6Ii9zY2kvMzI5LzU5OTgvMTQ2Ny4yLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==

...read moreread less