Home
/
Authors
/
Ning Li

Author

Ning Li

Other affiliations: Beijing Institute of Genomics, University of Science and Technology of China, Yunnan Agricultural University ...read more

Bio: Ning Li is an academic researcher from University of Minnesota. The author has contributed to research in topics: Gene & Population. The author has an hindex of 51, co-authored 449 publications receiving 14228 citations. Previous affiliations of Ning Li include Beijing Institute of Genomics & University of Science and Technology of China.

Topics: Gene, Population, Medicine, Transgene, Locus (genetics) ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1995

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Whole-genome analyses resolve early branches in the tree of life of modern birds

[...]

Erich D. Jarvis¹, Siavash Mirarab², Andre J. Aberer³, Bo Li⁴, Bo Li⁵, Bo Li⁶, Peter Houde⁷, Cai Li⁵, Cai Li⁴, Simon Y. W. Ho⁸, Brant C. Faircloth⁹, Benoit Nabholz, Jason T. Howard¹, Alexander Suh¹⁰, Claudia C. Weber¹⁰, Rute R. da Fonseca¹¹, Jianwen Li, Fang Zhang Zhang, Hui Li, Long Zhou, Nitish Narula⁷, Nitish Narula¹², Liang Liu¹³, Ganesh Ganapathy¹, Bastien Boussau, Shamsuzzoha Bayzid², Volodymyr Zavidovych¹, Sankar Subramanian¹⁴, Toni Gabaldón¹⁵, Salvador Capella-Gutierrez, Jaime Huerta-Cepas, Bhanu Rekepalli¹⁶, Bhanu Rekepalli¹⁷, Kasper Munch¹⁸, Mikkel H. Schierup¹⁸, Bent E. K. Lindow¹¹, Wesley C. Warren¹⁹, David A. Ray, Richard E. Green²⁰, Michael William Bruford²¹, Xiangjiang Zhan²¹, Xiangjiang Zhan²², Andrew Dixon, Shengbin Li⁶, Ning Li²³, Yinhua Huang²³, Elizabeth P. Derryberry²⁴, Elizabeth P. Derryberry²⁵, Mads F. Bertelsen²⁶, Frederick H. Sheldon²⁵, Robb T. Brumfield²⁵, Claudio V. Mello²⁷, Claudio V. Mello²⁸, Peter V. Lovell²⁷, Morgan Wirthlin²⁷, Maria Paula Cruz Schneider²⁸, Francisco Prosdocimi²⁸, José Alfredo Samaniego¹¹, Amhed Missael Vargas Velazquez¹¹, Alonzo Alfaro-Núñez¹¹, Paula F. Campos¹¹, Bent O. Petersen²⁹, Thomas Sicheritz-Pontén²⁹, An Pas, Thomas L. Bailey, R. Paul Scofield³⁰, Michael Bunce³¹, David M. Lambert¹⁴, Qi Zhou, Polina L. Perelman³², Amy C. Driskell³³, Beth Shapiro²⁰, Zijun Xiong, Yongli Zeng, Shiping Liu, Zhenyu Li, Binghang Liu, Kui Wu, Jin Xiao, Xiong Yinqi, Quiemei Zheng, Yong Zhang, Huanming Yang, Jian Wang, Linnéa Smeds¹⁰, Frank E. Rheindt³⁴, Michael J. Braun³⁵, Jon Fjeldså¹¹, Ludovic Orlando¹¹, F. Keith Barker⁴, Knud A. Jønsson⁴, Warren E. Johnson³³, Klaus-Peter Koepfli³³, Stephen J. O'Brien³⁶, David Haussler, Oliver A. Ryder, Carsten Rahbek⁴, Eske Willerslev¹¹, Gary R. Graves³³, Gary R. Graves⁴, Travis C. Glenn¹³, John E. McCormack³⁷, Dave Burt³⁸, Hans Ellegren¹⁰, Per Alström, Scott V. Edwards³⁹, Alexandros Stamatakis³, David P. Mindell⁴⁰, Joel Cracraft⁴, Edward L. Braun⁴¹, Tandy Warnow⁴², Tandy Warnow², Wang Jun, M. Thomas P. Gilbert³¹, M. Thomas P. Gilbert⁴, Guojie Zhang¹¹, Guojie Zhang⁵ - Show less +113 more•Institutions (42)

Duke University¹, University of Texas at Austin², Heidelberg Institute for Theoretical Studies³, American Museum of Natural History⁴, Beijing Genomics Institute⁵, Xi'an Jiaotong University⁶, New Mexico State University⁷, University of Sydney⁸, University of California⁹, Uppsala University¹⁰, University of Copenhagen¹¹, Okinawa Institute of Science and Technology¹², University of Georgia¹³, Griffith University¹⁴, Catalan Institution for Research and Advanced Studies¹⁵, Oak Ridge National Laboratory¹⁶, Joint Institute for Nuclear Research¹⁷, Aarhus University¹⁸, Washington University in St. Louis¹⁹, University of California, Santa Cruz²⁰, Cardiff University²¹, Kunming Institute of Zoology²², China Agricultural University²³, Tulane University²⁴, Louisiana State University²⁵, Copenhagen Zoo²⁶, Oregon Health & Science University²⁷, Federal University of Pará²⁸, Technical University of Denmark²⁹, Canterbury Museum³⁰, Curtin University³¹, Novosibirsk State University³², Smithsonian Institution³³, National University of Singapore³⁴, National Museum of Natural History³⁵, Nova Southeastern University³⁶, Occidental College³⁷, University of Edinburgh³⁸, Harvard University³⁹, University of California, San Francisco⁴⁰, University of Florida⁴¹, University of Illinois at Urbana–Champaign⁴²

12 Dec 2014-Science

TL;DR: A genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves recovered a highly resolved tree that confirms previously controversial sister or close relationships and identifies the first divergence in Neoaves, two groups the authors named Passerea and Columbea.

...read moreread less

Abstract: To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago.

...read moreread less

1,624 citations

Journal Article•DOI•

The diploid genome sequence of an Asian individual.

[...]

Jun Wang, Wei Wang¹, Ruiqiang Li¹, Ruiqiang Li², Yingrui Li³, Yingrui Li⁴, Yingrui Li¹, Geng Tian⁵, Geng Tian¹, Laurie Goodman¹, Wei Fan¹, Junqing Zhang¹, Jun Li¹, Juanbin Zhang¹, Yiran Guo⁵, Yiran Guo¹, Binxiao Feng¹, Heng Li¹, Heng Li⁶, Yao Lu¹, Xiaodong Fang¹, Huiqing Liang¹, Zhenglin Du¹, Dong Li¹, Yiqing Zhao¹, Yiqing Zhao⁵, Yujie Hu¹, Yujie Hu⁵, Zhenzhen Yang¹, Hancheng Zheng¹, Ines Hellmann⁷, Michael Inouye⁶, John E. Pool⁷, Xin Yi¹, Xin Yi⁵, Jing Zhao¹, Jinjie Duan¹, Yan Zhou¹, Junjie Qin¹, Junjie Qin⁵, Lijia Ma⁵, Lijia Ma¹, Guoqing Li¹, Zhentao Yang¹, Guojie Zhang⁵, Guojie Zhang¹, Bin Yang¹, Chang Yu¹, Fang Liang¹, Fang Liang⁵, Wenjie Li¹, Shaochuan Li¹, Dawei Li¹, Peixiang Ni¹, Jue Ruan¹, Jue Ruan⁵, Qibin Li¹, Qibin Li⁵, Hongmei Zhu¹, Dongyuan Liu¹, Zhike Lu¹, Ning Li⁵, Ning Li¹, Guangwu Guo⁵, Guangwu Guo¹, Jianguo Zhang¹, Jia Ye¹, Lin Fang¹, Qin Hao⁵, Qin Hao¹, Quan Chen⁴, Quan Chen¹, Yu Liang¹, Yu Liang⁵, Yeyang Su⁵, Yeyang Su¹, A. san¹, A. san⁵, Cuo Ping⁵, Cuo Ping¹, Shuang Yang¹, Fang Chen⁵, Fang Chen¹, Li Li¹, Ke Zhou¹, Hongkun Zheng², Hongkun Zheng¹, Yuanyuan Ren¹, Ling Yang¹, Yang Gao³, Yang Gao¹, Guohua Yang¹, Guohua Yang⁸, Zhuo Li¹, Xiaoli Feng¹, Karsten Kristiansen², Gane Ka-Shu Wong⁹, Gane Ka-Shu Wong¹, Rasmus Nielsen⁷, Richard Durbin⁶, Lars Bolund¹⁰, Lars Bolund¹, Xiuqing Zhang³, Xiuqing Zhang¹, Songgang Li⁸, Songgang Li¹, Songgang Li⁴, Huanming Yang¹, Huanming Yang⁸, Jian Wang⁸, Jian Wang¹ - Show less +107 more•Institutions (10)

Beijing Genomics Institute¹, University of Southern Denmark², Beijing Institute of Genomics³, Peking University⁴, Chinese Academy of Sciences⁵, Wellcome Trust Sanger Institute⁶, University of California, Berkeley⁷, Shenzhen University⁸, University of Alberta⁹, Aarhus University¹⁰

06 Nov 2008-Nature

TL;DR: Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly, and the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

Abstract: Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

963 citations

Journal Article•DOI•

Comparative genomics reveals insights into avian genome evolution and adaptation.

[...]

Guojie Zhang¹, Guojie Zhang², Cai Li², Qiye Li², Bo Li², Denis M. Larkin³, Chul Hee Lee⁴, Jay F. Storz⁵, Agostinho Antunes⁶, Matthew J. Greenwold⁷, Robert W. Meredith⁸, Anders Ödeen⁹, Jie Cui¹⁰, Qi Zhou¹¹, Luohao Xu², Hailin Pan², Zongji Wang¹², Lijun Jin², Pei Zhang², Haofu Hu², Wei Yang², Jiang Hu², Jin Xiao², Zhikai Yang², Yang Liu², Qiaolin Xie², Hao Yu², Jinmin Lian², Ping Wen², Fang Zhang², Hui Li², Yongli Zeng², Zijun Xiong², Shiping Liu¹², Long Zhou², Zhiyong Huang², Na An², Jie Wang¹³, Qiumei Zheng², Yingqi Xiong², Guangbiao Wang², Bo Wang², Jingjing Wang², Yu Fan¹⁴, Rute R. da Fonseca¹, Alonzo Alfaro-Núñez¹, Mikkel Schubert¹, Ludovic Orlando¹, Tobias Mourier¹, Jason T. Howard¹⁵, Ganeshkumar Ganapathy¹⁵, Andreas R. Pfenning¹⁵, Osceola Whitney¹⁵, Miriam V. Rivas¹⁵, Erina Hara¹⁵, Julia Smith¹⁵, Marta Farré³, Jitendra Narayan¹⁶, Gancho T. Slavov¹⁶, Michael N Romanov¹⁷, Rui Borges⁶, João Paulo Machado⁶, Imran Khan⁶, Mark S. Springer¹⁸, John Gatesy¹⁸, Federico G. Hoffmann¹⁹, Juan C. Opazo²⁰, Olle Håstad²¹, Roger H. Sawyer⁷, Heebal Kim⁴, Kyu-Won Kim⁴, Hyeon Jeong Kim⁴, Seoae Cho⁴, Ning Li²², Yinhua Huang²², Michael William Bruford²³, Xiangjiang Zhan¹³, Andrew Dixon, Mads F. Bertelsen²⁴, Elizabeth P. Derryberry²⁵, Wesley C. Warren²⁶, Richard K. Wilson²⁶, Shengbin Li²⁷, David A. Ray¹⁹, Richard E. Green²⁸, Stephen J. O'Brien²⁹, Darren K. Griffin¹⁷, Warren E. Johnson³⁰, David Haussler²⁸, Oliver A. Ryder, Eske Willerslev¹, Gary R. Graves³¹, Per Alström²¹, Jon Fjeldså³², David P. Mindell³³, Scott V. Edwards³⁴, Edward L. Braun³⁵, Carsten Rahbek³², David W. Burt³⁶, Peter Houde³⁷, Yong Zhang², Huanming Yang³⁸, Jian Wang², Erich D. Jarvis¹⁵, M. Thomas P. Gilbert¹, M. Thomas P. Gilbert³⁹, Jun Wang - Show less +103 more•Institutions (39)

University of Copenhagen¹, Beijing Genomics Institute², Royal Veterinary College³, Seoul National University⁴, University of Nebraska–Lincoln⁵, University of Porto⁶, University of South Carolina⁷, Montclair State University⁸, Uppsala University⁹, National University of Singapore¹⁰, University of California, Berkeley¹¹, South China University of Technology¹², Chinese Academy of Sciences¹³, Kunming Institute of Zoology¹⁴, Howard Hughes Medical Institute¹⁵, Aberystwyth University¹⁶, University of Kent¹⁷, University of California, Riverside¹⁸, Mississippi State University¹⁹, Austral University of Chile²⁰, Swedish University of Agricultural Sciences²¹, China Agricultural University²², Cardiff University²³, Copenhagen Zoo²⁴, Louisiana State University²⁵, Washington University in St. Louis²⁶, Xi'an Jiaotong University²⁷, University of California, Santa Cruz²⁸, Nova Southeastern University Oceanographic Center²⁹, Smithsonian Conservation Biology Institute³⁰, National Museum of Natural History³¹, Natural History Museum³², University of California, San Francisco³³, Harvard University³⁴, University of Florida³⁵, University of Edinburgh³⁶, New Mexico State University³⁷, Macau University of Science and Technology³⁸, Curtin University³⁹

12 Dec 2014-Science

TL;DR: This work explored bird macroevolution using full genomes from 48 avian species representing all major extant clades to reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

Abstract: Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

872 citations

Journal Article•DOI•

Genome sequence of foxtail millet ( Setaria italica ) provides insights into grass evolution and biofuel potential

[...]

Gengyun Zhang¹, Xin Liu, Zhiwu Quan¹, Shifeng Cheng, Xun Xu¹, Shengkai Pan, Min Xie, Peng Zeng, Zhen Yue, Wenliang Wang, Ye Tao, Chao Bian, Changlei Han, Qiuju Xia¹, Xiaohua Peng¹, Rui Cao, Xinhua Yang, Dongliang Zhan, Jingchu Hu, Yinxin Zhang¹, Henan Li¹, Li Hua¹, Ning Li¹, Junyi Wang, Chanchan Wang¹, Renyi Wang¹, Tao Guo¹, Cai Yanjie¹, Chengzhang Liu¹, Haitao Xiang¹, Qiuxiang Shi¹, Huang Ping¹, Qingchun Chen¹, Yingrui Li, Jun Wang², Zhao Zhihai, Jian Wang¹ - Show less +33 more•Institutions (2)

Chinese Ministry of Agriculture¹, University of Copenhagen²

01 Jun 2012-Nature Biotechnology

TL;DR: A draft genome anchored onto nine chromosomes and annotated 38,801 genes was produced and key chromosome reshuffling events were detected through collinearity identification between foxtail millet, rice and sorghum.

...read moreread less

Abstract: Completion of genome sequences for the diploid Setaria italica reveals features of C4 photosynthesis that could enable improvement of the polyploid biofuel crop switchgrass (Panicum virgatum). The genetic basis of biotechnologically relevant traits, including drought tolerance, photosynthetic efficiency and flowering control, is also highlighted. Foxtail millet (Setaria italica), a member of the Poaceae grass family, is an important food and fodder crop in arid regions and has potential for use as a C4 biofuel. It is a model system for other biofuel grasses, including switchgrass and pearl millet. We produced a draft genome (∼423 Mb) anchored onto nine chromosomes and annotated 38,801 genes. Key chromosome reshuffling events were detected through collinearity identification between foxtail millet, rice and sorghum including two reshuffling events fusing rice chromosomes 7 and 9, 3 and 10 to foxtail millet chromosomes 2 and 9, respectively, that occurred after the divergence of foxtail millet and rice, and a single reshuffling event fusing rice chromosome 5 and 12 to foxtail millet chromosome 3 that occurred after the divergence of millet and sorghum. Rearrangements in the C4 photosynthesis pathway were also identified.

...read moreread less

553 citations

Journal Article•DOI•

Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars

[...]

Mingzhou Li¹, Shilin Tian, Long Jin¹, Guangyu Zhou, Ying Li¹, Yuan Zhang, Tao Wang¹, Carol K.L. Yeung, Lei Chen, Jideng Ma¹, Jinbo Zhang, Anan Jiang¹, Ji Li, Chaowei Zhou¹, Jie Zhang¹, Yingkai Liu¹, Xiaoqing Sun, Hongwei Zhao, Zexiong Niu, Pinger Lou¹, Lingjin Xian¹, Xiaoyong Shen, Shaoqing Liu, Shunhua Zhang¹, Mingwang Zhang¹, Li Zhu¹, Surong Shuai¹, Lin Bai¹, Guoqing Tang¹, Haifeng Liu¹, Yanzhi Jiang¹, Miaomiao Mai¹, Jian Xiao¹, Xun Wang¹, Qi Zhou, Zhiquan Wang², Paul Stothard², Ming Xue, Xiaolian Gao³, Zonggang Luo⁴, Yiren Gu, Hongmei Zhu, Xiaoxiang Hu⁵, Yaofeng Zhao⁵, Graham Plastow², Jinyong Wang, Zhi Jiang, Kui Li, Ning Li⁵, Xuewei Li¹, Ruiqiang Li⁶ - Show less +47 more•Institutions (6)

Sichuan Agricultural University¹, University of Alberta², University of Houston³, Southwest University⁴, University of Minnesota⁵, Peking University⁶

01 Dec 2013-Nature Genetics

TL;DR: Comparing the genome of Tibetan wild boars with those of neighboring Chinese domestic pigs further showed the impact of thousands of years of artificial selection and different signatures of selection in wild boar and domestic pig.

...read moreread less

Abstract: We report the sequencing at 131× coverage, de novo assembly and analyses of the genome of a female Tibetan wild boar. We also resequenced the whole genomes of 30 Tibetan wild boars from six major distributed locations and 18 geographically related pigs in China. We characterized genetic diversity, population structure and patterns of evolution. We searched for genomic regions under selection, which includes genes that are involved in hypoxia, olfaction, energy metabolism and drug response. Comparing the genome of Tibetan wild boar with those of neighboring Chinese domestic pigs further showed the impact of thousands of years of artificial selection and different signatures of selection in wild boar and domestic pig. We also report genetic adaptations in Tibetan wild boar that are associated with high altitudes and characterize the genetic basis of increased salivation in domestic pig.

...read moreread less

412 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

[...]

Ben Langmead¹, Cole Trapnell¹, Mihai Pop¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

04 Mar 2009-Genome Biology

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.

...read moreread less

Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

...read moreread less

20,335 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Journal Article•DOI•

A Map of Human Genome Variation From Population-Scale Sequencing

[...]

Gonçalo R. Abecasis¹, David Altshuler², David Altshuler³, Adam Auton⁴, Lisa D Brooks⁵, Richard Durbin⁶, Richard A. Gibbs⁷, Matthew E. Hurles⁶, Gil McVean⁴ - Show less +5 more•Institutions (7)

University of Michigan¹, Broad Institute², Harvard University³, University of Oxford⁴, Johns Hopkins University⁵, Wellcome Trust Sanger Institute⁶, Baylor College of Medicine⁷

28 Oct 2010-Nature

TL;DR: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype as mentioned in this paper, and the results of the pilot phase of the project, designed to develop and compare different strategies for genomewide sequencing with high-throughput platforms.

...read moreread less

Abstract: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

...read moreread less

7,538 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse