Home
/
Authors
/
Richard Durbin

Author

Richard Durbin

Other affiliations: Wellcome Trust Sanger Institute, University of Manchester, Wellcome Trust ...read more

Bio: Richard Durbin is an academic researcher from University of Cambridge. The author has contributed to research in topics: Genome & Population. The author has an hindex of 125, co-authored 319 publications receiving 207192 citations. Previous affiliations of Richard Durbin include Wellcome Trust Sanger Institute & University of Manchester.

Topics: Genome, Population, Genomics, Gene, Sequence assembly ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1992
1990
1989
1988
1987
1986
1985
1960
1959

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The diploid genome sequence of an Asian individual.

[...]

Jun Wang, Wei Wang¹, Ruiqiang Li², Ruiqiang Li¹, Yingrui Li³, Yingrui Li⁴, Yingrui Li¹, Geng Tian⁵, Geng Tian¹, Laurie Goodman¹, Wei Fan¹, Junqing Zhang¹, Jun Li¹, Juanbin Zhang¹, Yiran Guo⁵, Yiran Guo¹, Binxiao Feng¹, Heng Li⁶, Heng Li¹, Yao Lu¹, Xiaodong Fang¹, Huiqing Liang¹, Zhenglin Du¹, Dong Li¹, Yiqing Zhao⁵, Yiqing Zhao¹, Yujie Hu⁵, Yujie Hu¹, Zhenzhen Yang¹, Hancheng Zheng¹, Ines Hellmann⁷, Michael Inouye⁶, John E. Pool⁷, Xin Yi⁵, Xin Yi¹, Jing Zhao¹, Jinjie Duan¹, Yan Zhou¹, Junjie Qin¹, Junjie Qin⁵, Lijia Ma⁵, Lijia Ma¹, Guoqing Li¹, Zhentao Yang¹, Guojie Zhang⁵, Guojie Zhang¹, Bin Yang¹, Chang Yu¹, Fang Liang¹, Fang Liang⁵, Wenjie Li¹, Shaochuan Li¹, Dawei Li¹, Peixiang Ni¹, Jue Ruan¹, Jue Ruan⁵, Qibin Li¹, Qibin Li⁵, Hongmei Zhu¹, Dongyuan Liu¹, Zhike Lu¹, Ning Li⁵, Ning Li¹, Guangwu Guo¹, Guangwu Guo⁵, Jianguo Zhang¹, Jia Ye¹, Lin Fang¹, Qin Hao⁵, Qin Hao¹, Quan Chen³, Quan Chen¹, Yu Liang⁵, Yu Liang¹, Yeyang Su⁵, Yeyang Su¹, A. san⁵, A. san¹, Cuo Ping¹, Cuo Ping⁵, Shuang Yang¹, Fang Chen⁵, Fang Chen¹, Li Li¹, Ke Zhou¹, Hongkun Zheng², Hongkun Zheng¹, Yuanyuan Ren¹, Ling Yang¹, Yang Gao⁴, Yang Gao¹, Guohua Yang¹, Guohua Yang⁸, Zhuo Li¹, Xiaoli Feng¹, Karsten Kristiansen², Gane Ka-Shu Wong⁹, Gane Ka-Shu Wong¹, Rasmus Nielsen⁷, Richard Durbin⁶, Lars Bolund¹⁰, Lars Bolund¹, Xiuqing Zhang¹, Xiuqing Zhang⁴, Songgang Li¹, Songgang Li³, Songgang Li⁸, Huanming Yang¹, Huanming Yang⁸, Jian Wang⁸, Jian Wang¹ - Show less +107 more•Institutions (10)

Beijing Genomics Institute¹, University of Southern Denmark², Peking University³, Beijing Institute of Genomics⁴, Chinese Academy of Sciences⁵, Wellcome Trust Sanger Institute⁶, University of California, Berkeley⁷, Shenzhen University⁸, University of Alberta⁹, Aarhus University¹⁰

06 Nov 2008-Nature

TL;DR: Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly, and the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

Abstract: Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

963 citations

Journal Article•DOI•

The genome sequence of Caenorhabditis briggsae: A platform for comparative genomics

[...]

Lincoln Stein¹, Zhirong Bao², Zhirong Bao³, Darin Blasiar², Thomas Blumenthal⁴, Michael R. Brent², Nansheng Chen¹, Asif T. Chinwalla², Laura Clarke⁵, Chris Clee⁵, Avril Coghlan⁶, Alan Coulson⁵, Alan Coulson⁷, Peter D'Eustachio¹, Peter D'Eustachio⁸, David H. A. Fitch⁸, Lucinda Fulton², Robert E Fulton², Sam Griffiths-Jones⁵, Todd W. Harris¹, LaDeana W. Hillier², LaDeana W. Hillier³, Ravi Kamath⁵, Patricia E. Kuwabara⁵, Elaine R. Mardis², Marco A. Marra², Marco A. Marra⁹, Tracie L. Miner², Patrick Minx², James C. Mullikin¹⁰, James C. Mullikin⁵, Robert W. Plumb⁵, Jane Rogers⁵, Jacqueline E. Schein⁹, Jacqueline E. Schein², Marc Sohrmann⁵, John Spieth², Jason E. Stajich¹¹, Chaochun Wei², David Willey⁵, Richard K. Wilson², Richard Durbin⁵, Robert H. Waterston², Robert H. Waterston³ - Show less +40 more•Institutions (11)

Cold Spring Harbor Laboratory¹, Washington University in St. Louis², University of Washington³, University of Colorado Denver⁴, Wellcome Trust Sanger Institute⁵, Trinity College, Dublin⁶, Laboratory of Molecular Biology⁷, New York University⁸, BC Cancer Agency⁹, National Institutes of Health¹⁰, Duke University¹¹

17 Nov 2003-PLOS Biology

TL;DR: Comparisons of the two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers, which will help to understand the evolutionary forces that mold nematode genomes.

...read moreread less

Abstract: The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.

...read moreread less

954 citations

Journal Article•DOI•

The UK10K project identifies rare variants in health and disease

[...]

Klaudia Walter¹, J L Min², Jie Huang¹, Lucy Crooks, Yasin Memari³, Shane A. McCarthy³, Perry Jrb.⁴, ChangJiang Xu⁴, Marta Futema⁵, Daniel Lawson², Valentina Iotchkova, Stephan Schiffels³, Audrey E. Hendricks⁶, Petr Danecek³, R Li¹, James A B Floyd⁷, Louise V. Wain², Louise V. Wain⁸, Inês Barroso³, Steve E. Humphries⁵, Matthew E. Hurles³, Eleftheria Zeggini³, Jeffrey C. Barrett³, Vincent Plagnol⁵, J. B. Richards⁴, Greenwood Cmt.², Nicholas J. Timpson², Richard Durbin³, Nicole Soranzo⁹ - Show less +25 more•Institutions (9)

Max Planck Society¹, University of Bristol², Wellcome Trust Sanger Institute³, McGill University⁴, University College London⁵, University of Colorado Denver⁶, Queen Mary University of London⁷, University of Leicester⁸, University of Cambridge⁹

01 Oct 2015-Nature

TL;DR: In extensively phenotyped cohorts, insights from sequencing whole genomes or exomes of nearly 10,000 individuals from population-based and disease collections are described and population structure and functional annotation of rare and low-frequency variants are described.

...read moreread less

Abstract: The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.

...read moreread less

948 citations

Journal Article•DOI•

Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing

[...]

Peter J. Campbell¹, Philip J. Stephens¹, Erin Pleasance¹, Sarah O’Meara¹, Heng Li¹, Thomas Santarius², Thomas Santarius¹, Lucy Stebbings¹, Catherine Leroy¹, Sarah Edkins¹, Claire Hardy¹, Jon W. Teague¹, Andrew Menzies¹, Ian Goodhead¹, Daniel J. Turner¹, C M Clee¹, Michael A. Quail¹, Antony V. Cox¹, Clive Gavin Brown¹, Richard Durbin¹, Matthew E. Hurles¹, Paul A.W. Edwards², Graham R. Bignell¹, Michael R. Stratton¹, P. Andrew Futreal¹ - Show less +21 more•Institutions (2)

Wellcome Trust Sanger Institute¹, University of Cambridge²

01 Jun 2008-Nature Genetics

TL;DR: The results demonstrate the feasibility of systematic, genome-wide characterization of rearrangements in complex human cancer genomes, raising the prospect of a new harvest of genes associated with cancer using this strategy.

...read moreread less

Abstract: Human cancers often carry many somatically acquired genomic rearrangements, some of which may be implicated in cancer development. However, conventional strategies for characterizing rearrangements are laborious and low-throughput and have low sensitivity or poor resolution. We used massively parallel sequencing to generate sequence reads from both ends of short DNA fragments derived from the genomes of two individuals with lung cancer. By investigating read pairs that did not align correctly with respect to each other on the reference human genome, we characterized 306 germline structural variants and 103 somatic rearrangements to the base-pair level of resolution. The patterns of germline and somatic rearrangement were markedly different. Many somatic rearrangements were from amplicons, although rearrangements outside these regions, notably including tandem duplications, were also observed. Some somatic rearrangements led to abnormal transcripts, including two from internal tandem duplications and two fusion transcripts created by interchromosomal rearrangements. Germline variants were predominantly mediated by retrotransposition, often involving AluY and LINE elements. The results demonstrate the feasibility of systematic, genome-wide characterization of rearrangements in complex human cancer genomes, raising the prospect of a new harvest of genes associated with cancer using this strategy.

...read moreread less

899 citations

Journal Article•DOI•

Inferring human population size and separation history from multiple genome sequences

[...]

Stephan Schiffels¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Aug 2014-Nature Genetics

TL;DR: Results from applying multiple sequentially Markovian coalescent (MSMC) to genome sequences from nine populations across the world suggest that the genetic separation of non-African ancestors from African Yoruban ancestors started long before 50,000 years ago and give information about human population history as recent as 2,000 Years ago.

...read moreread less

Abstract: The availability of complete human genome sequences from populations across the world has given rise to new population genetic inference methods that explicitly model ancestral relationships under recombination and mutation. So far, application of these methods to evolutionary history more recent than 20,000-30,000 years ago and to population separations has been limited. Here we present a new method that overcomes these shortcomings. The multiple sequentially Markovian coalescent (MSMC) analyzes the observed pattern of mutations in multiple individuals, focusing on the first coalescence between any two individuals. Results from applying MSMC to genome sequences from nine populations across the world suggest that the genetic separation of non-African ancestors from African Yoruban ancestors started long before 50,000 years ago and give information about human population history as recent as 2,000 years ago, including the bottleneck in the peopling of the Americas and separations within Africa, East Asia and Europe.

...read moreread less

866 citations

1
2
3
…
4
5
6
7
8
9
10
…
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

[...]

Stephen F. Altschul¹, Thomas L. Madden, Alejandro A. Schäffer¹, Jinghui Zhang, Zheng Zhang², Webb Miller², David J. Lipman - Show less +3 more•Institutions (2)

National Institutes of Health¹, Pennsylvania State University²

01 Sep 1997-Nucleic Acids Research

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.

...read moreread less

Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

...read moreread less

70,111 citations

Journal Article•DOI•

The Sequence Alignment/Map format and SAMtools

[...]

Heng Li¹, Bob Handsaker², Alec Wysoker², T. J. Fennell², Jue Ruan³, Nils Homer², Gabor T. Marth⁴, Gonçalo R. Abecasis², Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of California, Los Angeles², Chinese Academy of Sciences³, Boston College⁴

01 Aug 2009-Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

...read moreread less

45,957 citations

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Fiji: an open-source platform for biological-image analysis

[...]

Johannes Schindelin¹, Ignacio Arganda-Carreras², Erwin Frise³, Verena Kaynig⁴, Mark Longair⁴, Tobias Pietzsch¹, Stephan Preibisch¹, Curtis Rueden⁵, Stephan Saalfeld¹, Benjamin Schmid¹, Jean-Yves Tinevez¹, Daniel J. White¹, Volker Hartenstein¹, Kevin W. Eliceiri⁵, Pavel Tomancak¹, Albert Cardona¹ - Show less +12 more•Institutions (5)

Max Planck Society¹, Massachusetts Institute of Technology², Lawrence Berkeley National Laboratory³, ETH Zurich⁴, University of Wisconsin-Madison⁵

01 Jul 2012-Nature Methods

TL;DR: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis that facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system.

...read moreread less

Abstract: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis. Fiji uses modern software engineering practices to combine powerful software libraries with a broad range of scripting languages to enable rapid prototyping of image-processing algorithms. Fiji facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system. We propose Fiji as a platform for productive collaboration between computer science and biology research communities.

...read moreread less

43,540 citations

Journal Article•DOI•

Trimmomatic: a flexible trimmer for Illumina sequence data

[...]

Anthony Bolger¹, Marc Lohse¹, Bjoern Usadel¹•Institutions (1)

Max Planck Society¹

01 Aug 2014-Bioinformatics

TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.

...read moreread less

Abstract: Motivation: Although many next-generation sequencing (NGS) read preprocessing tools already existed, we could not find any tool or combination of tools that met our requirements in terms of flexibility, correct handling of paired-end data and high performance. We have developed Trimmomatic as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data. Results: The value of NGS read preprocessing is demonstrated for both reference-based and reference-free tasks. Trimmomatic is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested. Availability and implementation: Trimmomatic is licensed under GPL V3. It is cross-platform (Java 1.5+ required) and available at http://www.usadellab.org/cms/index.php?page=trimmomatic Contact: ed.nehcaa-htwr.1oib@ledasu Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

39,291 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse