Home
/
Authors
/
Richard Durbin

Author

Richard Durbin

Other affiliations: Wellcome Trust Sanger Institute, University of Manchester, Wellcome Trust ...read more

Bio: Richard Durbin is an academic researcher from University of Cambridge. The author has contributed to research in topics: Genome & Population. The author has an hindex of 125, co-authored 319 publications receiving 207192 citations. Previous affiliations of Richard Durbin include Wellcome Trust Sanger Institute & University of Manchester.

Topics: Genome, Population, Genomics, Gene, Sequence assembly ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1992
1990
1989
1988
1987
1986
1985
1960
1959

Papers

PDF

Open Access

More filters

Journal Article•DOI•

TCTEX1D2 mutations underlie Jeune asphyxiating thoracic dystrophy with impaired retrograde intraflagellar transport

[...]

Miriam Schmidts¹, Yuqing Hou², Claudio Cortes³, Dorus A. Mans⁴ +183 more•Institutions (37)

05 Jun 2015-Nature Communications

TL;DR: TCTEX1D2 mutations causing Jeune asphyxiating thoracic dystrophy with partially penetrant inheritance are identified and defined as an integral component of the evolutionarily conserved retrograde IFT machinery.

...read moreread less

Abstract: The analysis of individuals with ciliary chondrodysplasias can shed light on sensitive mechanisms controlling ciliogenesis and cell signalling that are essential to embryonic development and survival. Here we identify TCTEX1D2 mutations causing Jeune asphyxiating thoracic dystrophy with partially penetrant inheritance. Loss of TCTEX1D2 impairs retrograde intraflagellar transport (IFT) in humans and the protist Chlamydomonas, accompanied by destabilization of the retrograde IFT dynein motor. We thus define TCTEX1D2 as an integral component of the evolutionarily conserved retrograde IFT machinery. In complex with several IFT dynein light chains, it is required for correct vertebrate skeletal formation but may be functionally redundant under certain conditions.

...read moreread less

53 citations

Journal Article•DOI•

Tree-based maximal likelihood substitution matrices and hidden Markov models

[...]

Graeme Mitchison¹, Richard Durbin¹•Institutions (1)

Laboratory of Molecular Biology¹

01 Dec 1995-Journal of Molecular Evolution

TL;DR: The concept of a hidden Markov model (HMM) to evolutionary trees which allows what may be loosely regarded as learnable affine-type gap penalties for alignments is extended and an alignment algorithm is defined which fails to find global optima for realistic sequence sets.

...read moreread less

Abstract: There has been considerable interest in the problem of making maximum likelihood (ML) evolutionary trees which allow insertions and deletions. This problem is partly one of formulation: how does one define a probabilistic model for such trees which treats insertion and deletion in a biologically plausible manner? A possible answer to this question is proposed here by extending the concept of a hidden Markov model (HMM) to evolutionary trees. The model, called a tree-HMM, allows what may be loosely regarded as learnable affine-type gap penalties for alignments. These penalties are expressed in HMMs as probabilities of transitions between states. In the tree-HMM, this idea is given an evolutionary embodiment by defining trees of transitions. Just as the probability of a tree composed of ungapped sequences is computed, by Felsenstein's method, using matrices representing the probabilities of substitutions of residues along the edges of the tree, so the probabilities in a tree-HMM are computed by substitution matrices for both residues and transitions. How to define these matrices by a ML procedure using an algorithm that learns from a database of protein sequences is shown here. Given these matrices, one can define a tree-HMM likelihood for a set of sequences, assuming a particular tree topology and an alignment of the sequences to the model. If one could efficiently find the alignment which maximizes (or comes close to maximizing) this likelihood, then one could search for the optimal tree topology for the sequences. An alignment algorithm is defined here which, given a particular tree topology, is guaranteed to increase the likelihood of the model. Unfortunately, it fails to find global optima for realistic sequence sets. Thus further research is needed to turn the tree-HMM into a practical phylogenetic tool.

...read moreread less

53 citations

Journal Article•DOI•

Alfresco—A Workbench for Comparative Genomic Sequence Analysis

[...]

Niclas Jareborg¹, Richard Durbin•Institutions (1)

Wellcome Trust¹

01 Aug 2000-Genome Research

TL;DR: Using Java, this work has developed a new visualization tool that allows effective comparative genome sequence analysis and presents the analysis of two unannotated orthologous genomic sequences from human and mouse containing parts of the UTY locus.

...read moreread less

Abstract: Comparative analysis of genomic sequences provides a powerful tool for identifying regions of potential biologic function; by comparing corresponding regions of genomes from suitable species, protein coding or regulatory regions can be identified by their homology This requires the use of several specific types of computational analysis tools Many programs exist for these types of analysis; not many exist for overall view/control of the results, which is necessary for large-scale genomic sequence analysis Using Java, we have developed a new visualization tool that allows effective comparative genome sequence analysis The program handles a pair of sequences from putatively homologous regions in different species Results from various different existing external analysis programs, such as database searching, gene prediction, repeat masking, and alignment programs, are visualized and used to find corresponding functional sequence domains in the two sequences The user interacts with the program through a graphic display of the genome regions, in which an independently scrollable and zoomable symbolic representation of the sequences is shown As an example, the analysis of two unannotated orthologous genomic sequences from human and mouse containing parts of the UTY locus is presented

...read moreread less

52 citations

Journal Article•DOI•

The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X.

[...]

David R. Bentley¹, Panagiotis Deloukas¹, Andrew Dunham¹, Lisa French¹, Simon G. Gregory¹, Sean Humphray¹, Andrew J. Mungall¹, Mark T. Ross¹, Nigel P. Carter¹, Ian Dunham¹, Carol Scott¹, K. J. Ashcroft¹, A. L. Atkinson¹, K. Aubin¹, David Beare¹, Graeme Bethel¹, N. Brady¹, J. C. Brook¹, D. C. Burford¹, W. D. Burrill¹, C. Burrows¹, Adam Butler¹, C. Carder¹, J. J. Catanese², C M Clee¹, S. M. Clegg¹, V. Cobley¹, A. J. Coffey¹, Charlotte G. Cole¹, John E. Collins¹, J. S. Conquer¹, R. A. Cooper¹, K. M. Culley¹, Elisabeth Dawson¹, F. L. Dearden¹, Richard Durbin¹, P. J. De Jong², P. D. Dhami¹, M. E. Earthrowl¹, Carol A. Edwards¹, R Evans¹, Christopher J. Gillson¹, J. Ghori¹, L D Green¹, Rhian Gwilliam¹, K. S. Halls¹, S. Hammond¹, G. L. Harper¹, R. W. Heathcott¹, Jane L. Holden¹, E. Holloway¹, B. L. Hopkins¹, P. J. Howard¹, Gareth R. Howell¹, E. J. Huckle¹, Jaime Hughes¹, P. J. Hunt¹, Sarah E. Hunt¹, M. Izmajlowicz¹, C. A. Jones¹, Soumi Joseph¹, G. Laird¹, Cordelia Langford¹, M. H. Lehvaslaiho¹, M.A. Leversha¹, Owen T. McCann¹, Louise McDonald¹, Jennifer McDowall¹, G. L. Maslen¹, D. Mistry¹, Nicholas K. Moschonas³, Vassos Neocleous⁴, D. M. Pearson¹, K. J. Phillips¹, K. M. Porter¹, S. R. Prathalingam¹, Y. H. Ramsey¹, S. A. Ranby¹, C. M. Rice¹, Jane Rogers¹, L. J. Rogers¹, Theologia Sarafidou³, D. J. Scott¹, G. J. Sharp¹, C. J. Shaw-Smith¹, Luc J. Smink¹, Carol Soderlund¹, E. C. Sotheran¹, Helen E. Steingruber¹, John Sulston¹, A. Taylor¹, Rohan Taylor¹, A. A. Thorpe¹, E. J. Tinsley¹, Georgina Warry¹, Adam Whittaker¹, Pamela Whittaker¹, S. H. Williams¹, T. E. Wilmer¹, Richard Wooster¹, C. L. Wright¹ - Show less +97 more•Institutions (4)

Wellcome Trust¹, Boston Children's Hospital², University of Crete³, The Cyprus Institute of Neurology and Genetics⁴

15 Feb 2001-Nature

TL;DR: By measuring the remaining gaps, this work can assess chromosome length and coverage in sequenced clones and establish the long-range organization of the maps early in the project.

...read moreread less

Abstract: We constructed maps for eight chromosomes (1, 6, 9, 10, 13, 20, X and (previously) 22), representing one-third of the genome, by building landmark maps, isolating bacterial clones and assembling contigs. By this approach, we could establish the long-range organization of the maps early in the project, and all contig extension, gap closure and problem-solving was simplified by containment within local regions. The maps currently represent more than 94% of the euchromatic (gene-containing) regions of these chromosomes in 176 contigs, and contain 96% of the chromosome-specific markers in the human gene map. By measuring the remaining gaps, we can assess chromosome length and coverage in sequenced clones.

...read moreread less

50 citations

Journal Article•DOI•

Complete vertebrate mitogenomes reveal widespread repeats and gene duplications.

[...]

Giulio Formenti¹, Arang Rhie², Jennifer Balacco¹, Bettina Haase¹, Jacquelyn Mountcastle¹, Olivier Fedrigo¹, Samara Brown¹, Marco Rosario Capodiferro³, Farooq O. Al-Ajli⁴, Farooq O. Al-Ajli⁵, Roberto Ambrosini⁶, Peter Houde⁷, Sergey Koren², Karen Oliver⁸, Michelle Smith⁸, Jason Skelton⁸, Emma Betteridge⁸, Jale Dolucan⁸, Craig Corton⁸, Iliana Bista⁹, Iliana Bista⁸, James Torrance⁸, Alan Tracey⁸, Jonathan Wood⁸, Marcela Uliano-Silva⁸, Kerstin Howe⁸, Shane A. McCarthy⁹, Shane A. McCarthy⁸, Sylke Winkler¹⁰, Woori Kwak, Jonas Korlach¹¹, Arkarachai Fungtammasan, Daniel Fordham, Vania Costa, Simon Mayes, Matteo Chiara⁶, David S. Horner⁶, Eugene W. Myers¹⁰, Richard Durbin⁹, Richard Durbin⁸, Alessandro Achilli³, Edward L. Braun¹², Adam M. Phillippy², Erich D. Jarvis¹ - Show less +40 more•Institutions (12)

Rockefeller University¹, National Institutes of Health², University of Pavia³, Monash University Malaysia Campus⁴, Qatar Airways⁵, University of Milan⁶, New Mexico State University⁷, Wellcome Trust Sanger Institute⁸, University of Cambridge⁹, Max Planck Society¹⁰, Pacific Biosciences¹¹, University of Florida¹²

29 Apr 2021-Genome Biology

TL;DR: The mitoVGP as discussed by the authors is a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10-kbp, PacBio or Nanopore) and short (100-300-bp, Illumina) reads, leading to successful complete mitogenome assemblies of 100 vertebrate species of the VGP.

...read moreread less

Abstract: Modern sequencing technologies should make the assembly of the relatively small mitochondrial genomes an easy undertaking. However, few tools exist that address mitochondrial assembly directly. As part of the Vertebrate Genomes Project (VGP) we develop mitoVGP, a fully automated pipeline for similarity-based identification of mitochondrial reads and de novo assembly of mitochondrial genomes that incorporates both long (> 10 kbp, PacBio or Nanopore) and short (100–300 bp, Illumina) reads. Our pipeline leads to successful complete mitogenome assemblies of 100 vertebrate species of the VGP. We observe that tissue type and library size selection have considerable impact on mitogenome sequencing and assembly. Comparing our assemblies to purportedly complete reference mitogenomes based on short-read sequencing, we identify errors, missing sequences, and incomplete genes in those references, particularly in repetitive regions. Our assemblies also identify novel gene region duplications. The presence of repeats and duplications in over half of the species herein assembled indicates that their occurrence is a principle of mitochondrial structure rather than an exception, shedding new light on mitochondrial genome evolution and organization. Our results indicate that even in the “simple” case of vertebrate mitogenomes the completeness of many currently available reference sequences can be further improved, and caution should be exercised before claiming the complete assembly of a mitogenome, particularly from short reads alone.

...read moreread less

48 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
…
36
37
38
39
40
41
42
…
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

[...]

Stephen F. Altschul¹, Thomas L. Madden, Alejandro A. Schäffer¹, Jinghui Zhang, Zheng Zhang², Webb Miller², David J. Lipman - Show less +3 more•Institutions (2)

National Institutes of Health¹, Pennsylvania State University²

01 Sep 1997-Nucleic Acids Research

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.

...read moreread less

Abstract: The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSIBLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

...read moreread less

70,111 citations

Journal Article•DOI•

The Sequence Alignment/Map format and SAMtools

[...]

Heng Li¹, Bob Handsaker², Alec Wysoker², T. J. Fennell², Jue Ruan³, Nils Homer², Gabor T. Marth⁴, Gonçalo R. Abecasis², Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of California, Los Angeles², Chinese Academy of Sciences³, Boston College⁴

01 Aug 2009-Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

...read moreread less

45,957 citations

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Fiji: an open-source platform for biological-image analysis

[...]

Johannes Schindelin¹, Ignacio Arganda-Carreras², Erwin Frise³, Verena Kaynig⁴, Mark Longair⁴, Tobias Pietzsch¹, Stephan Preibisch¹, Curtis Rueden⁵, Stephan Saalfeld¹, Benjamin Schmid¹, Jean-Yves Tinevez¹, Daniel J. White¹, Volker Hartenstein¹, Kevin W. Eliceiri⁵, Pavel Tomancak¹, Albert Cardona¹ - Show less +12 more•Institutions (5)

Max Planck Society¹, Massachusetts Institute of Technology², Lawrence Berkeley National Laboratory³, ETH Zurich⁴, University of Wisconsin-Madison⁵

01 Jul 2012-Nature Methods

TL;DR: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis that facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system.

...read moreread less

Abstract: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis. Fiji uses modern software engineering practices to combine powerful software libraries with a broad range of scripting languages to enable rapid prototyping of image-processing algorithms. Fiji facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system. We propose Fiji as a platform for productive collaboration between computer science and biology research communities.

...read moreread less

43,540 citations

Journal Article•DOI•

Trimmomatic: a flexible trimmer for Illumina sequence data

[...]

Anthony Bolger¹, Marc Lohse¹, Bjoern Usadel¹•Institutions (1)

Max Planck Society¹

01 Aug 2014-Bioinformatics

TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.

...read moreread less

Abstract: Motivation: Although many next-generation sequencing (NGS) read preprocessing tools already existed, we could not find any tool or combination of tools that met our requirements in terms of flexibility, correct handling of paired-end data and high performance. We have developed Trimmomatic as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data. Results: The value of NGS read preprocessing is demonstrated for both reference-based and reference-free tasks. Trimmomatic is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested. Availability and implementation: Trimmomatic is licensed under GPL V3. It is cross-platform (Java 1.5+ required) and available at http://www.usadellab.org/cms/index.php?page=trimmomatic Contact: ed.nehcaa-htwr.1oib@ledasu Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

39,291 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse