Home
/
Authors
/
Douglas B. Kell

Author

Douglas B. Kell

Other affiliations: Max Planck Society, University of Wales, Heidelberg Institute for Theoretical Studies ...read more

Bio: Douglas B. Kell is an academic researcher from University of Liverpool. The author has contributed to research in topics: Dielectric & Systems biology. The author has an hindex of 111, co-authored 634 publications receiving 50335 citations. Previous affiliations of Douglas B. Kell include Max Planck Society & University of Wales.

Topics: Dielectric, Systems biology, Population, Metabolome, Fibrin ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A community-driven global reconstruction of human metabolism

[...]

Ines Thiele¹, Neil Swainston², Ronan M. T. Fleming¹, Andreas Hoppe³, Swagatika Sahoo¹, Maike K. Aurich¹, Hulda S. Haraldsdóttir¹, Monica L. Mo⁴, Ottar Rolfsson¹, Miranda D. Stobbe⁵, Miranda D. Stobbe⁶, Stefan Gretar Thorleifsson¹, Rasmus Agren⁷, Christian Bölling³, Sergio Bordel⁷, Arvind K. Chavali⁸, Paul D. Dobson⁹, Warwick B. Dunn¹⁰, Warwick B. Dunn², Lukas Endler¹¹, David Hala¹², Michael Hucka¹³, Duncan Hull², Daniel Jameson², Neema Jamshidi⁴, Jon J. Jonsson¹, Nick Juty¹⁴, Sarah M. Keating¹⁴, Intawat Nookaew⁷, Nicolas Le Novère¹⁴, Nicolas Le Novère¹⁵, Naglis Malys², Naglis Malys¹⁶, Alexander Mazein¹⁷, Jason A. Papin⁸, Nathan D. Price¹⁸, Evgeni Selkov, Martin I. Sigurdsson¹, Evangelos Simeonidis¹⁸, Evangelos Simeonidis¹⁹, Nikolaus Sonnenschein²⁰, Kieran Smallbone², Anatoly Sorokin²¹, Anatoly Sorokin¹⁷, Johannes H. G. M. van Beek²², Dieter Weichart², Igor Goryanin¹⁷, Jens Nielsen⁷, Hans V. Westerhoff, Douglas B. Kell², Pedro Mendes², Pedro Mendes²³, Bernhard O. Palsson¹, Bernhard O. Palsson⁴ - Show less +50 more•Institutions (23)

University of Iceland¹, University of Manchester², Charité³, University of California, San Diego⁴, Netherlands Bioinformatics Centre⁵, University of Amsterdam⁶, Chalmers University of Technology⁷, University of Virginia⁸, University of Sheffield⁹, Central Manchester University Hospitals NHS Foundation Trust¹⁰, University of Vienna¹¹, University of North Texas¹², California Institute of Technology¹³, European Bioinformatics Institute¹⁴, Babraham Institute¹⁵, University of Warwick¹⁶, University of Edinburgh¹⁷, Institute for Systems Biology¹⁸, University of Luxembourg¹⁹, Jacobs University Bremen²⁰, Russian Academy of Sciences²¹, VU University Amsterdam²², Virginia Bioinformatics Institute²³

01 May 2013-Nature Biotechnology

TL;DR: Recon 2, a community-driven, consensus 'metabolic reconstruction', is described, which is the most comprehensive representation of human metabolism that is applicable to computational modeling and has improved topological and functional features.

...read moreread less

Abstract: Multiple models of human metabolism have been reconstructed, but each represents only a subset of our knowledge. Here we describe Recon 2, a community-driven, consensus 'metabolic reconstruction', which is the most comprehensive representation of human metabolism that is applicable to computational modeling. Compared with its predecessors, the reconstruction has improved topological and functional features, including ~2× more reactions and ~1.7× more unique metabolites. Using Recon 2 we predicted changes in metabolite biomarkers for 49 inborn errors of metabolism with 77% accuracy when compared to experimental data. Mapping metabolomic data and drug information onto Recon 2 demonstrates its potential for integrating and analyzing diverse data types. Using protein expression data, we automatically generated a compendium of 65 cell type–specific models, providing a basis for manual curation or investigation of cell-specific metabolic properties. Recon 2 will facilitate many future biomedical studies and is freely available at http://humanmetabolism.org/.

...read moreread less

1,002 citations

Journal Article•DOI•

Computational cluster validation in post-genomic data analysis

[...]

Julia Handl¹, Joshua Knowles¹, Douglas B. Kell¹•Institutions (1)

University of Manchester¹

01 Aug 2005-Bioinformatics

TL;DR: In this article, the authors present a review of clustering validation techniques for post-genomic data analysis, with a particular focus on their application to postgenomic analysis of biological data.

...read moreread less

Abstract: Motivation: The discovery of novel biological knowledge from the ab initio analysis of post-genomic data relies upon the use of unsupervised processing methods, in particular clustering techniques. Much recent research in bioinformatics has therefore been focused on the transfer of clustering methods introduced in other scientific fields and on the development of novel algorithms specifically designed to tackle the challenges posed by post-genomic data. The partitions returned by a clustering algorithm are commonly validated using visual inspection and concordance with prior biological knowledge---whether the clusters actually correspond to the real structure in the data is somewhat less frequently considered. Suitable computational cluster validation techniques are available in the general data-mining literature, but have been given only a fraction of the same attention in bioinformatics. Results: This review paper aims to familiarize the reader with the battery of techniques available for the validation of clustering results, with a particular focus on their application to post-genomic data analysis. Synthetic and real biological datasets are used to demonstrate the benefits, and also some of the perils, of analytical clustervalidation. Availability: The software used in the experiments is available at http://dbkweb.ch.umist.ac.uk/handl/clustervalidation/ Contact: J.Handl@postgrad.manchester.ac.uk Supplementary information: Enlarged colour plots are provided in the Supplementary Material, which is available at http://dbkweb.ch.umist.ac.uk/handl/clustervalidation/

...read moreread less

884 citations

Journal Article•DOI•

The Systems Biology Graphical Notation

[...]

Nicolas Le Novère, Michael Hucka¹, Huaiyu Mi², Stuart L. Moodie³, Falk Schreiber⁴, Falk Schreiber⁵, Anatoly Sorokin³, Emek Demir⁶, Katja Wegner⁷, Mirit I. Aladjem⁸, Sarala M. Wimalaratne⁹, Frank T Bergman¹⁰, Ralph Gauges¹¹, Peter Ghazal³, Hideya Kawaji, Lu Li, Yukiko Matsuoka, Alice Villéger¹², Sarah Elizabeth Boyd¹³, Laurence Calzone¹⁴, Mélanie Courtot¹⁵, Ugur Dogrusoz¹⁶, Tom C. Freeman³, Akira Funahashi¹⁷, Samik Ghosh, Akiya Jouraku¹⁷, Sohoung Kim⁸, Fedor A. Kolpakov, Augustin Luna⁸, Sven Sahle¹¹, Esther Schmidt, Steven Watterson³, Steven Watterson¹⁶, Guanming Wu¹⁸, Igor Goryanin³, Douglas B. Kell¹², Chris Sander⁶, Herbert M. Sauro¹⁰, Jacky L. Snoep¹⁹, Kurt W. Kohn⁸, Hiroaki Kitano²⁰ - Show less +37 more•Institutions (20)

California Institute of Technology¹, SRI International², University of Edinburgh³, Leibniz Association⁴, Martin Luther University of Halle-Wittenberg⁵, Memorial Sloan Kettering Cancer Center⁶, University of Hertfordshire⁷, National Institutes of Health⁸, University of Auckland⁹, University of Washington¹⁰, Heidelberg University¹¹, University of Manchester¹², Monash University¹³, Mines ParisTech¹⁴, University of British Columbia¹⁵, Bilkent University¹⁶, Keio University¹⁷, Ontario Institute for Cancer Research¹⁸, Stellenbosch University¹⁹, Okinawa Institute of Science and Technology²⁰

07 Aug 2009-Nature Biotechnology

TL;DR: The Systems Biology Graphical Notation (SBGN), a visual language developed by a community of biochemists, modelers and computer scientists, believes that it will foster efficient and accurate representation, visualization, storage, exchange and reuse of information on all kinds of biological knowledge.

...read moreread less

Abstract: Circuit diagrams and Unified Modeling Language diagrams are just two examples of standard visual languages that help accelerate work by promoting regularity, removing ambiguity and enabling software tool support for communication of complex information. Ironically, despite having one of the highest ratios of graphical to textual information, biology still lacks standard graphical notations. The recent deluge of biological knowledge makes addressing this deficit a pressing concern. Toward this goal, we present the Systems Biology Graphical Notation (SBGN), a visual language developed by a community of biochemists, modelers and computer scientists. SBGN consists of three complementary languages: process diagram, entity relationship diagram and activity flow diagram. Together they enable scientists to represent networks of biochemical interactions in a standard, unambiguous way. We believe that SBGN will foster efficient and accurate representation, visualization, storage, exchange and reuse of information on all kinds of biological knowledge, from gene regulation, to metabolism, to cellular signaling.

...read moreread less

880 citations

Journal Article•DOI•

Flow cytometry and cell sorting of heterogeneous microbial populations: the importance of single-cell analyses.

[...]

Hazel M. Davey¹, Douglas B. Kell¹•Institutions (1)

Aberystwyth University¹

01 Dec 1996-Microbiological Research

TL;DR: Flow cytometry is a technique, which allows one to analyze cells rapidly and individually and permits the quantitative analysis of microbial heterogeneity, and offers many advantages over conventional measurements for both routine and more exploratory analyses of microbial properties.

...read moreread less

875 citations

Journal Article•DOI•

Statistical strategies for avoiding false discoveries in metabolomics and related experiments

[...]

David Broadhurst¹, Douglas B. Kell¹•Institutions (1)

University of Manchester¹

12 Jan 2007-Metabolomics

TL;DR: A list of some of the simpler checks that might improve one’s confidence that a candidate biomarker is not simply a statistical artefact is provided, and a series of preferred tests and visualisation tools that can assist readers and authors in assessing papers are suggested.

...read moreread less

Abstract: Many metabolomics, and other high-content or high-throughput, experiments are set up such that the primary aim is the discovery of biomarker metabolites that can discriminate, with a certain level of certainty, between nominally matched ‘case’ and ‘control’ samples. However, it is unfortunately very easy to find markers that are apparently persuasive but that are in fact entirely spurious, and there are well-known examples in the proteomics literature. The main types of danger are not entirely independent of each other, but include bias, inadequate sample size (especially relative to the number of metabolite variables and to the required statistical power to prove that a biomarker is discriminant), excessive false discovery rate due to multiple hypothesis testing, inappropriate choice of particular numerical methods, and overfitting (generally caused by the failure to perform adequate validation and cross-validation). Many studies fail to take these into account, and thereby fail to discover anything of true significance (despite their claims). We summarise these problems, and provide pointers to a substantial existing literature that should assist in the improved design and evaluation of metabolomics experiments, thereby allowing robust scientific conclusions to be drawn from the available data. We provide a list of some of the simpler checks that might improve one’s confidence that a candidate biomarker is not simply a statistical artefact, and suggest a series of preferred tests and visualisation tools that can assist readers and authors in assessing papers. These tools can be applied to individual metabolites by using multiple univariate tests performed in parallel across all metabolite peaks. They may also be applied to the validation of multivariate models. We stress in particular that classical p-values such as “p < 0.05”, that are often used in biomedicine, are far too optimistic when multiple tests are done simultaneously (as in metabolomics). Ultimately it is desirable that all data and metadata are available electronically, as this allows the entire community to assess conclusions drawn from them. These analyses apply to all high-dimensional ‘omics’ datasets.

...read moreread less

747 citations

1
2
3
4
5
…
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products

[...]

Kirill A. Datsenko¹, Barry L. Wanner•Institutions (1)

Purdue University¹

06 Jun 2000-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: A simple and highly efficient method to disrupt chromosomal genes in Escherichia coli in which PCR primers provide the homology to the targeted gene(s), which should be widely useful, especially in genome analysis of E. coli and other bacteria.

...read moreread less

Abstract: We have developed a simple and highly efficient method to disrupt chromosomal genes in Escherichia coli in which PCR primers provide the homology to the targeted gene(s). In this procedure, recombination requires the phage lambda Red recombinase, which is synthesized under the control of an inducible promoter on an easily curable, low copy number plasmid. To demonstrate the utility of this approach, we generated PCR products by using primers with 36- to 50-nt extensions that are homologous to regions adjacent to the gene to be inactivated and template plasmids carrying antibiotic resistance genes that are flanked by FRT (FLP recognition target) sites. By using the respective PCR products, we made 13 different disruptions of chromosomal genes. Mutants of the arcB, cyaA, lacZYA, ompR-envZ, phnR, pstB, pstCA, pstS, pstSCAB-phoU, recA, and torSTRCAD genes or operons were isolated as antibiotic-resistant colonies after the introduction into bacteria carrying a Red expression plasmid of synthetic (PCR-generated) DNA. The resistance genes were then eliminated by using a helper plasmid encoding the FLP recombinase which is also easily curable. This procedure should be widely useful, especially in genome analysis of E. coli and other bacteria because the procedure can be done in wild-type cells.

...read moreread less

14,389 citations

Journal Article•

The Design and Analysis of Experiments

[...]

Margaret J. Robertson

01 Jun 1953-Yale Journal of Biology and Medicine

TL;DR: This book by a teacher of statistics (as well as a consultant for "experimenters") is a comprehensive study of the philosophical background for the statistical design of experiment.

...read moreread less

Abstract: THE DESIGN AND ANALYSIS OF EXPERIMENTS. By Oscar Kempthorne. New York, John Wiley and Sons, Inc., 1952. 631 pp. $8.50. This book by a teacher of statistics (as well as a consultant for \"experimenters\") is a comprehensive study of the philosophical background for the statistical design of experiment. It is necessary to have some facility with algebraic notation and manipulation to be able to use the volume intelligently. The problems are presented from the theoretical point of view, without such practical examples as would be helpful for those not acquainted with mathematics. The mathematical justification for the techniques is given. As a somewhat advanced treatment of the design and analysis of experiments, this volume will be interesting and helpful for many who approach statistics theoretically as well as practically. With emphasis on the \"why,\" and with description given broadly, the author relates the subject matter to the general theory of statistics and to the general problem of experimental inference. MARGARET J. ROBERTSON

...read moreread less

13,333 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Journal Article•DOI•

Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal

[...]

Jianjiong Gao¹, Bulent Arman Aksoy¹, Ugur Dogrusoz², Gideon Dresdner¹, Benjamin Gross¹, S. Onur Sumer¹, Yichao Sun¹, Anders Jacobsen¹, Rileen Sinha¹, Erik Larsson³, Ethan Cerami¹, Chris Sander¹, Nikolaus Schultz¹ - Show less +9 more•Institutions (3)

Memorial Sloan Kettering Cancer Center¹, Bilkent University², University of Gothenburg³

02 Apr 2013-Science Signaling

TL;DR: A practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics, which makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries.

...read moreread less

Abstract: The cBioPortal for Cancer Genomics (http://cbioportal.org) provides a Web resource for exploring, visualizing, and analyzing multidimensional cancer genomics data. The portal reduces molecular profiling data from cancer tissues and cell lines into readily understandable genetic, epigenetic, gene expression, and proteomic events. The query interface combined with customized data storage enables researchers to interactively explore genetic alterations across samples, genes, and pathways and, when available in the underlying data, to link these to clinical outcomes. The portal provides graphical summaries of gene-level data from multiple platforms, network visualization and analysis, survival analysis, patient-centric queries, and software programmatic access. The intuitive Web interface of the portal makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries. Here, we provide a practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics.

...read moreread less

10,947 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse