scispace - formally typeset
Search or ask a question
Author

Douglas B. Kell

Bio: Douglas B. Kell is an academic researcher from University of Liverpool. The author has contributed to research in topics: Dielectric & Systems biology. The author has an hindex of 111, co-authored 634 publications receiving 50335 citations. Previous affiliations of Douglas B. Kell include Max Planck Society & University of Wales.


Papers
More filters
Journal ArticleDOI
TL;DR: This is the first demonstration, within microbiology, of KANNs as an unsupervised clustering technique which has the potential to group pyrolysis mass spectra both automatically and relatively objectively.
Abstract: Curie-point pyrolysis mass spectra were obtained from reference Propionibacterium strains and canine isolates. Artificial neural networks (ANNs) were trained by supervised learning (with the back-propagation algorithm) to recognize these strains from their pyrolysis mass spectra; all the strains isolated from dogs were identified as human wild type P. acnes. This is an important nosological discovery, and demonstrates that the combination of pyrolysis mass spectrometry and ANNs provides an objective, rapid and accurate identification technique. Bacteria isolated from different biopsy specimens from the same dog were found to be separate strains of P. acnes, demonstrating a within-animal variation in microflora. The classification of the canine isolates by Kohonen artificial neural networks (KANNs) was compared with the classical multivariate techniques of canonical variates analysis and hierarchical cluster analysis, and found to give similar results. This is the first demonstration, within microbiology, of KANNs as an unsupervised clustering technique which has the potential to group pyrolysis mass spectra both automatically and relatively objectively.

98 citations

Journal ArticleDOI
TL;DR: This work extended the closed-loop optimization system that was previously developed for one-dimensional GC-TOF-MS to comprehensive two-dimensional (GCxGC) chromatography, and improved the number of metabolites observable relative to those in 1D GC by some 3-fold.
Abstract: Metabolomics seeks to measure potentially all the metabolites in a biological sample, and consequently, we need to develop and optimize methods to increase significantly the number of metabolites we can detect. We extended the closed-loop (iterative, automated) optimization system that we had previously developed for one-dimensional GC-TOF-MS (O'Hagan, S.; Dunn, W. B.; Brown, M.; Knowles, J. D.; Kell, D. B. Anal. Chem. 2005, 77, 290-303) to comprehensive two-dimensional (GCxGC) chromatography. The heuristic approach used was a multiobjective version of the efficient global optimization algorithm. In just 300 automated runs, we improved the number of metabolites observable relative to those in 1D GC by some 3-fold. The optimized conditions allowed for the detection of over 4000 raw peaks, of which some 1800 were considered to be real metabolite peaks and not impurities or peaks with a signal/noise ratio of less than 5. A variety of computational methods served to explain the basis for the improvement. This closed-loop optimization strategy is a generic and powerful approach for the optimization of any analytical instrumentation.

98 citations

Journal ArticleDOI
TL;DR: Wavelet regression is used as an extension of the more traditional Fourier regression (where the modelling is performed in the frequency domain without taking into consideration any of the information in the time domain) and truncation of weight vectors in PLS was the most effective method for selecting variables.

98 citations

Journal ArticleDOI
TL;DR: Using competition experiments in continuous cultures grown in different nutrient environments, genes that show haploinsufficiency phenotypes or haploproficient phenotypes are identified and this chromosome determines a yeast's mating type, and the concentration of haplOinsufficient genes there may be a mechanism to prevent its loss.
Abstract: Using competition experiments in continuous cultures grown in different nutrient environments (glucose limited, ammonium limited, phosphate limited and white grape juice), we identified genes that show haploinsufficiency phenotypes (reduced growth rate when hemizygous) or haploproficiency phenotypes (increased growth rate when hemizygous). Haploproficient genes (815, 1,194, 733 and 654 in glucose-limited, ammonium-limited, phosphate-limited and white grape juice environments, respectively) frequently show that phenotype in a specific environmental context. For instance, genes encoding components of the ubiquitination pathway or the proteasome show haploproficiency in nitrogen-limited conditions where protein conservation may be beneficial. Haploinsufficiency is more likely to be observed in all environments, as is the case with genes determining polar growth of the cell. Haploproficient genes seem randomly distributed in the genome, whereas haploinsufficient genes (685, 765, 1,277 and 217 in glucose-limited, ammonium-limited, phosphate-limited and white grape juice environments, respectively) are over-represented on chromosome III. This chromosome determines a yeast's mating type, and the concentration of haploinsufficient genes there may be a mechanism to prevent its loss.

98 citations

Journal ArticleDOI
TL;DR: This is the first study to identify, in an unbiased manner, a series of small-molecular-weight metabolites that effectively detect preeclampsia in plasma and provides new insights into the pathology of this condition and raises the possibility of the development of a predictive test.
Abstract: In a previous study, the ability of gas chromatography time-of-flight mass spectrometry to detect potential metabolic biomarkers in preeclampsia was demonstrated. In this study, the authors sought to validate their preliminary findings in an entirely different patient cohort using a complementary, novel, and powerful combination of analytical tools (ultra performance liquid chromatography and LTQ Orbitrap mass spectrometry system). Eight metabolites that appeared in the authors' previous patient cohort were identified as being statistically significant (P < .01) as discriminatory biomarkers. The chemical identities of these 8 metabolites were established using authentic chemical standards. They included uric acid, 2-oxoglutarate, glutamate, and alanine. This is the first study to identify, in an unbiased manner, a series of small-molecular-weight metabolites that effectively detect preeclampsia in plasma. The identity of these metabolites provides new insights into the pathology of this condition and raises the possibility of the development of a predictive test.

97 citations


Cited by
More filters
28 Jul 2005
TL;DR: PfPMP1)与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作�ly.
Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1(PfPMP1)与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用,在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员,通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

18,940 citations

Journal ArticleDOI
TL;DR: A simple and highly efficient method to disrupt chromosomal genes in Escherichia coli in which PCR primers provide the homology to the targeted gene(s), which should be widely useful, especially in genome analysis of E. coli and other bacteria.
Abstract: We have developed a simple and highly efficient method to disrupt chromosomal genes in Escherichia coli in which PCR primers provide the homology to the targeted gene(s). In this procedure, recombination requires the phage lambda Red recombinase, which is synthesized under the control of an inducible promoter on an easily curable, low copy number plasmid. To demonstrate the utility of this approach, we generated PCR products by using primers with 36- to 50-nt extensions that are homologous to regions adjacent to the gene to be inactivated and template plasmids carrying antibiotic resistance genes that are flanked by FRT (FLP recognition target) sites. By using the respective PCR products, we made 13 different disruptions of chromosomal genes. Mutants of the arcB, cyaA, lacZYA, ompR-envZ, phnR, pstB, pstCA, pstS, pstSCAB-phoU, recA, and torSTRCAD genes or operons were isolated as antibiotic-resistant colonies after the introduction into bacteria carrying a Red expression plasmid of synthetic (PCR-generated) DNA. The resistance genes were then eliminated by using a helper plasmid encoding the FLP recombinase which is also easily curable. This procedure should be widely useful, especially in genome analysis of E. coli and other bacteria because the procedure can be done in wild-type cells.

14,389 citations

Journal Article
TL;DR: This book by a teacher of statistics (as well as a consultant for "experimenters") is a comprehensive study of the philosophical background for the statistical design of experiment.
Abstract: THE DESIGN AND ANALYSIS OF EXPERIMENTS. By Oscar Kempthorne. New York, John Wiley and Sons, Inc., 1952. 631 pp. $8.50. This book by a teacher of statistics (as well as a consultant for \"experimenters\") is a comprehensive study of the philosophical background for the statistical design of experiment. It is necessary to have some facility with algebraic notation and manipulation to be able to use the volume intelligently. The problems are presented from the theoretical point of view, without such practical examples as would be helpful for those not acquainted with mathematics. The mathematical justification for the techniques is given. As a somewhat advanced treatment of the design and analysis of experiments, this volume will be interesting and helpful for many who approach statistics theoretically as well as practically. With emphasis on the \"why,\" and with description given broadly, the author relates the subject matter to the general theory of statistics and to the general problem of experimental inference. MARGARET J. ROBERTSON

13,333 citations

Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

Journal ArticleDOI
TL;DR: A practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics, which makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries.
Abstract: The cBioPortal for Cancer Genomics (http://cbioportal.org) provides a Web resource for exploring, visualizing, and analyzing multidimensional cancer genomics data. The portal reduces molecular profiling data from cancer tissues and cell lines into readily understandable genetic, epigenetic, gene expression, and proteomic events. The query interface combined with customized data storage enables researchers to interactively explore genetic alterations across samples, genes, and pathways and, when available in the underlying data, to link these to clinical outcomes. The portal provides graphical summaries of gene-level data from multiple platforms, network visualization and analysis, survival analysis, patient-centric queries, and software programmatic access. The intuitive Web interface of the portal makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries. Here, we provide a practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics.

10,947 citations