Home
/
Authors
/
Olivier Grisel

Author

Olivier Grisel

Other affiliations: French Institute for Research in Computer Science and Automation

Bio: Olivier Grisel is an academic researcher from Université Paris-Saclay. The author has contributed to research in topics: Python (programming language) & Medicine. The author has an hindex of 11, co-authored 19 publications receiving 63822 citations. Previous affiliations of Olivier Grisel include French Institute for Research in Computer Science and Automation.

Papers

PDF

Open Access

More filters

Posted Content•DOI•

Prediction and inference diverge in biomedicine: Simulations and real-world data

[...]

Danilo Bzdok, Denis A. Engemann, Olivier Grisel, Gaël Varoquaux, Bertrand Thirion - Show less +1 more

21 Apr 2018-bioRxiv

TL;DR: In systematic data simulations and common medical datasets, it is explored how statistical inference and pattern recognition can agree and diverge and applies the linear model for identifying significant contributing variables and for finding the most predictive variable sets.

...read moreread less

Abstract: In the 20th century many advances in biological knowledge and evidence-based medicine were supported by p-values and accompanying methods. In the beginning 21st century, ambitions towards precision medicine put a premium on detailed predictions for single individuals. The shift causes tension between traditional methods used to infer statistically significant group differences and burgeoning machine-learning tools suited to forecast an individual9s future. This comparison applies the linear model for identifying significant contributing variables and for finding the most predictive variable sets. In systematic data simulations and common medical datasets, we explored how statistical inference and pattern recognition can agree and diverge. Across analysis scenarios, even small predictive performances typically coincided with finding underlying significant statistical relationships. However, even statistically strong findings with very low p-values shed little light on their value for achieving accurate prediction in the same dataset. More complete understanding of different ways to define "important" associations is a prerequisite for reproducible research findings that can serve to personalize clinical care.

...read moreread less

16 citations

Journal Article•DOI•

Risk of Death in Individuals Hospitalized for COVID-19 With and Without Psychiatric Disorders: An Observational Multicenter Study in France

[...]

Nicolas Hoertel, Marina Sánchez-Rico, P. de la Muela, M. Hernandez Abellán, Carlos Blanco, Marion Leboyer, Céline Cougoule, Erich Gulbins, Johannes Kornhuber, Alexander Carpinteiro, Katrin Anne Becker, Raphaël Vernet, Jesús M. Alvarado, J. J. Herrera-Morueco, Guillaume Airagnes, Cédric Lemogne, Frédéric Limosin, Pierre-Yves Ancel, A Bauchet, Nathanaël Beeker, Vincent Benoit, Mélodie Bernaux, Ali Bellamine, Romain Bey, Aurélie Bourmaud, Stephane Breant, Anita Burgun, Fabrice Carrat, Charlotte Caucheteux, Julien Champ, Sylvie Cormont, Christel Daniel, Julien Dubiel, Catherine Ducloas, Loic Esteve, Marie Frank, Nicolas Garcelon, Alexandre Gramfort, Nicolas Griffon, Olivier Grisel, Martin Guilbaud, Claire Hassen-Khodja, François Hemery, Martin Hilka, Anne-Sophie Jannot, Jérôme Lambert, Richard Layese, Judith Leblanc, Léo Lebouter, Guillaume Lemaitre, Damien Leprovost, Ivan Lerner, Kankoe Sallah, A Maire, Marie-France Mamzer, P. Martel, Arthur Mensch, Thomas Moreau, Antoine Neuraz, Nina G. Orlova, Nicolas Paris, Bastien Rance, Hélène Ravera, Antoine Rozès, Elisa Salamanca, Arnaud Sandrin, Patricia Serre, Xavier Tannier, Jean-Marc Tréluyer, Damien van Gysel, Gaël Varoquaux, Jill-Jênn Vie, Maxime Wack, Perceval Wajsburt, Demian Wassermann, Eric Zapletal - Show less +72 more

01 Jan 2022-Biological psychiatry global open science

TL;DR: In this paper , the authors performed an observational multicenter retrospective cohort study to examine the association between psychiatric disorders and mortality among patients hospitalized for laboratory-confirmed COVID-19 at 36 Greater Paris University hospitals.

...read moreread less

Abstract: Prior research suggests that psychiatric disorders could be linked to increased mortality among patients with COVID-19. However, whether all or specific psychiatric disorders are intrinsic risk factors of death in COVID-19, or whether these associations reflect the greater prevalence of medical risk factors in people with psychiatric disorders, has yet to be evaluated. We performed an observational multicenter retrospective cohort study to examine the association between psychiatric disorders and mortality among patients hospitalized for laboratory-confirmed COVID-19 at 36 Greater Paris University hospitals. Of 15,168 adult patients, 857 (5.7%) had an ICD-10 diagnosis of psychiatric disorder. Over a mean follow-up of 14.6 days (SD=17.9), death occurred in 326/857 (38.0%) patients with a diagnosis of psychiatric disorder versus 1,276/14,311 (8.9%) in patients without such a diagnosis (OR=6.27; 95%CI=5.40-7.28; p<0.01). When adjusting for age, sex, hospital, current smoking status, and medications according to compassionate use or as part of a clinical trial, this association remained significant (AOR=3.27; 95%CI=2.78-3.85; p<0.01). However, additional adjustments for obesity and number of medical conditions resulted in a non-significant association (AOR=1.02; 95%CI=0.84-1.23; p=0.86). Exploratory analyses following the same adjustments suggest that a diagnosis of mood disorders was significantly associated with reduced mortality, which might be explained by the use of antidepressants. These findings suggest that the increased risk of COVID-19-related mortality in individuals with psychiatric disorders hospitalized for COVID-19 might be explained by the greater number of medical conditions and the higher prevalence of obesity in this population, but not by the underlying psychiatric disease.

...read moreread less

12 citations

Posted Content•DOI•

Brain-based ranking of cognitive domains to predict schizophrenia

[...]

Teresa M. Karrer¹, Danielle S. Bassett, Birgit Derntl², Oliver Gruber³, André Aleman⁴, Renaud Jardri⁵, Angela R. Laird⁶, Peter T. Fox⁷, Simon B. Eickhoff⁸, Olivier Grisel⁹, Gaël Varoquaux⁹, Bertrand Thirion⁹, Danilo Bzdok⁹, Danilo Bzdok¹ - Show less +10 more•Institutions (9)

RWTH Aachen University¹, University of Tübingen², Heidelberg University³, University Medical Center Groningen⁴, university of lille⁵, Florida International University⁶, University of Hong Kong⁷, Forschungszentrum Jülich⁸, French Institute for Research in Computer Science and Automation⁹

13 Aug 2018-bioRxiv

TL;DR: A bottom-up machine-learning strategy is developed and a proof of principle in a multi-site clinical dataset is provided that can distinguish patients and controls using brain morphology and intrinsic functional connectivity in schizophrenia.

...read moreread less

Abstract: Schizophrenia is a devastating brain disorder that disturbs sensory perception, motor action, and abstract thought. Its clinical phenotype implies dysfunction of various mental domains, which has motivated a series of theories regarding the underlying pathophysiology. Aiming at a predictive benchmark of a catalogue of cognitive functions, we developed a bottom-up machine-learning strategy and provide a proof of principle in a multi-site clinical dataset (n=324). Existing neuroscientific knowledge on diverse cognitive domains was first condensed into neuro-topographical maps. We then examined how the ensuing meta-analytic cognitive priors can distinguish patients and controls using brain morphology and intrinsic functional connectivity. Some affected cognitive domains supported well-studied directions of research on auditory evaluation and social cognition. However, rarely suspected cognitive domains also emerged as disease-relevant, including self-oriented processing of bodily sensations in gustation and pain. Such algorithmic charting of the cognitive landscape can be used to make targeted recommendations for future mental health research.

...read moreread less

8 citations

Journal Article•DOI•

Long-term kidney function recovery and mortality after COVID-19-associated acute kidney injury: An international multi-centre observational cohort study

[...]

B. W. Tan, Bryce W.Q. Tan, Amelia L.M. Tan, Emily Schriver +255 more

07 Nov 2022-EClinicalMedicine

TL;DR: In this paper , a retrospective multi-centre observational cohort study comprising 12,891 hospitalized patients aged 18 years or older with a diagnosis of SARS-CoV-2 infection confirmed by polymerase chain reaction from 1 January 2020 to 10 September 2020, and with at least one serum creatinine value 1-365 days prior to admission.

...read moreread less

8 citations

Book Chapter•DOI•

Principal Component Regression predicts functional responses across individuals

[...]

Bertrand Thirion, Gaël Varoquaux, Olivier Grisel¹, Cyril Poupon¹, Philippe Pinel¹ - Show less +1 more•Institutions (1)

French Institute for Research in Computer Science and Automation¹

14 Sep 2014

TL;DR: In this article, the authors estimate the amount of variance that is fit by a random effects subspace learned on other images, and show that a principal component regression estimator outperforms other regression models and that it fits a significant proportion (10% to 25%) of the between-subject variability.

...read moreread less

Abstract: Inter-subject variability is a major hurdle for neuroimaging group-level inference, as it creates complex image patterns that are not captured by standard analysis models and jeopardizes the sensitivity of statistical procedures. A solution to this problem is to model random subjects effects by using the redundant information conveyed by multiple imaging contrasts. In this paper, we introduce a novel analysis framework, where we estimate the amount of variance that is fit by a random effects subspace learned on other images; we show that a principal component regression estimator outperforms other regression models and that it fits a significant proportion (10% to 25%) of the between-subject variability. This proves for the first time that the accumulation of contrasts in each individual can provide the basis for more sensitive neuroimaging group analyzes.

...read moreread less

7 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

XGBoost: A Scalable Tree Boosting System

[...]

Tianqi Chen¹, Carlos Guestrin¹•Institutions (1)

University of Washington¹

13 Aug 2016

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.

...read moreread less

Abstract: Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges. We propose a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning. More importantly, we provide insights on cache access patterns, data compression and sharding to build a scalable tree boosting system. By combining these insights, XGBoost scales beyond billions of examples using far fewer resources than existing systems.

...read moreread less

14,872 citations

Proceedings Article•DOI•

XGBoost: A Scalable Tree Boosting System

[...]

Tianqi Chen¹, Carlos Guestrin¹•Institutions (1)

University of Washington¹

09 Mar 2016-arXiv: Learning

TL;DR: This paper proposes a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning and provides insights on cache access patterns, data compression and sharding to build a scalable tree boosting system called XGBoost.

...read moreread less

13,333 citations

Journal Article•DOI•

SciPy 1.0--Fundamental Algorithms for Scientific Computing in Python

[...]

Pauli Virtanen¹, Ralf Gommers, Travis E. Oliphant, Matt Haberland², Matt Haberland³, Tyler Reddy⁴, David Cournapeau, Evgeni Burovski⁵, Pearu Peterson, Warren Weckesser⁶, Jonathan Bright, Stefan van der Walt⁶, Matthew Brett⁷, Joshua Wilson, K. Jarrod Millman⁶, Nikolay Mayorov, Andrew Nelson⁸, Eric Jones, Robert Kern, Eric B. Larson⁹, CJ Carey¹⁰, Ilhan Polat, Yu Feng⁶, Eric Moore, Jake Vanderplas⁹, Denis Laxalde, Josef Perktold, Robert Cimrman¹¹, Ian Henriksen¹², Ian Henriksen¹³, E. A. Quintero, Charles R. Harris, Anne M. Archibald, Antônio H. Ribeiro¹⁴, Fabian Pedregosa¹⁵, Paul van Mulbregt¹⁵, SciPy . Contributors - Show less +33 more•Institutions (15)

University of Jyväskylä¹, University of California, Los Angeles², California Polytechnic State University³, Los Alamos National Laboratory⁴, National Research University – Higher School of Economics⁵, University of California, Berkeley⁶, University of Birmingham⁷, Australian Nuclear Science and Technology Organisation⁸, University of Washington⁹, University of Massachusetts Amherst¹⁰, University of West Bohemia¹¹, University of Texas at Austin¹², Brigham Young University¹³, Universidade Federal de Minas Gerais¹⁴, Google¹⁵

23 Jul 2019-arXiv: Mathematical Software

TL;DR: SciPy as discussed by the authors is an open source scientific computing library for the Python programming language, which includes functionality spanning clustering, Fourier transforms, integration, interpolation, file I/O, linear algebra, image processing, orthogonal distance regression, minimization algorithms, signal processing, sparse matrix handling, computational geometry, and statistics.

...read moreread less

Abstract: SciPy is an open source scientific computing library for the Python programming language. SciPy 1.0 was released in late 2017, about 16 years after the original version 0.1 release. SciPy has become a de facto standard for leveraging scientific algorithms in the Python programming language, with more than 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories, and millions of downloads per year. This includes usage of SciPy in almost half of all machine learning projects on GitHub, and usage by high profile projects including LIGO gravitational wave analysis and creation of the first-ever image of a black hole (M87). The library includes functionality spanning clustering, Fourier transforms, integration, interpolation, file I/O, linear algebra, image processing, orthogonal distance regression, minimization algorithms, signal processing, sparse matrix handling, computational geometry, and statistics. In this work, we provide an overview of the capabilities and development practices of the SciPy library and highlight some recent technical developments.

...read moreread less

12,774 citations

Proceedings Article•DOI•

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

[...]

Marco Tulio Ribeiro¹, Sameer Singh¹, Carlos Guestrin¹•Institutions (1)

University of Washington¹

13 Aug 2016

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

Abstract: Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy model or prediction into a trustworthy one. In this work, we propose LIME, a novel explanation technique that explains the predictions of any classifier in an interpretable and faithful manner, by learning an interpretable model locally varound the prediction. We also propose a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem. We demonstrate the flexibility of these methods by explaining different models for text (e.g. random forests) and image classification (e.g. neural networks). We show the utility of explanations via novel experiments, both simulated and with human subjects, on various scenarios that require trust: deciding if one should trust a prediction, choosing between models, improving an untrustworthy classifier, and identifying why a classifier should not be trusted.

...read moreread less

11,104 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse