Home
/
Authors
/
François Husson

Author

François Husson

Other affiliations: École nationale supérieure agronomique de Rennes, Centre national de la recherche scientifique

Bio: François Husson is an academic researcher from Agrocampus Ouest. The author has contributed to research in topics: Missing data & Principal component analysis. The author has an hindex of 26, co-authored 85 publications receiving 7559 citations. Previous affiliations of François Husson include École nationale supérieure agronomique de Rennes & Centre national de la recherche scientifique.

Papers published on a yearly basis

2023
2022
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2001
1999
1998

Papers

PDF

Open Access

More filters

Journal Article•DOI•

FactoMineR: An R Package for Multivariate Analysis

[...]

Sébastien Lê, Julie Josse¹, François Husson•Institutions (1)

Agrocampus Ouest¹

18 Mar 2008-Journal of Statistical Software

TL;DR: FactoMineR an R package dedicated to multivariate data analysis with the possibility to take into account different types of variables (quantitative or categorical), different kinds of structure on the data, and finally supplementary information (supplementary individuals and variables).

...read moreread less

Abstract: In this article, we present FactoMineR an R package dedicated to multivariate data analysis. The main features of this package is the possibility to take into account different types of variables (quantitative or categorical), different types of structure on the data (a partition on the variables, a hierarchy on the variables, a partition on the individuals) and finally supplementary information (supplementary individuals and variables). Moreover, the dimensions issued from the different exploratory data analyses can be automatically described by quantitative and/or categorical variables. Numerous graphics are also available with various options. Finally, a graphical user interface is implemented within the Rcmdr environment in order to propose an user friendly package.

...read moreread less

6,472 citations

Journal Article•DOI•

missMDA: A Package for Handling Missing Values in Multivariate Data Analysis

[...]

Julie Josse, François Husson

04 Apr 2016-Journal of Statistical Software

TL;DR: The missMDA as mentioned in this paper package performs principal component analysis on incomplete data sets, aiming to obtain scores, loadings and graphical representations despite missing values, and can be used to perform single imputation to complete data involving continuous, categorical and mixed variables.

...read moreread less

Abstract: We present the R package missMDA which performs principal component methods on incomplete data sets, aiming to obtain scores, loadings and graphical representations despite missing values. Package methods include principal component analysis for continuous variables, multiple correspondence analysis for categorical variables, factorial analysis on mixed data for both continuous and categorical variables, and multiple factor analysis for multi-table data. Furthermore, missMDA can be used to perform single imputation to complete data involving continuous, categorical and mixed variables. A multiple imputation method is also available. In the principal component analysis framework, variability across different imputations is represented by confidence areas around the row and column positions on the graphical outputs. This allows assessment of the credibility of results obtained from incomplete data sets.

...read moreread less

758 citations

Book•DOI•

Exploratory Multivariate Analysis by Example Using R

[...]

François Husson, Sébastien Lê, Jérôme Pagès¹•Institutions (1)

Agrocampus Ouest¹

15 Nov 2010

TL;DR: Principal Component Analysis (PCA) Data - Notation - Examples Objectives Studying Individuals Studying Variables Relationships between the Two Representations NI and NK Interpreting the Data Implementation with FactoMineR Additional Results.

...read moreread less

Abstract: Principal Component Analysis (PCA) Data - Notation - Examples Objectives Studying Individuals Studying Variables Relationships between the Two Representations NI and NK Interpreting the Data Implementation with FactoMineR Additional Results Example: The Decathlon Dataset Example: The Temperature Dataset Example of Genomic Data: The Chicken Dataset Correspondence Analysis (CA) Data - Notation - Examples Objectives and the Independence Model Fitting the Clouds Interpreting the Data Supplementary Elements (= Illustrative) Implementation with FactoMineR CA and Textual Data Processing Example: The Olympic Games Dataset Example: The White Wines Dataset Example: The Causes of Mortality Dataset Multiple Correspondence Analysis (MCA) Data - Notation - Examples Objectives Defining Distances between Individuals and Distances between Categories CA on the Indicator Matrix Interpreting the Data Implementation with FactoMineR Addendum Example: The Survey on the Perception of Genetically Modified Organisms Example: The Sorting Task Dataset Clustering Data - Issues Formalising the Notion of Similarity Constructing an Indexed Hierarchy Ward's Method Direct Search for Partitions: K-means Algorithm Partitioning and Hierarchical Clustering Clustering and Principal Component Methods Example: The Temperature Dataset Example: The Tea Dataset Dividing Quantitative Variables into Classes Appendix Percentage of Inertia Explained by the First Component or by the First Plane R Software Bibliography of Software Packages Bibliography Index

...read moreread less

454 citations

Journal Article•DOI•

Testing the significance of the RV coefficient

[...]

Julie Josse¹, Jérôme Pagès¹, François Husson¹•Institutions (1)

Centre national de la recherche scientifique¹

01 Sep 2008-Computational Statistics & Data Analysis

TL;DR: The current approximations (normal approximation, a log-transformation and Pearson type III approximation) are discussed and a new one is described: an Edgeworth expansion.

...read moreread less

210 citations

Handling missing values in exploratory multivariate data analysis methods

[...]

Julie Josse, François Husson

14 Dec 2012

TL;DR: A regularized iterative PCA algorithm to provide point estimates of the principal axes and components and to overcome the major issue of overfitting is described and implemented in the R package missMDA.

...read moreread less

Abstract: This paper is a written version of the talk Julie Josse delivered at the 44 Journees de Statistique (Bruxelles, 2012), when being awarded the Marie-Jeanne Laurent-Duhamel prize for her Ph.D. dissertation by the French Statistical Society. It proposes an overview of some results, proposed in Julie Josse and Francois Husson’s papers, as well as new challenges in the field of handling missing values in exploratory multivariate data analysis methods and especially in principal component analysis (PCA). First we describe a regularized iterative PCA algorithm to provide point estimates of the principal axes and components and to overcome the major issue of overfitting. Then, we give insight in the parameters variance using a non parametric multiple imputation procedure. Finally, we discuss the problem of the choice of the number of dimensions and we detail cross-validation approximation criteria. The proposed methodology is implemented in the R package missMDA.

...read moreread less

210 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Multiple Imputation for Nonresponse in Surveys

[...]

Roger A. Sugden¹•Institutions (1)

Goldsmiths, University of London¹

01 May 1988-Journal of The Royal Statistical Society Series A-statistics in Society

TL;DR: It is concluded that multiple Imputation for Nonresponse in Surveys should be considered as a legitimate method for answering the question of why people do not respond to survey questions.

...read moreread less

Abstract: 25. Multiple Imputation for Nonresponse in Surveys. By D. B. Rubin. ISBN 0 471 08705 X. Wiley, Chichester, 1987. 258 pp. £30.25.

...read moreread less

3,216 citations

Journal Article•DOI•

Statistical Analysis with Missing Data

[...]

Martin G. Gibson

01 Mar 1989-The Statistician

3,152 citations

Journal Article•DOI•

ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap

[...]

Tauno Metsalu¹, Jaak Vilo¹•Institutions (1)

University of Tartu¹

01 Jul 2015-Nucleic Acids Research

TL;DR: A web tool called ClustVis that aims to have an intuitive user interface for the Principal Component Analysis and heatmap plots and is freely available at http://biit.cs.ut.ee/clustvis/.

...read moreread less

Abstract: The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/.

...read moreread less

2,293 citations

Journal Article•DOI•

Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment

[...]

Xochitl C. Morgan¹, Timothy L. Tickle¹, Timothy L. Tickle², Harry Sokol¹, Harry Sokol³, Dirk Gevers², Kathryn L. Devaney¹, Doyle V. Ward², Joshua A Reyes¹, Samir A. Shah⁴, Neal Leleiko⁴, Scott B. Snapper⁵, Athos Bousvaros⁵, Joshua R. Korzenik¹, Joshua R. Korzenik⁵, Bruce E. Sands⁶, Ramnik J. Xavier², Ramnik J. Xavier¹, Curtis Huttenhower², Curtis Huttenhower¹ - Show less +16 more•Institutions (6)

Harvard University¹, Massachusetts Institute of Technology², University of Paris³, Brown University⁴, Brigham and Women's Hospital⁵, Icahn School of Medicine at Mount Sinai⁶

26 Sep 2012-Genome Biology

TL;DR: The microbiome of ileal Crohn's disease was notable for increases in virulence and secretion pathways, and the first insights into community-wide microbial processes and pathways that underpin IBD pathogenesis are provided.

...read moreread less

Abstract: Background: The inflammatory bowel diseases (IBD) Crohn’s disease and ulcerative colitis result from alterations in intestinal microbes and the immune system. However, the precise dysfunctions of microbial metabolism in the gastrointestinal microbiome during IBD remain unclear. We analyzed the microbiota of intestinal biopsies and stool samples from 231 IBD and healthy subjects by 16S gene pyrosequencing and followed up a subset using shotgun metagenomics. Gene and pathway composition were assessed, based on 16S data from phylogenetically-related reference genomes, and associated using sparse multivariate linear modeling with medications, environmental factors, and IBD status. Results: Firmicutes and Enterobacteriaceae abundances were associated with disease status as expected, but also with treatment and subject characteristics. Microbial function, though, was more consistently perturbed than composition, with 12% of analyzed pathways changed compared with 2% of genera. We identified major shifts in oxidative stress pathways, as well as decreased carbohydrate metabolism and amino acid biosynthesis in favor of nutrient transport and uptake. The microbiome of ileal Crohn’s disease was notable for increases in virulence and secretion pathways.

...read moreread less

2,189 citations

Journal Article•DOI•

The global burden of pathogens and pests on major food crops.

[...]

Serge Savary¹, Laetitia Willocquet¹, Sarah J. Pethybridge², Paul D. Esker³, Neil McRoberts⁴, Andrew Nelson⁵ - Show less +2 more•Institutions (5)

University of Toulouse¹, Cornell University², Pennsylvania State University³, University of California, Davis⁴, University of Twente⁵

04 Feb 2019-Nature Ecology and Evolution

TL;DR: An expert elicitation survey estimates yield losses for the five major food crops worldwide, suggesting that the highest losses are associated with food-deficit regions with fast-growing populations and frequently with emerging or re-emerging pests and diseases.

...read moreread less

Abstract: Crop pathogens and pests reduce the yield and quality of agricultural production. They cause substantial economic losses and reduce food security at household, national and global levels. Quantitative, standardized information on crop losses is difficult to compile and compare across crops, agroecosystems and regions. Here, we report on an expert-based assessment of crop health, and provide numerical estimates of yield losses on an individual pathogen and pest basis for five major crops globally and in food security hotspots. Our results document losses associated with 137 pathogens and pests associated with wheat, rice, maize, potato and soybean worldwide. Our yield loss (range) estimates at a global level and per hotspot for wheat (21.5% (10.1–28.1%)), rice (30.0% (24.6–40.9%)), maize (22.5% (19.5–41.1%)), potato (17.2% (8.1–21.0%)) and soybean (21.4% (11.0–32.4%)) suggest that the highest losses are associated with food-deficit regions with fast-growing populations, and frequently with emerging or re-emerging pests and diseases. Our assessment highlights differences in impacts among crop pathogens and pests and among food security hotspots. This analysis contributes critical information to prioritize crop health management to improve the sustainability of agroecosystems in delivering services to societies. An expert elicitation survey estimates yield losses for the five major food crops worldwide, suggesting that the highest losses are associated with food-deficit regions with fast-growing populations and frequently with emerging or re-emerging pests and diseases.

...read moreread less

1,376 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse