Home
/
Authors
/
Philippe Leray

Author

Philippe Leray

Other affiliations: Duke University, École polytechnique de l'université de Nantes, Institut national des sciences appliquées de Rouen ...read more

Bio: Philippe Leray is an academic researcher from University of Nantes. The author has contributed to research in topics: Bayesian network & Graphical model. The author has an hindex of 17, co-authored 151 publications receiving 1253 citations. Previous affiliations of Philippe Leray include Duke University & École polytechnique de l'université de Nantes.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2001
1999
1998
1997
1996

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Feature selection with neural networks

[...]

Philippe Leray, Patrick Gallinari

01 Jan 1999-Behaviormetrika

TL;DR: This paper first briefly introduces baseline statistical methods used in regression and classification, then describes families of methods which have been developed specifically for neural networks, and compared on different test problems.

...read moreread less

Abstract: The observed features of a given phenomenon are not all equally informative : some may be noisy, others correlated or irrelevant. The purpose of feature selection is to select a set of features pertinent to a given task. This is a complex process, but it is an important issue in many fields. In neural networks, feature selection has been studied for the last ten years, using conventional and original methods. This paper is a review of neural network approaches to feature selection. We first briefly introduce baseline statistical methods used in regression and classification. We then describe families of methods which have been developed specifically for neural networks. Representative methods are then compared on different test problems.

...read moreread less

126 citations

BNT STRUCTURE LEARNING PACKAGE : Documentation and Experiments

[...]

Philippe Leray¹, Olivier François², Laboratoire Psi, Fre Cnrs•Institutions (2)

University of Nantes¹, Institut national des sciences appliquées de Rouen²

01 Jan 2004

TL;DR: The Bayes Net Toolbox for Matlab, introduced by Murphy (2004), offers functions for both using and learning Bayesian Networks, but this toolbox is not ’state of the art’ as regards structural learning methods, so the SLP package is proposed.

...read moreread less

Abstract: Bayesian networks are a formalism for probabilistic reasoning that have grown increasingly popular for tasks such as classification in data-mining In some situations, the structure of the Bayesian network can be given by an expert If not, retrieving it automatically from a database of cases is a NP-hard problem; notably because of the complexity of the search space In the last decade, numerous methods have been introduced to learn the network’s structure automatically, by simplifying the search space or by using an heuristic in the search space Most methods deal with completely observed data, but some can deal with incomplete data The Bayes Net Toolbox for Matlab, introduced by Murphy (2004), offers functions for both using and learning Bayesian Networks But this toolbox is not ’state of the art’ as regards structural learning methods This is why we propose the SLP package

...read moreread less

123 citations

Journal Article•DOI•

A survey on latent tree models and applications

[...]

Raphaël Mourad¹, Christine Sinoquet², Nevin L. Zhang³, Tengfei Liu³, Philippe Leray¹ - Show less +1 more•Institutions (3)

École polytechnique de l'université de Nantes¹, University of Nantes², Hong Kong University of Science and Technology³

01 May 2013-Journal of Artificial Intelligence Research

TL;DR: This review of the latent tree model, a particular type of probabilistic graphical models, deserves attention because its simple structure allows simple and efficient inference, while its latent variables capture complex relationships.

...read moreread less

Abstract: In data analysis, latent variables play a central role because they help provide powerful insights into a wide variety of phenomena, ranging from biological to human sciences. The latent tree model, a particular type of probabilistic graphical models, deserves attention. Its simple structure - a tree - allows simple and efficient inference, while its latent variables capture complex relationships. In the past decade, the latent tree model has been subject to significant theoretical and methodological developments. In this review, we propose a comprehensive study of this model. First we summarize key ideas underlying the model. Second we explain how it can be efficiently learned from data. Third we illustrate its use within three types of applications: latent structure discovery, multidimensional clustering, and probabilistic inference. Finally, we conclude and give promising directions for future researches in this field.

...read moreread less

94 citations

Journal Article•DOI•

A hierarchical Bayesian network approach for linkage disequilibrium modeling and data-dimensionality reduction prior to genome-wide association studies

[...]

Raphaël Mourad¹, Christine Sinoquet², Philippe Leray¹•Institutions (2)

École polytechnique de l'université de Nantes¹, University of Nantes²

12 Jan 2011-BMC Bioinformatics

TL;DR: An accurate modeling of dependences between genetic markers is presented, based on a forest of hierarchical latent class models which is a particular class of probabilistic graphical models which offers an adapted framework to deal with the fuzzy nature of linkage disequilibrium blocks.

...read moreread less

Abstract: Background Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1 million genetic markers) and its complexity make the statistical analysis a challenging task.

...read moreread less

87 citations

Journal Article•

Learning Causal Bayesian Networks from Observations and Experiments : A Decision Theoretic Approach

[...]

Stijn Meganck¹, Philippe Leray², Bernard Manderick¹•Institutions (2)

Vrije Universiteit Brussel¹, Intelligence and National Security Alliance²

01 Jan 2006-Lecture Notes in Computer Science

TL;DR: An algorithm is introduced that allows to actively add results of experiments so that arcs can be directed during learning and it is shown that this approach allows to learn a causal Bayesian network optimally with relation to a number of decision criteria.

...read moreread less

Abstract: We discuss a decision theoretic approach to learn causal Bayesian networks from observational data and experiments. We use the information of observational data to learn a completed partially directed acyclic graph using a structure learning technique and try to discover the directions of the remaining edges by means of experiment. We will show that our approach allows to learn a causal Bayesian network optimally with relation to a number of decision criteria. Our method allows the possibility to assign costs to each experiment and each measurement. We introduce an algorithm that allows to actively add results of experiments so that arcs can be directed during learning. A numerical example is given as demonstration of the techniques.

...read moreread less

73 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Matrix Factorization Techniques for Recommender Systems

[...]

Patrick Seemann

01 Jan 2014

2,080 citations

Journal Article•

Variable selection using svm based criteria

[...]

Alain Rakotomamonjy¹•Institutions (1)

Institut national des sciences appliquées de Rouen¹

01 Mar 2003-Journal of Machine Learning Research

TL;DR: New methods to evaluate variable subset relevance with a view to variable selection based on weight vector derivative achieves good results and performs consistently well over the datasets used.

...read moreread less

Abstract: We propose new methods to evaluate variable subset relevance with a view to variable selection. Relevance criteria are derived from Support Vector Machines and are based on weight vector ||w||2 or generalization error bounds sensitivity with respect to a variable. Experiments on linear and non-linear toy problems and real-world datasets have been carried out to assess the effectiveness of these criteria. Results show that the criterion based on weight vector derivative achieves good results and performs consistently well over the datasets we used.

...read moreread less

699 citations

Journal Article•DOI•

Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part I: Algorithms and Empirical Evaluation

[...]

Constantin F. Aliferis¹, Alexander Statnikov¹, Ioannis Tsamardinos², Subramani Mani³, Xenofon Koutsoukos³ - Show less +1 more•Institutions (3)

New York University¹, University of Crete², Vanderbilt University³

01 Mar 2010-Journal of Machine Learning Research

TL;DR: It is found that non-causal feature selection methods cannot be interpreted causally even when they achieve excellent predictivity, so only local causal techniques should be used when insight into causal structure is sought.

...read moreread less

Abstract: We present an algorithmic framework for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large data sets with relatively small samples. The selected feature sets can be used for causal discovery and classification. The framework (Generalized Local Learning, or GLL) can be instantiated in numerous ways, giving rise to both existing state-of-the-art as well as novel algorithms. The resulting algorithms are sound under well-defined sufficient conditions. In a first set of experiments we evaluate several algorithms derived from this framework in terms of predictivity and feature set parsimony and compare to other local causal discovery methods and to state-of-the-art non-causal feature selection methods using real data. A second set of experimental evaluations compares the algorithms in terms of ability to induce local causal neighborhoods using simulated and resimulated data and examines the relation of predictivity with causal induction performance. Our experiments demonstrate, consistently with causal feature selection theory, that local causal feature selection methods (under broad assumptions encompassing appropriate family of distributions, types of classifiers, and loss functions) exhibit strong feature set parsimony, high predictivity and local causal interpretability. Although non-causal feature selection methods are often used in practice to shed light on causal relationships, we find that they cannot be interpreted causally even when they achieve excellent predictivity. Therefore we conclude that only local causal techniques should be used when insight into causal structure is sought. In a companion paper we examine in depth the behavior of GLL algorithms, provide extensions, and show how local techniques can be used for scalable and accurate global causal graph learning.

...read moreread less

521 citations

Journal Article•DOI•

Data mining in manufacturing: a review based on the kind of knowledge

[...]

Alok K. Choudhary¹, Jenny A. Harding¹, Manoj Kumar Tiwari²•Institutions (2)

Loughborough University¹, Indian Institutes of Technology²

01 Jan 2009-Journal of Intelligent Manufacturing

TL;DR: There is a rapid growth in the application of data mining in the context of manufacturing processes and enterprises in the last 3 years, and a review of the literature reveals the progressive applications and existing gaps identified.

...read moreread less

Abstract: In modern manufacturing environments, vast amounts of data are collected in database management systems and data warehouses from all involved areas, including product and process design, assembly, materials planning, quality control, scheduling, maintenance, fault detection etc. Data mining has emerged as an important tool for knowledge acquisition from the manufacturing databases. This paper reviews the literature dealing with knowledge discovery and data mining applications in the broad domain of manufacturing with a special emphasis on the type of functions to be performed on the data. The major data mining functions to be performed include characterization and description, association, classification, prediction, clustering and evolution analysis. The papers reviewed have therefore been categorized in these five categories. It has been shown that there is a rapid growth in the application of data mining in the context of manufacturing processes and enterprises in the last 3 years. This review reveals the progressive applications and existing gaps identified in the context of data mining in manufacturing. A novel text mining approach has also been used on the abstracts and keywords of 150 papers to identify the research gaps and find the linkages between knowledge area, knowledge type and the applied data mining tools and techniques.

...read moreread less

450 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse