Home
/
Authors
/
Claudio Conversano

Author

Claudio Conversano

Other affiliations: University of Cassino, University of Naples Federico II

Bio: Claudio Conversano is an academic researcher from University of Cagliari. The author has contributed to research in topics: Statistical model & Sentiment analysis. The author has an hindex of 11, co-authored 65 publications receiving 317 citations. Previous affiliations of Claudio Conversano include University of Cassino & University of Naples Federico II.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2013
2012
2011
2010
2009
2008
2005
2004
2003
2002
2001
2000

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Combining an Additive and Tree-Based Regression Model Simultaneously: STIMA

[...]

Elise Dusseldorp¹, Claudio Conversano², Bart Jan Van Os•Institutions (2)

Netherlands Organisation for Applied Scientific Research¹, University of Cagliari²

01 Jan 2010-Journal of Computational and Graphical Statistics

TL;DR: A new algorithm is proposed—Simultaneous Threshold Interaction Modeling Algorithm (STIMA)—to estimate a regression trunk model that is more general and more efficient than the initial one (RTA) and is implemented in the R-package stima.

...read moreread less

Abstract: Additive models and tree-based regression models are two main classes of statistical models used to predict the scores on a continuous response variable. It is known that additive models become very complex in the presence of higher order interaction effects, whereas some tree-based models, such as CART, have problems capturing linear main effects of continuous predictors. To overcome these drawbacks, the regression trunk model has been proposed: a multiple regression model with main effects and a parsimonious amount of higher order interaction effects. The interaction effects can be represented by a small tree: a regression trunk. This article proposes a new algorithm-Simultaneous Threshold Interaction Modeling Algorithm (STIMA)-to estimate a regression trunk model that is more general and more efficient than the initial one (RTA) and is implemented in the R-package stima. Results from a simulation study show that the performance of STIMA is satisfactory for sample sizes of 200 or higher. For sample sizes of 300 or higher, the 0.50 SE rule is the best pruning rule for a regression trunk in terms of power and Type I error. For sample sizes of 200, the 0.80 SE rule is recommended. Results from a comparative study of eight regression methods applied to ten benchmark datasets suggest that STIMA and GUIDE are the best performers in terms of cross-validated prediction error. STIMA appeared to be the best method for datasets containing many categorical variables. The characteristics of a regression trunk model are illustrated using the Boston house price dataset. Supplemental materials for this article, including the R-package stima, are available online. © 2010 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.

...read moreread less

70 citations

Journal Article•DOI•

Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering

[...]

Claudio Conversano¹, Roberta Siciliano²•Institutions (2)

University of Cagliari¹, University of Naples Federico II²

01 Dec 2009-Journal of Classification

TL;DR: An incremental procedure based on the iterative use of tree-based method is proposed and a suitable Incremental Imputation Algorithm is introduced to define a lexicographic ordering of cases and variables so that conditional mean imputation via binary trees can be performed incrementally.

...read moreread less

Abstract: In the framework of incomplete data analysis, this paper provides a nonparametric approach to missing data imputation based on Information Retrieval. In particular, an incremental procedure based on the iterative use of tree-based method is proposed and a suitable Incremental Imputation Algorithm is introduced. The key idea is to define a lexicographic ordering of cases and variables so that conditional mean imputation via binary trees can be performed incrementally. A simulation study and real data applications are carried out to describe the advantages and the performance with respect to standard approaches.

...read moreread less

32 citations

Journal Article•DOI•

An Integrated Approach to Select Key Quality Indicators in Transit Services

[...]

Benedetto Barabino¹, Nicola Aldo Cabras², Claudio Conversano², Alessandro Olivo²•Institutions (2)

University of Brescia¹, University of Cagliari²

01 Jun 2020-Social Indicators Research

TL;DR: An integrated approach is proposed, which identifies a long list of key quality indicators (KQI), defines their properties, involves experts to elicit judgments for each KQI, evaluates the long list, and points out the most promising set.

...read moreread less

Abstract: Recent interests in transit services have captured attention of experts on the monitoring of public transport quality. Previous research focused on relevant models and methods to monitor the quality of transit services and showed where and when different service quality levels occur. However, there was little attention to detect objectively a pool of key quality indicators (KQI) to be monitored, from a large set. This paper covers this gap by the proposal of an integrated approach, which identifies a long list of KQI, defines their properties, involves experts to elicit judgments for each KQI, evaluates the long list, and points out the most promising set. This integrated approach is demonstrated with an application based on an international survey and a Monte Carlo simulation method. Moreover, a restricted and relevant set of 9 overlapping KQI is derived by linking these results with those obtained from two different approaches.

...read moreread less

28 citations

Journal Article•DOI•

On the Use of Markov Models in Pharmacoeconomics: Pros and Cons and Implications for Policy Makers.

[...]

A. Carta¹, Claudio Conversano¹•Institutions (1)

University of Cagliari¹

30 Oct 2020-Frontiers in Public Health

TL;DR: The main methodological features and the goals of pharmacoeconomic models that are classified in three major categories: regression models, decision trees, and Markov models are presented and decision makers are advised to interpret the results with extreme caution.

...read moreread less

Abstract: We present an overview of the main methodological features and the goals of pharmacoeconomic models that are classified in three major categories: regression models, decision trees, and Markov models. In particular, we focus on Markov models and define a semi-Markov model on the cost utility of a vaccine for Dengue fever discussing the key components of the model and the interpretation of its results. Next, we identify some criticalities of the decision rule arising from a possible incorrect interpretation of the model outcomes. Specifically, we focus on the difference between median and mean ICER and on handling the willingness-to-pay thresholds. We also show that the life span of the model and an incorrect hypothesis specification can lead to very different outcomes. Finally, we analyse the limit of Markov model when a large number of states is considered and focus on the implementation of tools that can bypass the lack of memory condition of Markov models. We conclude that decision makers should interpret the results of these models with extreme caution before deciding to fund any health care policy and give some recommendations about the appropriate use of these models.

...read moreread less

20 citations

Book Chapter•DOI•

Decision Tree Induction

[...]

Roberta Siciliano¹, Claudio Conversano²•Institutions (2)

University of Naples Federico II¹, University of Cagliari²

01 Jan 2009

TL;DR: Decision Tree Induction is a tool to induce a classification or regression model from (usually large) datasets characterized by n objects (records), each one containing a set x of numerical or nominal attributes, and a special feature y designed as its outcome.

...read moreread less

Abstract: Decision Tree Induction (DTI) is a tool to induce a classification or regression model from (usually large) datasets characterized by n objects (records), each one containing a set x of numerical or nominal attributes, and a special feature y designed as its outcome. Statisticians use the terms “predictors” to identify attributes and “response variable” for the outcome. DTI builds a model that summarizes the underlying relationships between x and y. Actually, two kinds of model can be estimated using decision trees: classification trees if y is nominal, and regression trees if y is numerical. Hereinafter we refer to classification trees to show the main features of DTI. For a detailed insight into the characteristics of regression trees see Hastie et al. (2001). As an example of classification tree, let us consider a sample of patients with prostate cancer on which data

...read moreread less

18 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Statistical Analysis with Missing Data

[...]

Martin G. Gibson

01 Mar 1989-The Statistician

3,152 citations

Book•

Flexible Imputation of Missing Data

[...]

Stef van Buuren

29 Mar 2012

TL;DR: The problem of missing data concepts of MCAR, MAR and MNAR simple solutions that do not (always) work multiple imputation in a nutshell and some dangers, some do's and some don'ts are covered.

...read moreread less

Abstract: Basics Introduction The problem of missing data Concepts of MCAR, MAR and MNAR Simple solutions that do not (always) work Multiple imputation in a nutshell Goal of the book What the book does not cover Structure of the book Exercises Multiple imputation Historic overview Incomplete data concepts Why and when multiple imputation works Statistical intervals and tests Evaluation criteria When to use multiple imputation How many imputations? Exercises Univariate missing data How to generate multiple imputations Imputation under the normal linear normal Imputation under non-normal distributions Predictive mean matching Categorical data Other data types Classification and regression trees Multilevel data Non-ignorable methods Exercises Multivariate missing data Missing data pattern Issues in multivariate imputation Monotone data imputation Joint Modeling Fully Conditional Specification FCS and JM Conclusion Exercises Imputation in practice Overview of modeling choices Ignorable or non-ignorable? Model form and predictors Derived variables Algorithmic options Diagnostics Conclusion Exercises Analysis of imputed data What to do with the imputed data? Parameter pooling Statistical tests for multiple imputation Stepwise model selection Conclusion Exercises Case studies Measurement issues Too many columns Sensitivity analysis Correct prevalence estimates from self-reported data Enhancing comparability Exercises Selection issues Correcting for selective drop-out Correcting for non-response Exercises Longitudinal data Long and wide format SE Fireworks Disaster Study Time raster imputation Conclusion Exercises Extensions Conclusion Some dangers, some do's and some don'ts Reporting Other applications Future developments Exercises Appendices: Software R S-Plus Stata SAS SPSS Other software References Author Index Subject Index

...read moreread less

2,156 citations

Journal Article•DOI•

Fifty Years of Classification and Regression Trees

[...]

Wei-Yin Loh¹•Institutions (1)

University of Wisconsin-Madison¹

01 Dec 2014-International Statistical Review

TL;DR: This article surveys the developments and briefly reviews the key ideas behind some of the major algorithms in regression tree algorithms.

...read moreread less

Abstract: Fifty years have passed since the publication of the first regression tree algorithm. New techniques have added capabilities that far surpass those of the early methods. Modern classification trees can partition the data with linear splits on subsets of variables and fit nearest neighbor, kernel density, and other models in the partitions. Regression trees can fit almost every kind of traditional statistical model, including least-squares, quantile, logistic, Poisson, and proportional hazards models, as well as models for longitudinal and multiresponse data. Greater availability and affordability of software (much of which is free) have played a significant role in helping the techniques gain acceptance and popularity in the broader scientific community. This article surveys the developments and briefly reviews the key ideas behind some of the major algorithms.

...read moreread less

437 citations

Journal Article•

National Institute of Environmental Health Sciences 魅せられる"生物学の面白さ"

[...]

淳一小池

01 Mar 1999-Journal of the Medical Society of Toho University

390 citations

Journal Article•DOI•

Multiple Imputation for Missing Data via Sequential Regression Trees

[...]

Lane F. Burgette¹, Jerome P. Reiter•Institutions (1)

Duke University¹

01 Nov 2010-American Journal of Epidemiology

TL;DR: The authors present a nonparametric approach for implementing multiple imputation via chained equations by using sequential regression trees as the conditional models and demonstrate that the method can result in more plausible imputations, and hence more reliable inferences, in complex settings than the naive application of standard sequential regression imputation techniques.

...read moreread less

Abstract: Multiple imputation is particularly well suited to deal with missing data in large epidemiologic studies, because typically these studies support a wide range of analyses by many data users. Some of these analyses may involve complex modeling, including interactions and nonlinear relations. Identifying such relations and encoding them in imputation models, for example, in the conditional regressions for multiple imputation via chained equations, can be daunting tasks with large numbers of categorical and continuous variables. The authors present a nonparametric approach for implementing multiple imputation via chained equations by using sequential regression trees as the conditional models. This has the potential to capture complex relations with minimal tuning by the data imputer. Using simulations, the authors demonstrate that the method can result in more plausible imputations, and hence more reliable inferences, in complex settings than the naive application of standard sequential regression imputation techniques. They apply the approach to impute missing values in data on adverse birth outcomes with more than 100 clinical and survey variables. They evaluate the imputations using posterior predictive checks with several epidemiologic analyses of interest.

...read moreread less

247 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66

Collapse