Home
/
Authors
/
Ioannis Kosmidis

Author

Ioannis Kosmidis

Other affiliations: University College London, The Turing Institute

Bio: Ioannis Kosmidis is an academic researcher from University of Warwick. The author has contributed to research in topics: Generalized linear model & Estimator. The author has an hindex of 16, co-authored 61 publications receiving 1231 citations. Previous affiliations of Ioannis Kosmidis include University College London & The Turing Institute.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned

[...]

Bettina Grün, Ioannis Kosmidis, Achim Zeileis

24 May 2012-Journal of Statistical Software

TL;DR: These extensions make beta regression not only “a better lemon squeezer” but a full-fledged modern juicer offering lemon-based drinks: shaken and stirred, mixed, mixed (finite mixture model), or partitioned (tree model).

...read moreread less

Abstract: Beta regression – an increasingly popular approach for modeling rates and proportions – is extended in various directions: (a) bias correction/reduction of the maximum likelihood estimator, (b) beta regression tree models by means of recursive partitioning, (c) latent class beta regression by means of finite mixture models. All three extensions may be of importance for enhancing the beta regression toolbox in practice to provide more reliable inference and capture both observed and unobserved/latent heterogeneity in the data. Using the analogy of Smithson and Verkuilen (2006), these extensions make beta regression not only “a better lemon squeezer” (compared to classical least squares regression) but a full-fledged modern juicer offering lemon-based drinks: shaken and stirred (bias correction and reduction), mixed (finite mixture model), or partitioned (tree model). All three extensions are provided in the R package betareg (at least 2.4-0), building on generic algorithms and implementations for bias correction/reduction, model-based recursive partioning, and finite mixture models, respectively. Specifically, the new functions betatree() and betamix() reuse the object-oriented flexible implementation from the R packages party and flexmix, respectively.

...read moreread less

374 citations

Journal Article•DOI•

Bias reduction in exponential family nonlinear models

[...]

Ioannis Kosmidis¹, David Firth¹•Institutions (1)

University of Warwick¹

01 Dec 2009-Biometrika

TL;DR: In this article, a more general family of bias-reducing adjustments is developed for a broad class of univariate and multivariate generalized nonlinear models, and a necessary and sufficient condition is given for the existence of a penalized likelihood interpretation of the method.

...read moreread less

Abstract: In Firth (1993, Biometrika) it was shown how the leading term in the asymptotic bias of the maximum likelihood estimator is removed by adjusting the score vector, and that in canonical-link generalized linear models the method is equivalent to maximizing a penalized likelihood that is easily implemented via iterative adjustment of the data. Here a more general family of bias-reducing adjustments is developed for a broad class of univariate and multivariate generalized nonlinear models. The resulting formulae for the adjusted score vector are computationally convenient, and in univariate models they directly suggest implementation through an iterative scheme of data adjustment. For generalized linear models a necessary and sufficient condition is given for the existence of a penalized likelihood interpretation of the method. An illustrative application to the Goodman row-column association model shows how the computational simplicity and statistical benefits of bias reduction extend beyond generalized linear models.

...read moreread less

196 citations

Journal Article•DOI•

A generic algorithm for reducing bias in parametric estimation

[...]

Ioannis Kosmidis, David Firth

01 Jan 2010-Electronic Journal of Statistics

TL;DR: In this article, a general iterative algorithm is developed for the computation of reduced-bias parameter estimates in regular statistical models throughadjustments to the score function, which can usefully be viewed as a series of iterative bias corrections.

...read moreread less

Abstract: A general iterative algorithm is developed for the computation of reduced-bias parameter estimates in regular statistical models through adjustments to the score function. The algorithm unifies and provides appealing new interpretation for iterative methods that have been published previously for some specific model classes. The new algorithm can usefully be viewed as a series of iterative bias corrections, thus facilitating the adjusted score approach to bias reduction in any model for which the first- order bias of the maximum likelihood estimator has already been derived. The method is tested by application to a logit-linear multiple regression model with beta-distributed responses; the results confirm the effectiveness of the new algorithm, and also reveal some important errors in the existing literature on beta regression.

...read moreread less

77 citations

brglm: Bias reduction in generalized linear models

[...]

Ioannis Kosmidis¹•Institutions (1)

University College London¹

16 Aug 2011

TL;DR: The brglm R package provides an alternative fitting method for the glm function for reducing the bias of the maximum likelihood estimator in generalized linear models (GLMs) using the generic iteration developed in Kosmidis and Firth (2010a).

...read moreread less

Abstract: The brglm R package provides an alternative fitting method for the glm function for reducing the bias of the maximum likelihood estimator in generalized linear models (GLMs). The fitting method is based on the generic iteration developed in Kosmidis and Firth (2010a) for solving the bias-reducing adjusted score equations (Firth, 1993). It relies on the implementation of the first-order term in the asymptotic expansion of the bias of the maximum likelihood estimator for GLMs which has been derived in Cordeiro and McCullagh (1991). The bias-corrected estimates derived in the latter study are by-products of the general fitting method.

...read moreread less

68 citations

Journal Article•DOI•

Model-based clustering using copulas with applications

[...]

Ioannis Kosmidis¹, Dimitris Karlis²•Institutions (2)

University College London¹, Athens University of Economics and Business²

01 Sep 2016-Statistics and Computing

TL;DR: Copulas are used for the construction of flexible families of models for clustering applications and, depending on the mode of the data, more efficient procedures are provided that can fully exploit the copula structure.

...read moreread less

Abstract: The majority of model-based clustering techniques is based on multivariate normal models and their variants. In this paper copulas are used for the construction of flexible families of models for clustering applications. The use of copulas in model-based clustering offers two direct advantages over current methods: (i) the appropriate choice of copulas provides the ability to obtain a range of exotic shapes for the clusters, and (ii) the explicit choice of marginal distributions for the clusters allows the modelling of multivariate data of various modes (either discrete or continuous) in a natural way. This paper introduces and studies the framework of copula-based finite mixture models for clustering applications. Estimation in the general case can be performed using standard EM, and, depending on the mode of the data, more efficient procedures are provided that can fully exploit the copula structure. The closure properties of the mixture models under marginalization are discussed, and for continuous, real-valued data parametric rotations in the sample space are introduced, with a parallel discussion on parameter identifiability depending on the choice of copulas for the components. The exposition of the methodology is accompanied and motivated by the analysis of real and artificial data.

...read moreread less

66 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13

Collapse

Cited by

PDF

Open Access

More filters

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Particle Markov chain Monte Carlo methods

[...]

Christophe Andrieu¹, Arnaud Doucet², Roman Holenstein²•Institutions (2)

University of Bristol¹, University of British Columbia²

01 Jun 2010-Journal of The Royal Statistical Society Series B-statistical Methodology

TL;DR: It is shown here how it is possible to build efficient high dimensional proposal distributions by using sequential Monte Carlo methods, which allows not only to improve over standard Markov chain Monte Carlo schemes but also to make Bayesian inference feasible for a large class of statistical models where this was not previously so.

...read moreread less

Abstract: Summary. Markov chain Monte Carlo and sequential Monte Carlo methods have emerged as the two main tools to sample from high dimensional probability distributions. Although asymptotic convergence of Markov chain Monte Carlo algorithms is ensured under weak assumptions, the performance of these algorithms is unreliable when the proposal distributions that are used to explore the space are poorly chosen and/or if highly correlated variables are updated independently. We show here how it is possible to build efficient high dimensional proposal distributions by using sequential Monte Carlo methods. This allows us not only to improve over standard Markov chain Monte Carlo schemes but also to make Bayesian inference feasible for a large class of statistical models where this was not previously so. We demonstrate these algorithms on a non-linear state space model and a Levy-driven stochastic volatility model.

...read moreread less

1,869 citations

Journal Article•DOI•

Beta Regression in R

[...]

Francisco Cribari-Neto, Achim Zeileis

05 Apr 2010-Journal of Statistical Software

TL;DR: The betareg package is described which provides the class of beta regressions in the R system for statistical computing and incorporates features such as heteroskedasticity or skewness which are commonly observed in data taking values in the standard unit interval, such as rates or proportions.

...read moreread less

Abstract: The class of beta regression models is commonly used by practitioners to model variables that assume values in the standard unit interval (0, 1). It is based on the assumption that the dependent variable is beta-distributed and that its mean is related to a set of regressors through a linear predictor with unknown coefficients and a link function. The model also includes a precision parameter which may be constant or depend on a (potentially different) set of regressors through a link function as well. This approach naturally incorporates features such as heteroskedasticity or skewness which are commonly observed in data taking values in the standard unit interval, such as rates or proportions. This paper describes the betareg package which provides the class of beta regressions in the R system for statistical computing. The underlying theory is briefly outlined, the implementation discussed and illustrated in various replication exercises.

...read moreread less

1,706 citations

Journal Article•DOI•

Matrix Differential Calculus with Applications in Statistics and Econometrics

[...]

R. Iyer

01 Sep 1991-The Statistician

1,677 citations

Journal Article•DOI•

A weakly informative default prior distribution for logistic and other regression models

[...]

Andrew Gelman¹, Aleks Jakulin, Maria Grazia Pittau, Yu-Sung Su•Institutions (1)

Columbia University¹

01 Dec 2008-The Annals of Applied Statistics

TL;DR: In this paper, the authors propose a new prior distribution for logistic regression models, called Cauchy prior, constructed by first scaling all nonbinary variables to have mean 0 and standard deviation 0.5, and then placing independent Student-t prior distributions on the coefficients.

...read moreread less

Abstract: We propose a new prior distribution for classical (nonhierarchical) logistic regression models, constructed by first scaling all nonbinary variables to have mean 0 and standard deviation 0.5, and then placing independent Student-t prior distributions on the coefficients. As a default choice, we recommend the Cauchy distribution with center 0 and scale 2.5, which in the simplest setting is a longer-tailed version of the distribution attained by assuming one-half additional success and one-half additional failure in a logistic regression. Cross-validation on a corpus of datasets shows the Cauchy class of prior distributions to outperform existing implementations of Gaussian and Laplace priors. We recommend this prior distribution as a default choice for routine applied use. It has the advantage of always giving answers, even when there is complete separation in logistic regression (a common problem, even when the sample size is large and the number of predictors is small), and also automatically applying more shrinkage to higher-order interactions. This can be useful in routine data analysis as well as in automated procedures such as chained equations for missing-data imputation. We implement a procedure to fit generalized linear models in R with the Student-t prior distribution by incorporating an approximate EM algorithm into the usual iteratively weighted least squares. We illustrate with several applications, including a series of logistic regressions predicting voting preferences, a small bioassay experiment, and an imputation model for a public health data set.

...read moreread less

1,598 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse