Showing papers in "Computational Statistics & Data Analysis in 2017"

PDF

Open Access

Journal Article•DOI•

[...]

Daniel Kraus¹, Claudia Czado¹•Institutions (1)

01 Jun 2017-Computational Statistics & Data Analysis

TL;DR: In this paper, a new semiparametric quantile regression method is introduced based on sequentially fitting a likelihood optimal D-vine copula to given data resulting in highly flexible models with easily extractable conditional quantiles.

...read moreread less

123 citations

Journal Article•DOI•

RHSBoost: Improving classification performance in imbalance data

[...]

Joonho Gong¹, Hyunjoong Kim²•Institutions (2)

North Carolina State University¹, Yonsei University²

01 Jul 2017-Computational Statistics & Data Analysis

TL;DR: RHSBoost appears to be an attractive classification model for imbalance data and uses random undersampling and ROSE sampling under a boosting scheme to address the imbalance classification problem.

...read moreread less

85 citations

Journal Article•DOI•

Robust and sparse estimators for linear regression models

[...]

Ezequiel Smucler¹, Victor J. Yohai¹•Institutions (1)

University of Buenos Aires¹

01 Jul 2017-Computational Statistics & Data Analysis

TL;DR: In this paper, the robust and asymptotic properties of 1-penalized MM-estimators and MM estimators with an adaptive 1 penalty are studied for the case of a fixed number of covariates.

...read moreread less

63 citations

Journal Article•DOI•

A Bayesian adaptive design for clinical trials in rare diseases

[...]

Faye Williamson¹, Peter Jacko¹, Sofia S. Villar, Thomas Jaki¹•Institutions (1)

Lancaster University¹

01 Sep 2017-Computational Statistics & Data Analysis

TL;DR: A novel randomised response-adaptive design is proposed which maximises the total number of patient successes in the trial and penalises if a minimum number of patients are not recruited to each treatment arm.

...read moreread less

59 citations

Journal Article•DOI•

Bivariate copula additive models for location, scale and shape

[...]

Giampiero Marra¹, Rosalba Radice²•Institutions (2)

University College London¹, Birkbeck, University of London²

01 Aug 2017-Computational Statistics & Data Analysis

TL;DR: This article introduces a bivariate copula additive model with continuous margins for location, scale and shape that permits the copula dependence and marginal distribution parameters to be estimated simultaneously and, like in GAMLSS, each parameter to be modeled using an additive predictor.

...read moreread less

58 citations

Journal Article•DOI•

Extending approximate Bayesian computation methods to high dimensions via a Gaussian copula model

[...]

Jingjing Li¹, David J. Nott¹, Yanan Fan², Scott A. Sisson²•Institutions (2)

National University of Singapore¹, University of New South Wales²

01 Feb 2017-Computational Statistics & Data Analysis

TL;DR: The Gaussian Copula method as discussed by the authors uses a 2-dimensional Gaussian copula to estimate the bivariate posterior for each pair of parameters separately, and then combines these estimates together to obtain the joint posterior.

...read moreread less

50 citations

Journal Article•DOI•

A fast algorithm for two-dimensional KolmogorovSmirnov two sample tests

[...]

Yuanhui Xiao¹•Institutions (1)

Mississippi State University¹

01 Jan 2017-Computational Statistics & Data Analysis

TL;DR: A fast algorithm for computing the two-sample KolmogorovSmirnov test statistic is proposed, which is O(n) times more efficient than the brute force algorithm, where n is the sum of the two sample sizes.

...read moreread less

49 citations

Journal Article•DOI•

Gradient boosting for high-dimensional prediction of rare events

[...]

Rok Blagus¹, Lara Lusa¹•Institutions (1)

University of Ljubljana¹

01 Sep 2017-Computational Statistics & Data Analysis

TL;DR: It is demonstrated that the proposed corrections successfully remove the rare events bias and outperform the other ensemble classifiers that were considered and large flexibility and high interpretability of the proposed methods is also illustrated.

...read moreread less

46 citations

Journal Article•DOI•

The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation

[...]

Markus Frölich¹, Martin Huber², Manuel Wiesenfarth³•Institutions (3)

University of Mannheim¹, University of Fribourg², German Cancer Research Center³

01 Nov 2017-Computational Statistics & Data Analysis

TL;DR: Several nonparametric estimators outperform commonly used treatment estimators based on parametric propensity scores in terms of root mean squared error (RMSE), even though average RMSEs based on the 16 simulation designs considered are not statistically significantly different across the estimators investigated.

...read moreread less

43 citations

Journal Article•DOI•

Numerical implementation of the QuEST function

[...]

Olivier Ledoit¹, Michael Wolf¹•Institutions (1)

University of Zurich¹

01 Nov 2017-Computational Statistics & Data Analysis

TL;DR: In this paper, an estimator of the eigenvalues of the population covariance matrix has been proposed that is consistent according to a mean-squared criterion under large-dimensional asymptotics.

...read moreread less

43 citations

Journal Article•DOI•

Variable selection using shrinkage priors

[...]

Hanning Li¹, Debdeep Pati¹•Institutions (1)

Florida State University¹

01 Mar 2017-Computational Statistics & Data Analysis

TL;DR: In this paper, a general approach for variable selection with shrinkage priors is proposed, which can be used along with any shrinkage prior, and has good performance in a wide range of synthetic data examples and in a real data example on selecting genes affecting survival due to lymphoma.

...read moreread less

Journal Article•DOI•

Model selection for discrete regular vine copulas

[...]

Anastasios Panagiotelis¹, Claudia Czado², Harry Joe³, Jakob Stöber²•Institutions (3)

Monash University¹, Technische Universität München², University of British Columbia³

01 Feb 2017-Computational Statistics & Data Analysis

TL;DR: Two greedy algorithms for automatically selecting vine structures and component pair-copula building blocks are introduced and outperforms a Gaussian copula benchmark using both in-sample and out-of-sample criteria.

...read moreread less

Journal Article•DOI•

Nonparametric incidence estimation and bootstrap bandwidth selection in mixture cure models

[...]

Ana López-Cheda¹, Ricardo Cao¹, M. Amalia Jácome¹, Ingrid Van Keilegom²•Institutions (2)

University of A Coruña¹, Université catholique de Louvain²

01 Jan 2017-Computational Statistics & Data Analysis

TL;DR: Two nonparametric estimators, which are based on the Beran estimator of the conditional survival function, are proved to be the local maximum likelihood estimators for mixture cure models.

...read moreread less

Journal Article•DOI•

Density Estimation on Manifolds with Boundary

[...]

Tyrus Berry¹, Timothy Sauer¹•Institutions (1)

George Mason University¹

01 Mar 2017-Computational Statistics & Data Analysis

TL;DR: This work introduces statistics that provably estimate the distance and direction of the boundary, which allows for a cut-and-normalize boundary correction, and introduces a consistent kernel density estimator that has uniform bias, at interior and boundary points, on manifolds with boundary.

...read moreread less

Journal Article•DOI•

Correlation rank screening for ultrahigh-dimensional survival data

[...]

Jing Zhang¹, Yanyan Liu¹, Yuanshan Wu¹•Institutions (1)

Wuhan University¹

01 Apr 2017-Computational Statistics & Data Analysis

TL;DR: A novel feature screening procedure is proposed for ultrahigh-dimensional survival data which is invariant to the monotone transformation of the response and can be readily applied to ultra high-dimensional complete data when the censoring rate is zero.

...read moreread less

Journal Article•DOI•

Model-based time-varying clustering of multivariate longitudinal data with covariates and outliers

[...]

Antonello Maruotti¹, Antonio Punzo²•Institutions (2)

University of Southampton¹, University of Catania²

01 Sep 2017-Computational Statistics & Data Analysis

TL;DR: A class of multivariate linear models under the longitudinal setting, in which unobserved heterogeneity may evolve over time, is introduced, and a latent structure is considered to model heterogeneity, having a discrete support and following a first-order Markov chain.

...read moreread less

Journal Article•DOI•

Poisson mixed models for studying the poverty in small areas

[...]

Miguel Boubeta¹, María José Lombardía¹, Domingo Morales²•Institutions (2)

University of A Coruña¹, Universidad Miguel Hernández de Elche²

01 Mar 2017-Computational Statistics & Data Analysis

TL;DR: The developed methodology is applied to estimate the proportion of people under the poverty line by counties and sex in Galicia (a region in north-west of Spain).

...read moreread less

Journal Article•DOI•

Using contrastive divergence to seed Monte Carlo MLE for exponential-family random graph models

[...]

Pavel N. Krivitsky¹•Institutions (1)

University of Wollongong¹

01 Mar 2017-Computational Statistics & Data Analysis

TL;DR: This paper focuses on exponential-family models for dependent data, which have applications in a wide variety of areas, but the dependence often results in an intractable likelihood, requiring either analytic approximation or MCMC-based techniques to fit.

...read moreread less

Journal Article•DOI•

Robust and efficient estimation of multivariate scatter and location

[...]

Ricardo A. Maronna¹, Victor J. Yohai²•Institutions (2)

National University of La Plata¹, Facultad de Ciencias Exactas y Naturales²

01 May 2017-Computational Statistics & Data Analysis

TL;DR: Several equivariant estimators of multivariate location and scatter are studied, which are highly robust, have a controllable finite-sample efficiency and are computationally feasible in large dimensions.

...read moreread less

Journal Article•DOI•

Sparse vector Markov switching autoregressive models. Application to multivariate time series of temperature

[...]

Valérie Monbet¹, Pierre Ailliot²•Institutions (2)

University of Rennes¹, University of Western Brittany²

01 Apr 2017-Computational Statistics & Data Analysis

TL;DR: A Smoothly Clipped Absolute Deviation penalization of the likelihood is proposed to shrink the parameters towards zeros and regularize the inference problem which is generally ill-posed.

...read moreread less

Journal Article•DOI•

Model free feature screening for ultrahigh dimensional data with responses missing at random

[...]

Peng Lai¹, Yiming Liu², Zhi Liu³, Yi Wan³•Institutions (3)

Nanjing University of Information Science and Technology¹, Nanyang Technological University², University of Macau³

01 Jan 2017-Computational Statistics & Data Analysis

TL;DR: A model free feature screening procedure based on the inverse probability weighted methods has been proposed, where the Kolmogorov filter method is used to screen the important features under an unknown propensity score function.

...read moreread less

Journal Article•DOI•

FFT-based fast bandwidth selector for multivariate kernel density estimation

[...]

Artur Gramacki¹, J. Gramacki¹•Institutions (1)

University of Zielona Góra¹

01 Feb 2017-Computational Statistics & Data Analysis

TL;DR: In this article, a more general solution is presented where the above mentioned limitation is relaxed and the presented solution can be easily adopted also for the task of efficient computation of integrated density derivative functionals involving an arbitrary derivative order.

...read moreread less

Journal Article•DOI•

Bayesian quantile regression using random B-spline series prior

[...]

Priyam Das, Subhashis Ghosal¹•Institutions (1)

North Carolina State University¹

01 May 2017-Computational Statistics & Data Analysis

TL;DR: The proposed Bayesian method is extended to multidimensional predictors such that the quantile regression depends on the predictors through an unknown linear combination only.

...read moreread less

Journal Article•DOI•

An SVM-like approach for expectile regression

[...]

Muhammad Farooq¹, Ingo Steinwart¹•Institutions (1)

University of Stuttgart¹

01 May 2017-Computational Statistics & Data Analysis

TL;DR: In this paper, an efficient sequential-minimal-optimization-based solver is developed and its convergence derived for the underlying optimization problem, and the results are compared with the solver for quantile regression and the recent R-package ER-Boost.

...read moreread less

Journal Article•DOI•

Estimation of population proportion for judgment post-stratification

[...]

Ehsan Zamanzade¹, Xinlei Wang²•Institutions (2)

University of Isfahan¹, Southern Methodist University²

01 Aug 2017-Computational Statistics & Data Analysis

TL;DR: It is shown that the JPS scheme improves estimation of the population proportion in a very wide range of settings as compared to simple random sampling (SRS).

...read moreread less

Journal Article•DOI•

A wild bootstrap approach for nonparametric repeated measurements

[...]

Sarah Friedrich¹, Frank Konietschke², Markus Pauly¹•Institutions (2)

University of Ulm¹, University of Texas at Dallas²

01 Sep 2017-Computational Statistics & Data Analysis

TL;DR: It is shown that a specific wild bootstrap procedure inherits the large sample properties of the Wald- and ANOVA-type statistics while considerably improving their small sample behavior.

...read moreread less

Journal Article•DOI•

Lasso, fractional norm and structured sparse estimation using a Hadamard product parametrization

[...]

Peter D. Hoff¹•Institutions (1)

Duke University¹

01 Nov 2017-Computational Statistics & Data Analysis

TL;DR: It is shown that a subclass of Lq penalties with q less than or equal to one can be expressed as sums of L2 penalties, and it follows that the lasso and other norm-penalized regression estimates may be obtained using a very simple and intuitive alternating ridge regression algorithm.

...read moreread less

Journal Article•DOI•

Regression analysis of current status data in the presence of dependent censoring with applications to tumorigenicity experiments

[...]

Shuwei Li¹, Tao Hu², Peijie Wang¹, Jianguo Sun³•Institutions (3)

Jilin University¹, Capital Normal University², University of Missouri³

01 Jun 2017-Computational Statistics & Data Analysis

TL;DR: A frailty model-based maximum likelihood approach is proposed with the use of monotone splines to approximate the unknown baseline cumulative hazard function of the failure time and a novel EM algorithm, which is based on a three-stage data augmentation and can be easily implemented, is presented.

...read moreread less

Journal Article•DOI•

Application of imperialist competitive algorithm to find minimax and standardized maximin optimal designs

[...]

Ehsan Masoudi¹, Heinz Holling¹, Weng Kee Wong²•Institutions (2)

University of Münster¹, University of California, Los Angeles²

01 Sep 2017-Computational Statistics & Data Analysis

TL;DR: A population-based evolutionary algorithm called imperialist competitive algorithm (ICA) is applied to find minimax or nearly minimax D-optimal designs for nonlinear models and can hybridize with a local search to find optimal designs under a more complicated criterion, such as standardized maximin optimality.

...read moreread less

Journal Article•DOI•

Constrained center and range joint model for interval-valued symbolic data regression

[...]

Peng Hao¹, Junpeng Guo¹•Institutions (1)

College of Management and Economics¹

01 Dec 2017-Computational Statistics & Data Analysis

TL;DR: A constrained center and range joint model to fit linear regression to interval-valued symbolic data is introduced that has better fitness and avoids the negative value of the range of the predicted dependent interval variable by adding nonnegative constraints.

...read moreread less

Collapse