scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Applications in 2019"


Journal ArticleDOI
TL;DR: This study presents a novel prediction model for the Cc of fine-grained soils using gene expression programming (GEP), and the proposed model performed better in terms of R2, RMSE, and MAE compared to the other models.
Abstract: In construction projects, estimation of the settlement of fine-grained soils is of critical importance, and yet is a challenging task. The coefficient of consolidation for the compression index (Cc) is a key parameter in modeling the settlement of fine-grained soil layers. However, the estimation of this parameter is costly, time-consuming, and requires skilled technicians. To overcome these drawbacks, we aimed to predict Cc through other soil parameters, i.e., the liquid limit (LL), plastic limit (PL), and initial void ratio (e0). Using these parameters is more convenient and requires substantially less time and cost compared to the conventional tests to estimate Cc. This study presents a novel prediction model for the Cc of fine-grained soils using gene expression programming (GEP). A database consisting of 108 different data points was used to develop the model. A closed-form equation solution was derived to estimate Cc based on LL, PL, and e0. The performance of the developed GEP-based model was evaluated through the coefficient of determination (R2), the root mean squared error (RMSE), and the mean average error (MAE). The proposed model performed better in terms of R2, RMSE, and MAE compared to the other models.

112 citations


Posted Content
TL;DR: It is found that both greenery measures capture different aspects of natural environments and may contribute to people's wellbeing by means of different mechanisms, and both streetscape and satellite-derived greenery seem to be both directly correlated and indirectly mediated.
Abstract: Multiple mechanisms have been proposed to explain how greenery enhances their mental wellbeing. Mediation studies, however, focus on a limited number of mechanisms and rely on remotely sensed greenery measures, which do not accurately capture how neighborhood greenery is perceived on the ground. To examine: 1) how streetscape and remote sensing-based greenery affect people's mental wellbeing in Guangzhou, China; 2) whether and, if so, to what extent the associations are mediated by physical activity, stress, air quality and noise, and social cohesion; and 3) whether differences in the mediation across the streetscape greenery and NDVI exposure metrics occurred. Mental wellbeing was quantified by the WHO-5 wellbeing index. Greenery measures were extracted at the neighborhood level: 1) streetscape greenery from street view data via a convolutional neural network, and 2) the NDVI remote sensing images. Single and multiple mediation analyses with multilevel regressions were conducted. Streetscape and NDVI greenery were weakly and positively, but not significantly, correlated. Our regression results revealed that streetscape greenery and NDVI were, individually and jointly, positively associated with mental wellbeing. Significant partial mediators for the streetscape greenery were physical activity, stress, air quality and noise, and social cohesion; together, they explained 62% of the association. For NDVI, only physical activity and social cohesion were significant partial mediators, accounting for 22% of the association. Mental health and wellbeing and both streetscape and satellite-derived greenery seem to be both directly correlated and indirectly mediated. Our findings signify that both greenery measures capture different aspects of natural environments and may contribute to people's wellbeing by means of different mechanisms.

111 citations


Posted Content
TL;DR: The findings regarding recruitment and retention from eight remote digital health studies conducted between 2014–2019 that provided individual-level study-app usage data from more than 100,000 participants completing nearly 3.5 million remote health evaluations over cumulative participation of 850,000 days are reported.
Abstract: Digital technologies such as smartphones are transforming the way scientists conduct biomedical research using real-world data. Several remotely-conducted studies have recruited thousands of participants over a span of a few months. Unfortunately, these studies are hampered by substantial participant attrition, calling into question the representativeness of the collected data including generalizability of findings from these studies. We report the challenges in retention and recruitment in eight remote digital health studies comprising over 100,000 participants who participated for more than 850,000 days, completing close to 3.5 million remote health evaluations. Survival modeling surfaced several factors significantly associated(P < 1e-16) with increase in median retention time i) Clinician referral(increase of 40 days), ii) Effect of compensation (22 days), iii) Clinical conditions of interest to the study (7 days) and iv) Older adults(4 days). Additionally, four distinct patterns of daily app usage behavior that were also associated(P < 1e-10) with participant demographics were identified. Most studies were not able to recruit a representative sample, either demographically or regionally. Combined together these findings can help inform recruitment and retention strategies to enable equitable participation of populations in future digital health research.

80 citations


Journal ArticleDOI
TL;DR: In this paper, three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests, and combination tests (including Breslow test, Lee's combo test, and MaxCombo test).
Abstract: The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.

55 citations


Posted Content
TL;DR: In this article, the authors compare commonly used graph metrics and distance measures, and demonstrate their ability to distinguish between common topological features found in both random graph models and empirical datasets.
Abstract: Comparison of graph structure is a ubiquitous task in data analysis and machine learning, with diverse applications in fields such as neuroscience, cyber security, social network analysis, and bioinformatics, among others. Discovery and comparison of structures such as modular communities, rich clubs, hubs, and trees in data in these fields yields insight into the generative mechanisms and functional properties of the graph. Often, two graphs are compared via a pairwise distance measure, with a small distance indicating structural similarity and vice versa. Common choices include spectral distances (also known as $\lambda$ distances) and distances based on node affinities. However, there has of yet been no comparative study of the efficacy of these distance measures in discerning between common graph topologies and different structural scales. In this work, we compare commonly used graph metrics and distance measures, and demonstrate their ability to discern between common topological features found in both random graph models and empirical datasets. We put forward a multi-scale picture of graph structure, in which the effect of global and local structure upon the distance measures is considered. We make recommendations on the applicability of different distance measures to empirical graph data problem based on this multi-scale view. Finally, we introduce the Python library NetComp which implements the graph distances used in this work.

48 citations


Journal ArticleDOI
TL;DR: It is concluded that despite challenges and limitations of current off-the-shelf wearables, the utilization of these devices offers novel opportunities for evaluating episodic changes in physiological signals as a marker for mental state during everyday activities including in outdoor environments.
Abstract: Advances in commercial wearable devices are increasingly facilitating the collection and analysis of everyday physiological data. This paper discusses the theoretical and practical aspects of using such ambulatory devices for the detection of episodic changes in physiological signals as a marker for mental state in outdoor environments. A pilot study was conducted to evaluate the feasibility of utilizing commercial wearables in combination with location tracking technologies. The study measured physiological signals for 15 participants, including heart rate, heart-rate variability, and skin conductance. Participants' signals were recorded during an outdoor walk that was tracked using a GPS logger. The walk was designed to pass through various types of environments including green, blue, and urban spaces as well as a more stressful road crossing. The data that was obtained was used to demonstrate how biosensors information can be contextualized and enriched using location information. Significant episodic changes in physiological signals under real-world conditions were detectable in the stressful road crossing, but not in the other types of environments. The article concludes that despite challenges and limitations of current off-the-shelf wearables, the utilization of these devices offers novel opportunities for evaluating episodic changes in physiological signals as a marker for mental state during everyday activities including in outdoor environments.

48 citations


Posted Content
TL;DR: The importance of thoughtful incorporation of covariates to address confounding bias in difference‐in‐difference studies is emphasized, and analysts should begin by postulating a causal model that relates covariates, both time‐varying and those with time‐ varying effects on the outcome, to treatment.
Abstract: Difference-in-differences (diff-in-diff) is a study design that compares outcomes of two groups (treated and comparison) at two time points (pre- and post-treatment) and is widely used in evaluating new policy implementations. For instance, diff-in-diff has been used to estimate the effect that increasing minimum wage has on employment rates and to assess the Affordable Care Act's effect on health outcomes. Although diff-in-diff appears simple, potential pitfalls lurk. In this paper, we discuss one such complication: time-varying confounding. We provide rigorous definitions for confounders in diff-in-diff studies and explore regression strategies to adjust for confounding. In simulations, we show how and when regression adjustment can ameliorate confounding for both time-invariant and time-varying covariates. We compare our regression approach to those models commonly fit in applied literature, which often fail to address the time-varying nature of confounding in diff-in-diff.

43 citations


Posted Content
TL;DR: In this paper, a discrete GM(1,1) model with Simpson formula was proposed to predict the Gross Domestic Product and the freightage of Lanzhou, and the results illustrate that this model provides accurate prediction.
Abstract: The classical GM(1,1) model is an efficient tool to {make accurate forecasts} with limited samples. But the accuracy of the GM(1,1) model still needs to be improved. This paper proposes a novel discrete GM(1,1) model, named ${\rm GM_{SD}}$(1,1) model, of which the background value is reconstructed using Simpson formula. The expression of the specific time response function is deduced, and the relationship between our model} and the continuous GM(1,1) model with Simpson formula called ${\rm GM_{SC} }$(1,1) model is systematically discussed. The proposed model is proved to be unbiased to simulate the homogeneous exponent sequence. Further, some numerical examples are given to validate the accuracy of the new ${\rm GM_{SD}}$(1,1) model. Finally, this model is used to predict the Gross Domestic Product and the freightage of Lanzhou, and the results illustrate the ${\rm GM_{SD}}$(1,1) model provides accurate prediction.

41 citations


Journal ArticleDOI
TL;DR: This article suggests that approaches to the problem that involve appropriate use of independent information have the potential to resolve the contention about believability of data from networks of low-cost measurement devices.
Abstract: Plausibility of data from networks of low-cost measurement devices is a growing and important contentious issue. Informal networks of low-cost devices have particularly come to prominence for air quality monitoring. The contentious point is the believability of data without regular on-site calibration since that is a specialist task and the costs very quickly become very much larger than the cost of installation in the first place. This article suggests that approaches to the problem that involve appropriate use of independent information have the potential to resolve the contention. Ideas are illustrated particularly with reference to low-cost sensor networks for air quality measurement.

37 citations


Posted Content
TL;DR: Design and analysis considerations based on a combination test under different non-proportional hazard types and a straw man proposal for practitioners are provided.
Abstract: Loss of power and clear description of treatment differences are key issues in designing and analyzing a clinical trial where non-proportional hazard is a possibility. A log-rank test may be very inefficient and interpretation of the hazard ratio estimated using Cox regression is potentially problematic. In this case, the current ICH E9 (R1) addendum would suggest designing a trial with a clinically relevant estimand, e.g., expected life gain. This approach considers appropriate analysis methods for supporting the chosen estimand. However, such an approach is case specific and may suffer lack of power for important choices of the underlying alternate hypothesis distribution. On the other hand, there may be a desire to have robust power under different deviations from proportional hazards. Also, we would contend that no single number adequately describes treatment effect under non-proportional hazards scenarios. The cross-pharma working group has proposed a combination test to provide robust power under a variety of alternative hypotheses. These can be specified for primary analysis at the design stage and methods appropriately accounting for combination test correlations are efficient for a variety of scenarios. We have provided design and analysis considerations based on a combination test under different non-proportional hazard types and present a straw man proposal for practitioners. The proposals are illustrated with real life example and simulation.

33 citations


Posted Content
TL;DR: A deep learning model designed for predicting future obesity patterns from generally available items on children’s medical history is presented and it outperforms a series of existing studies in the literature and outperforms their performance in most age ranges.
Abstract: Childhood obesity is a major public health challenge. Obesity in early childhood and adolescence can lead to obesity and other health problems in adulthood. Early prediction and identification of the children at a high risk of developing childhood obesity may help in engaging earlier and more effective interventions to prevent and manage this and other related health conditions. Existing predictive tools designed for childhood obesity primarily rely on traditional regression-type methods without exploiting longitudinal patterns of children's data (ignoring data temporality). In this paper, we present a machine learning model specifically designed for predicting future obesity patterns from generally available items on children's medical history. To do this, we have used a large unaugmented EHR (Electronic Health Record) dataset from a major pediatric health system in the US. We adopt a general LSTM (long short-term memory) network architecture for our model for training over dynamic (sequential) and static (demographic) EHR data. We have additionally included a set embedding and attention layers to compute the feature ranking of each timestamp and attention scores of each hidden layer corresponding to each input timestamp. These feature ranking and attention scores added interpretability at both the features and the timestamp-level.

Posted Content
TL;DR: Bayesian additive regression trees (BART) is a flexible prediction model/machine learning approach that has gained widespread popularity in recent years as mentioned in this paper, from what it is to why it works.
Abstract: Bayesian additive regression trees (BART) is a flexible prediction model/machine learning approach that has gained widespread popularity in recent years. As BART becomes more mainstream, there is an increased need for a paper that walks readers through the details of BART, from what it is to why it works. This tutorial is aimed at providing such a resource. In addition to explaining the different components of BART using simple examples, we also discuss a framework, the General BART model, that unifies some of the recent BART extensions, including semiparametric models, correlated outcomes, statistical matching problems in surveys, and models with weaker distributional assumptions. By showing how these models fit into a single framework, we hope to demonstrate a simple way of applying BART to research problems that go beyond the original independent continuous or binary outcomes framework.

Posted Content
TL;DR: Wang et al. as discussed by the authors used a novel bibliometric approach to estimate the stocks of overseas Chinese and returnees from the perspective of their publication activities, albeit with some limitations.
Abstract: The Chinese approach to developing a world-class science system includes a vigorous set of programmes to attract back Chinese researchers who have overseas training and work experience. No analysis is available to show the performance of these mobile researchers. This article attempts to close part of this gap. Using a novel bibliometric approach, we estimate the stocks of overseas Chinese and returnees from the perspective of their publication activities, albeit with some limitations. We show that the share of overseas Chinese scientists in the US is considerably larger than that in the EU. We also show that Chinese returnees publish higher impact work, and continue to publish more and at the international level than domestic counterparts. Returnees not only tend to publish more, but they are instrumental in linking China into the global network. Indeed, returnees actively co-publish with researchers in their former host system, showing the importance of scientific social capital. Future research will examine the impact of length of stay, among other factors, on such impact and integration.

Posted Content
TL;DR: The authors show that the difference-in-differences and lagged-dependent-variable regression estimates have a bracketing relationship, and extend the result to semiparametric estimation based on inverse probability weighting.
Abstract: Difference-in-differences is a widely-used evaluation strategy that draws causal inference from observational panel data. Its causal identification relies on the assumption of parallel trends, which is scale dependent and may be questionable in some applications. A common alternative is a regression model that adjusts for the lagged dependent variable, which rests on the assumption of ignorability conditional on past outcomes. In the context of linear models, \citet{APbook} show that the difference-in-differences and lagged-dependent-variable regression estimates have a bracketing relationship. Namely, for a true positive effect, if ignorability is correct, then mistakenly assuming parallel trends will overestimate the effect; in contrast, if the parallel trends assumption is correct, then mistakenly assuming ignorability will underestimate the effect. We show that the same bracketing relationship holds in general nonparametric (model-free) settings. We also extend the result to semiparametric estimation based on inverse probability weighting. We provide three examples to illustrate the theoretical results with replication files in \citet{ding2019bracketingData}.

Book ChapterDOI
TL;DR: In this chapter, space-time analysis of surveillance count data is considered and time series SIR (TSIR) models originally described by Finkenstadt and Grenfell and the epidemic/endemic models first proposed by Held, Hohle, and Hofmann are implemented.
Abstract: Author(s): Wakefield, Jon; Dong, Tracy Qi; Minin, Vladimir N | Abstract: In this chapter, we consider space-time analysis of surveillance count data. Such data are ubiquitous and a number of approaches have been proposed for their analysis. We first describe the aims of a surveillance endeavor, before reviewing and critiquing a number of common models. We focus on models in which time is discretized to the time scale of the latent and infectious periods of the disease under study. In particular, we focus on the time series SIR (TSIR) models originally described by Finkenstadt and Grenfell in their 2000 paper and the epidemic/endemic models first proposed by Held, Hohle, and Hofmann in their 2005 paper. We implement both of these models in the Stan software and illustrate their performance via analyses of measles data collected over a 2-year period in 17 regions in the Weser-Ems region of Lower Saxony, Germany.

Posted Content
TL;DR: In this paper, the authors evaluate different methods for wind power simulation on four spatial resolution levels from wind park to national level in Brazil and show that bias correction with the Global Wind Atlas (GWA) improves results on state, sub-system, and national level but not on wind park level.
Abstract: NASAs MERRA-2 reanalysis is a widely used dataset in renewable energy resource modelling. The Global Wind Atlas (GWA) has been used to bias-correct MERRA-2 data before. There is, however, a lack of an analysis of the performance of MERRA-2 with bias correction from GWA on different spatial levels - and for regions outside of Europe, China or the United States. This study therefore evaluates different methods for wind power simulation on four spatial resolution levels from wind park to national level in Brazil. In particular, spatial interpolation methods and spatial as well as spatiotemporal wind speed bias correction using local wind speed measurements and mean wind speeds from the GWA are assessed. By validating the resulting timeseries against observed generation it is assessed at which spatial levels the different methods improve results - and whether global information derived from the GWA can compete with locally measured wind speed data as a source of bias correction. Results show that (i) bias correction with the GWA improves results on state, sub-system, and national-level, but not on wind park level, that (ii) the GWA improves results comparably to local measurements, and that (iii) complex spatial interpolation methods do not contribute in improving quality of the simulation.

Journal ArticleDOI
TL;DR: This large scale study is a step forward toward assessing the development of a reliable, cost-effective, and practical clinical decision support tool for screening the population at large for PD using telephone-quality voice.
Abstract: Recent studies have demonstrated that analysis of laboratory-quality voice recordings can be used to accurately differentiate people diagnosed with Parkinson's disease (PD) from healthy controls (HC). These findings could help facilitate the development of remote screening and monitoring tools for PD. In this study, we analyzed 2759 telephone-quality voice recordings from 1483 PD and 15321 recordings from 8300 HC participants. To account for variations in phonetic backgrounds, we acquired data from seven countries. We developed a statistical framework for analyzing voice, whereby we computed 307 dysphonia measures that quantify different properties of voice impairment, such as, breathiness, roughness, monopitch, hoarse voice quality, and exaggerated vocal tremor. We used feature selection algorithms to identify robust parsimonious feature subsets, which were used in combination with a Random Forests (RF) classifier to accurately distinguish PD from HC. The best 10-fold cross-validation performance was obtained using Gram-Schmidt Orthogonalization (GSO) and RF, leading to mean sensitivity of 64.90% (standard deviation, SD 2.90%) and mean specificity of 67.96% (SD 2.90%). This large-scale study is a step forward towards assessing the development of a reliable, cost-effective and practical clinical decision support tool for screening the population at large for PD using telephone-quality voice.

Journal ArticleDOI
Fan Li1
TL;DR: This article considers the proportional decay correlation structure for a cohort stepped wedge design, and provides a matrix-adjusted quasi-least squares approach to accurately estimate the correlation parameters along with the marginal intervention effect, and develops the sample size and power procedures accounting for the correlation decay.
Abstract: A stepped wedge cluster randomized trial is a type of longitudinal cluster design that sequentially switches clusters to intervention over time until all clusters are treated. While the traditional posttest-only parallel design requires adjustment for a single intraclass correlation coefficient, the stepped wedge design allows multiple outcome measurements from the same cluster and so additional correlation parameters are necessary to characterize the within-cluster correlation structure. Although a number of studies have differentiated between the concepts of within-period and between-period correlations, only a few studies have allowed the between-period correlation to decay over time. In this article, we consider the proportional decay correlation structure for a cohort stepped wedge design, and provide a matrix-adjusted quasi-least squares (MAQLS) approach to accurately estimate the correlation parameters along with the marginal intervention effect. We further develop the sample size and power procedures accounting for the correlation decay, and investigate the accuracy of the power procedure with continuous outcomes in a simulation study. We show that the empirical power agrees well with the prediction even with as few as 9 clusters, when data are analyzed with MAQLS concurrently with a suitable bias-corrected sandwich variance. Two trial examples are provided to illustrate the new sample size procedure.

Journal ArticleDOI
TL;DR: An overview of a new Python package called semopy that was specifically developed to overcome limitations in Structural equation modeling and its performance in accuracy and execution time is compared to lavaan.
Abstract: Structural equation modelling (SEM) is a multivariate statistical technique for estimating complex relationships between observed and latent variables. Although numerous SEM packages exist, each of them has limitations. Some packages are not free or open-source; the most popular package not having this disadvantage is $\textbf{lavaan}$, but it is written in R language, which is behind current mainstream tendencies that make it harder to be incorporated into developmental pipelines (i.e. bioinformatical ones). Thus we developed the Python package $\textbf{semopy}$ to satisfy those criteria. The paper provides detailed examples of package usage and explains it's inner clockworks. Moreover, we developed the unique generator of SEM models to extensively test SEM packages and demonstrated that $\textbf{semopy}$ significantly outperforms $\textbf{lavaan}$ in execution time and accuracy.

Journal ArticleDOI
TL;DR: Three data sets collected in the field of modern slavery, together with a data set about the death toll in the Kosovo conflict, are used to investigate the stability and robustness of various multiple‐systems‐estimate methods.
Abstract: Multiple systems estimation is a key approach for quantifying hidden populations such as the number of victims of modern slavery. The UK Government published an estimate of 10,000 to 13,000 victims, constructed by the present author, as part of the strategy leading to the Modern Slavery Act 2015. This estimate was obtained by a stepwise multiple systems method based on six lists. Further investigation shows that a small proportion of the possible models give rather different answers, and that other model fitting approaches may choose one of these. Three data sets collected in the field of modern slavery, together with a data set about the death toll in the Kosovo conflict, are used to investigate the stability and robustness of various multiple systems estimate methods. The crucial aspect is the way that interactions between lists are modelled, because these can substantially affect the results. Model selection and Bayesian approaches are considered in detail, in particular to assess their stability and robustness when applied to real modern slavery data. A new Markov Chain Monte Carlo Bayesian approach is developed; overall, this gives robust and stable results at least for the examples considered. The software and datasets are freely and publicly available to facilitate wider implementation and further research.

Posted Content
TL;DR: Based on stock-market data spanning over thirty years, it is shown that estimating the covariance matrix under MTP_2 outperforms previous state-of-the-art methods including shrinkage estimators and factor models.
Abstract: Selecting the optimal Markowitz porfolio depends on estimating the covariance matrix of the returns of $N$ assets from $T$ periods of historical data. Problematically, $N$ is typically of the same order as $T$, which makes the sample covariance matrix estimator perform poorly, both empirically and theoretically. While various other general purpose covariance matrix estimators have been introduced in the financial economics and statistics literature for dealing with the high dimensionality of this problem, we here propose an estimator that exploits the fact that assets are typically positively dependent. This is achieved by imposing that the joint distribution of returns be multivariate totally positive of order 2 ($\text{MTP}_2$). This constraint on the covariance matrix not only enforces positive dependence among the assets, but also regularizes the covariance matrix, leading to desirable statistical properties such as sparsity. Based on stock-market data spanning over thirty years, we show that estimating the covariance matrix under $\text{MTP}_2$ outperforms previous state-of-the-art methods including shrinkage estimators and factor models.

Posted Content
Sebastian Weber1, Yue Li1, John W. Seaman1, Tomoyuki Kakizume1, Heinz Schmidli1 
TL;DR: The framework of robust Bayesian evidence synthesis in this setting is introduced and it is explained how RBesT facilitates the derivation and evaluation of an informative MAP prior from historical control data.
Abstract: Use of historical data in clinical trial design and analysis has shown various advantages such as reduction of within-study placebo-treated number of subjects and increase of study power. The meta-analytic-predictive (MAP) approach accounts with a hierarchical model for between-trial heterogeneity in order to derive an informative prior from historical (often control) data. In this paper, we introduce the package RBesT (R Bayesian Evidence Synthesis Tools) which implements the MAP approach with normal (known sampling standard deviation), binomial and Poisson endpoints. The hierarchical MAP model is evaluated by MCMC. The numerical MCMC samples representing the MAP prior are approximated with parametric mixture densities which are obtained with the expectation maximization algorithm. The parametric mixture density representation facilitates easy communication of the MAP prior and enables via fast and accurate analytical procedures to evaluate properties of trial designs with informative MAP priors. The paper first introduces the framework of robust Bayesian evidence synthesis in this setting and then explains how RBesT facilitates the derivation and evaluation of an informative MAP prior from historical control data. In addition we describe how the meta-analytic framework relates to further applications including probability of success calculations.

Posted Content
TL;DR: This paper presents the results of large-scale field tests conducted in Murchison Falls and Srepok Wildlife Sanctuary which confirm that the predictive power of PAWS extends promisingly to multiple parks, and applies the methodology to three national parks with diverse characteristics.
Abstract: Illegal wildlife poaching threatens ecosystems and drives endangered species toward extinction. However, efforts for wildlife protection are constrained by the limited resources of law enforcement agencies. To help combat poaching, the Protection Assistant for Wildlife Security (PAWS) is a machine learning pipeline that has been developed as a data-driven approach to identify areas at high risk of poaching throughout protected areas and compute optimal patrol routes. In this paper, we take an end-to-end approach to the data-to-deployment pipeline for anti-poaching. In doing so, we address challenges including extreme class imbalance (up to 1:200), bias, and uncertainty in wildlife poaching data to enhance PAWS, and we apply our methodology to three national parks with diverse characteristics. (i) We use Gaussian processes to quantify predictive uncertainty, which we exploit to improve robustness of our prescribed patrols and increase detection of snares by an average of 30%. We evaluate our approach on real-world historical poaching data from Murchison Falls and Queen Elizabeth National Parks in Uganda and, for the first time, Srepok Wildlife Sanctuary in Cambodia. (ii) We present the results of large-scale field tests conducted in Murchison Falls and Srepok Wildlife Sanctuary which confirm that the predictive power of PAWS extends promisingly to multiple parks. This paper is part of an effort to expand PAWS to 800 parks around the world through integration with SMART conservation software.

Posted Content
TL;DR: A novel meta-learning algorithm for time series forecasting using the efficient Bayesian multivariate surface regression approach to model forecast error as a function of features calculated from the time series.
Abstract: This paper introduces a novel meta-learning algorithm for time series forecast model performance prediction. We model the forecast error as a function of time series features calculated from the historical time series with an efficient Bayesian multivariate surface regression approach. The minimum predicted forecast error is then used to identify an individual model or a combination of models to produce the final forecasts. It is well-known that the performance of most meta-learning models depends on the representativeness of the reference dataset used for training. In such circumstances, we augment the reference dataset with a feature-based time series simulation approach, namely GRATIS, in generating a rich and representative time series collection. The proposed framework is tested using the M4 competition data and is compared against commonly used forecasting approaches. Our approach provides comparable performances to other model selection/combination approaches but at a lower computational cost and a higher degree of interpretability, which is important for supporting decisions. We also provide useful insights regarding which forecasting models are expected to work better for particular types of time series, the intrinsic mechanisms of the meta-learners and how the forecasting performances are affected by various factors.

Journal ArticleDOI
TL;DR: In this paper, a data-driven approach is proposed to understand and predict highway travel time using spatio-temporal features of those factors, all of which are acquired from multiple data sources.
Abstract: Travel time on a route varies substantially by time of day and from day to day. It is critical to understand to what extent this variation is correlated with various factors, such as weather, incidents, events or travel demand level in the context of dynamic networks. This helps a better decision making for infrastructure planning and real-time traffic operation. We propose a data-driven approach to understand and predict highway travel time using spatio-temporal features of those factors, all of which are acquired from multiple data sources. The prediction model holistically selects the most related features from a high-dimensional feature space by correlation analysis, principle component analysis and LASSO. We test and compare the performance of several regression models in predicting travel time 30 min in advance via two case studies: (1) a 6-mile highway corridor of I-270N in D.C. region, and (2) a 2.3-mile corridor of I-376E in Pittsburgh region. We found that some bottlenecks scattered in the network can imply congestion on those corridors at least 30 minutes in advance, including those on the alternative route to the corridors of study. In addition, real-time travel time is statistically related to incidents on some specific locations, morning/afternoon travel demand, visibility, precipitation, wind speed/gust and the weather type. All those spatio-temporal information together help improve prediction accuracy, comparing to using only speed data. In both case studies, random forest shows the most promise, reaching a root-mean-squared error of 16.6\% and 17.0\% respectively in afternoon peak hours for the entire year of 2014.

Posted Content
TL;DR: Researchers across the health and social sciences generally assume that observations are independent, even while relying on convenience samples that draw subjects from one or a small number of committal samples.
Abstract: Researchers across the health and social sciences generally assume that observations are independent, even while relying on convenience samples that draw subjects from one or a small number of communities, schools, hospitals, etc. A paradigmatic example of this is the Framingham Heart Study (FHS). Many of the limitations of such samples are well-known, but the issue of statistical dependence due to social network ties has not previously been addressed. We show that, along with anticonservative variance estimation, this can result in spurious associations due to network dependence. Using a statistical test that we adapted from one developed for spatial autocorrelation, we test for network dependence in several of the thousands of influential papers that have been published using FHS data. Results suggest that some of the many decades of research on coronary heart disease, other health outcomes, and peer influence using FHS data may suffer from spurious associations, error-prone point estimates, and anticonservative inference due to unacknowledged network dependence. These issues are not unique to the FHS; as researchers in psychology, medicine, and beyond grapple with replication failures, this unacknowledged source of invalid statistical inference should be part of the conversation.

Posted Content
TL;DR: This work applies a particle filter with three additional procedures (model reduction, tempering and jittering) to a damped and forced incompressible 2D Euler dynamics defined on a simply connected bounded domain and shows that using the combined algorithm, it is able to successfully assimilate data from a reference system state modelled by a highly resolved numerical solution of the flow.
Abstract: In this work, we combine a stochastic model reduction with a particle filter augmented with tempering and jittering, and apply the combined algorithm to a damped and forced incompressible 2D Euler dynamics defined on a simply connected bounded domain. We show that using the combined algorithm, we are able to assimilate data from a reference system state (the ``truth") modelled by a highly resolved numerical solution of the flow that has roughly $3.1\times10^6$ degrees of freedom, into a stochastic system having two orders of magnitude less degrees of freedom, which is able to approximate the true state reasonably accurately for $5$ large scale eddy turnover times, using modest computational hardware. The model reduction is performed through the introduction of a stochastic advection by Lie transport (SALT) model as the signal on a coarser resolution. The SALT approach was introduced as a general theory using a geometric mechanics framework from Holm, Proc. Roy. Soc. A (2015). This work follows on the numerical implementation for SALT presented by Cotter et al, SIAM Multiscale Model. Sim. (2019) for the flow in consideration. The model reduction is substantial: The reduced SALT model has $4.9\times 10^4$ degrees of freedom. Results from reliability tests on the assimilated system are also presented.

Posted Content
TL;DR: The endemic-epidemic framework is considered, an autoregressive model class for infectious disease surveillance counts, and the default autoregression on counts from the previous time period is replaced with more flexible weighting schemes inspired by discrete-time serial interval distributions.
Abstract: Multivariate count time series models are an important tool for the analysis and prediction of infectious disease spread. We consider the endemic-epidemic framework, an autoregressive model class for infectious disease surveillance counts, and replace the default autoregression on counts from the previous time period with more flexible weighting schemes inspired by discrete-time serial interval distributions. We employ three different parametric formulations, each with an additional unknown weighting parameter estimated via a profile likelihood approach, and compare them to an unrestricted nonparametric approach. The new methods are illustrated in a univariate analysis of dengue fever incidence in San Juan, Puerto Rico, and a spatio-temporal study of viral gastroenteritis in the twelve districts of Berlin. We assess the predictive performance of the suggested models and several reference models at various forecast horizons. In both applications, the performance of the endemic-epidemic models is considerably improved by the proposed weighting schemes.

Posted Content
TL;DR: In this paper, a method for forecasting satellite-based indicators of vegetation condition is presented, which can identify a deteriorating vegetation condition well and sufficiently in advance to help disaster risk managers act early to support vulnerable communities and limit the impact of a drought hazard.
Abstract: Droughts are a recurring hazard in sub-Saharan Africa, that can wreak huge socioeconomic costs.Acting early based on alerts provided by early warning systems (EWS) can potentially provide substantial mitigation, reducing the financial and human cost. However, existing EWS tend only to monitor current, rather than forecast future, environmental and socioeconomic indicators of drought, and hence are not always sufficiently timely to be effective in practice. Here we present a novel method for forecasting satellite-based indicators of vegetation condition. Specifically, we focused on the 3-month Vegetation Condition Index (VCI3M) over pastoral livelihood zones in Kenya, which is the indicator used by the Kenyan National Drought Management Authority(NDMA). Using data from MODIS and Landsat, we apply linear autoregression and Gaussian process modeling methods and demonstrate high forecasting skill several weeks ahead. As a benchmark we predicted the drought alert marker used by NDMA (VCI3M<35). Both of our models were able to predict this alert marker four weeks ahead with a hit rate of around 89% and a false alarm rate of around 4%, or 81% and 6% respectively six weeks ahead. The methods developed here can thus identify a deteriorating vegetation condition well and sufficiently in advance to help disaster risk managers act early to support vulnerable communities and limit the impact of a drought hazard.

Posted Content
TL;DR: In this paper, the authors present how statistical methods can contribute to choosing a tournament format that is in line with the above axiom, and they show that being a top team in the lowest ranked League D of the 2018/19 UEFA Nations League substantially increases the probability of qualifying compared to being a bottom team in a higher-ranked League C.
Abstract: The integrity of a sport can be seriously undermined if its rules punish winning as this creates incentives for strategic manipulation. Therefore, a sports tournament can be called unfair if the overall win probabilities are not ordered according to the teams' ranking based on their past performances. We present how statistical methods can contribute to choosing a tournament format that is in line with the above axiom. In particular, the qualification for the 2020 UEFA European Championship is shown to violate this requirement: being a top team in the lowest-ranked League D of the 2018/19 UEFA Nations League substantially increases the probability of qualifying compared to being a bottom team in the higher-ranked League C. The unfairness can be remarkably reduced or even eliminated with slightly changing the path formation policy of the UEFA Euro 2020 qualifying play-offs. The misaligned design has severely punished a team for winning a match years before. Since the deficiency is an inherent feature of the qualifying process, the Union of European Football Associations (UEFA) should reconsider the format of future tournaments to eliminate the unfair advantage enjoyed by certain teams.