scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Statistical Finance in 2016"


Posted Content
TL;DR: This paper analyzes multi-period mortgage risk at loan and pool levels using an unprecedented dataset of over 120 million prime and subprime mortgages originated across the United States between 1995 and 2014, which includes the individual characteristics of each loan, monthly updates on loan performance over the life of a loan, and a number of time-varying economic variables at the zip code level.
Abstract: We develop a deep learning model of multi-period mortgage risk and use it to analyze an unprecedented dataset of origination and monthly performance records for over 120 million mortgages originated across the US between 1995 and 2014. Our estimators of term structures of conditional probabilities of prepayment, foreclosure and various states of delinquency incorporate the dynamics of a large number of loan-specific as well as macroeconomic variables down to the zip-code level. The estimators uncover the highly nonlinear nature of the relationship between the variables and borrower behavior, especially prepayment. They also highlight the effects of local economic conditions on borrower behavior. State unemployment has the greatest explanatory power among all variables, offering strong evidence of the tight connection between housing finance markets and the macroeconomy. The sensitivity of a borrower to changes in unemployment strongly depends upon current unemployment. It also significantly varies across the entire borrower population, which highlights the interaction of unemployment and many other variables. These findings have important implications for mortgage-backed security investors, rating agencies, and housing finance policymakers.

126 citations


Journal ArticleDOI
TL;DR: This work proposes a method for characterizing the joint multifractal nature of these long-range cross correlations using wavelet leaders (MF-X-WL), and applies this method to pairs of series from financial markets and online worlds to determine intriguing joint multifractional behavior.
Abstract: Mutually interacting components form complex systems and the outputs of these components are usually long-range cross-correlated. Using wavelet leaders, we propose a method of characterizing the joint multifractal nature of these long-range cross correlations, a method we call joint multifractal analysis based on wavelet leaders (MF-X-WL). We test the validity of the MF-X-WL method by performing extensive numerical experiments on the dual binomial measures with multifractal cross correlations and the bivariate fractional Brownian motions (bFBMs) with monofractal cross correlations. Both experiments indicate that MF-X-WL is capable to detect the cross correlations in synthetic data with acceptable estimating errors. We also apply the MF-X-WL method to the pairs of series from financial markets (returns and volatilities) and online worlds (online numbers of different genders and different societies) and find an intriguing joint multifractal behavior.

41 citations


Posted Content
TL;DR: In this paper, the authors introduce a new class of continuous-time models of the stochastic volatility of asset prices, which can simultaneously incorporate roughness and slowly decaying autocorrelations, including proper long memory.
Abstract: We introduce a new class of continuous-time models of the stochastic volatility of asset prices. The models can simultaneously incorporate roughness and slowly decaying autocorrelations, including proper long memory, which are two stylized facts often found in volatility data. Our prime model is based on the so-called Brownian semistationary process and we derive a number of theoretical properties of this process, relevant to volatility modeling. Applying the models to realized volatility measures covering a vast panel of assets, we find evidence consistent with the hypothesis that time series of realized measures of volatility are both rough and very persistent. Lastly, we illustrate the utility of the models in an extensive forecasting study; we find that the models proposed in this paper outperform a wide array of benchmarks considerably, indicating that it pays off to exploit both roughness and persistence in volatility forecasting.

40 citations


Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper performed a comparative analysis of the Chinese stock market around the occurrence of the 2008 crisis based on the random matrix analysis of high-frequency stock returns of 1228 stocks listed on the Shanghai and Shenzhen stock exchanges.
Abstract: We perform a comparative analysis of the Chinese stock market around the occurrence of the 2008 crisis based on the random matrix analysis of high-frequency stock returns of 1228 stocks listed on the Shanghai and Shenzhen stock exchanges. Both raw correlation matrix and partial correlation matrix with respect to the market index in two time periods of one year are investigated. We find that the Chinese stocks have stronger average correlation and partial correlation in 2008 than in 2007 and the average partial correlation is significantly weaker than the average correlation in each period. Accordingly, the largest eigenvalue of the correlation matrix is remarkably greater than that of the partial correlation matrix in each period. Moreover, each largest eigenvalue and its eigenvector reflect an evident market effect, while other deviating eigenvalues do not. We find no evidence that deviating eigenvalues contain industrial sectorial information. Surprisingly, the eigenvectors of the second largest eigenvalues in 2007 and of the third largest eigenvalues in 2008 are able to distinguish the stocks from the two exchanges. We also find that the component magnitudes of the some largest eigenvectors are proportional to the stocks' capitalizations.

37 citations


Journal ArticleDOI
TL;DR: In this paper, the authors reinterpreted popular mortality models such as the Lee-Carter class of models in a general state-space modelling methodology, which allows modelling, estimation and forecasting of mortality under a unified framework.
Abstract: This paper explores and develops alternative statistical representations and estimation approaches for dynamic mortality models. The framework we adopt is to reinterpret popular mortality models such as the Lee-Carter class of models in a general state-space modelling methodology, which allows modelling, estimation and forecasting of mortality under a unified framework. Furthermore, we propose an alternative class of model identification constraints which is more suited to statistical inference in filtering and parameter estimation settings based on maximization of the marginalized likelihood or in Bayesian inference. We then develop a novel class of Bayesian state-space models which incorporate apriori beliefs about the mortality model characteristics as well as for more flexible and appropriate assumptions relating to heteroscedasticity that present in observed mortality data. We show that multiple period and cohort effect can be cast under a state-space structure. To study long term mortality dynamics, we introduce stochastic volatility to the period effect. The estimation of the resulting stochastic volatility model of mortality is performed using a recent class of Monte Carlo procedure specifically designed for state and parameter estimation in Bayesian state-space models, known as the class of particle Markov chain Monte Carlo methods. We illustrate the framework we have developed using Danish male mortality data, and show that incorporating heteroscedasticity and stochastic volatility markedly improves model fit despite an increase of model complexity. Forecasting properties of the enhanced models are examined with long term and short term calibration periods on the reconstruction of life tables.

30 citations


Journal ArticleDOI
TL;DR: In this article, Wang et al. studied the non-vanishing price responses across different stocks in correlated financial markets and evaluated the average cross-responses for a given stock with respect to the whole market or to different sectors.
Abstract: There are non-vanishing price responses across different stocks in correlated financial markets. We further study this issue by performing different averages, which identify active and passive cross-responses. The two average cross-responses show different characteristic dependences on the time lag. The passive cross-response exhibits a shorter response period with sizeable volatilities, while the corresponding period for the active cross-response is longer. The average cross-responses for a given stock are evaluated either with respect to the whole market or to different sectors. Using the response strength, the influences of individual stocks are identified and discussed. Moreover, the various cross-responses as well as the average cross-responses are compared with the self-responses. In contrast, the short memory of trade sign cross-correlation for stock pairs, the sign cross-correlation has long memory when averaged over different pairs of stocks.

20 citations


Posted Content
TL;DR: In this article, the authors derive weaker conditions that can be used in practice to ensure the consistency of the maximum likelihood estimator for a wide class of observation-driven time series models and obtain an asymptotic test and confidence bounds for the unfeasible " true " invertibility region of the parameter space.
Abstract: Invertibility conditions for observation-driven time series models often fail to be guaranteed in empirical applications. As a result, the asymptotic theory of maximum likelihood and quasi-maximum likelihood estimators may be compromised. We derive considerably weaker conditions that can be used in practice to ensure the consistency of the maximum likelihood estimator for a wide class of observation-driven time series models. Our consistency results hold for both correctly specified and misspecified models. The practical relevance of the theory is highlighted in a set of empirical examples. We further obtain an asymptotic test and confidence bounds for the unfeasible " true " invertibility region of the parameter space.

20 citations


Posted Content
TL;DR: In this paper, a hybrid approach integrating the advantages of both decomposition model (namely, Maximal Overlap Discrete Wavelet Transform (MODWT)) and machine learning models (ANN and SVR) was proposed to predict the National Stock Exchange Fifty Index.
Abstract: Financial Times Series such as stock price and exchange rates are, often, non-linear and non-stationary. Use of decomposition models has been found to improve the accuracy of predictive models. The paper proposes a hybrid approach integrating the advantages of both decomposition model (namely, Maximal Overlap Discrete Wavelet Transform (MODWT)) and machine learning models (ANN and SVR) to predict the National Stock Exchange Fifty Index. In first phase, the data is decomposed into a smaller number of subseries using MODWT. In next phase, each subseries is predicted using machine learning models (i.e., ANN and SVR). The predicted subseries are aggregated to obtain the final forecasts. In final stage, the effectiveness of the proposed approach is evaluated using error measures and statistical test. The proposed methods (MODWT-ANN and MODWT-SVR) are compared with ANN and SVR models and, it was observed that the return on investment obtained based on trading rules using predicted values of MODWT-SVR model was higher than that of Buy-and-hold strategy.

19 citations


Posted Content
TL;DR: A new approach to estimate high-dimensional factor models, using the empirical spectral density of residuals, which simultaneously provides estimators of the number of factors and information about correlation structures in residuals and shows that the estimators capture essential aspects of market dynamics.
Abstract: In dealing with high-dimensional data sets, factor models are often useful for dimension reduction. The estimation of factor models has been actively studied in various fields. In the first part of this paper, we present a new approach to estimate high-dimensional factor models, using the empirical spectral density of residuals. The spectrum of covariance matrices from financial data typically exhibits two characteristic aspects: a few spikes and bulk. The former represent factors that mainly drive the features and the latter arises from idiosyncratic noise. Motivated by these two aspects, we consider a minimum distance between two spectrums; one from a covariance structure model and the other from real residuals of financial data that are obtained by subtracting principal components. Our method simultaneously provides estimators of the number of factors and information about correlation structures in residuals. Using free random variable techniques, the proposed algorithm can be implemented and controlled effectively. Monte Carlo simulations confirm that our method is robust to noise or the presence of weak factors. Furthermore, the application to financial time-series shows that our estimators capture essential aspects of market dynamics.

18 citations


Posted Content
TL;DR: The logarithm of product diversification as discussed by the authors has a natural foundation in information theory: it measures the information needed to encode the knowledge required to make a country's products.
Abstract: Researchers developed the Economic Complexity Index (ECI) as a measure of the overall sophistication of a country's products. They argued that this measure explains economic growth better than the conventional variables such as human capital. This paper suggests a simpler measure of production complexity, the logarithm of product diversification, which has a natural foundation in information theory: it measures the information needed to encode the knowledge required to make a country's products. This measure explains well the income differences between countries. It has a basic link with ECI that is strongly supported by the data.

17 citations


Posted Content
TL;DR: It is suggested that social media significant causality on stocks' returns are purely nonlinear in most cases and social media dominates the directional coupling with stock market, an effect not observable within linear modeling.
Abstract: Online social networks offer a new way to investigate financial markets' dynamics by enabling the large-scale analysis of investors' collective behavior. We provide empirical evidence that suggests social media and stock markets have a nonlinear causal relationship. We take advantage of an extensive data set composed of social media messages related to DJIA index components. By using information-theoretic measures to cope for possible nonlinear causal coupling between social media and stock markets systems, we point out stunning differences in the results with respect to linear coupling. Two main conclusions are drawn: First, social media significant causality on stocks' returns are purely nonlinear in most cases; Second, social media dominates the directional coupling with stock market, an effect not observable within linear modeling. Results also serve as empirical guidance on model adequacy in the investigation of sociotechnical and financial systems.

Journal ArticleDOI
TL;DR: In this paper, three approaches to calculate the self-similarity exponent of a time series are compared in order to determine which one performs best to identify the transition from random efficient market behavior (EM) to herding behavior (HB) and hence, to find out the beginning of a market bubble.
Abstract: In this paper, three approaches to calculate the self-similarity exponent of a time series are compared in order to determine which one performs best to identify the transition from random efficient market behavior (EM) to herding behavior (HB) and hence, to find out the beginning of a market bubble. In particular, classical Detrended Fluctuation Analysis (DFA), Generalized Hurst Exponent (GHE) and GM2 (one of Geometric Method-based algorithms) were applied for self-similarity exponent calculation purposes. Traditionally, researchers have been focused on identifying the beginning of a crash. Instead of this, we are pretty interested in identifying the beginning of the transition process from EM to a market bubble onset, what we consider could be more interesting. The relevance of self-similarity index in such a context lies on the fact that it becomes a suitable indicator which allows to identify the raising of HB in financial markets. Overall, we could state that the greater the self-similarity exponent in financial series, the more likely the transition process to HB could start. This fact is illustrated through actual S\&P500 stocks.

Journal ArticleDOI
TL;DR: The authors study the predictability of returns in the Chinese stock market by employing the wild bootstrap automatic variance ratio test and the generalized spectral test and find that the return predictability vary over time.
Abstract: China's stock market is the largest emerging market all over the world. It is widely accepted that the Chinese stock market is far from efficiency and it possesses possible linear and nonlinear dependence. We study the predictability of returns in the Chinese stock market by employing the wild bootstrap automatic variance ratio test and the generalized spectral test. We find that the return predictability vary over time and significant return predictability is observed around market turmoils. Our findings are consistent with the Adaptive Markets Hypothesis and have practical implications for market participants.

Posted Content
TL;DR: In this article, a linear shrinkage estimator is proposed for the mean-variance portfolio in the high-dimensional case using the recent results from the theory of random matrices, which is distribution-free and is optimal in the sense of maximizing with probability $1$ the asymptotic out-of-sample expected utility.
Abstract: In this paper we estimate the mean-variance (MV) portfolio in the high-dimensional case using the recent results from the theory of random matrices. We construct a linear shrinkage estimator which is distribution-free and is optimal in the sense of maximizing with probability $1$ the asymptotic out-of-sample expected utility, i.e., mean-variance objective function for several values of risk aversion coefficient which in particular leads to the maximization of the out-of sample expected utility, to the maximization of the out-of-sample Sharpe ratio, and to the minimization of the out-of-sample variance. Its asymptotic properties are investigated when the number of assets $p$ together with the sample size $n$ tend to infinity such that $p/n \rightarrow c\in (0,+\infty)$. The results are obtained under weak assumptions imposed on the distribution of the asset returns, namely the existence of the fourth moments is only required. Thereafter we perform numerical and empirical studies where the small- and large-sample behavior of the derived estimator is investigated. The suggested estimator shows significant improvements over the naive diversification and it is robust to the deviations from normality.

Journal ArticleDOI
TL;DR: In this article, the authors used the distribution of recurrence intervals to predict the occurrence of extreme returns in financial risk management and showed that these extreme returns are predictable on the short term.
Abstract: Being able to predict the occurrence of extreme returns is important in financial risk management. Using the distribution of recurrence intervals---the waiting time between consecutive extremes---we show that these extreme returns are predictable on the short term. Examining a range of different types of returns and thresholds we find that recurrence intervals follow a $q$-exponential distribution, which we then use to theoretically derive the hazard probability $W(\Delta t |t)$. Maximizing the usefulness of extreme forecasts to define an optimized hazard threshold, we indicates a financial extreme occurring within the next day when the hazard probability is greater than the optimized threshold. Both in-sample tests and out-of-sample predictions indicate that these forecasts are more accurate than a benchmark that ignores the predictive signals. This recurrence interval finding deepens our understanding of reoccurring extreme returns and can be applied to forecast extremes in risk management.

Posted Content
TL;DR: In this paper, the authors explored the relationship between time series irreversibility and entropy production in thermodynamic systems operating away from equilibrium and showed that this metric is complementary to standard measures based on volatility and exploit it to both classify periods of financial stress and to rank companies accordingly.
Abstract: The relation between time series irreversibility and entropy production has been recently investigated in thermodynamic systems operating away from equilibrium. In this work we explore this concept in the context of financial time series. We make use of visibility algorithms to quantify in graph-theoretical terms time irreversibility of 35 financial indices evolving over the period 1998-2012. We show that this metric is complementary to standard measures based on volatility and exploit it to both classify periods of financial stress and to rank companies accordingly. We then validate this approach by finding that a projection in principal components space of financial years based on time irreversibility features clusters together periods of financial stress from stable periods. Relations between irreversibility, efficiency and predictability are briefly discussed.

Posted Content
TL;DR: In this article, the existence of a Hawkes self-exciting point process with exponentially decreasing kernel and time-varying parameters was shown and the associated central limit theorem was established.
Abstract: We introduce and show the existence of a Hawkes self-exciting point process with exponentially-decreasing kernel and where parameters are time-varying. The quantity of interest is defined as the integrated parameter $T^{-1}\int_0^T\theta_t^*dt$, where $\theta_t^*$ is the time-varying parameter, and we consider the high-frequency asymptotics. To estimate it naively, we chop the data into several blocks, compute the maximum likelihood estimator (MLE) on each block, and take the average of the local estimates. The asymptotic bias explodes asymptotically, thus we provide a non-naive estimator which is constructed as the naive one when applying a first-order bias reduction to the local MLE. We show the associated central limit theorem. Monte Carlo simulations show the importance of the bias correction and that the method performs well in finite sample, whereas the empirical study discusses the implementation in practice and documents the stochastic behavior of the parameters.

Posted Content
TL;DR: In this article, a general time-varying parameter model is given, where the multidimensional parameter possibly includes jumps and the quantity of interest is defined as the integrated value over time of the parameter process.
Abstract: In this paper, we give a general time-varying parameter model, where the multidimensional parameter possibly includes jumps. The quantity of interest is defined as the integrated value over time of the parameter process $\Theta = T^{-1} \int_0^T \theta_t^* dt$. We provide a local parametric estimator (LPE) of $\Theta$ and conditions under which we can show the central limit theorem. Roughly speaking those conditions correspond to some uniform limit theory in the parametric version of the problem. The framework is restricted to the specific convergence rate $n^{1/2}$. Several examples of LPE are studied: estimation of volatility, powers of volatility, volatility when incorporating trading information and time-varying MA(1).

Journal ArticleDOI
TL;DR: In this article, the immediate price impacts of market orders are estimated by two competitive models, the power-law model (PL model) and the logarithmic model (LG model).
Abstract: Based on the order flow data of a stock and its warrant, the immediate price impacts of market orders are estimated by two competitive models, the power-law model (PL model) and the logarithmic model (LG model). We find that the PL model is overwhelmingly superior to the LG model, regarding the robustness of the estimated parameters and the accuracy of out-of-sample forecasting. We also find that the price impacts of ask and bid orders are consistent with each other for filled trades, since significant positive correlations are observed between the model parameters of both types of orders. Our findings may provide valuable insights for optimal trade execution.

Posted Content
TL;DR: In this paper, both wavelet analysis and VARMA (Vector Autoregressive Moving Average) models are utilized to determine dynamic correlation time interval and scales, which results in reduced errors.
Abstract: The assessment of co-movement among metals is crucial to better understand the behaviors of the metal prices and the interactions with others that affect the changes in prices. In this study, both Wavelet Analysis and VARMA (Vector Autoregressive Moving Average) models are utilized. First, Multiple Wavelet Coherence (MWC), where Wavelet Analysis is needed, is utilized to determine dynamic correlation time interval and scales. VARMA is then used for forecasting which results in reduced errors. The daily prices of steel, aluminium, copper and zinc between 10.05.2010 and 29.05.2014 are analyzed via wavelet analysis to highlight the interactions. Results uncover interesting dynamics between mentioned metals in the time-frequency space. VARMA (1,1) model forecasting is carried out considering the daily prices between 14.11.2011 and 16.11.2012 where the interactions are quite high and prediction errors are found quite limited with respect to ARMA(1.1). It is shown that dynamic co-movement detection via four variables wavelet coherency analysis in the determination of VARMA time interval enables to improve forecasting power of ARMA by decreasing forecasting errors.

Posted Content
TL;DR: In this paper, a general time-varying parameter model is proposed, where the multidimensional parameter follows a continuous local martingale and the quantity of interest is defined as the integrated value over time of the parameter process.
Abstract: In this paper, we give a general time-varying parameter model, where the multidimensional parameter follows a continuous local martingale. As such, we call it the locally parametric model. The quantity of interest is defined as the integrated value over time of the parameter process $\Theta := T^{-1} \int_0^T \theta_t^* dt$. We provide a local parametric estimator of $\Theta$ based on the original (non time-varying) parametric model estimator and conditions under which we can show consistency and the corresponding limit distribution. We show that the LPM class contains some models that come from popular problems in the high-frequency financial econometrics literature (estimating volatility, high-frequency covariance, integrated betas, leverage effect, volatility of volatility), as well as a new general asset-price diffusion model which allows for endogenous observations and time-varying noise which can be auto-correlated and correlated with the efficient price and the sampling times. Finally, as an example of how to apply the limit theory provided in this paper, we build a time-varying friction parameter extension of the (semiparametric) model with uncertainty zones (Robert and Rosenbaum (2012)), which is noisy and endogenous, and we show that we can verify the conditions for the estimation of integrated volatility.

Journal ArticleDOI
TL;DR: In this paper, the authors investigated the presence of long memory in corporate bond and stock indices of six European Union countries from July 1998 to February 2015 using the DFA method and using a sliding window in order to measure long range dependence.
Abstract: This paper investigates the presence of long memory in corporate bond and stock indices of six European Union countries from July 1998 to February 2015. We compute the Hurst exponent by means of the DFA method and using a sliding window in order to measure long range dependence. We detect that Hurst exponents behave differently in the stock and bond markets, being smoother in the stock indices than in the bond indices. We verify that the level of informational efficiency is time-varying. Moreover we find an asymmetric impact of the 2008 financial crisis in the fixed income and the stock markets, affecting the former but not the latter. Similar results are obtained using the R/S method.

Posted Content
TL;DR: This article applied Markov-switching $R$-vine models to investigate the existence of different, global dependence regimes, identifying times of "normal" and "abnormal" states within a data set consisting of North-American, European and Asian indices.
Abstract: For nearly every major stock market there exist equity and implied volatility indices. These play important roles within finance: be it as a benchmark, a measure of general uncertainty or a way of investing or hedging. It is well known in the academic literature, that correlations and higher moments between different indices tend to vary in time. However, to the best of our knowledge, no one has yet considered a global setup including both, equity and implied volatility indices of various continents, and allowing for a changing dependence structure. We aim to close this gap by applying Markov-switching $R$-vine models to investigate the existence of different, global dependence regimes. In particular, we identify times of "normal" and "abnormal" states within a data set consisting of North-American, European and Asian indices. Our results confirm the existence of joint points in time at which global regime switching takes place.

Journal ArticleDOI
TL;DR: In this article, a method to characterize the joint multifractal nature of long-range cross correlations based on wavelet analysis, termed Multifractal Cross Wavelet Analysis (MFXWT), was proposed.
Abstract: Complex systems are composed of mutually interacting components and the output values of these components are usually long-range cross-correlated. We propose a method to characterize the joint multifractal nature of such long-range cross correlations based on wavelet analysis, termed multifractal cross wavelet analysis (MFXWT). We assess the performance of the MFXWT method by performing extensive numerical experiments on the dual binomial measures with multifractal cross correlations and the bivariate fractional Brownian motions (bFBMs) with monofractal cross correlations. For binomial multifractal measures, the empirical joint multifractality of MFXWT is found to be in approximate agreement with the theoretical formula. For bFBMs, MFXWT may provide spurious multifractality because of the wide spanning range of the multifractal spectrum. We also apply the MFXWT method to stock market indexes and uncover intriguing joint multifractal nature in pairs of index returns and volatilities.

Journal ArticleDOI
TL;DR: Two sparse Kalman filtering approaches to the covariance matrix of asset returns from high frequency data are proposed and each provides for improved covariance estima- tion relative to the KEM method in a variety of settings where jumps occur.
Abstract: Estimation of the covariance matrix of asset returns from high frequency data is complicated by asynchronous returns, market mi- crostructure noise and jumps. One technique for addressing both asynchronous returns and market microstructure is the Kalman-EM (KEM) algorithm. However the KEM approach assumes log-normal prices and does not address jumps in the return process which can corrupt estimation of the covariance matrix. In this paper we extend the KEM algorithm to price models that include jumps. We propose two sparse Kalman filtering approaches to this problem. In the first approach we develop a Kalman Expectation Conditional Maximization (KECM) algorithm to determine the un- known covariance as well as detecting the jumps. For this algorithm we consider Laplace and the spike and slab jump models, both of which promote sparse estimates of the jumps. In the second method we take a Bayesian approach and use Gibbs sampling to sample from the posterior distribution of the covariance matrix under the spike and slab jump model. Numerical results using simulated data show that each of these approaches provide for improved covariance estima- tion relative to the KEM method in a variety of settings where jumps occur.

Posted Content
TL;DR: This work describes a general pairs-trading algorithm which allows the user to define a rather arbitrary spread function which is used in a feedback context to modify the investment levels dynamically over time and proves that this algorithm results in positive expected growth in account value.
Abstract: Pairs trading is a market-neutral strategy that exploits historical correlation between stocks to achieve statistical arbitrage. Existing pairs-trading algorithms in the literature require rather restrictive assumptions on the underlying stochastic stock-price processes and the so-called spread function. In contrast to existing literature, we consider an algorithm for pairs trading which requires less restrictive assumptions than heretofore considered. Since our point of view is control-theoretic in nature, the analysis and results are straightforward to follow by a non-expert in finance. To this end, we describe a general pairs-trading algorithm which allows the user to define a rather arbitrary spread function which is used in a feedback context to modify the investment levels dynamically over time. When this function, in combination with the price process, satisfies a certain mean-reversion condition, we deem the stocks to be a tradeable pair. For such a case, we prove that our control-inspired trading algorithm results in positive expected growth in account value. Finally, we describe tests of our algorithm on historical trading data by fitting stock price pairs to a popular spread function used in literature. Simulation results from these tests demonstrate robust growth while avoiding huge drawdowns.

Posted Content
TL;DR: This article proposed a testing framework resistant to such violations, which is consistent with nearly integrated regressors and applicable to multi-predictor settings, when the data may only approximately follow a predictive regression model.
Abstract: Testing procedures for predictive regressions with lagged autoregressive variables imply a suboptimal inference in presence of small violations of ideal assumptions. We propose a novel testing framework resistant to such violations, which is consistent with nearly integrated regressors and applicable to multi-predictor settings, when the data may only approximately follow a predictive regression model. The Monte Carlo evidence demonstrates large improvements of our approach, while the empirical analysis produces a strong robust evidence of market return predictability hidden by anomalous observations, both in- and out-of-sample, using predictive variables such as the dividend yield or the volatility risk premium.

Posted Content
TL;DR: In this article, the authors test whether the futures prices of some commodity and energy markets are determined by stochastic rules or exhibit nonlinear deterministic endogenous fluctuations using the maximal Lyapunov exponents (MLE) and a determinism test, both based on the reconstruction of the phase space.
Abstract: We test whether the futures prices of some commodity and energy markets are determined by stochastic rules or exhibit nonlinear deterministic endogenous fluctuations. As for the methodologies, we use the maximal Lyapunov exponents (MLE) and a determinism test, both based on the reconstruction of the phase space. In particular, employing a recent methodology, we estimate a coefficient $\kappa$ that describes the determinism rate of the analyzed time series. We find that the underlying system for futures prices shows a reliability level $\kappa$ near to $1$ while the MLE is positive for all commodity futures series. Thus, the empirical evidence suggests that commodity and energy futures prices are the measured footprint of a nonlinear deterministic, rather than a stochastic, system.

Journal ArticleDOI
TL;DR: In this paper, a parametric model for the simulation of limit order books is proposed, where limit orders, market orders and cancellations are submitted according to point processes with state-dependent intensities.
Abstract: We propose a parametric model for the simulation of limit order books. We assume that limit orders, market orders and cancellations are submitted according to point processes with state-dependent intensities. We propose new functional forms for these intensities, as well as new models for the placement of limit orders and cancellations. For cancellations, we introduce the concept of "priority index" to describe the selection of orders to be cancelled in the order book. Parameters of the model are estimated using likelihood maximization. We illustrate the performance of the model by providing extensive simulation results, with a comparison to empirical data and a standard Poisson reference.

Posted Content
TL;DR: In this paper, the authors used ANNs to forecast the value of the Indian rupee vis a vis the US Dollar, considering political instability and lack of mechanism for enforcement of contracts that can affect both direct foreign investment and portfolio investment.
Abstract: Any discussion on exchange rate movements and forecasting should include explanatory variables from both the current account and the capital account of the balance of payments. In this paper, we include such factors to forecast the value of the Indian rupee vis a vis the US Dollar. Further, factors reflecting political instability and lack of mechanism for enforcement of contracts that can affect both direct foreign investment and also portfolio investment, have been incorporated. The explanatory variables chosen are the 3 month Rupee Dollar futures exchange rate (FX4), NIFTY returns (NIFTYR), Dow Jones Industrial Average returns (DJIAR), Hang Seng returns (HSR), DAX returns (DR), crude oil price (COP), CBOE VIX (CV) and India VIX (IV). To forecast the exchange rate, we have used two different classes of frameworks namely, Artificial Neural Network (ANN) based models and Time Series Econometric models. Multilayer Feed Forward Neural Network (MLFFNN) and Nonlinear Autoregressive models with Exogenous Input (NARX) Neural Network are the approaches that we have used as ANN models. Generalized Autoregressive Conditional Heteroskedastic (GARCH) and Exponential Generalized Autoregressive Conditional Heteroskedastic (EGARCH) techniques are the ones that we have used as Time Series Econometric methods. Within our framework, our results indicate that, although the two different approaches are quite efficient in forecasting the exchange rate, MLFNN and NARX are the most efficient.