Correlating financial time series with micro-blogging activity
Eduardo J. Ruiz,Vagelis Hristidis,Carlos Castillo,Aristides Gionis,Alejandro Jaimes +4 more
- pp 513-522
Reads0
Chats0
TLDR
The problem of correlating micro-blogging activity with stock-market events, defined as changes in the price and traded volume of stocks, is studied and it is shown that even relatively small correlations between price and micro- bloggers features can be exploited to drive a stock trading strategy that outperforms other baseline strategies.Abstract:
We study the problem of correlating micro-blogging activity with stock-market events, defined as changes in the price and traded volume of stocks. Specifically, we collect messages related to a number of companies, and we search for correlations between stock-market events for those companies and features extracted from the micro-blogging messages. The features we extract can be categorized in two groups. Features in the first group measure the overall activity in the micro-blogging platform, such as number of posts, number of re-posts, and so on. Features in the second group measure properties of an induced interaction graph, for instance, the number of connected components, statistics on the degree distribution, and other graph-based properties.We present detailed experimental results measuring the correlation of the stock market events with these features, using Twitter as a data source. Our results show that the most correlated features are the number of connected components and the number of nodes of the interaction graph. The correlation is stronger with the traded volume than with the price of the stock. However, by using a simulator we show that even relatively small correlations between price and micro-blogging features can be exploited to drive a stock trading strategy that outperforms other baseline strategies.read more
Citations
More filters
Proceedings ArticleDOI
k-Shape: Efficient and Accurate Clustering of Time Series
John Paparrizos,Luis Gravano +1 more
TL;DR: K-Shape as discussed by the authors uses a normalized version of the cross-correlation measure in order to consider the shapes of time series while comparing them, and develops a method to compute cluster centroids, which are used in every iteration to update the assignment of the time series to clusters.
Journal ArticleDOI
Computational Intelligence and Financial Markets
Rodolfo C. Cavalcante,Rodrigo C. Brasileiro,Victor L. F. Souza,Jarley P. Nobrega,Adriano L. I. Oliveira +4 more
TL;DR: An overview of the most important primary studies published from 2009 to 2015, which cover techniques for preprocessing and clustering of financial data, for forecasting future market movements, for mining financial text information, among others, are given.
Journal ArticleDOI
Natural language based financial forecasting: a survey
TL;DR: This review article clarifies the scope of NLFF research by ordering and structuring techniques and applications from related work, and aims to increase the understanding of progress and hotspots in NLFF, and bring about discussions across many different disciplines.
Proceedings ArticleDOI
Sentiment analysis of Twitter data for predicting stock market movements
TL;DR: A strong correlation exists between the rise and falls in stock prices with the public sentiments in tweets, and this work has applied sentiment analysis and supervised machine learning principles to the tweets extracted from Twitter and analyzed the correlation between stock market movements of a company and sentiments in tweet.
Proceedings Article
Exploiting Topic based Twitter Sentiment for Stock Prediction
TL;DR: This paper proposes a technique to leverage topic based sentiments from Twitter to help predict the stock market by utilizing a con- tinuous Dirichlet Process Mixture model to learn the daily topic set and regress the stock index and the Twitter sentiment time series to predict the market.
References
More filters
Journal ArticleDOI
The Structure and Function of Complex Networks
TL;DR: Developments in this field are reviewed, including such concepts as the small-world effect, degree distributions, clustering, network correlations, random graph models, models of network growth and preferential attachment, and dynamical processes taking place on networks.
Proceedings ArticleDOI
What is Twitter, a social network or a news media?
TL;DR: In this paper, the authors have crawled the entire Twittersphere and found a non-power-law follower distribution, a short effective diameter, and low reciprocity, which all mark a deviation from known characteristics of human social networks.
Journal ArticleDOI
Twitter mood predicts the stock market.
TL;DR: This work investigates whether measurements of collective mood states derived from large-scale Twitter feeds are correlated to the value of the Dow Jones Industrial Average (DJIA) over time and indicates that the accuracy of DJIA predictions can be significantly improved by the inclusion of specific public mood dimensions but not others.
Journal ArticleDOI
Detecting influenza epidemics using search engine query data
Jeremy Ginsberg,Matthew H. Mohebbi,Rajan Patel,Lynnette Brammer,Mark S. Smolinski,Lawrence B. Brilliant +5 more
TL;DR: A method of analysing large numbers of Google search queries to track influenza-like illness in a population and accurately estimate the current level of weekly influenza activity in each region of the United States with a reporting lag of about one day is presented.
Proceedings ArticleDOI
Why we twitter: understanding microblogging usage and communities
TL;DR: It is found that people use microblogging to talk about their daily activities and to seek or share information and the user intentions associated at a community level are analyzed to show how users with similar intentions connect with each other.
Related Papers (5)
Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards
Werner Antweiler,Murray Z. Frank +1 more