scispace - formally typeset
Open AccessJournal ArticleDOI

Twitter data analysis: temporal and term frequency analysis with real-time event

Garima Yadav, +2 more
- Vol. 263, Iss: 4, pp 042081
Reads0
Chats0
TLDR
This document is performing time-series analysis and term frequency analysis using different techniques such as filtering, information extraction for text-mining that fulfils the objective of finding interesting moments for temporal data in the event and finding the ranking among the players or the teams based on popularity.
Abstract
From the past few years, World Wide Web (www) has become a prominent and huge source for user generated content and opinionative data. Among various social media, Twitter gained popularity as it offers a fast and effective way of sharing users' perspective towards various critical and other issues in different domain. As the data is hugely generated on cloud, it has opened doors for the researchers in the field of data science and analysis. There are various domains such as 'Political' domain, 'Entertainment' domain and 'Business' domain. Also there are various APIs that Twitter provides for developers 1) Search API, focus on the old tweets 2) Rest API, focuses on user details and allow to collect the user profile, friends and followers 3) Streaming API, which collects details like tweets, hashtags, geo locations. In our work we are accessing Streaming API in order to fetch real-time tweets for the dynamic happening event. For this we are focusing on 'Entertainment' domain especially 'Sports' as IPL-T20 is currently the trending on-going event. We are collecting these numerous amounts of tweets and storing them in MongoDB database where the tweets are stored in JSON document format. On this document we are performing time-series analysis and term frequency analysis using different techniques such as filtering, information extraction for text-mining that fulfils our objective of finding interesting moments for temporal data in the event and finding the ranking among the players or the teams based on popularity which helps people in understanding key influencers on the social media platform.

read more

Citations
More filters

Detecting real-time events using tweets

TL;DR: In this paper, a scheme is proposed which can detect what happens in real world in real time only by analyzing tweets as Big Data and let a user know the event by quantifying importance of words accurately and evaluating the quantified values dynamically.
Book ChapterDOI

A Study on Spatiotemporal Topical Analysis of Twitter Data

TL;DR: This study categorizes a large number of recent studies and articles in relevant area to get a summarized view of the state of the art in this field and provides a quick baseline for further research.
Journal ArticleDOI

Thematic context vector association based on event uncertainty for Twitter

TL;DR: In this paper , contextual keywords are extracted using thematic events with the help of data association and the thematic context for events is identified using the uncertainty principle in the proposed system.
References
More filters
Proceedings ArticleDOI

A text mining application of emotion classifications of Twitter's users using Naïve Bayes method

TL;DR: This research developed a text mining application to detect emotions of Twitter users that are classified into six emotions, namely happiness, sadness, anger, disgust, fear, and surprise, which is able to achieve 83% accuracy for 105 tweets.
Proceedings ArticleDOI

Using Social Sensing to Discover Trends in Public Emotion

TL;DR: The emotion analysis methodology is shown to present a fast and robust way of analyzing online stream of tweets and an online approach to predict emotion-intensive moments during real-life events.
Proceedings ArticleDOI

Extraction of Unexpected Rules from Twitter Hashtags and its Application to Sport Events

TL;DR: This paper analyzes tweets relating to the English FA Cup finals 2012 by applying a novel method named TRCM to extract association rules present in hash tag keywords of tweets in different time-slots, and maps the identified hash tags to event highlights of the game as reported in the ground truth of the main stream media.
Proceedings ArticleDOI

Finding Non-Redundant Multi-Word Events on Twitter

TL;DR: The model is used to analyze 24 million geotagged tweets that have been sent in the US from April 9 to April 22, 2013 - the time period of the Boston marathon bombing - and it is shown that the approach can create multi-word events that efficiently summarize real-world events.
Proceedings ArticleDOI

How Has Twitter Changed the Event Discussion Scenario? A Spatio-temporal Diffusion Analysis

TL;DR: The aim is to study diffusion dynamics of specific real world events, discussed on Twitter, with respect to location and time, and study the diffusion of the events using the user interaction graph formed by retweet/mention links.
Related Papers (5)