TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams

doi:10.1145/3097983.3098027

Proceedings ArticleDOI

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams

Chao Zhang, +6 more

- pp 595-604

Chats0

TLDR

Crowdourcing is used to evaluate TrioVecEvent, a method that leverages multimodal embeddings to achieve accurate online local event detection and introduces discriminative features that can well characterize local events.

Abstract:

Detecting local events (e.g., protest, disaster) at their onsets is an important task for a wide spectrum of applications, ranging from disaster control to crime monitoring and place recommendation. Recent years have witnessed growing interest in leveraging geo-tagged tweet streams for online local event detection. Nevertheless, the accuracies of existing methods still remain unsatisfactory for building reliable local event detection systems. We propose TrioVecEvent, a method that leverages multimodal embeddings to achieve accurate online local event detection. The effectiveness of TrioVecEvent is underpinned by its two-step detection scheme. First, it ensures a high coverage of the underlying local events by dividing the tweets in the query window into coherent geo-topic clusters. To generate quality geo-topic clusters, we capture short-text semantics by learning multimodal embeddings of the location, time, and text, and then perform online clustering with a novel Bayesian mixture model. Second, TrioVecEvent considers the geo-topic clusters as candidate events and extracts a set of features for classifying the candidates. Leveraging the multimodal embeddings as background knowledge, we introduce discriminative features that can well characterize local events, which enables pinpointing true local events from the candidate pool with a small amount of training data. We have used crowdsourcing to evaluate TrioVecEvent, and found that it improves the performance of the state-of-the-art method by a large margin.

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams

Citations

Weakly-Supervised Neural Text Classification

Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks

Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks

MiST: A Multiview and Multimodal Spatial-Temporal Learning Framework for Citywide Abnormal Event Forecasting

Weakly-Supervised Neural Text Classification

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Words and Phrases and their Compositionality

Machine Learning : A Probabilistic Perspective

Related Papers (5)

Earthquake shakes Twitter users: real-time event detection by social sensors

Latent dirichlet allocation

Distributed Representations of Words and Phrases and their Compositionality

TEDAS: A Twitter-based Event Detection and Analysis System

A Survey of Techniques for Event Detection in Twitter