scispace - formally typeset
Proceedings ArticleDOI

GeoFolk: latent spatial semantics in web 2.0 social media

Sergej Sizov
- pp 281-290
Reads0
Chats0
TLDR
Experimental results show that the model-based framework GeoFolk outperforms baseline techniques that are based on one of the aspects alone, and the approach described in this contribution can also be used in other domains such as Geoweb retrieval.
Abstract
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spatial knowledge (e.g. geotags and coordinates of images and videos). Our model-based framework GeoFolk combines these two aspects in order to construct better algorithms for content management, retrieval, and sharing. The approach is based on multi-modal Bayesian models which allow us to integrate spatial semantics of social media in a well-formed, probabilistic manner. We systematically evaluate the solution on a subset of Flickr data, in characteristic scenarios of tag recommendation, content classification, and clustering. Experimental results show that our method outperforms baseline techniques that are based on one of the aspects alone. The approach described in this contribution can also be used in other domains such as Geoweb retrieval.

read more

Citations
More filters
Journal ArticleDOI

Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey

TL;DR: In this article, the authors investigated highly scholarly articles (between 2003 to 2016) related to topic modeling based on LDA to discover the research development, current trends and intellectual structure of topic modeling.
Posted Content

Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey

TL;DR: In this article, the authors investigated the research development, current trends and intellectual structure of topic modeling based on Latent Dirichlet Allocation (LDA), and summarized challenges and introduced famous tools and datasets in topic modelling based on LDA.
Proceedings ArticleDOI

Learning geographical preferences for point-of-interest recommendation

TL;DR: A novel geographical probabilistic factor analysis framework which strategically takes various factors into consideration and allows to capture the geographical influences on a user's check-in behavior and shows that the proposed recommendation method outperforms state-of-the-art latent factor models with a significant margin.
Proceedings ArticleDOI

Discovering geographical topics in the twitter stream

TL;DR: An algorithm is presented by modeling diversity in tweets based on topical diversity, geographical diversity, and an interest distribution of the user by exploiting sparse factorial coding of the attributes, thus allowing it to deal with a large and diverse set of covariates efficiently.
Proceedings ArticleDOI

Towards social user profiling: unified and discriminative influence model for inferring home locations

TL;DR: A unified discriminative influence model, named as UDI, is proposed to solve the problem of profiling users' home locations in the context of social network (Twitter), and develops local and global location prediction methods.
References
More filters
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article

Latent Dirichlet Allocation

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Journal ArticleDOI

Indexing by Latent Semantic Analysis

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.
Book

Introduction to Information Retrieval

TL;DR: In this article, the authors present an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.
Journal ArticleDOI

Divergence measures based on the Shannon entropy

TL;DR: A novel class of information-theoretic divergence measures based on the Shannon entropy is introduced, which do not require the condition of absolute continuity to be satisfied by the probability distributions involved and are established in terms of bounds.
Related Papers (5)