Proceedings ArticleDOI
GeoFolk: latent spatial semantics in web 2.0 social media
Sergej Sizov
- pp 281-290
Reads0
Chats0
TLDR
Experimental results show that the model-based framework GeoFolk outperforms baseline techniques that are based on one of the aspects alone, and the approach described in this contribution can also be used in other domains such as Geoweb retrieval.Abstract:
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spatial knowledge (e.g. geotags and coordinates of images and videos). Our model-based framework GeoFolk combines these two aspects in order to construct better algorithms for content management, retrieval, and sharing. The approach is based on multi-modal Bayesian models which allow us to integrate spatial semantics of social media in a well-formed, probabilistic manner. We systematically evaluate the solution on a subset of Flickr data, in characteristic scenarios of tag recommendation, content classification, and clustering. Experimental results show that our method outperforms baseline techniques that are based on one of the aspects alone. The approach described in this contribution can also be used in other domains such as Geoweb retrieval.read more
Citations
More filters
Journal ArticleDOI
Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
TL;DR: In this article, the authors investigated highly scholarly articles (between 2003 to 2016) related to topic modeling based on LDA to discover the research development, current trends and intellectual structure of topic modeling.
Posted Content
Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey
TL;DR: In this article, the authors investigated the research development, current trends and intellectual structure of topic modeling based on Latent Dirichlet Allocation (LDA), and summarized challenges and introduced famous tools and datasets in topic modelling based on LDA.
Proceedings ArticleDOI
Learning geographical preferences for point-of-interest recommendation
TL;DR: A novel geographical probabilistic factor analysis framework which strategically takes various factors into consideration and allows to capture the geographical influences on a user's check-in behavior and shows that the proposed recommendation method outperforms state-of-the-art latent factor models with a significant margin.
Proceedings ArticleDOI
Discovering geographical topics in the twitter stream
TL;DR: An algorithm is presented by modeling diversity in tweets based on topical diversity, geographical diversity, and an interest distribution of the user by exploiting sparse factorial coding of the attributes, thus allowing it to deal with a large and diverse set of covariates efficiently.
Proceedings ArticleDOI
Towards social user profiling: unified and discriminative influence model for inferring home locations
TL;DR: A unified discriminative influence model, named as UDI, is proposed to solve the problem of profiling users' home locations in the context of social network (Twitter), and develops local and global location prediction methods.
References
More filters
Journal ArticleDOI
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article
Latent Dirichlet Allocation
TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Journal ArticleDOI
Indexing by Latent Semantic Analysis
TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.
Book
Introduction to Information Retrieval
TL;DR: In this article, the authors present an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.
Journal ArticleDOI
Divergence measures based on the Shannon entropy
TL;DR: A novel class of information-theoretic divergence measures based on the Shannon entropy is introduced, which do not require the condition of absolute continuity to be satisfied by the probability distributions involved and are established in terms of bounds.