Incorporating Metadata into Content-Based User Embeddings.

doi:10.18653/V1/W17-4406

Open AccessProceedings ArticleDOI

Incorporating Metadata into Content-Based User Embeddings.

Linzi Xing, +1 more

- pp 45-49

Chats0

TLDR

This work proposes a data augmentation method that allows novel feature types to be used within off-the-shelf embedding models, and shows that this approach can lead to substantial performance gains with the simple addition of network and geographic features.

Abstract:

Low-dimensional vector representations of social media users can benefit applications like recommendation systems and user attribute inference Recent work has shown that user embeddings can be improved by combining different types of information, such as text and network data We propose a data augmentation method that allows novel feature types to be used within off-the-shelf embedding models Experimenting with the task of friend recommendation on a dataset of 5,019 Twitter users, we show that our approach can lead to substantial performance gains with the simple addition of network and geographic features

Citations

PDF

Open Access

More filters

Book

Actes de la conférence Traitement Automatique de la Langue Naturelle, TALN 2018

Anne-Laure Ligozat, +5 more

TL;DR: This article presents an information extraction method which collects additional information on the web so as to enrich already existing information and then fill in a knowledge base using lexical and syntactical patterns.

...read moreread less

Proceedings Article

RP-DNN : a Tweet level propagation context based deep neural networks for early rumor detection in social media

Jie Gao, +3 more

TL;DR: The authors proposed a novel hybrid neural network architecture, which combines a task-specific character-based bidirectional language model and stacked Long Short-Term Memory (LSTM) networks to represent textual contents and social-temporal contexts of input source tweets for modeling propagation patterns of rumors in the early stages of their development.

...read moreread less

Proceedings ArticleDOI

Party Matters: Enhancing Legislative Embeddings with Author Attributes for Vote Prediction

Anastassia Kornilova, +2 more

TL;DR: This article proposed a novel neural method for encoding documents alongside additional metadata, achieving an average of a 4% boost in accuracy over the previous state-of-the-art state of the art.

...read moreread less

Proceedings ArticleDOI

Detecting Trending Terms in Cybersecurity Forum Discussions.

John Hughes, +4 more

TL;DR: This work presents a lightweight method for identifying currently trending terms in relation to a known prior of terms, using a weighted log-odds ratio with an informative prior, and finds this method outperforms TF-IDF on information retrieval.

...read moreread less

Posted Content

RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media

Jie Gao, +3 more

- 28 Feb 2020 -

arXiv: Social and Information Networks

TL;DR: A novel hybrid neural network architecture is presented, which combines a task-specific character-based bidirectional language model and stacked Long Short-Term Memory networks to represent textual contents and social-temporal contexts of input source tweets, for modelling propagation patterns of rumors in the early stages of their development.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Proceedings Article

Distributed Representations of Sentences and Documents

Quoc V. Le, +1 more

TL;DR: Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models.

...read moreread less

Journal IssueDOI

The link-prediction problem for social networks

David Liben-Nowell, +1 more

- 01 May 2007 -

Journal of the Association for Informati...

TL;DR: Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures.

...read moreread less

Software Framework for Topic Modelling with Large Corpora

Radim Řehůřek, +1 more

TL;DR: This work describes a Natural Language Processing software framework which is based on the idea of document streaming, i.e. processing corpora document after document, in a memory independent fashion, and implements several popular algorithms for topical inference, including Latent Semantic Analysis and Latent Dirichlet Allocation in a way that makes them completely independent of the training corpus size.

...read moreread less

Posted Content

Distributed Representations of Sentences and Documents

Quoc V. Le, +1 more

- 16 May 2014 -

arXiv: Computation and Language

TL;DR: The authors proposed paragraph vector, an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and achieved new state-of-the-art results on several text classification and sentiment analysis tasks.

...read moreread less

Incorporating Metadata into Content-Based User Embeddings.

Citations

Actes de la conférence Traitement Automatique de la Langue Naturelle, TALN 2018

RP-DNN : a Tweet level propagation context based deep neural networks for early rumor detection in social media

Party Matters: Enhancing Legislative Embeddings with Author Attributes for Vote Prediction

Detecting Trending Terms in Cybersecurity Forum Discussions.

RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media

References

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Sentences and Documents

The link-prediction problem for social networks

Software Framework for Topic Modelling with Large Corpora

Distributed Representations of Sentences and Documents

Related Papers (5)

A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems

One Embedding To Do Them All

Recommendation of Points-of-Interest Using Graph Embeddings

Distributed Representations of Words and Phrases and their Compositionality

Multi-network User Identification via Graph-Aware Embedding.