Showing papers on "Semantic similarity published in 2015"

PDF

Open Access

Proceedings Article•DOI•

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

[...]

Kai Sheng Tai¹, Richard Socher², Christopher D. Manning¹•Institutions (2)

Stanford University¹, University of Colorado Boulder²

28 Feb 2015

TL;DR: The authors introduced the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies, which outperformed all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

...read moreread less

Abstract: A Long Short-Term Memory (LSTM) network is a type of recurrent neural network architecture which has recently obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. TreeLSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

...read moreread less

2,702 citations

Proceedings Article•

Skip-thought vectors

[...]

Ryan Kiros¹, Yukun Zhu¹, Ruslan Salakhutdinov², Richard S. Zemel², Antonio Torralba³, Raquel Urtasun¹, Sanja Fidler¹ - Show less +3 more•Institutions (3)

University of Toronto¹, Canadian Institute for Advanced Research², Massachusetts Institute of Technology³

07 Dec 2015

TL;DR: This article used the continuity of text from books to train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage, which can produce highly generic sentence representations that are robust and perform well in practice.

...read moreread less

Abstract: We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice.

...read moreread less

1,802 citations

Posted Content•

Skip-Thought Vectors

[...]

Ryan Kiros¹, Yukun Zhu¹, Ruslan Salakhutdinov², Richard S. Zemel², Antonio Torralba³, Raquel Urtasun¹, Sanja Fidler¹ - Show less +3 more•Institutions (3)

University of Toronto¹, Canadian Institute for Advanced Research², Massachusetts Institute of Technology³

22 Jun 2015-arXiv: Computation and Language

TL;DR: The approach for unsupervised learning of a generic, distributed sentence encoder is described, using the continuity of text from books to train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage.

...read moreread less

1,115 citations

Posted Content•

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

[...]

Kai Sheng Tai¹, Richard Socher², Christopher D. Manning¹•Institutions (2)

Stanford University¹, University of Colorado Boulder²

28 Feb 2015-arXiv: Computation and Language

TL;DR: The Tree-LSTM is introduced, a generalization of LSTMs to tree-structured network topologies that outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences and sentiment classification.

...read moreread less

Abstract: Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. Tree-LSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

...read moreread less

759 citations

Posted Content•

Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval

[...]

Fang Zhao¹, Yongzhen Huang¹, Liang Wang¹, Tieniu Tan¹•Institutions (1)

Chinese Academy of Sciences¹

26 Jan 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this work, deep convolutional neural network is incorporated into hash functions to jointly learn feature representations and mappings from them to hash codes, which avoids the limitation of semantic representation power of hand-crafted features.

...read moreread less

Abstract: With the rapid growth of web images, hashing has received increasing interests in large scale image retrieval. Research efforts have been devoted to learning compact binary codes that preserve semantic similarity based on labels. However, most of these hashing methods are designed to handle simple binary similarity. The complex multilevel semantic structure of images associated with multiple labels have not yet been well explored. Here we propose a deep semantic ranking based method for learning hash functions that preserve multilevel semantic similarity between multi-label images. In our approach, deep convolutional neural network is incorporated into hash functions to jointly learn feature representations and mappings from them to hash codes, which avoids the limitation of semantic representation power of hand-crafted features. Meanwhile, a ranking list that encodes the multilevel similarity information is employed to guide the learning of such deep hash functions. An effective scheme based on surrogate loss is used to solve the intractable optimization problem of nonsmooth and multivariate ranking measures involved in the learning procedure. Experimental results show the superiority of our proposed approach over several state-of-the-art hashing methods in term of ranking evaluation metrics when tested on multi-label image datasets.

...read moreread less

520 citations

Proceedings Article•DOI•

Zero-Shot Learning via Semantic Similarity Embedding

[...]

Ziming Zhang¹, Venkatesh Saligrama¹•Institutions (1)

Boston University¹

07 Dec 2015

TL;DR: A version of the zero-shot learning problem where seen class source and target domain data are provided and the goal during test-time is to accurately predict the class label of an unseen target domain instance based on revealed source domain side information for unseen classes.

...read moreread less

Abstract: In this paper we consider a version of the zero-shot learning problem where seen class source and target domain data are provided. The goal during test-time is to accurately predict the class label of an unseen target domain instance based on revealed source domain side information (e.g. attributes) for unseen classes. Our method is based on viewing each source or target data as a mixture of seen class proportions and we postulate that the mixture patterns have to be similar if the two instances belong to the same unseen class. This perspective leads us to learning source/target embedding functions that map an arbitrary source/target domain data into a same semantic space where similarity can be readily measured. We develop a max-margin framework to learn these similarity functions and jointly optimize parameters by means of cross validation. Our test results are compelling, leading to significant improvement in terms of accuracy on most benchmark datasets for zero-shot recognition.

...read moreread less

506 citations

Proceedings Article•DOI•

[...]

Tom Kenter¹, Maarten de Rijke¹•Institutions (1)

University of Amsterdam¹

17 Oct 2015

TL;DR: This work proposes to go from word-level to text-level semantics by combining insights from methods based on external sources of semantic knowledge with word embeddings, and derives multiple types of meta-features from the comparison of the word vectors for short text pairs, and from the vector means of their respective word embedDings.

...read moreread less

Abstract: Determining semantic similarity between texts is important in many tasks in information retrieval such as search, query suggestion, automatic summarization and image finding. Many approaches have been suggested, based on lexical matching, handcrafted patterns, syntactic parse trees, external sources of structured semantic knowledge and distributional semantics. However, lexical features, like string matching, do not capture semantic similarity beyond a trivial level. Furthermore, handcrafted patterns and external sources of structured semantic knowledge cannot be assumed to be available in all circumstances and for all domains. Lastly, approaches depending on parse trees are restricted to syntactically well-formed texts, typically of one sentence in length. We investigate whether determining short text similarity is possible using only semantic features---where by semantic we mean, pertaining to a representation of meaning---rather than relying on similarity in lexical or syntactic representations. We use word embeddings, vector representations of terms, computed from unlabelled data, that represent terms in a semantic space in which proximity of vectors can be interpreted as semantic similarity. We propose to go from word-level to text-level semantics by combining insights from methods based on external sources of semantic knowledge with word embeddings. A novel feature of our approach is that an arbitrary number of word embedding sets can be incorporated. We derive multiple types of meta-features from the comparison of the word vectors for short text pairs, and from the vector means of their respective word embeddings. The features representing labelled short text pairs are used to train a supervised learning algorithm. We use the trained model at testing time to predict the semantic similarity of new, unlabelled pairs of short texts We show on a publicly available evaluation set commonly used for the task of semantic similarity that our method outperforms baseline methods that work under the same conditions.

...read moreread less

426 citations

Posted Content•

Zero-Shot Learning via Semantic Similarity Embedding

[...]

Ziming Zhang¹, Venkatesh Saligrama¹•Institutions (1)

Boston University¹

15 Sep 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, a max-margin framework is developed to learn source/target embedding functions that map an arbitrary source and target domain data into a same semantic space where similarity can be readily measured.

...read moreread less

Abstract: In this paper we consider a version of the zero-shot learning problem where seen class source and target domain data are provided. The goal during test-time is to accurately predict the class label of an unseen target domain instance based on revealed source domain side information (\eg attributes) for unseen classes. Our method is based on viewing each source or target data as a mixture of seen class proportions and we postulate that the mixture patterns have to be similar if the two instances belong to the same unseen class. This perspective leads us to learning source/target embedding functions that map an arbitrary source/target domain data into a same semantic space where similarity can be readily measured. We develop a max-margin framework to learn these similarity functions and jointly optimize parameters by means of cross validation. Our test results are compelling, leading to significant improvement in terms of accuracy on most benchmark datasets for zero-shot recognition.

...read moreread less

411 citations

Proceedings Article•DOI•

Deep semantic ranking based hashing for multi-label image retrieval

[...]

Fang Zhao¹, Yongzhen Huang¹, Liang Wang¹, Tieniu Tan¹•Institutions (1)

Chinese Academy of Sciences¹

07 Jun 2015

TL;DR: Zhang et al. as discussed by the authors proposed a deep semantic ranking based method for learning hash functions that preserve multilevel semantic similarity between multi-label images, which avoids the limitation of semantic representation power of hand-crafted features.

...read moreread less

Abstract: With the rapid growth of web images, hashing has received increasing interests in large scale image retrieval. Research efforts have been devoted to learning compact binary codes that preserve semantic similarity based on labels. However, most of these hashing methods are designed to handle simple binary similarity. The complex multi-level semantic structure of images associated with multiple labels have not yet been well explored. Here we propose a deep semantic ranking based method for learning hash functions that preserve multilevel semantic similarity between multi-label images. In our approach, deep convolutional neural network is incorporated into hash functions to jointly learn feature representations and mappings from them to hash codes, which avoids the limitation of semantic representation power of hand-crafted features. Meanwhile, a ranking list that encodes the multilevel similarity information is employed to guide the learning of such deep hash functions. An effective scheme based on surrogate loss is used to solve the intractable optimization problem of nonsmooth and multivariate ranking measures involved in the learning procedure. Experimental results show the superiority of our proposed approach over several state-of-the-art hashing methods in term of ranking evaluation metrics when tested on multi-label image datasets.

...read moreread less

377 citations

Journal Article•DOI•

Semantic mapping for mobile robotics tasks

[...]

Ioannis Kostavelis¹, Antonios Gasteratos¹•Institutions (1)

Democritus University of Thrace¹

01 Apr 2015-Robotics and Autonomous Systems

TL;DR: An explicit analysis of the existing methods of semantic mapping is sought, and the several algorithms are categorized according to their primary characteristics, namely scalability, inference model, temporal coherence and topological map usage.

...read moreread less

348 citations

Proceedings Article•DOI•

SensEmbed: Learning Sense Embeddings for Word and Relational Similarity

[...]

Ignacio Iacobacci¹, Mohammad Taher Pilehvar¹, Roberto Navigli¹•Institutions (1)

Sapienza University of Rome¹

01 Jul 2015

TL;DR: This work proposes a multifaceted approach that transforms word embeddings to the sense level and leverages knowledge from a large semantic network for effective semantic similarity measurement.

...read moreread less

Abstract: Word embeddings have recently gained considerable popularity for modeling words in different Natural Language Processing (NLP) tasks including semantic similarity measurement. However, notwithstanding their success, word embeddings are by their very nature unable to capture polysemy, as different meanings of a word are conflated into a single representation. In addition, their learning process usually relies on massive corpora only, preventing them from taking advantage of structured knowledge. We address both issues by proposing a multifaceted approach that transforms word embeddings to the sense level and leverages knowledge from a large semantic network for effective semantic similarity measurement. We evaluate our approach on word similarity and relational similarity frameworks, reporting state-of-the-art performance on multiple datasets.

...read moreread less

Proceedings Article•DOI•

Zero-shot object recognition by semantic manifold distance

[...]

Zhenyong Fu¹, Tao Xiang¹, Elyor Kodirov¹, Shaogang Gong¹•Institutions (1)

Queen Mary University of London¹

07 Jun 2015

TL;DR: The semantic manifold structure is used to redefine the distance metric in the semantic embedding space for more effective ZSL, and the proposed new model improves upon and seamlessly unifies various existing ZSL algorithms.

...read moreread less

Abstract: Object recognition by zero-shot learning (ZSL) aims to recognise objects without seeing any visual examples by learning knowledge transfer between seen and unseen object classes. This is typically achieved by exploring a semantic embedding space such as attribute space or semantic word vector space. In such a space, both seen and unseen class labels, as well as image features can be embedded (projected), and the similarity between them can thus be measured directly. Existing works differ in what embedding space is used and how to project the visual data into the semantic embedding space. Yet, they all measure the similarity in the space using a conventional distance metric (e.g. cosine) that does not consider the rich intrinsic structure, i.e. semantic manifold, of the semantic categories in the embedding space. In this paper we propose to model the semantic manifold in an embedding space using a semantic class label graph. The semantic manifold structure is used to redefine the distance metric in the semantic embedding space for more effective ZSL. The proposed semantic manifold distance is computed using a novel absorbing Markov chain process (AMP), which has a very efficient closed-form solution. The proposed new model improves upon and seamlessly unifies various existing ZSL algorithms. Extensive experiments on both the large scale ImageNet dataset and the widely used Animal with Attribute (AwA) dataset show that our model outperforms significantly the state-of-the-arts.

...read moreread less

Journal Article•DOI•

A semantic approach for text clustering using WordNet and lexical chains

[...]

Tingting Wei¹, Yonghe Lu¹, Huiyou Chang¹, Qiang Zhou¹, Xianyu Bao - Show less +1 more•Institutions (1)

Sun Yat-sen University¹

01 Mar 2015-Expert Systems With Applications

TL;DR: The proposed approach exploits ontology hierarchical structure and relations to provide a more accurate assessment of the similarity between terms for word sense disambiguation and introduces lexical chains to extract a set of semantically related words from texts, which can represent the semantic content of the texts.

...read moreread less

Abstract: A modified WordNet based similarity measure for word sense disambiguation.Lexical chains as text representation for ideally cover the theme of texts.Extracted core semantics are sufficient to reduce dimensionality of feature set.The proposed scheme is able to correctly estimate the true number of clusters.The topic labels have good indicator of recognizing and understanding the clusters. Traditional clustering algorithms do not consider the semantic relationships among words so that cannot accurately represent the meaning of documents. To overcome this problem, introducing semantic information from ontology such as WordNet has been widely used to improve the quality of text clustering. However, there still exist several challenges, such as synonym and polysemy, high dimensionality, extracting core semantics from texts, and assigning appropriate description for the generated clusters. In this paper, we report our attempt towards integrating WordNet with lexical chains to alleviate these problems. The proposed approach exploits ontology hierarchical structure and relations to provide a more accurate assessment of the similarity between terms for word sense disambiguation. Furthermore, we introduce lexical chains to extract a set of semantically related words from texts, which can represent the semantic content of the texts. Although lexical chains have been extensively used in text summarization, their potential impact on text clustering problem has not been fully investigated. Our integrated way can identify the theme of documents based on the disambiguated core features extracted, and in parallel downsize the dimensions of feature space. The experimental results using the proposed framework on reuters-21578 show that clustering performance improves significantly compared to several classical methods.

...read moreread less

Proceedings Article•DOI•

Do Multi-Sense Embeddings Improve Natural Language Understanding?

[...]

Jiwei Li¹, Dan Jurafsky¹•Institutions (1)

Stanford University¹

01 Sep 2015

TL;DR: A multisense embedding model based on Chinese Restaurant Processes is introduced that achieves state of the art performance on matching human word similarity judgments, and a pipelined architecture for incorporating multi-sense embeddings into language understanding is proposed.

...read moreread less

Abstract: Learning a distinct representation for each sense of an ambiguous word could lead to more powerful and fine-grained models of vector-space representations. Yet while ‘multi-sense’ methods have been proposed and tested on artificial wordsimilarity tasks, we don’t know if they improve real natural language understanding tasks. In this paper we introduce a multisense embedding model based on Chinese Restaurant Processes that achieves state of the art performance on matching human word similarity judgments, and propose a pipelined architecture for incorporating multi-sense embeddings into language understanding. We then test the performance of our model on part-of-speech tagging, named entity recognition, sentiment analysis, semantic relation identification and semantic relatedness, controlling for embedding dimensionality. We find that multi-sense embeddings do improve performance on some tasks (part-of-speech tagging, semantic relation identification, semantic relatedness) but not on others (named entity recognition, various forms of sentiment analysis). We discuss how these differences may be caused by the different role of word sense information in each of the tasks. The results highlight the importance of testing embedding models in real applications.

...read moreread less

Journal Article•DOI•

Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity

[...]

Xing Chen¹, Chenggang Clarence Yan², Cai Luo², Wen Ji¹, Yongdong Zhang¹, Qionghai Dai² - Show less +2 more•Institutions (2)

Chinese Academy of Sciences¹, Tsinghua University²

10 Jun 2015-Scientific Reports

TL;DR: Two novel lncRNA functional similarity calculation models (LNCSIM) are developed based on the assumption that functionally similar lncRNAs tend to be associated with similar diseases and it is anticipated that LNCSIM could be a useful and important biological tool for human disease diagnosis, treatment, and prevention.

...read moreread less

Abstract: Increasing evidence has indicated that plenty of lncRNAs play important roles in many critical biological processes Developing powerful computational models to construct lncRNA functional similarity network based on heterogeneous biological datasets is one of the most important and popular topics in the fields of both lncRNAs and complex diseases Functional similarity network construction could benefit the model development for both lncRNA function inference and lncRNA-disease association identification However, little effort has been attempted to analysis and calculate lncRNA functional similarity on a large scale In this study, based on the assumption that functionally similar lncRNAs tend to be associated with similar diseases, we developed two novel lncRNA functional similarity calculation models (LNCSIM) LNCSIM was evaluated by introducing similarity scores into the model of Laplacian Regularized Least Squares for LncRNA-Disease Association (LRLSLDA) for lncRNA-disease association prediction As a result, new predictive models improved the performance of LRLSLDA in the leave-one-out cross validation of various known lncRNA-disease associations datasets Furthermore, some of the predictive results for colorectal cancer and lung cancer were verified by independent biological experimental studies It is anticipated that LNCSIM could be a useful and important biological tool for human disease diagnosis, treatment, and prevention

...read moreread less

Posted Content•

Do Multi-Sense Embeddings Improve Natural Language Understanding?

[...]

Jiwei Li¹, Dan Jurafsky¹•Institutions (1)

Stanford University¹

02 Jun 2015-arXiv: Computation and Language

TL;DR: This paper proposed a multi-sense embedding model based on Chinese Restaurant Processes that achieves state-of-the-art performance on matching human word similarity judgments, and proposed a pipelined architecture for incorporating multisense embeddings into language understanding.

...read moreread less

Abstract: Learning a distinct representation for each sense of an ambiguous word could lead to more powerful and fine-grained models of vector-space representations. Yet while `multi-sense' methods have been proposed and tested on artificial word-similarity tasks, we don't know if they improve real natural language understanding tasks. In this paper we introduce a multi-sense embedding model based on Chinese Restaurant Processes that achieves state of the art performance on matching human word similarity judgments, and propose a pipelined architecture for incorporating multi-sense embeddings into language understanding. We then test the performance of our model on part-of-speech tagging, named entity recognition, sentiment analysis, semantic relation identification and semantic relatedness, controlling for embedding dimensionality. We find that multi-sense embeddings do improve performance on some tasks (part-of-speech tagging, semantic relation identification, semantic relatedness) but not on others (named entity recognition, various forms of sentiment analysis). We discuss how these differences may be caused by the different role of word sense information in each of the tasks. The results highlight the importance of testing embedding models in real applications.

...read moreread less

Proceedings Article•DOI•

SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)

[...]

Wei Xu¹, Chris Callison-Burch¹, Bill Dolan²•Institutions (2)

University of Pennsylvania¹, Microsoft²

01 Jun 2015

TL;DR: In this shared task, evaluations on two related tasks Paraphrase Identification and Semantic Textual Similarity (SS) systems for the Twitter data are presented and the importance to bringing these two research areas together is suggested.

...read moreread less

Abstract: In this shared task, we present evaluations on two related tasks Paraphrase Identification (PI) and Semantic Textual Similarity (SS) systems for the Twitter data. Given a pair of sentences, participants are asked to produce a binary yes/no judgement or a graded score to measure their semantic equivalence. The task features a newly constructed Twitter Paraphrase Corpus that contains 18,762 sentence pairs. A total of 19 teams participated, submitting 36 runs to the PI task and 26 runs to the SS task. The evaluation shows encouraging results and open challenges for future research. The best systems scored a F1-measure of 0.674 for the PI task and a Pearson correlation of 0.619 for the SS task respectively, comparing to a strong baseline using logistic regression model of 0.589 F1 and 0.511 Pearson; while the best SS systems can often reach >0.80 Pearson on well-formed text. This shared task also provides insights into the relation between the PI and SS tasks and suggests the importance to bringing these two research areas together. We make all the data, baseline systems and evaluation scripts publicly available. 1

...read moreread less

Posted Content•

Effective LSTMs for Target-Dependent Sentiment Classification

[...]

Duyu Tang¹, Bing Qin², Xiaocheng Feng², Ting Liu²•Institutions (2)

Microsoft¹, Harbin Institute of Technology²

03 Dec 2015-arXiv: Computation and Language

TL;DR: Zhang et al. as discussed by the authors developed two target dependent long short-term memory (LSTM) models, where target information is automatically taken into account, which achieved state-of-the-art performance on a benchmark dataset from Twitter.

...read moreread less

Abstract: Target-dependent sentiment classification remains a challenge: modeling the semantic relatedness of a target with its context words in a sentence. Different context words have different influences on determining the sentiment polarity of a sentence towards the target. Therefore, it is desirable to integrate the connections between target word and context words when building a learning system. In this paper, we develop two target dependent long short-term memory (LSTM) models, where target information is automatically taken into account. We evaluate our methods on a benchmark dataset from Twitter. Empirical results show that modeling sentence representation with standard LSTM does not perform well. Incorporating target information into LSTM can significantly boost the classification accuracy. The target-dependent LSTM models achieve state-of-the-art performances without using syntactic parser or external sentiment lexicons.

...read moreread less

Journal Article•DOI•

RecomMetz: A context-aware knowledge-based mobile recommender system for movie showtimes

[...]

Luis Omar Colombo-Mendoza¹, Rafael Valencia-García¹, Alejandro Rodríguez-González², Giner Alor-Hernández, José Javier Samper-Zapater³ - Show less +1 more•Institutions (3)

University of Murcia¹, Universidad Internacional de La Rioja², University of Valencia³

15 Feb 2015-Expert Systems With Applications

TL;DR: The evaluation results show the efficiency and effectiveness of the recommendation mechanism implemented by RecomMetz in both a cold-start scenario and a no cold- start scenario.

...read moreread less

Abstract: Recommender systems are used to provide filtered information from a large amount of elements. They provide personalized recommendations on products or services to users. The recommendations are intended to provide interesting elements to users. Recommender systems can be developed using different techniques and algorithms where the selection of these techniques depends on the area in which they will be applied. This paper proposes a recommender system in the leisure domain, specifically in the movie showtimes domain. The system proposed is called RecomMetz, and it is a context-aware mobile recommender system based on Semantic Web technologies. In detail, a domain ontology primarily serving a semantic similarity metric adjusted to the concept of “packages of single items” was developed in this research. In addition, location, crowd and time were considered as three different kinds of contextual information in RecomMetz. In a nutshell, RecomMetz has unique features: (1) the items to be recommended have a composite structure (movie theater + movie + showtime), (2) the integration of the time and crowd factors into a context-aware model, (3) the implementation of an ontology-based context modeling approach and (4) the development of a multi-platform native mobile user interface intended to leverage the hardware capabilities (sensors) of mobile devices. The evaluation results show the efficiency and effectiveness of the recommendation mechanism implemented by RecomMetz in both a cold-start scenario and a no cold-start scenario.

...read moreread less

Proceedings Article•DOI•

Specializing Word Embeddings for Similarity or Relatedness

[...]

Douwe Kiela¹, Felix Hill¹, Stephen Clark¹•Institutions (1)

University of Cambridge¹

01 Sep 2015

TL;DR: It is shown that using specialized spaces in NLP tasks and applications leads to clear improvements, for document classification and synonym selection, which rely on either similarity or relatedness but not both.

...read moreread less

Abstract: We demonstrate the advantage of specializing semantic word embeddings for either similarity or relatedness. We compare two variants of retrofitting and a joint-learning approach, and find that all three yield specialized semantic spaces that capture human intuitions regarding similarity and relatedness better than unspecialized spaces. We also show that using specialized spaces in NLP tasks and applications leads to clear improvements, for document classification and synonym selection, which rely on either similarity or relatedness but not both.

...read moreread less

Journal Article•DOI•

L-GRAAL: Lagrangian graphlet-based network aligner.

[...]

Noël Malod-Dognin¹, Nataša Pržulj¹•Institutions (1)

Imperial College London¹

01 Jul 2015-Bioinformatics

TL;DR: L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions, and is compared with the state-of-the-art network aligners on the largest available PPI networks from BioGRID.

...read moreread less

Abstract: Motivation: Discovering and understanding patterns in networks of protein–protein interactions (PPIs) is a central problem in systems biology. Alignments between these networks aid functional understanding as they uncover important information, such as evolutionary conserved pathways, protein complexes and functional orthologs. A few methods have been proposed for global PPI network alignments, but because of NP-completeness of underlying sub-graph isomorphism problem, producing topologically and biologically accurate alignments remains a challenge. Results: We introduce a novel global network alignment tool, Lagrangian GRAphlet-based ALigner (L-GRAAL), which directly optimizes both the protein and the interaction functional conservations, using a novel alignment search heuristic based on integer programming and Lagrangian relaxation. We compare L-GRAAL with the state-of-the-art network aligners on the largest available PPI networks from BioGRID and observe that L-GRAAL uncovers the largest common sub-graphs between the networks, as measured by edge-correctness and symmetric sub-structures scores, which allow transferring more functional information across networks. We assess the biological quality of the protein mappings using the semantic similarity of their Gene Ontology annotations and observe that L-GRAAL best uncovers functionally conserved proteins. Furthermore, we introduce for the first time a measure of the semantic similarity of the mapped interactions and show that L-GRAAL also uncovers best functionally conserved interactions. In addition, we illustrate on the PPI networks of baker's yeast and human the ability of L-GRAAL to predict new PPIs. Finally, L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions. Availability and implementation: L-GRAAL is coded in C++. Software is available at: http://bio-nets.doc.ic.ac.uk/L-GRAAL/. Contact: ku.ca.lairepmi@ningod-dolam.n Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

Proceedings Article•DOI•

Better Summarization Evaluation with Word Embeddings for ROUGE

[...]

Jun-Ping Ng¹, Viktoria Abrecht•Institutions (1)

Bloomberg L.P.¹

25 Aug 2015

TL;DR: In this article, instead of measuring lexical overlaps, word embeddings are used to compute the semantic similarity of the words used in summaries instead, which is able to achieve better correlations with human judgements when measured with the Spearman and Kendall rank coefficients.

...read moreread less

Abstract: ROUGE is a widely adopted, automatic evaluation measure for text summarization. While it has been shown to correlate well with human judgements, it is biased towards surface lexical similarities. This makes it unsuitable for the evaluation of abstractive summarization, or summaries with substantial paraphrasing. We study the effectiveness of word embeddings to overcome this disadvantage of ROUGE. Specifically, instead of measuring lexical overlaps, word embeddings are used to compute the semantic similarity of the words used in summaries instead. Our experimental results show that our proposal is able to achieve better correlations with human judgements when measured with the Spearman and Kendall rank coefficients.

...read moreread less

Proceedings Article•DOI•

DLS$@$CU: Sentence Similarity from Word Alignment and Semantic Vector Composition

[...]

Arafat Sultan¹, Steven Bethard², Tamara Sumner³•Institutions (3)

University of Colorado Boulder¹, University of Alabama at Birmingham², Imperial College London³

01 Jun 2015

TL;DR: A set of top-performing systems at the SemEval 2015 English Semantic Textual Similarity (STS) task, which uses word alignments and similarities between compositional sentence vectors as its features.

...read moreread less

Abstract: We describe a set of top-performing systems at the SemEval 2015 English Semantic Textual Similarity (STS) task. Given two English sentences, each system outputs the degree of their semantic similarity. Our unsupervised system, which is based on word alignments across the two input sentences, ranked 5th among 73 submitted system runs with a mean correlation of 79.19% with human annotations. We also submitted two runs of a supervised system which uses word alignments and similarities between compositional sentence vectors as its features. Our best supervised run ranked 1st with a mean correlation of 80.15%.

...read moreread less

Journal Article•DOI•

[...]

Sébastien Harispe¹, Sylvie Ranwez, Stefan Janaqi, Jacky Montmain•Institutions (1)

University of Montpellier¹

08 May 2015-Synthesis Lectures on Human Language Technologies

TL;DR: Semantic measures as discussed by the authors assess the similarity or relatedness of semantic entities by taking into account their semantics, i.e. their meaning; intuitively, the words tea and coffee, which both refer to stimulating beverages, will be estimated to be more semantically similar than the words toffee (confection) and coffee despite that the last pair has a higher syntactic similarity.

...read moreread less

Abstract: Artificial Intelligence federates numerous scientific fields in the aim of developing machines able to assist human operators performing complex treatments; most of which demand high cognitive skills (e.g. learning or decision processes). Central to this quest is to give machines the ability to estimate the likeness or similarity between things in the way human beings estimate the similarity between stimuli. In this context, this book focuses on semantic measures: approaches designed for comparing semantic entities such as units of language, e.g. words, sentences, or concepts and instances defined into knowledge bases. The aim of these measures is to assess the similarity or relatedness of such semantic entities by taking into account their semantics, i.e. their meaning; intuitively, the words tea and coffee, which both refer to stimulating beverage, will be estimated to be more semantically similar than the words toffee (confection) and coffee, despite that the last pair has a higher syntactic similarity. The two state-of-the-art approaches for estimating and quantifying semantic similarities/relatedness of semantic entities are presented in detail: the first one relies on corpora analysis and is based on Natural Language Processing techniques and semantic models while the second is based on more or less formal, computer-readable and workable forms of knowledge such as semantic networks, thesauri or ontologies. Semantic measures are widely used today to compare units of language, concepts, instances or even resources indexed by them (e.g., documents, genes). They are central elements of a large variety of Natural Language Processing applications and knowledge-based treatments, and have therefore naturally been subject to intensive and interdisciplinary research efforts during last decades. Beyond a simple inventory and categorization of existing measures, the aim of this monograph is to convey novices as well as researchers of these domains toward a better understanding of semantic similarity estimation and more generally semantic measures. To this end, we propose an in-depth characterization of existing proposals by discussing their features, the assumptions on which they are based and empirical results regarding their performance in particular applications. By answering these questions and by providing a detailed discussion on the foundations of semantic measures, our aim is to give the reader key knowledge required to: (i) select the more relevant methods according to a particular usage context, (ii) understand the challenges offered to this field of study, (iii) distinguish room of improvements for state-of-the-art approaches and (iv) stimulate creativity toward the development of new approaches. In this aim, several definitions, theoretical and practical details, as well as concrete applications are presented.

...read moreread less

Posted Content•

Target-Dependent Sentiment Classification with Long Short Term Memory.

[...]

Duyu Tang, Bing Qin, Xiaocheng Feng, Ting Liu

03 Dec 2015-arXiv: Computation and Language

TL;DR: Two target dependent long short-term memory models, where target information is automatically taken into account, are developed, which achieve state-of-the-art performances without using syntactic parser or external sentiment lexicons.

...read moreread less

Journal Article•DOI•

A semantic enhanced hybrid recommendation approach

[...]

Malak Al-hassan¹, Haiyan Lu¹, Jie Lu¹•Institutions (1)

University of Technology, Sydney¹

01 Apr 2015

TL;DR: The effectiveness of utilizing semantic knowledge of items to enhance the recommendation quality is presented and a new Inferential Ontology-based Semantic Similarity (IOBSS) measure is proposed to evaluate semantic similarity between items in a specific domain of interest.

...read moreread less

Abstract: Recommender systems are effectively used as a personalized information filtering technology to automatically predict and identify a set of interesting items on behalf of users according to their personal needs and preferences. Collaborative Filtering (CF) approach is commonly used in the context of recommender systems; however, obtaining better prediction accuracy and overcoming the main limitations of the standard CF recommendation algorithms, such as sparsity and cold-start item problems, remain a significant challenge. Recent developments in personalization and recommendation techniques support the use of semantic enhanced hybrid recommender systems, which incorporate ontology-based semantic similarity measure with other recommendation approaches to improve the quality of recommendations. Consequently, this paper presents the effectiveness of utilizing semantic knowledge of items to enhance the recommendation quality. It proposes a new Inferential Ontology-based Semantic Similarity (IOBSS) measure to evaluate semantic similarity between items in a specific domain of interest by taking into account their explicit hierarchical relationships, shared attributes and implicit relationships. The paper further proposes a hybrid semantic enhanced recommendation approach by combining the new IOBSS measure and the standard item-based CF approach. A set of experiments with promising results validates the effectiveness of the proposed hybrid approach, using a case study of the Australian e-Government tourism services. A hybrid semantic enhanced recommendation approachA new Inferential Ontology-based Semantic Similarity (IOBSS) between two ontological instancesA few new concepts: Association, Associate Network and Common Associate Pair SetA case study of Australian e-Government tourism services

...read moreread less

Proceedings Article•DOI•

NASARI: a Novel Approach to a Semantically-Aware Representation of Items

[...]

Jose Camacho-Collados¹, Mohammad Taher Pilehvar¹, Roberto Navigli¹•Institutions (1)

Sapienza University of Rome¹

01 Jan 2015

TL;DR: A vector representation technique that combines the complementary knowledge of both lexicographic and encyclopedic resources, such as Wikipedia, and attains state-of-the-art performance on multiple datasets in two standard benchmarks: word similarity and sense clustering.

...read moreread less

Abstract: The semantic representation of individual word senses and concepts is of fundamental importance to several applications in Natural Language Processing. To date, concept modeling techniques have in the main based their representation either on lexicographic resources, such as WordNet, or on encyclopedic resources, such as Wikipedia. We propose a vector representation technique that combines the complementary knowledge of both these types of resource. Thanks to its use of explicit semantics combined with a novel cluster-based dimensionality reduction and an effective weighting scheme, our representation attains state-of-the-art performance on multiple datasets in two standard benchmarks: word similarity and sense clustering. We are releasing our vector representations at http://lcl.uniroma1.it/nasari/.

...read moreread less

Journal Article•DOI•

A framework for multi-document abstractive summarization based on semantic role labelling

[...]

Atif Khan¹, Naomie Salim¹, Yogan Jaya Kumar•Institutions (1)

Universiti Teknologi Malaysia¹

01 May 2015

TL;DR: Results indicate that the proposed approach performs better than other summarization systems, and the integration of genetic algorithm with SRL based framework for abstractive summarizations results gives improved summarization results.

...read moreread less

Abstract: We have proposed a framework for multi-document abstractive summarization based on semantic role labeling (SRL). To the best of our knowledge, SRL has not been employed for abstractive summarization.The integration of genetic algorithm with SRL based framework for abstractive summarization results gives improved summarization results.My study focus on two highlights and discussion is based on these two highlights. We propose a framework for abstractive summarization of multi-documents, which aims to select contents of summary not from the source document sentences but from the semantic representation of the source documents. In this framework, contents of the source documents are represented by predicate argument structures by employing semantic role labeling. Content selection for summary is made by ranking the predicate argument structures based on optimized features, and using language generation for generating sentences from predicate argument structures. Our proposed framework differs from other abstractive summarization approaches in a few aspects. First, it employs semantic role labeling for semantic representation of text. Secondly, it analyzes the source text semantically by utilizing semantic similarity measure in order to cluster semantically similar predicate argument structures across the text; and finally it ranks the predicate argument structures based on features weighted by genetic algorithm (GA). Experiment of this study is carried out using DUC-2002, a standard corpus for text summarization. Results indicate that the proposed approach performs better than other summarization systems.

...read moreread less

Journal Article•DOI•

LSAfun - An R package for computations based on Latent Semantic Analysis

[...]

Fritz Günther¹, Carolin Dudschig¹, Barbara Kaup¹•Institutions (1)

University of Tübingen¹

01 Dec 2015-Behavior Research Methods

TL;DR: The R package LSAfun enables a variety of functions and computations based on Vector Semantic Models such as Latent Semantic Analysis (LSA), which are procedures to obtain a high-dimensional vector representation for words (and documents) from a text corpus.

...read moreread less

Abstract: In this article, the R package LSAfun is presented. This package enables a variety of functions and computations based on Vector Semantic Models such as Latent Semantic Analysis (LSA) Landauer, Foltz and Laham (Discourse Processes 25:259–284, 1998), which are procedures to obtain a high-dimensional vector representation for words (and documents) from a text corpus. Such representations are thought to capture the semantic meaning of a word (or document) and allow for semantic similarity comparisons between words to be calculated as the cosine of the angle between their associated vectors. LSAfun uses pre-created LSA spaces and provides functions for (a) Similarity Computations between words, word lists, and documents; (b) Neighborhood Computations, such as obtaining a word’s or document’s most similar words, (c) plotting such a neighborhood, as well as similarity structures for any word lists, in a two- or three-dimensional approximation using Multidimensional Scaling, (d) Applied Functions, such as computing the coherence of a text, answering multiple choice questions and producing generic text summaries; and (e) Composition Methods for obtaining vector representations for two-word phrases. The purpose of this package is to allow convenient access to computations based on LSA.

...read moreread less

Journal Article•DOI•

Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications

[...]

Ivan Vulić¹, Wim De Smet¹, Jie Tang², Marie-Francine Moens¹•Institutions (2)

University of Copenhagen Faculty of Science¹, Tsinghua University²

01 Jan 2015-Information Processing and Management

TL;DR: A systematic overview of how to link and transfer aspect knowledge across corpora written in different languages via the shared space of latent cross-lingual topics is provided, that is, how to effectively employ learned per-topic word distributions and per-document topic distributions of any multilingual probabilistic topic model in various cross-lingsual applications.

...read moreread less

Abstract: Probabilistic topic models are unsupervised generative models which model document content as a two-step generation process, that is, documents are observed as mixtures of latent concepts or topics, while topics are probability distributions over vocabulary words. Recently, a significant research effort has been invested into transferring the probabilistic topic modeling concept from monolingual to multilingual settings. Novel topic models have been designed to work with parallel and comparable texts. We define multilingual probabilistic topic modeling (MuPTM) and present the first full overview of the current research, methodology, advantages and limitations in MuPTM. As a representative example, we choose a natural extension of the omnipresent LDA model to multilingual settings called bilingual LDA (BiLDA). We provide a thorough overview of this representative multilingual model from its high-level modeling assumptions down to its mathematical foundations. We demonstrate how to use the data representation by means of output sets of (i) per-topic word distributions and (ii) per-document topic distributions coming from a multilingual probabilistic topic model in various real-life cross-lingual tasks involving different languages, without any external language pair dependent translation resource: (1) cross-lingual event-centered news clustering, (2) cross-lingual document classification, (3) cross-lingual semantic similarity, and (4) cross-lingual information retrieval. We also briefly review several other applications present in the relevant literature, and introduce and illustrate two related modeling concepts: topic smoothing and topic pruning. In summary, this article encompasses the current research in multilingual probabilistic topic modeling. By presenting a series of potential applications, we reveal the importance of the language-independent and language pair independent data representations by means of MuPTM. We provide clear directions for future research in the field by providing a systematic overview of how to link and transfer aspect knowledge across corpora written in different languages via the shared space of latent cross-lingual topics, that is, how to effectively employ learned per-topic word distributions and per-document topic distributions of any multilingual probabilistic topic model in various cross-lingual applications.

...read moreread less

Collapse