Showing papers on "Ranking (information retrieval) published in 2016"

PDF

Open Access

Proceedings Article•DOI•

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

[...]

Relja Arandjelovic¹, Petr Gronat¹, Akihiko Torii², Tomas Pajdla³, Josef Sivic¹ - Show less +1 more•Institutions (3)

École Normale Supérieure¹, Tokyo Institute of Technology², Czech Technical University in Prague³

26 Jun 2016

TL;DR: A convolutional neural network architecture that is trainable in an end-to-end manner directly for the place recognition task and an efficient training procedure which can be applied on very large-scale weakly labelled tasks are developed.

...read moreread less

Abstract: We tackle the problem of large scale visual place recognition, where the task is to quickly and accurately recognize the location of a given query photograph. We present the following three principal contributions. First, we develop a convolutional neural network (CNN) architecture that is trainable in an end-to-end manner directly for the place recognition task. The main component of this architecture, NetVLAD, is a new generalized VLAD layer, inspired by the "Vector of Locally Aggregated Descriptors" image representation commonly used in image retrieval. The layer is readily pluggable into any CNN architecture and amenable to training via backpropagation. Second, we develop a training procedure, based on a new weakly supervised ranking loss, to learn parameters of the architecture in an end-to-end manner from images depicting the same places over time downloaded from Google Street View Time Machine. Finally, we show that the proposed architecture significantly outperforms non-learnt image representations and off-the-shelf CNN descriptors on two challenging place recognition benchmarks, and improves over current state of-the-art compact image representations on standard image retrieval benchmarks.

...read moreread less

1,783 citations

Journal Article•DOI•

How the machine ‘thinks’: Understanding opacity in machine learning algorithms

[...]

Jenna Burrell¹•Institutions (1)

University of California, Berkeley¹

06 Jan 2016-Big Data & Society

TL;DR: In this paper, the authors consider the issue of opacity as a problem for socially consequential mechanisms of classification and ranking, such as spam filters, credit card fraud detection, search engines, news, etc.

...read moreread less

Abstract: This article considers the issue of opacity as a problem for socially consequential mechanisms of classification and ranking, such as spam filters, credit card fraud detection, search engines, news...

...read moreread less

915 citations

Journal Article•DOI•

Salient Band Selection for Hyperspectral Image Classification via Manifold Ranking

[...]

Qi Wang¹, Jianzhe Lin², Yuan Yuan²•Institutions (2)

Northwestern Polytechnical University¹, Chinese Academy of Sciences²

18 Mar 2016-IEEE Transactions on Neural Networks

TL;DR: This paper proposes to eliminate the drawbacks of traditional salient band selection methods by manifold ranking and puts the band vectors in the more accurate manifold space and treats the saliency problem from a novel ranking perspective, which is considered to be the main contributions of this paper.

...read moreread less

Abstract: Saliency detection has been a hot topic in recent years, and many efforts have been devoted in this area. Unfortunately, the results of saliency detection can hardly be utilized in general applications. The primary reason, we think, is unspecific definition of salient objects, which makes that the previously published methods cannot extend to practical applications. To solve this problem, we claim that saliency should be defined in a context and the salient band selection in hyperspectral image (HSI) is introduced as an example. Unfortunately, the traditional salient band selection methods suffer from the problem of inappropriate measurement of band difference. To tackle this problem, we propose to eliminate the drawbacks of traditional salient band selection methods by manifold ranking. It puts the band vectors in the more accurate manifold space and treats the saliency problem from a novel ranking perspective, which is considered to be the main contributions of this paper. To justify the effectiveness of the proposed method, experiments are conducted on three HSIs, and our method is compared with the six existing competitors. Results show that the proposed method is very effective and can achieve the best performance among the competitors.

...read moreread less

444 citations

Proceedings Article•DOI•

Sketch Me That Shoe

[...]

Qian Yu¹, Feng Liu¹, Yi-Zhe Song¹, Tao Xiang¹, Timothy M. Hospedales¹, Chen Change Loy² - Show less +2 more•Institutions (2)

Queen Mary University of London¹, The Chinese University of Hong Kong²

12 Dec 2016

TL;DR: A deep tripletranking model for instance-level SBIR is developed with a novel data augmentation and staged pre-training strategy to alleviate the issue of insufficient fine-grained training data.

...read moreread less

Abstract: We investigate the problem of fine-grained sketch-based image retrieval (SBIR), where free-hand human sketches are used as queries to perform instance-level retrieval of images. This is an extremely challenging task because (i) visual comparisons not only need to be fine-grained but also executed cross-domain, (ii) free-hand (finger) sketches are highly abstract, making fine-grained matching harder, and most importantly (iii) annotated cross-domain sketch-photo datasets required for training are scarce, challenging many state-of-the-art machine learning techniques. In this paper, for the first time, we address all these challenges, providing a step towards the capabilities that would underpin a commercial sketch-based image retrieval application. We introduce a new database of 1,432 sketchphoto pairs from two categories with 32,000 fine-grained triplet ranking annotations. We then develop a deep tripletranking model for instance-level SBIR with a novel data augmentation and staged pre-training strategy to alleviate the issue of insufficient fine-grained training data. Extensive experiments are carried out to contribute a variety of insights into the challenges of data sufficiency and over-fitting avoidance when training deep networks for finegrained cross-domain ranking tasks.

...read moreread less

420 citations

Journal Article•DOI•

A novel features ranking metric with application to scalable visual and bioinformatics data classification

[...]

Quan Zou¹, Jiancang Zeng², Liujuan Cao², Rongrong Ji²•Institutions (2)

Tianjin University¹, Xiamen University²

15 Jan 2016-Neurocomputing

TL;DR: A Max-Relevance-Max-Distance (MRMD) feature ranking method, which balances accuracy and stability of feature ranking and prediction task, and runs faster than other filtering and wrapping methods, such as mRMR and Information Gain.

...read moreread less

344 citations

Posted Content•

Photo Aesthetics Ranking Network with Attributes and Content Adaptation

[...]

Shu Kong¹, Xiaohui Shen², Zhe Lin², Radomir Mech², Charless C. Fowlkes¹ - Show less +1 more•Institutions (2)

University of California, Irvine¹, Adobe Systems²

06 Jun 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes to learn a deep convolutional neural network to rank photo aesthetics in which the relative ranking of photo aesthetics are directly modeled in the loss function.

...read moreread less

Abstract: Real-world applications could benefit from the ability to automatically generate a fine-grained ranking of photo aesthetics. However, previous methods for image aesthetics analysis have primarily focused on the coarse, binary categorization of images into high- or low-aesthetic categories. In this work, we propose to learn a deep convolutional neural network to rank photo aesthetics in which the relative ranking of photo aesthetics are directly modeled in the loss function. Our model incorporates joint learning of meaningful photographic attributes and image content information which can help regularize the complicated photo aesthetics rating problem. To train and analyze this model, we have assembled a new aesthetics and attributes database (AADB) which contains aesthetic scores and meaningful attributes assigned to each image by multiple human raters. Anonymized rater identities are recorded across images allowing us to exploit intra-rater consistency using a novel sampling strategy when computing the ranking loss of training image pairs. We show the proposed sampling strategy is very effective and robust in face of subjective judgement of image aesthetics by individuals with different aesthetic tastes. Experiments demonstrate that our unified model can generate aesthetic rankings that are more consistent with human ratings. To further validate our model, we show that by simply thresholding the estimated aesthetic scores, we are able to achieve state-or-the-art classification performance on the existing AVA dataset benchmark.

...read moreread less

275 citations

Book Chapter•DOI•

Photo Aesthetics Ranking Network with Attributes and Content Adaptation

[...]

Shu Kong¹, Xiaohui Shen², Zhe Lin², Radomir Mech², Charless C. Fowlkes¹ - Show less +1 more•Institutions (2)

University of California, Irvine¹, Adobe Systems²

08 Oct 2016

TL;DR: In this paper, a deep convolutional neural network is proposed to rank photo aesthetics in which the relative ranking of photo aesthetics are directly modeled in the loss function. But this method is not suitable for image aesthetics analysis.

...read moreread less

231 citations

Journal Article•DOI•

Deep Ranking for Person Re-Identification via Joint Representation Learning

[...]

Shi-Zhe Chen¹, Chun-Chao Guo¹, Jian-Huang Lai•Institutions (1)

Sun Yat-sen University¹

23 Mar 2016-IEEE Transactions on Image Processing

TL;DR: A novel approach to person re-identification, a fundamental task in distributed multi-camera surveillance systems, is proposed that significantly outperforms all the state-of-the-art approaches, including both traditional and CNN-based methods on the challenging VIPeR, CUHK-01, and CAVIAR4REID datasets.

...read moreread less

Abstract: This paper proposes a novel approach to person re-identification, a fundamental task in distributed multi-camera surveillance systems. Although a variety of powerful algorithms have been presented in the past few years, most of them usually focus on designing hand-crafted features and learning metrics either individually or sequentially. Different from previous works, we formulate a unified deep ranking framework that jointly tackles both of these key components to maximize their strengths. We start from the principle that the correct match of the probe image should be positioned in the top rank within the whole gallery set. An effective learning-to-rank algorithm is proposed to minimize the cost corresponding to the ranking disorders of the gallery. The ranking model is solved with a deep convolutional neural network (CNN) that builds the relation between input image pairs and their similarity scores through joint representation learning directly from raw image pixels. The proposed framework allows us to get rid of feature engineering and does not rely on any assumption. An extensive comparative evaluation is given, demonstrating that our approach significantly outperforms all the state-of-the-art approaches, including both traditional and CNN-based methods on the challenging VIPeR, CUHK-01, and CAVIAR4REID datasets. In addition, our approach has better ability to generalize across datasets without fine-tuning.

...read moreread less

221 citations

Proceedings Article•DOI•

aNMM: Ranking Short Answer Texts with Attention-Based Neural Matching Model

[...]

Liu Yang¹, Qingyao Ai¹, Jiafeng Guo², W. Bruce Croft¹•Institutions (2)

University of Massachusetts Amherst¹, Chinese Academy of Sciences²

24 Oct 2016

TL;DR: This article proposed an attention-based neural matching model for ranking short answer text, which adopts value-shared weighting scheme instead of position shared weighting for combining different matching signals and incorporate question term importance learning using question attention network.

...read moreread less

Abstract: As an alternative to question answering methods based on feature engineering, deep learning approaches such as convolutional neural networks (CNNs) and Long Short-Term Memory Models (LSTMs) have recently been proposed for semantic matching of questions and answers. To achieve good results, however, these models have been combined with additional features such as word overlap or BM25 scores. Without this combination, these models perform significantly worse than methods based on linguistic feature engineering. In this paper, we propose an attention based neural matching model for ranking short answer text. We adopt value-shared weighting scheme instead of position-shared weighting scheme for combining different matching signals and incorporate question term importance learning using question attention network. Using the popular benchmark TREC QA data, we show that the relatively simple aNMM model can significantly outperform other neural network models that have been used for the question answering task, and is competitive with models that are combined with additional features. When aNMM is combined with additional features, it outperforms all baselines.

...read moreread less

200 citations

Proceedings Article•DOI•

Query Expansion Using Word Embeddings

[...]

Saar Kuzi¹, Anna Shtok¹, Oren Kurland¹•Institutions (1)

Technion – Israel Institute of Technology¹

24 Oct 2016

TL;DR: A suite of query expansion methods that are based on word embeddings that use the CBOW embedding approach to select terms that are semantically related to the query and integrate them with the effective pseudo-feedback-based relevance model.

...read moreread less

Abstract: We present a suite of query expansion methods that are based on word embeddings. Using Word2Vec's CBOW embedding approach, applied over the entire corpus on which search is performed, we select terms that are semantically related to the query. Our methods either use the terms to expand the original query or integrate them with the effective pseudo-feedback-based relevance model. In the former case, retrieval performance is significantly better than that of using only the query, and in the latter case the performance is significantly better than that of the relevance model.

...read moreread less

194 citations

Journal Article•DOI•

Information fusion for intuitionistic fuzzy decision making

[...]

Zeshui Xu¹, Na Zhao¹•Institutions (1)

Southeast University¹

01 Mar 2016-Information Fusion

TL;DR: An overview on the existing intuitionism fuzzy decision making theories and methods from the perspective of information fusion, involving the determination of attribute weights, the aggregation of intuitionistic fuzzy information and the ranking of alternatives is presented.

...read moreread less

Journal Article•DOI•

Person Reidentification via Ranking Aggregation of Similarity Pulling and Dissimilarity Pushing

[...]

Mang Ye¹, Chao Liang¹, Yi Yu², Zheng Wang¹, Qingming Leng³, Chunxia Xiao¹, Jun Chen¹, Ruimin Hu¹ - Show less +4 more•Institutions (3)

Wuhan University¹, National Institute of Informatics², Jiujiang University³

01 Dec 2016-IEEE Transactions on Multimedia

TL;DR: A ranking aggregation algorithm is proposed to enhance the detection of similarity and dissimilarity based on the following assumption: the true match should be similar to the probe in different baseline methods, but also be dissimilar to those strongly dissimilar galleries of the probe.

...read moreread less

Abstract: Person reidentification is a key technique to match different persons observed in nonoverlapping camera views. Many researchers treat it as a special object-retrieval problem, where ranking optimization plays an important role. Existing ranking optimization methods mainly utilize the similarity relationship between the probe and gallery images to optimize the original ranking list, but seldom consider the important dissimilarity relationship. In this paper, we propose to use both similarity and dissimilarity cues in a ranking optimization framework for person reidentification. Its core idea is that the true match should not only be similar to those strongly similar galleries of the probe, but also be dissimilar to those strongly dissimilar galleries of the probe. Furthermore, motivated by the philosophy of multiview verification, a ranking aggregation algorithm is proposed to enhance the detection of similarity and dissimilarity based on the following assumption: the true match should be similar to the probe in different baseline methods. In other words, if a gallery blue image is strongly similar to the probe in one method, while simultaneously strongly dissimilar to the probe in another method, it will probably be a wrong match of the probe. Extensive experiments conducted on public benchmark datasets and comparisons with different baseline methods have shown the great superiority of the proposed ranking optimization method.

...read moreread less

Proceedings Article•DOI•

A Neural Click Model for Web Search

[...]

Alexey Borisov¹, Ilya Markov², Maarten de Rijke², Pavel Serdyukov¹•Institutions (2)

Yandex¹, University of Amsterdam²

11 Apr 2016

TL;DR: The experimental results show that the neural click model that uses the same training data as traditional PGM-based click models, has better performance on the click prediction task and the relevance prediction task.

...read moreread less

Abstract: Understanding user browsing behavior in web search is key to improving web search effectiveness. Many click models have been proposed to explain or predict user clicks on search engine results. They are based on the probabilistic graphical model (PGM) framework, in which user behavior is represented as a sequence of observable and hidden events. The PGM framework provides a mathematically solid way to reason about a set of events given some information about other events. But the structure of the dependencies between the events has to be set manually. Different click models use different hand-crafted sets of dependencies. We propose an alternative based on the idea of distributed representations: to represent the user's information need and the information available to the user with a vector state. The components of the vector state are learned to represent concepts that are useful for modeling user behavior. And user behavior is modeled as a sequence of vector states associated with a query session: the vector state is initialized with a query, and then iteratively updated based on information about interactions with the search engine results. This approach allows us to directly understand user browsing behavior from click-through data, i.e., without the need for a predefined set of rules as is customary for PGM-based click models. We illustrate our approach using a set of neural click models. Our experimental results show that the neural click model that uses the same training data as traditional PGM-based click models, has better performance on the click prediction task (i.e., predicting user click on search engine results) and the relevance prediction task (i.e., ranking documents by their relevance to a query). An analysis of the best performing neural click model shows that it learns similar concepts to those used in traditional click models, and that it also learns other concepts that cannot be designed manually.

...read moreread less

Proceedings Article•DOI•

Improving Document Ranking with Dual Word Embeddings

[...]

Eric Nalisnick¹, Bhaskar Mitra², Nick Craswell², Rich Caruana²•Institutions (2)

University of California, Irvine¹, Microsoft²

11 Apr 2016

TL;DR: This paper investigates the popular neural word embedding method Word2vec as a source of evidence in document ranking and proposes the proposed Dual Embedding Space Model (DESM), which provides evidence that a document is about a query term.

...read moreread less

Abstract: This paper investigates the popular neural word embedding method Word2vec as a source of evidence in document ranking. In contrast to NLP applications of word2vec, which tend to use only the input embeddings, we retain both the input and the output embeddings, allowing us to calculate a different word similarity that may be more suitable for document ranking. We map the query words into the input space and the document words into the output space, and compute a relevance score by aggregating the cosine similarities across all the query-document word pairs. We postulate that the proposed Dual Embedding Space Model (DESM) provides evidence that a document is about a query term, in addition to and complementing the traditional term frequency based approach.

...read moreread less

Journal Article•DOI•

Comparing structural fingerprints using a literature-based similarity benchmark

[...]

Noel M. O'Boyle, Roger A. Sayle

05 Jul 2016-Journal of Cheminformatics

TL;DR: Two benchmark datasets for structural similarity are created that can be used to guide the development of improved measures and it is found that the performance of the ECFP fingerprints significantly improves if the bit-vector length is increased from 1024 to 16,384 and the atom pair fingerprint outperforms the others tested.

...read moreread less

Abstract: The concept of molecular similarity is one of the central ideas in cheminformatics, despite the fact that it is ill-defined and rather difficult to assess objectively. Here we propose a practical definition of molecular similarity in the context of drug discovery: molecules A and B are similar if a medicinal chemist would be likely to synthesise and test them around the same time as part of the same medicinal chemistry program. The attraction of such a definition is that it matches one of the key uses of similarity measures in early-stage drug discovery. If we make the assumption that molecules in the same compound activity table in a medicinal chemistry paper were considered similar by the authors of the paper, we can create a dataset of similar molecules from the medicinal chemistry literature. Furthermore, molecules with decreasing levels of similarity to a reference can be found by either ordering molecules in an activity table by their activity, or by considering activity tables in different papers which have at least one molecule in common. Using this procedure with activity data from ChEMBL, we have created two benchmark datasets for structural similarity that can be used to guide the development of improved measures. Compared to similar results from a virtual screen, these benchmarks are an order of magnitude more sensitive to differences between fingerprints both because of their size and because they avoid loss of statistical power due to the use of mean scores or ranks. We measure the performance of 28 different fingerprints on the benchmark sets and compare the results to those from the Riniker and Landrum (J Cheminf 5:26, 2013. doi: 10.1186/1758-2946-5-26 ) ligand-based virtual screening benchmark. Extended-connectivity fingerprints of diameter 4 and 6 are among the best performing fingerprints when ranking diverse structures by similarity, as is the topological torsion fingerprint. However, when ranking very close analogues, the atom pair fingerprint outperforms the others tested. When ranking diverse structures or carrying out a virtual screen, we find that the performance of the ECFP fingerprints significantly improves if the bit-vector length is increased from 1024 to 16,384. Graphical abstract An example series from one of the benchmark datasets. Each fingerprint is assessed on its ability to reproduce a specific series order.

...read moreread less

Proceedings Article•DOI•

Fast Training of Triplet-Based Deep Binary Embedding Networks

[...]

Bohan Zhuang¹, Guosheng Lin¹, Chunhua Shen¹, Ian Reid¹•Institutions (1)

University of Adelaide¹

27 Jun 2016

TL;DR: This paper proposes to formulate high-order binary codes learning as a multi-label classification problem by explicitly separating learning into two interleaved stages and proposes to map the original image to compact binary codes via carefully designed deep convolutional neural networks and the hashing function fitting can be solved by training binary CNN classifiers.

...read moreread less

Abstract: In this paper, we aim to learn a mapping (or embedding) from images to a compact binary space in which Hamming distances correspond to a ranking measure for the image retrieval task. We make use of a triplet loss because this has been shown to be most effective for ranking problems. However, training in previous works can be prohibitively expensive due to the fact that optimization is directly performed on the triplet space, where the number of possible triplets for training is cubic in the number of training examples. To address this issue, we propose to formulate high-order binary codes learning as a multi-label classification problem by explicitly separating learning into two interleaved stages. To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary codes which serve as the labels of each training datum. In the second stage we propose to map the original image to compact binary codes via carefully designed deep convolutional neural networks (CNNs) and the hashing function fitting can be solved by training binary CNN classifiers. An incremental/interleaved optimization strategy is proffered to ensure that these two steps are interactive with each other during training for better accuracy. We conduct experiments on several benchmark datasets, which demonstrate both improved training time (by as much as two orders of magnitude) as well as producing state-of-the-art hashing for various retrieval tasks.

...read moreread less

Proceedings Article•DOI•

[...]

Alfirna Rizqi Lahitani¹, Adhistya Erna Permanasari¹, Noor Akhmad Setiawan¹•Institutions (1)

Gadjah Mada University¹

26 Apr 2016

TL;DR: This research implemented the weighting of Term Frequency - Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document to rank the document weight that have closesness match level with expert's document.

...read moreread less

Abstract: Development of technology in educational field brings the easier ways through the variety of facilitation for learning process, sharing files, giving assignment and assessment. Automated Essay Scoring (AES) is one of the development systems for determining a score automatically from text document source to facilitate the correction and scoring by utilizing applications that run on the computer. AES process is used to help the lecturers to score efficiently and effectively. Besides it can reduce the subjectivity scoring problem. However, implementation of AES depends on many factors and cases, such as language and mechanism of scoring process especially for essay scoring. A number of methods implemented for weighting the terms from document and reaching the solutions for handling comparative level between documents answer and expert's document still defined. In this research, we implemented the weighting of Term Frequency — Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document. Tests carried out on a number of Indonesian text-based documents that have gone through the stage of pre-processing for data extraction purposes. This process results is in a ranking of the document weight that have closesness match level with expert's document.

...read moreread less

Proceedings Article•DOI•

Embedding-based Query Language Models

[...]

Hamed Zamani¹, W. Bruce Croft¹•Institutions (1)

University of Massachusetts Amherst¹

12 Sep 2016

TL;DR: This paper proposes to use word embeddings to incorporate and weight terms that do not occur in the query, but are semantically related to the query terms, and develops an embedding-based relevance model, an extension of the effective and robust relevance model approach.

...read moreread less

Abstract: Word embeddings, which are low-dimensional vector representations of vocabulary terms that capture the semantic similarity between them, have recently been shown to achieve impressive performance in many natural language processing tasks. The use of word embeddings in information retrieval, however, has only begun to be studied. In this paper, we explore the use of word embeddings to enhance the accuracy of query language models in the ad-hoc retrieval task. To this end, we propose to use word embeddings to incorporate and weight terms that do not occur in the query, but are semantically related to the query terms. We describe two embedding-based query expansion models with different assumptions. Since pseudo-relevance feedback methods that use the top retrieved documents to update the original query model are well-known to be effective, we also develop an embedding-based relevance model, an extension of the effective and robust relevance model approach. In these models, we transform the similarity values obtained by the widely-used cosine similarity with a sigmoid function to have more discriminative semantic similarity values. We evaluate our proposed methods using three TREC newswire and web collections. The experimental results demonstrate that the embedding-based methods significantly outperform competitive baselines in most cases. The embedding-based methods are also shown to be more robust than the baselines.

...read moreread less

Posted Content•

A Dual Embedding Space Model for Document Ranking

[...]

Bhaskar Mitra, Eric Nalisnick, Nick Craswell, Rich Caruana

02 Feb 2016-arXiv: Information Retrieval

TL;DR: The proposed Dual Embedding Space Model (DESM) captures evidence on whether a document is about a query term in addition to what is modelled by traditional term-frequency based approaches, and shows that the DESM can re-rank top documents returned by a commercial Web search engine, like Bing, better than a term-matching based signal like TF-IDF.

...read moreread less

Abstract: A fundamental goal of search engines is to identify, given a query, documents that have relevant text. This is intrinsically difficult because the query and the document may use different vocabulary, or the document may contain query words without being relevant. We investigate neural word embeddings as a source of evidence in document ranking. We train a word2vec embedding model on a large unlabelled query corpus, but in contrast to how the model is commonly used, we retain both the input and the output projections, allowing us to leverage both the embedding spaces to derive richer distributional relationships. During ranking we map the query words into the input space and the document words into the output space, and compute a query-document relevance score by aggregating the cosine similarities across all the query-document word pairs. We postulate that the proposed Dual Embedding Space Model (DESM) captures evidence on whether a document is about a query term in addition to what is modelled by traditional term-frequency based approaches. Our experiments show that the DESM can re-rank top documents returned by a commercial Web search engine, like Bing, better than a term-matching based signal like TF-IDF. However, when ranking a larger set of candidate documents, we find the embeddings-based approach is prone to false positives, retrieving documents that are only loosely related to the query. We demonstrate that this problem can be solved effectively by ranking based on a linear mixture of the DESM and the word counting features.

...read moreread less

Proceedings Article•DOI•

Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

[...]

Jinfeng Rao¹, Hua He¹, Jimmy Lin²•Institutions (2)

University of Maryland, College Park¹, University of Waterloo²

24 Oct 2016

TL;DR: The Noise-Contrastive Estimation approach is extended with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples and achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.

...read moreread less

Abstract: We study answer selection for question answering, in which given a question and a set of candidate answer sentences, the goal is to identify the subset that contains the answer. Unlike previous work which treats this task as a straightforward pointwise classification problem, we model this problem as a ranking task and propose a pairwise ranking approach that can directly exploit existing pointwise neural network models as base components. We extend the Noise-Contrastive Estimation approach with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples. Experiments on TrecQA and WikiQA datasets show that our approach achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.

...read moreread less

Proceedings Article•DOI•

Ranking Relevance in Yahoo Search

[...]

Dawei Yin¹, Yuening Hu¹, Jiliang Tang¹, Tim Daly¹, Mianwei Zhou¹, Hua Ouyang¹, Jianhui Chen¹, Changsung Kang¹, Hongbo Deng¹, Chikashi Nobata¹, Jean-Marc Langlois¹, Yi Chang¹ - Show less +8 more•Institutions (1)

Yahoo!¹

13 Aug 2016

TL;DR: This paper introduces three key techniques for base relevance -- ranking functions, semantic matching features and query rewriting, and describes solutions for recency sensitive relevance and location sensitive relevance.

...read moreread less

Abstract: Search engines play a crucial role in our daily lives. Relevance is the core problem of a commercial search engine. It has attracted thousands of researchers from both academia and industry and has been studied for decades. Relevance in a modern search engine has gone far beyond text matching, and now involves tremendous challenges. The semantic gap between queries and URLs is the main barrier for improving base relevance. Clicks help provide hints to improve relevance, but unfortunately for most tail queries, the click information is too sparse, noisy, or missing entirely. For comprehensive relevance, the recency and location sensitivity of results is also critical. In this paper, we give an overview of the solutions for relevance in the Yahoo search engine. We introduce three key techniques for base relevance -- ranking functions, semantic matching features and query rewriting. We also describe solutions for recency sensitive relevance and location sensitive relevance. This work builds upon 20 years of existing efforts on Yahoo search, summarizes the most recent advances and provides a series of practical relevance solutions. The performance reported is based on Yahoo's commercial search engine, where tens of billions of urls are indexed and served by the ranking system.

...read moreread less

Proceedings Article•DOI•

Is This Post Persuasive? Ranking Argumentative Comments in Online Forum

[...]

Zhongyu Wei¹, Zhongyu Wei², Yang Liu, Yi Li¹•Institutions (2)

University of Texas at Dallas¹, Fudan University²

01 Aug 2016

TL;DR: The experiments show that the surface textual features do not perform well compared to the argumentation based features, and the social interaction based features are effective especially when more users participate in the discussion.

...read moreread less

Abstract: In this paper we study how to identify persuasive posts in the online forum discussions, using data from Change My View sub-Reddit. Our analysis confirms that the users’ voting score for a comment is highly correlated with its metadata information such as published time and author reputation. In this work, we propose and evaluate other features to rank comments for their persuasive scores, including textual information in the comments and social interaction related features. Our experiments show that the surface textual features do not perform well compared to the argumentation based features, and the social interaction based features are effective especially when more users participate in the discussion.

...read moreread less

Book Chapter•DOI•

Modeling Stylized Character Expressions via Deep Learning

[...]

Deepali Aneja¹, Alex Colburn, Gary Faigin, Linda G. Shapiro¹, Barbara Mones¹ - Show less +1 more•Institutions (1)

University of Washington¹

20 Nov 2016

TL;DR: This work first train two Convolutional Neural Networks to recognize the expression of humans and stylized characters independently and utilizes a transfer learning technique to learn the mapping from humans to characters to create a shared embedding feature space.

...read moreread less

Abstract: We propose DeepExpr, a novel expression transfer approach from humans to multiple stylized characters. We first train two Convolutional Neural Networks to recognize the expression of humans and stylized characters independently. Then we utilize a transfer learning technique to learn the mapping from humans to characters to create a shared embedding feature space. This embedding also allows human expression-based image retrieval and character expression-based image retrieval. We use our perceptual model to retrieve character expressions corresponding to humans. We evaluate our method on a set of retrieval tasks on our collected stylized character dataset of expressions. We also show that the ranking order predicted by the proposed features is highly correlated with the ranking order provided by a facial expression expert and Mechanical Turk experiments.

...read moreread less

Posted Content•

Using Word Embeddings for Automatic Query Expansion.

[...]

Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, Utpal Garain

24 Jun 2016-arXiv: Information Retrieval

TL;DR: It is found that the word2vec based query expansion methods perform similarly with and without any feedback information, and the proposed method fails to achieve comparable performance with statistical co-occurrence based feedback method such as RM3.

...read moreread less

Abstract: In this paper a framework for Automatic Query Expansion (AQE) is proposed using distributed neural language model word2vec. Using semantic and contextual relation in a distributed and unsupervised framework, word2vec learns a low dimensional embedding for each vocabulary entry. Using such a framework, we devise a query expansion technique, where related terms to a query are obtained by K-nearest neighbor approach. We explore the performance of the AQE methods, with and without feedback query expansion, and a variant of simple K-nearest neighbor in the proposed framework. Experiments on standard TREC ad-hoc data (Disk 4, 5 with query sets 301-450, 601-700) and web data (WT10G data with query set 451-550) shows significant improvement over standard term-overlapping based retrieval methods. However the proposed method fails to achieve comparable performance with statistical co-occurrence based feedback method such as RM3. We have also found that the word2vec based query expansion methods perform similarly with and without any feedback information.

...read moreread less

Proceedings Article•DOI•

Towards a general-purpose query language for visualization recommendation

[...]

Kanit Wongsuphasawat¹, Dominik Moritz¹, Anushka Anand, Jock D. Mackinlay, Bill Howe¹, Jeffrey Heer¹ - Show less +2 more•Institutions (1)

University of Washington¹

26 Jun 2016

TL;DR: This paper presents the preliminary design of CompassQL, which defines a partial specification that describes enumeration constraints, and methods for choosing, ranking, and grouping recommended visualizations in a specification language for querying over the space of visualizations.

...read moreread less

Abstract: Creating effective visualizations requires domain familiarity as well as design and analysis expertise, and may impose a tedious specification process. To address these difficulties, many visualization tools complement manual specification with recommendations. However, designing interfaces, ranking metrics, and scalable recommender systems remain important research challenges. In this paper, we propose a common framework for facilitating the development of visualization recommender systems in the form of a specification language for querying over the space of visualizations. We present the preliminary design of CompassQL, which defines (1) a partial specification that describes enumeration constraints, and (2) methods for choosing, ranking, and grouping recommended visualizations. To demonstrate the expressivity of the language, we describe existing recommender systems in terms of CompassQL queries. Finally, we discuss the prospective benefits of a common language for future visualization recommender systems.

...read moreread less

Posted Content•

Quad-networks: unsupervised learning to rank for interest point detection

[...]

Nikolay Savinov¹, Akihito Seki², Lubor Ladicky¹, Torsten Sattler¹, Marc Pollefeys¹ - Show less +1 more•Institutions (2)

ETH Zurich¹, Toshiba²

22 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper is the first to propose such a formulation: training a neural network to rank points in a transformation-invariant manner, and shows that this unsupervised method performs better or on-par with baselines on two tasks.

...read moreread less

Abstract: Several machine learning tasks require to represent the data using only a sparse set of interest points. An ideal detector is able to find the corresponding interest points even if the data undergo a transformation typical for a given domain. Since the task is of high practical interest in computer vision, many hand-crafted solutions were proposed. In this paper, we ask a fundamental question: can we learn such detectors from scratch? Since it is often unclear what points are "interesting", human labelling cannot be used to find a truly unbiased solution. Therefore, the task requires an unsupervised formulation. We are the first to propose such a formulation: training a neural network to rank points in a transformation-invariant manner. Interest points are then extracted from the top/bottom quantiles of this ranking. We validate our approach on two tasks: standard RGB image interest point detection and challenging cross-modal interest point detection between RGB and depth images. We quantitatively show that our unsupervised method performs better or on-par with baselines.

...read moreread less

Journal Article•DOI•

Manifold Ranking-Based Matrix Factorization for Saliency Detection

[...]

Dapeng Tao¹, Jun Cheng², Mingli Song³, Xu Lin²•Institutions (3)

Yunnan University¹, Chinese Academy of Sciences², Zhejiang University³

01 Jun 2016-IEEE Transactions on Neural Networks

TL;DR: It is proved that the MRMF has good generalizability, and develops an efficient optimization algorithm based on the Nesterov method that enforces similar saliency values on neighboringsuperpixels and ranks superpixels according to the learned coefficients.

...read moreread less

Abstract: Saliency detection is used to identify the most important and informative area in a scene, and it is widely used in various vision tasks, including image quality assessment, image matching, and object recognition. Manifold ranking (MR) has been used to great effect for the saliency detection, since it not only incorporates the local spatial information but also utilizes the labeling information from background queries. However, MR completely ignores the feature information extracted from each superpixel. In this paper, we propose an MR-based matrix factorization (MRMF) method to overcome this limitation. MRMF models the ranking problem in the matrix factorization framework and embeds query sample labels in the coefficients. By incorporating spatial information and embedding labels, MRMF enforces similar saliency values on neighboring superpixels and ranks superpixels according to the learned coefficients. We prove that the MRMF has good generalizability, and develops an efficient optimization algorithm based on the Nesterov method. Experiments using popular benchmark data sets illustrate the promise of MRMF compared with the other state-of-the-art saliency detection methods.

...read moreread less

Journal Article•DOI•

A new hybrid filter-wrapper feature selection method for clustering based on ranking

[...]

Saúl Solorio-Fernández¹, J. Ariel Carrasco-Ochoa¹, José Fco. Martínez-Trinidad¹•Institutions (1)

National Institute of Astrophysics, Optics and Electronics¹

19 Nov 2016-Neurocomputing

TL;DR: This paper introduces a new hybrid filter-wrapper method for clustering, which combines the spectral feature selection framework using the Laplacian Score ranking and a modified Calinski-Harabasz index.

...read moreread less

Journal Article•DOI•

Fast ranking influential nodes in complex networks using a k-shell iteration factor

[...]

Zhixiao Wang¹, Zhixiao Wang², Ya Zhao¹, Jingke Xi¹, Changjiang Du¹ - Show less +1 more•Institutions (2)

China University of Mining and Technology¹, Nanjing University of Science and Technology²

01 Nov 2016-Physica A-statistical Mechanics and Its Applications

TL;DR: This paper presents a fast ranking method to evaluate the influence capability of nodes using a k-shell iteration factor and provides a more reasonable ranking list than previous methods.

...read moreread less

Abstract: Identifying the influential nodes of complex networks is important for optimizing the network structure or efficiently disseminating information through networks. The k-shell method is a widely used node ranking method that has inherent advantages in performance and efficiency. However, the iteration information produced in k-shell decomposition has been neglected in node ranking. This paper presents a fast ranking method to evaluate the influence capability of nodes using a k-shell iteration factor. The experimental results with respect to monotonicity, correctness and efficiency have demonstrated that the proposed method can yield excellent performance on artificial and real world networks. It discriminates the influence capability of nodes more accurately and provides a more reasonable ranking list than previous methods.

...read moreread less

Proceedings Article•

Concepts not alone: exploring pairwise relationships for zero-shot video activity recognition

[...]

Chuang Gan¹, Ming Lin², Yi Yang³, Gerard de Melo¹, Alexander G. Hauptmann⁴ - Show less +1 more•Institutions (4)

Tsinghua University¹, University of Michigan², University of Technology, Sydney³, Carnegie Mellon University⁴

12 Feb 2016

TL;DR: This paper proposes a more robust approach for scoring concepts in order to alleviate many of the brittleness and low precision problems of previous work, and proposes a novel pairwise order matrix approach for score aggregation.

...read moreread less

Abstract: Vast quantities of videos are now being captured at astonishing rates, but the majority of these are not labelled. To cope with such data, we consider the task of content-based activity recognition in videos without any manually labelled examples, also known as zero-shot video recognition. To achieve this, videos are represented in terms of detected visual concepts, which are then scored as relevant or irrelevant according to their similarity with a given textual query. In this paper, we propose a more robust approach for scoring concepts in order to alleviate many of the brittleness and low precision problems of previous work. Not only do we jointly consider semantic relatedness, visual reliability, and discriminative power. To handle noise and non-linearities in the ranking scores of the selected concepts, we propose a novel pairwise order matrix approach for score aggregation. Extensive experiments on the large-scale TRECVID Multimedia Event Detection data show the superiority of our approach.

...read moreread less

Collapse