Topic

Ranking (information retrieval)

About: Ranking (information retrieval) is a research topic. Over the lifetime, 21109 publications have been published within this topic receiving 435130 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

S3: syntax- and semantic-guided repair synthesis via programming by examples

[...]

Xuan-Bach D. Le¹, Duc-Hiep Chu², David Lo¹, Claire Le Goues³, Willem Visser⁴ - Show less +1 more•Institutions (4)

Singapore Management University¹, Institute of Science and Technology Austria², Carnegie Mellon University³, Stellenbosch University⁴

21 Aug 2017

TL;DR: This work presents S3, a new repair synthesis engine that leverages programming-by-examples methodology to synthesize high-quality bug repairs and compares S3’s repair effectiveness with state-of-the-art synthesis engines Angelix, Enumerative, and CVC4.

...read moreread less

Abstract: A notable class of techniques for automatic program repair is known as semantics-based. Such techniques, e.g., Angelix, infer semantic specifications via symbolic execution, and then use program synthesis to construct new code that satisfies those inferred specifications. However, the obtained specifications are naturally incomplete, leaving the synthesis engine with a difficult task of synthesizing a general solution from a sparse space of many possible solutions that are consistent with the provided specifications but that do not necessarily generalize. We present S3, a new repair synthesis engine that leverages programming-by-examples methodology to synthesize high-quality bug repairs. The novelty in S3 that allows it to tackle the sparse search space to create more general repairs is three-fold: (1) A systematic way to customize and constrain the syntactic search space via a domain-specific language, (2) An efficient enumeration- based search strategy over the constrained search space, and (3) A number of ranking features based on measures of the syntactic and semantic distances between candidate solutions and the original buggy program. We compare S3’s repair effectiveness with state-of-the-art synthesis engines Angelix, Enumerative, and CVC4. S3 can successfully and correctly fix at least three times more bugs than the best baseline on datasets of 52 bugs in small programs, and 100 bugs in real-world large programs.

...read moreread less

198 citations

Book Chapter•DOI•

The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking

[...]

Anja Theobald, Gerhard Weikum

25 Mar 2002

TL;DR: This paper presents the XXL search engine that supports relevance ranking on XML data XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions.

...read moreread less

Abstract: Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a rank list of XML elements in descending order of (estimated) relevance. Web search engines, which are based on the ranked retrieval paradigm, do, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This paper presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java servlets. Experiments with a variety of structurally diverse XML data demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.

...read moreread less

198 citations

Posted Content•

Overview of the TREC 2019 deep learning track.

[...]

Nick Craswell¹, Bhaskar Mitra¹, Emine Yilmaz², Daniel Campos¹, Ellen M. Voorhees³ - Show less +1 more•Institutions (3)

Microsoft¹, University College London², National Institute of Standards and Technology³

17 Mar 2020-arXiv: Information Retrieval

TL;DR: The Deep Learning Track is a new track for TREC 2019, with the goal of studying ad hoc ranking in a large data regime, and is the first track with large human-labeled training sets, introducing two sets corresponding to two tasks, each with rigorous TREC-style blind evaluation and reusable test sets.

...read moreread less

Abstract: The Deep Learning Track is a new track for TREC 2019, with the goal of studying ad hoc ranking in a large data regime. It is the first track with large human-labeled training sets, introducing two sets corresponding to two tasks, each with rigorous TREC-style blind evaluation and reusable test sets. The document retrieval task has a corpus of 3.2 million documents with 367 thousand training queries, for which we generate a reusable test set of 43 queries. The passage retrieval task has a corpus of 8.8 million passages with 503 thousand training queries, for which we generate a reusable test set of 43 queries. This year 15 groups submitted a total of 75 runs, using various combinations of deep learning, transfer learning and traditional IR ranking methods. Deep learning runs significantly outperformed traditional IR runs. Possible explanations for this result are that we introduced large training data and we included deep models trained on such data in our judging pools, whereas some past studies did not have such training data or pooling.

...read moreread less

198 citations

Proceedings Article•DOI•

Manual and automatic evaluation of summaries

[...]

Chin-Yew Lin¹, Eduard Hovy¹•Institutions (1)

Information Sciences Institute¹

11 Jul 2002

TL;DR: In this article, the authors discuss manual and automatic evaluation of summaries using data from the Document Understanding Conference 2001 (DUC-2001) and show the instability of the manual evaluation.

...read moreread less

Abstract: In this paper we discuss manual and automatic evaluations of summaries using data from the Document Understanding Conference 2001 (DUC-2001). We first show the instability of the manual evaluation. Specifically, the low inter-human agreement indicates that more reference summaries are needed. To investigate the feasibility of automated summary evaluation based on the recent BLEU method from machine translation, we use accumulative n-gram overlap scores between system and human summaries. The initial results provide encouraging correlations with human judgments, based on the Spearman rank-order correlation coefficient. However, relative ranking of systems needs to take into account the instability.

...read moreread less

198 citations

Patent•

Ranking Internet Search Results Based on Number of Mobile Device Visits to Physical Locations Related to the Search Results

[...]

David Ng

07 Dec 2004

TL;DR: In this article, a traffic counter for a business location in the business database is updated when a mobile device's geo-position moves inside the business location, and the traffic counter is adjusted for business size by dividing the traffic counters by the square footage of the business and sorting the result set based on the mobile-device visits or repeat visits, per square foot.

...read moreread less

Abstract: An Internet search engine ranks search results based on popularity with mobile-device users. Geo-position data from cell phones and other mobile devices are collected into a device geo-position database. The geo-position data is compared to locations of businesses in a business database. When a mobile device's geo-position moves inside a business location, a traffic counter for that business location in the business database is updated. When an Internet user performs a local search, the result set is sorted based on a rank that is at least partially determined by the traffic counters. The popularity-ranked search results indicate which businesses received the most mobile-device visits, an indication of the business's overall popularity. The popularity ranking may be adjusted for business size by dividing the traffic counter by the square footage of the business and sorting the result set based on the mobile-device visits, or repeat visits, per square foot.

...read moreread less

197 citations

Collapse

Network Information

Performance

Metrics

30,908

Papers

494,240

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	3,112
2022	6,541
2021	1,105
2020	1,082
2019	1,168

Ranking (information retrieval)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics