Institution
Yahoo!
Company•London, United Kingdom•
About: Yahoo! is a company organization based out in London, United Kingdom. It is known for research contribution in the topics: Population & Web search query. The organization has 26749 authors who have published 29915 publications receiving 732583 citations. The organization is also known as: Yahoo! Inc. & Maudwen-Yahoo! Inc.
Papers published on a yearly basis
Papers
More filters
••
01 Dec 2006TL;DR: This is the first publicly available Web spam collection that includes page contents and links, and that has been labelled by a large and diverse set of judges.
Abstract: We describe the WEBSPAM-UK2006 collection, a large set of Web pages that have been manually annotated with labels indicating if the hosts are include Web spam aspects or not. This is the first publicly available Web spam collection that includes page contents and links, and that has been labelled by a large and diverse set of judges.
227 citations
••
TL;DR: Geographic variation in embryonic development may reflect more complex interactions than previously recognized, and incubation periods of transferred eggs did not match host species and reflect intrinsic differences among species that may result from nest predation and other selection pressures.
Abstract: Theory predicts shorter embryonic periods in species with greater embryo mortality risk and smaller body size. Field studies of 80 passerine species on three continents yielded data that largely conflicted with theory; incubation (embryonic) periods were longer rather than shorter in smaller species, and egg (embryo) mortality risk explained some variation within regions, but did not explain larger differences in incubation periods among geographic regions. Incubation behavior of parents seems to explain these discrepancies. Bird embryos are effectively ectothermic and depend on warmth provided by parents sitting on the eggs to attain proper temperatures for development. Parents of smaller species, plus tropical and southern hemisphere species, commonly exhibited lower nest attentiveness (percent of time spent on the nest incubating) than larger and northern hemisphere species. Lower nest attentiveness produced cooler minimum and average embryonic temperatures that were correlated with longer incubation periods independent of nest predation risk or body size. We experimentally tested this correlation by swapping eggs of species with cool incubation temperatures with eggs of species with warm incubation temperatures and similar egg mass. Incubation periods changed (shortened or lengthened) as expected and verified the importance of egg temperature on development rate. Slower development resulting from cooler temperatures may simply be a cost imposed on embryos by parents and may not enhance offspring quality. At the same time, incubation periods of transferred eggs did not match host species and reflect intrinsic differences among species that may result from nest predation and other selection pressures. Thus, geographic variation in embryonic development may reflect more complex interactions than previously recognized.
226 citations
••
TL;DR: This work proposes an algorithm which aims at directly optimizing popular measures such as the Normalized Discounted Cumulative Gain and the Average Precision, to minimize a smooth approximation of these measures with gradient descent.
Abstract: Most ranking algorithms are based on the optimization of some loss functions, such as the pairwise loss. However, these loss functions are often different from the criteria that are adopted to measure the quality of the web page ranking results. To overcome this problem, we propose an algorithm which aims at directly optimizing popular measures such as the Normalized Discounted Cumulative Gain and the Average Precision. The basic idea is to minimize a smooth approximation of these measures with gradient descent. Crucial to this kind of approach is the choice of the smoothing factor. We provide various theoretical analysis on that choice and propose an annealing algorithm to iteratively minimize a less and less smoothed approximation of the measure of interest. Results on the Letor benchmark datasets show that the proposed algorithm achieves state-of-the-art performances.
226 citations
••
17 Mar 2018TL;DR: This article created the largest corpus for a particular stylistic transfer (formality) and show that techniques from the machine translation community can serve as strong baselines for future work, and discuss challenges of using automatic metrics.
Abstract: Style transfer is the task of automatically transforming a piece of text in one particular style into another. A major barrier to progress in this field has been a lack of training and evaluation datasets, as well as benchmarks and automatic metrics. In this work, we create the largest corpus for a particular stylistic transfer (formality) and show that techniques from the machine translation community can serve as strong baselines for future work. We also discuss challenges of using automatic metrics.
226 citations
•
15 Mar 2005TL;DR: In this paper, the authors describe a system where users can annotate content items found in a corpus such as the World Wide Web and search their annotations or limit searches to documents they have annotated.
Abstract: Computer systems and methods allow users to annotate content items found in a corpus such as the World Wide Web. Annotations, which can include any descriptive and/or evaluative metadata related to a document, are collected from a user and stored in association with that user. Users are able to annotate and view their annotations for any document they encounter while interacting with the corpus, including hits returned in a search of the corpus. Users are also able to search their annotations or to limit searches to documents they have annotated. Metadata from annotations can also be aggregated across users and aggregated metadata applied in generating search results.
226 citations
Authors
Showing all 26766 results
Name | H-index | Papers | Citations |
---|---|---|---|
Ashok Kumar | 151 | 5654 | 164086 |
Alexander J. Smola | 122 | 434 | 110222 |
Howard I. Maibach | 116 | 1821 | 60765 |
Sanjay Jain | 103 | 881 | 46880 |
Amirhossein Sahebkar | 100 | 1307 | 46132 |
Marc Davis | 99 | 412 | 50243 |
Wenjun Zhang | 96 | 976 | 38530 |
Jian Xu | 94 | 1366 | 52057 |
Fortunato Ciardiello | 94 | 695 | 47352 |
Tong Zhang | 93 | 414 | 36519 |
Michael E. J. Lean | 92 | 411 | 30939 |
Ashish K. Jha | 87 | 503 | 30020 |
Xin Zhang | 87 | 1714 | 40102 |
Theunis Piersma | 86 | 632 | 34201 |
George Varghese | 84 | 253 | 28598 |