Institution
Yahoo!
Company•London, United Kingdom•
About: Yahoo! is a company organization based out in London, United Kingdom. It is known for research contribution in the topics: Population & Web search query. The organization has 26749 authors who have published 29915 publications receiving 732583 citations. The organization is also known as: Yahoo! Inc. & Maudwen-Yahoo! Inc.
Papers published on a yearly basis
Papers
More filters
••
12 Aug 2007TL;DR: In this article, an unsupervised feature selection strategy with provable performance guarantees was proposed for text classification, and the performance of the strategy was evaluated empirically on the TechTC-100, 20-Newsgroups, and Reuters-RCV2 data sets.
Abstract: We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised feature selection strategy for which we give worst-case theoretical guarantees on the generalization power of the resultant classification function f with respect to the classification function f obtained when keeping all the features. To the best of our knowledge, this is the first feature selection method with such guarantees. In addition, the analysis leads to insights as to when and why this feature selection strategy will perform well in practice. We then use the TechTC-100, 20-Newsgroups, and Reuters-RCV2 data sets to evaluate empirically the performance of this and two simpler but related feature selection strategies against two commonly-used strategies. Our empirical evaluation shows that the strategy with provable performance guarantees performs well in comparison with other commonly-used feature selection strategies. In addition, it performs better on certain datasets under very aggressive feature selection.
184 citations
•
14 Jun 1999TL;DR: In this article, a method and apparatus for determining when electronic documents stored in a large collection of documents are similar to one another is provided for determining the similarity of documents stored on the same server.
Abstract: A method and apparatus are provided for determining when electronic documents stored in a large collection of documents are similar to one another. A plurality of similarity information is derived from the documents. The similarity information may be based on a variety of factors, including hyperlinks in the documents, text similarity, user click-through information, similarity in the titles of the documents or their location identifiers, and patterns of user viewing. The similarity information is fed to a combination function that synthesizes the various measures of similarity information into combined similarity information. Using the combined similarity information, an objective function is iteratively maximized in order to yield a generalized similarity value that expresses the similarity of particular pairs of documents. In an embodiment, the generalized similarity value is used to determine the proper category, among a taxonomy of categories in an index, cache or search system, into which certain documents belong.
184 citations
•
31 Aug 2009TL;DR: In this article, an ad creation system utilizes data made available by the W4 COMN to facilitate the creation and placement of advertisements on a message delivery network, such as the Web4COMN itself.
Abstract: Methods, apparatuses and systems directed to creating contextually-targeted advertisements. In a particular implementation, advertisers may leverage a W4 COMN to deliver contextually-targeted and/or contextually-enhanced advertisements. In the implementations discussed below, an ad creation system utilizes data made available by the W4 COMN to facilitate the creation and placement of advertisements on a message delivery network, such as the W4 COMN itself.
183 citations
•
28 Apr 2008TL;DR: In this article, a set of potential friends to a user based on an analysis of friends' contact lists is provided. But the user may provide a subset of friends within a contact list for analysis, along with a degree of separation over which to perform the analysis, and/or a minimum threshold number of occurrences for identifying a candidate friend.
Abstract: Embodiments are directed towards providing a list of potential friends to a user based on an analysis of friends' contact lists. The user may provide a subset of friends within a contact list for analysis, along with a degree of separation over which to perform the analysis, and/or a minimum threshold number of occurrences for identifying a candidate friend. The subset of friends' contact lists may then be recursively traversed and merged, where common friends may be identified as members of a candidate set for suggesting friends to the user. In one embodiment, the candidate members may be retained within the candidate set if there is a commonality between the friends and the candidate that exceeds the minimum threshold. The candidate list may also be rank order using various approaches, including a weighted energy diffusion model based in part on a number of communications between the candidates.
183 citations
••
TL;DR: This study demonstrates that children with physical and neurological disabilities enjoy the same activities as those children without these disabilities.
Abstract: Background. Children with disabilities are at risk for limited daily occupational participation. This paper presents a phenomenological study describing the meaning of participation in activities outside of school to children with physical and neurological disabilities and their parents. Methods. Semi-structured interviews were conducted with eight children and their parents. Results. Nine themes, organized under three categories, describe what successful participation means, and the personal and environmental supports and barriers to participation. Successful participation is defined in terms of being with others and being able to perform tasks independently. Conclusions. This study demonstrates that children with physical and neurological disabilities enjoy the same activities as those children without these disabilities. Activities allowing children to experience enjoyment have the best chance of ensuring children's participation. The role of the parent in providing opportunities for participation, and...
183 citations
Authors
Showing all 26766 results
Name | H-index | Papers | Citations |
---|---|---|---|
Ashok Kumar | 151 | 5654 | 164086 |
Alexander J. Smola | 122 | 434 | 110222 |
Howard I. Maibach | 116 | 1821 | 60765 |
Sanjay Jain | 103 | 881 | 46880 |
Amirhossein Sahebkar | 100 | 1307 | 46132 |
Marc Davis | 99 | 412 | 50243 |
Wenjun Zhang | 96 | 976 | 38530 |
Jian Xu | 94 | 1366 | 52057 |
Fortunato Ciardiello | 94 | 695 | 47352 |
Tong Zhang | 93 | 414 | 36519 |
Michael E. J. Lean | 92 | 411 | 30939 |
Ashish K. Jha | 87 | 503 | 30020 |
Xin Zhang | 87 | 1714 | 40102 |
Theunis Piersma | 86 | 632 | 34201 |
George Varghese | 84 | 253 | 28598 |