scispace - formally typeset
Search or ask a question
Institution

Yahoo!

CompanyLondon, United Kingdom
About: Yahoo! is a company organization based out in London, United Kingdom. It is known for research contribution in the topics: Population & Web search query. The organization has 26749 authors who have published 29915 publications receiving 732583 citations. The organization is also known as: Yahoo! Inc. & Maudwen-Yahoo! Inc.


Papers
More filters
Proceedings ArticleDOI
12 Aug 2007
TL;DR: In this article, an unsupervised feature selection strategy with provable performance guarantees was proposed for text classification, and the performance of the strategy was evaluated empirically on the TechTC-100, 20-Newsgroups, and Reuters-RCV2 data sets.
Abstract: We consider feature selection for text classification both theoretically and empirically. Our main result is an unsupervised feature selection strategy for which we give worst-case theoretical guarantees on the generalization power of the resultant classification function f with respect to the classification function f obtained when keeping all the features. To the best of our knowledge, this is the first feature selection method with such guarantees. In addition, the analysis leads to insights as to when and why this feature selection strategy will perform well in practice. We then use the TechTC-100, 20-Newsgroups, and Reuters-RCV2 data sets to evaluate empirically the performance of this and two simpler but related feature selection strategies against two commonly-used strategies. Our empirical evaluation shows that the strategy with provable performance guarantees performs well in comparison with other commonly-used feature selection strategies. In addition, it performs better on certain datasets under very aggressive feature selection.

184 citations

Patent
14 Jun 1999
TL;DR: In this article, a method and apparatus for determining when electronic documents stored in a large collection of documents are similar to one another is provided for determining the similarity of documents stored on the same server.
Abstract: A method and apparatus are provided for determining when electronic documents stored in a large collection of documents are similar to one another. A plurality of similarity information is derived from the documents. The similarity information may be based on a variety of factors, including hyperlinks in the documents, text similarity, user click-through information, similarity in the titles of the documents or their location identifiers, and patterns of user viewing. The similarity information is fed to a combination function that synthesizes the various measures of similarity information into combined similarity information. Using the combined similarity information, an objective function is iteratively maximized in order to yield a generalized similarity value that expresses the similarity of particular pairs of documents. In an embodiment, the generalized similarity value is used to determine the proper category, among a taxonomy of categories in an index, cache or search system, into which certain documents belong.

184 citations

Patent
31 Aug 2009
TL;DR: In this article, an ad creation system utilizes data made available by the W4 COMN to facilitate the creation and placement of advertisements on a message delivery network, such as the Web4COMN itself.
Abstract: Methods, apparatuses and systems directed to creating contextually-targeted advertisements. In a particular implementation, advertisers may leverage a W4 COMN to deliver contextually-targeted and/or contextually-enhanced advertisements. In the implementations discussed below, an ad creation system utilizes data made available by the W4 COMN to facilitate the creation and placement of advertisements on a message delivery network, such as the W4 COMN itself.

183 citations

Patent
Sunil Jagadish1, Jignashu Parikh1
28 Apr 2008
TL;DR: In this article, a set of potential friends to a user based on an analysis of friends' contact lists is provided. But the user may provide a subset of friends within a contact list for analysis, along with a degree of separation over which to perform the analysis, and/or a minimum threshold number of occurrences for identifying a candidate friend.
Abstract: Embodiments are directed towards providing a list of potential friends to a user based on an analysis of friends' contact lists. The user may provide a subset of friends within a contact list for analysis, along with a degree of separation over which to perform the analysis, and/or a minimum threshold number of occurrences for identifying a candidate friend. The subset of friends' contact lists may then be recursively traversed and merged, where common friends may be identified as members of a candidate set for suggesting friends to the user. In one embodiment, the candidate members may be retained within the candidate set if there is a commonality between the friends and the candidate that exceeds the minimum threshold. The candidate list may also be rank order using various approaches, including a weighted energy diffusion model based in part on a number of communications between the candidates.

183 citations

Journal ArticleDOI
TL;DR: This study demonstrates that children with physical and neurological disabilities enjoy the same activities as those children without these disabilities.
Abstract: Background. Children with disabilities are at risk for limited daily occupational participation. This paper presents a phenomenological study describing the meaning of participation in activities outside of school to children with physical and neurological disabilities and their parents. Methods. Semi-structured interviews were conducted with eight children and their parents. Results. Nine themes, organized under three categories, describe what successful participation means, and the personal and environmental supports and barriers to participation. Successful participation is defined in terms of being with others and being able to perform tasks independently. Conclusions. This study demonstrates that children with physical and neurological disabilities enjoy the same activities as those children without these disabilities. Activities allowing children to experience enjoyment have the best chance of ensuring children's participation. The role of the parent in providing opportunities for participation, and...

183 citations


Authors

Showing all 26766 results

NameH-indexPapersCitations
Ashok Kumar1515654164086
Alexander J. Smola122434110222
Howard I. Maibach116182160765
Sanjay Jain10388146880
Amirhossein Sahebkar100130746132
Marc Davis9941250243
Wenjun Zhang9697638530
Jian Xu94136652057
Fortunato Ciardiello9469547352
Tong Zhang9341436519
Michael E. J. Lean9241130939
Ashish K. Jha8750330020
Xin Zhang87171440102
Theunis Piersma8663234201
George Varghese8425328598
Network Information
Related Institutions (5)
University of Toronto
294.9K papers, 13.5M citations

85% related

University of California, San Diego
204.5K papers, 12.3M citations

85% related

University College London
210.6K papers, 9.8M citations

84% related

Cornell University
235.5K papers, 12.2M citations

84% related

University of Washington
305.5K papers, 17.7M citations

84% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20232
202247
20211,088
20201,074
20191,568
20181,352