scispace - formally typeset
Search or ask a question
Institution

Yahoo!

CompanyLondon, United Kingdom
About: Yahoo! is a company organization based out in London, United Kingdom. It is known for research contribution in the topics: Population & Web search query. The organization has 26749 authors who have published 29915 publications receiving 732583 citations. The organization is also known as: Yahoo! Inc. & Maudwen-Yahoo! Inc.


Papers
More filters
Proceedings ArticleDOI
04 Jun 2011
TL;DR: This paper presents new algorithms in the MapReduce framework for a variety of fundamental graph problems for sufficiently dense graphs and implements the maximal matching algorithm that lies at the core of the analysis and achieves a significant speedup over the sequential version.
Abstract: The MapReduce framework is currently the de facto standard used throughout both industry and academia for petabyte scale data analysis. As the input to a typical MapReduce computation is large, one of the key requirements of the framework is that the input cannot be stored on a single machine and must be processed in parallel. In this paper we describe a general algorithmic design technique in the MapReduce framework called filtering. The main idea behind filtering is to reduce the size of the input in a distributed fashion so that the resulting, much smaller, problem instance can be solved on a single machine. Using this approach we give new algorithms in the MapReduce framework for a variety of fundamental graph problems for sufficiently dense graphs. Specifically, we present algorithms for minimum spanning trees, maximal matchings, approximate weighted matchings, approximate vertex and edge covers and minimum cuts. In all of these cases, we parameterize our algorithms by the amount of memory available on the machines allowing us to show tradeoffs between the memory available and the number of MapReduce rounds. For each setting we will show that even if the machines are only given substantially sublinear memory, our algorithms run in a constant number of MapReduce rounds. To demonstrate the practical viability of our algorithms we implement the maximal matching algorithm that lies at the core of our analysis and show that it achieves a significant speedup over the sequential version.

283 citations

Proceedings ArticleDOI
26 Oct 2006
TL;DR: A framework for automatically selecting a summary set of photos from a large collection of geo-referenced photographs, based on spa-tial patterns in photo sets, as well as textual-topical patterns and user (photographer) identity cues, which can be expanded to support social, temporal, and other factors.
Abstract: We describe a framework for automatically selecting a summary set of photos from a large collection of geo-referenced photographs. Such large collections are inherently difficult to browse, and become excessively so as they grow in size, making summaries an important tool in rendering these collections accessible. Our summary algorithm is based on spa-tial patterns in photo sets, as well as textual-topical patterns and user (photographer) identity cues. The algorithm can be expanded to support social, temporal, and other factors. The summary can thus be biased by the content of the query, the user making the query, and the context in which the query is made.A modified version of our summarization algorithm serves as a basis for a new map-based visualization of large collections of geo-referenced photos, called Tag Maps. Tag Maps visualize the data by placing highly representative textual tags on relevant map locations in the viewed region, effectively providing a sense of the important concepts embodied in the collection.An initial evaluation of our implementation on a set of geo-referenced photos shows that our algorithm and visualization perform well, producing summaries and views that are highly rated by users.

282 citations

Proceedings Article
26 Oct 2008
TL;DR: The composition of a query plan for a group-by skyline query is examined and the missing cost model for the BBS algorithm is developed and Experimental results show that the techniques are able to devise the best query plans for a variety of group- by skyline queries.
Abstract: It is our great pleasure to welcome you to the 17th ACM Conference on Information and Knowledge Management -- CIKM'08. Since 1992, the ACM Conference on Information and Knowledge Management (CIKM) has been successfully bringing together leading researchers and developers from the database, information retrieval, and knowledge management communities. The purpose of the conference is to identify challenging problems facing the development of future knowledge and information systems, and to shape future research directions through the publication of high quality, applied and theoretical research findings. In CIKM 2008, we continued the tradition of promoting collaboration among the general areas of databases, information retrieval, and knowledge management. This year's call for papers attracted almost 800 submissions from Asia, Canada, Europe, Africa, and the United States. The program committee accepted 132 papers and 103 posters giving CIKM'08 an acceptance rate of 17%.

281 citations

Journal ArticleDOI
TL;DR: A green synthesis route for the production of silver nanoparticles using methanol extract from Solanum xanthocarpum berry (SXE) is reported in the present investigation, and AgNps under study were found to be equally efficient against the antibiotic-resistant and antibiotic-susceptible strains of H. pylori.
Abstract: A green synthesis route for the production of silver nanoparticles using methanol extract from Solanum xanthocarpum berry (SXE) is reported in the present investigation. Silver nanoparticles (AgNps), having a surface plasmon resonance (SPR) band centered at 406 nm, were synthesized by reacting SXE (as capping as well as reducing agent) with AgNO3 during a 25 min process at 45 °C. The synthesized AgNps were characterized using UV–Visible spectrophotometry, powdered X-ray diffraction, and transmission electron microscopy (TEM). The results showed that the time of reaction, temperature and volume ratio of SXE to AgNO3 could accelerate the reduction rate of Ag+ and affect the AgNps size and shape. The nanoparticles were found to be about 10 nm in size, mono-dispersed in nature, and spherical in shape. In vitro anti-Helicobacter pylori activity of synthesized AgNps was tested against 34 clinical isolates and two reference strains of Helicobacter pylori by the agar dilution method and compared with AgNO3 and four standard drugs, namely amoxicillin (AMX), clarithromycin (CLA), metronidazole (MNZ) and tetracycline (TET), being used in anti-H. pylori therapy. Typical AgNps sample (S1) effectively inhibited the growth of H. pylori, indicating a stronger anti-H. pylori activity than that of AgNO3 or MNZ, being almost equally potent to TET and less potent than AMX and CLA. AgNps under study were found to be equally efficient against the antibiotic-resistant and antibiotic-susceptible strains of H. pylori. Besides, in the H. pylori urease inhibitory assay, S1 also exhibited a significant inhibition. Lineweaver-Burk plots revealed that the mechanism of inhibition was noncompetitive.

281 citations

Journal ArticleDOI
Olivier Chapelle1, S. S. Keerthi1
TL;DR: Evaluation on the Letor benchmark datasets after complete training using new methods based on primal Newton method to speed up RankSVM training and show that they are 5 orders of magnitude faster than SVMLight.
Abstract: RankSVM (Herbrich et al. in Advances in large margin classifiers. MIT Press, Cambridge, MA, 2000; Joachims in Proceedings of the ACM conference on knowledge discovery and data mining (KDD), 2002) is a pairwise method for designing ranking models. SVMLight is the only publicly available software for RankSVM. It is slow and, due to incomplete training with it, previous evaluations show RankSVM to have inferior ranking performance. We propose new methods based on primal Newton method to speed up RankSVM training and show that they are 5 orders of magnitude faster than SVMLight. Evaluation on the Letor benchmark datasets after complete training using such methods shows that the performance of RankSVM is excellent.

280 citations


Authors

Showing all 26766 results

NameH-indexPapersCitations
Ashok Kumar1515654164086
Alexander J. Smola122434110222
Howard I. Maibach116182160765
Sanjay Jain10388146880
Amirhossein Sahebkar100130746132
Marc Davis9941250243
Wenjun Zhang9697638530
Jian Xu94136652057
Fortunato Ciardiello9469547352
Tong Zhang9341436519
Michael E. J. Lean9241130939
Ashish K. Jha8750330020
Xin Zhang87171440102
Theunis Piersma8663234201
George Varghese8425328598
Network Information
Related Institutions (5)
University of Toronto
294.9K papers, 13.5M citations

85% related

University of California, San Diego
204.5K papers, 12.3M citations

85% related

University College London
210.6K papers, 9.8M citations

84% related

Cornell University
235.5K papers, 12.2M citations

84% related

University of Washington
305.5K papers, 17.7M citations

84% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20232
202247
20211,088
20201,074
20191,568
20181,352