Showing papers on "Skyline published in 2012"

PDF

Open Access

Journal Article•DOI•

Platform-independent and Label-free Quantitation of Proteomic Data Using MS1 Extracted Ion Chromatograms in Skyline APPLICATION TO PROTEIN ACETYLATION AND PHOSPHORYLATION

[...]

Birgit Schilling¹, Matthew J. Rardin¹, Brendan MacLean², Anna M. Zawadzka¹, Barbara Frewen², Michael P. Cusack¹, Dylan J. Sorensen¹, Michael S. Bereman², Enxuan Jing³, Christine C. Wu⁴, Eric Verdin, C. Ronald Kahn³, Michael J. MacCoss², Bradford W. Gibson¹ - Show less +10 more•Institutions (4)

Buck Institute for Research on Aging¹, University of Washington², Harvard University³, University of Pittsburgh⁴

01 May 2012-Molecular & Cellular Proteomics

TL;DR: The capabilities of Skyline were expanded to process ion intensity chromatograms of peptide analytes from full scan mass spectral data (MS1) acquired during HPLC MS/MS proteomic experiments and the utility of the MS1 filtering approach was examined.

...read moreread less

413 citations

Journal Article•DOI•

A survey of skyline processing in highly distributed environments

[...]

Katja Hose¹, Akrivi Vlachou¹•Institutions (1)

Max Planck Society¹

01 Jun 2012

TL;DR: This paper outlines the objectives and the main principles that any distributed skyline approach has to fulfill, leading to useful guidelines for developing algorithms for distributed skyline processing, and reviews in detail existing approaches that are applicable for highly distributed environments.

...read moreread less

Abstract: During the last decades, data management and storage have become increasingly distributed. Advanced query operators, such as skyline queries, are necessary in order to help users to handle the huge amount of available data by identifying a set of interesting data objects. Skyline query processing in highly distributed environments poses inherent challenges and demands and requires non-traditional techniques due to the distribution of content and the lack of global knowledge. This paper surveys this interesting and still evolving research area, so that readers can easily obtain an overview of the state-of-the-art. We outline the objectives and the main principles that any distributed skyline approach has to fulfill, leading to useful guidelines for developing algorithms for distributed skyline processing. We review in detail existing approaches that are applicable for highly distributed environments, clarify the assumptions of each approach, and provide a comparative performance analysis. Moreover, we study the skyline variants each approach supports. Our analysis leads to a taxonomy of existing approaches. Finally, we present interesting research topics on distributed skyline computation that have not yet been explored.

...read moreread less

132 citations

Journal Article•DOI•

Continuous monitoring of skylines over uncertain data streams

[...]

Xiaofeng Ding¹, Xiang Lian², Lei Chen², Hai Jin¹•Institutions (2)

Huazhong University of Science and Technology¹, Hong Kong University of Science and Technology²

01 Feb 2012-Information Sciences

TL;DR: This paper proposes a novel sliding window skyline model where an uncertain tuple may take the probability to be in the skyline at a certain timestamp t, and proposes an efficient and effective approach, namely the candidate list approach, which maintains lists of candidates that might become skylines in future sliding windows.

...read moreread less

59 citations

Journal Article•DOI•

Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data

[...]

Xiaofeng Ding¹, Hai Jin¹•Institutions (1)

Huazhong University of Science and Technology¹

01 Aug 2012-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This paper proposes the notation of distributed skyline queries over uncertain data, and two communication- and computation-efficient algorithms are proposed to retrieve the qualified skylines from distributed local sites.

...read moreread less

Abstract: The skyline operator has received considerable attention from the database community, due to its importance in many applications including multicriteria decision making, preference answering, and so forth. In many applications where uncertain data are inherently exist, i.e., data collected from different sources in distributed locations are usually with imprecise measurements, and thus exhibit kind of uncertainty. Taking into account the network delay and economic cost associated with sharing and communicating large amounts of distributed data over an internet, an important problem in this scenario is to retrieve the global skyline tuples from all the distributed local sites with minimum communication cost. Based on the well-known notation of the probabilistic skyline query over centralized uncertain data, in this paper, we propose the notation of distributed skyline queries over uncertain data. Furthermore, two communication- and computation-efficient algorithms are proposed to retrieve the qualified skylines from distributed local sites. Extensive experiments have been conducted to verify the efficiency, the effectiveness and the progressiveness of our algorithms with both the synthetic and real data sets.

...read moreread less

55 citations

Proceedings Article•DOI•

Parallel skyline queries

[...]

Foto N. Afrati¹, Paraschos Koutris², Dan Suciu², Jeffrey D. Ullman³•Institutions (3)

National Technical University of Athens¹, University of Washington², Stanford University³

26 Mar 2012

TL;DR: This paper design and analyze parallel algorithms for skyline queries using the MP model and a variation of the model in (Afrati and Ullman, EDBT 2010), the GMP model, which demands weaker load balancing constraints, and presents a 1-step algorithm in theGMP model for any number of dimensions.

...read moreread less

Abstract: In this paper, we design and analyze parallel algorithms for skyline queries. The skyline of a multidimensional set consists of the points for which no other point exists that is at least as good along every dimension. As a framework for parallel computation, we use both the MP model proposed in (Koutris and Suciu, PODS 2011), which requires that the data is perfectly load-balanced, and a variation of the model in (Afrati and Ullman, EDBT 2010), the GMP model, which demands weaker load balancing constraints. In addition to load balancing, we want to minimize the number of blocking steps, where all processors must wait and synchronize. We propose a 2-step algorithm in the MP model for any dimension of the dataset, as well a 1-step algorithm for the case of 2 and 3 dimensions. Moreover, we present a 1-step algorithm in the GMP model for any number of dimensions.

...read moreread less

54 citations

Journal Article•DOI•

Energy-Efficient Reverse Skyline Query Processing over Wireless Sensor Networks

[...]

Guoren Wang¹, Junchang Xin¹, Lei Chen², Yunhao Liu²•Institutions (2)

Northeastern University (China)¹, Hong Kong University of Science and Technology²

01 Jul 2012-IEEE Transactions on Knowledge and Data Engineering

TL;DR: An energy-efficient approach is proposed to minimize the communication cost among sensor nodes of evaluating range reverse skyline query and optimization mechanisms to improve the performance of multiple reverse skylines are discussed.

...read moreread less

Abstract: Reverse skyline query plays an important role in many sensing applications, such as environmental monitoring, habitat monitoring, and battlefield monitoring. Due to the limited power supplies of wireless sensor nodes, the existing centralized approaches, which do not consider energy efficiency, cannot be directly applied to the distributed sensor environment. In this paper, we investigate how to process reverse skyline queries energy efficiently in wireless sensor networks. Initially, we theoretically analyzed the properties of reverse skyline query and proposed a skyband-based approach to tackle the problem of reverse skyline query answering over wireless sensor networks. Then, an energy-efficient approach is proposed to minimize the communication cost among sensor nodes of evaluating range reverse skyline query. Moreover, optimization mechanisms to improve the performance of multiple reverse skylines are also discussed. Extensive experiments on both real-world data and synthetic data have demonstrated the efficiency and effectiveness of our proposed approaches with various experimental settings.

...read moreread less

53 citations

Journal Article•DOI•

Continuous Top-k Dominating Queries

[...]

Maria Kontaki, Apostolos N. Papadopoulos, Yannis Manolopoulos

01 May 2012-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This work contains the first study of continuous top-k dominating queries over data streams, and two approximate algorithms are proposed (AHBA and AMSA).

...read moreread less

Abstract: Top-k dominating queries use an intuitive scoring function which ranks multidimensional points with respect to their dominance power, i.e., the number of points that a point dominates. The k points with the best (e.g., highest) scores are returned to the user. Both top-k and skyline queries have been studied in a streaming environment, where changes to the data set are very frequent. In such an environment, continuous query processing techniques are required toward efficient monitoring of query results, since periodic query re-execution is computationally intensive, and therefore, prohibitive. This work contains the first study of continuous top-k dominating queries over data streams. In comparison to continuous top-k and skyline queries, continuous top-k dominating queries pose additional challenges. Three exact algorithms (BFA, EVA, ADA) are studied, and among them ADA, which is enhanced with additional optimization techniques, shows the best overall performance. In some cases, we are willing to trade accuracy for speed. Toward this direction, two approximate algorithms are proposed (AHBA and AMSA). AHBA offers probabilistic guarantees regarding the accuracy of the result based on the Hoeffding bound, whereas AMSA performs a more aggressive computation resulting in more efficient processing. Evaluation results, based on real-life and synthetic data sets, show the efficiency and scalability of our techniques.

...read moreread less

53 citations

Proceedings Article•DOI•

On skyline groups

[...]

Chengkai Li¹, Nan Zhang², Naeemul Hassan¹, Sundaresan Rajasekaran², Gautam Das³ - Show less +1 more•Institutions (3)

University of Texas at Austin¹, George Washington University², Qatar Computing Research Institute³

29 Oct 2012

TL;DR: Two anti-monotonic properties with varying degrees of applicability are identified: order-specific property which applies to SUM, MIN, and MAX as well as weak candidate-generation property which applied to MIN and MAX only.

...read moreread less

Abstract: We formulate and investigate the novel problem of finding the skyline k-tuple groups from an n-tuple dataset - i.e., groups of k tuples which are not dominated by any other group of equal size, based on aggregate-based group dominance relationship. The major technical challenge is to identify effective anti-monotonic properties for pruning the search space of skyline groups. To this end, we show that the anti-monotonic property in the well-known Apriori algorithm does not hold for skyline group pruning. We then identify order-specific property which applies to SUM, MIN, and MAX and weak candidate-generation property which applies to MIN and MAX only. Experimental results on both real and synthetic datasets verify that the proposed algorithms achieve orders of magnitude performance gain over a baseline method.

...read moreread less

51 citations

Journal Article•DOI•

Group skyline computation

[...]

Hyeonseung Im¹, Sungwoo Park¹•Institutions (1)

Pohang University of Science and Technology¹

01 Apr 2012-Information Sciences

TL;DR: A group skyline algorithm GDynamic is developed which is equivalent to a dynamic algorithm that fills a table of skyline groups that determines the dominance relation between two groups by comparing their aggregate values such as sums or averages of elements of individual dimensions.

...read moreread less

42 citations

Journal Article•DOI•

Continuous distance-based skyline queries in road networks

[...]

Yuan-Ko Huang¹, Chia-Heng Chang², Chiang Lee²•Institutions (2)

Kao Yuan University¹, National Cheng Kung University²

01 Nov 2012-Information Systems

TL;DR: This paper addresses the issue of efficiently processing continuous skyline queries in road networks by proposing two novel and important distance-based skyline queries, namely, the continuousd"@e-skylinequery (Cd" @e-SQ) and the continuous k nearest neighbor-Skyline query (Cknn-S Q).

...read moreread less

41 citations

Journal Article•DOI•

Tailoring a geomodel for analyzing an urban skyline

[...]

Caner Güney¹, Suzan Akdag Girginkaya¹, Gülen Çağdaş¹, Sinem Yavuz¹•Institutions (1)

Istanbul Technical University¹

30 Mar 2012-Landscape and Urban Planning

TL;DR: In this paper, the authors investigated the skyline of Istanbul and its transformation due to high-rise buildings and developed a geomodel that can be applied to many urban skylines and urban areas in Turkey and other cities of the world.

...read moreread less

Proceedings Article•DOI•

Selecting Skyline Web Services from Uncertain QoS

[...]

Karim Benouaret¹, Djamal Benslimane¹, Allel Hadjali•Institutions (1)

University of Lyon¹

24 Jun 2012

TL;DR: This work represents each QoS attribute of a Web service using a possibility distribution and introduces two skyline extensions on uncertain QoS called pos-Dominant skyline and nec-dominant skyline, and develops appropriate algorithms to efficiently compute both the pos-dominate skyline and deathly skyline.

...read moreread less

Abstract: Quality of service (QoS) has been considered as a significant criterion for selecting among functionally similar Web services. Recent approaches focus on computing the skyline over a set of QoS attributes. This can completely free users from assigning weights to QoS attributes. However, these approaches are not sufficient in a dynamic Web service environment where the delivered QoS by a Web service is inherently uncertain. In this paper, we tackle the problem of skyline on uncertain QoS. We represent each QoS attribute of a Web service using a possibility distribution and introduce two skyline extensions on uncertain QoS called pos-dominant skyline and nec-dominant skyline. We then develop appropriate algorithms to efficiently compute both the pos-dominant skyline and nec-dominant skyline. Finally, we present our experimental results that show both the effectiveness of the introduced skyline extensions and the efficiency of the proposed algorithms.

...read moreread less

Journal Article•DOI•

Stochastic skylines

[...]

Wenjie Zhang¹, Xuemin Lin¹, Ying Zhang¹, Muhammad Aamir Cheema¹, Qing Zhang² - Show less +1 more•Institutions (2)

University of New South Wales¹, Commonwealth Scientific and Industrial Research Organisation²

04 Jun 2012-ACM Transactions on Database Systems

TL;DR: A novel skyline operator, namely stochastic skylines, is proposed for efficiently and effectively retrieving lskyline and gskyline from a set of uncertain objects, respectively, together with efficient and effective filtering techniques.

...read moreread less

Abstract: In many applications involving multiple criteria optimal decision making, users may often want to make a personal trade-off among all optimal solutions for selecting one object that fits best their personal needs. As a key feature, the skyline in a multidimensional space provides the minimum set of candidates for such purposes by removing all points not preferred by any (monotonic) utility/scoring functions; that is, the skyline removes all objects not preferred by any user no matter how their preferences vary. Driven by many recent applications with uncertain data, the probabilistic skyline model is proposed to retrieve uncertain objects based on skyline probabilities. Nevertheless, skyline probabilities cannot capture the preferences of monotonic utility functions. Motivated by this, in this article we propose a novel skyline operator, namely stochastic skylines. In the light of the expected utility principle, stochastic skylines guarantee to provide the minimum set of candidates to optimal solutions over a family of utility functions. We first propose the lskyline operator based on the lower orthant orders. lskyline guarantees to provide the minimum set of candidates to the optimal solutions for the family of monotonic multiplicative utility functions. While lskyline works very effectively for the family of multiplicative functions, it may miss optimal solutions for other utility /scoring functions (e.g., linear functions). To resolve this, we also propose a general stochastic skyline operator, gskyline, based on the usual orders. gskyline provides the minimum candidate set to the optimal solutions for all monotonic functions. For the first time regarding the existing literature, we investigate the complexities of determining a stochastic order between two uncertain objects whose probability distributions are described discretely. We firstly show that determining the lower orthant order is NP-complete with respect to the dimensionality; consequently the problem of computing lskyline is NP-complete. We also show an interesting result as follows. While the usual order involves more complicated geometric forms than the lower orthant order, the usual order may be determined in polynomial time regarding all the inputs, including the dimensionality; this implies that gskyline can be computed in polynomial time. A general framework is developed for efficiently and effectively retrieving lskyline and gskyline from a set of uncertain objects, respectively, together with efficient and effective filtering techniques. Novel and efficient verification algorithms are developed to efficiently compute lskyline over multidimensional uncertain data, which run in polynomial time if the dimensionality is fixed, and to efficiently compute gskyline in polynomial time regarding all inputs. We also show, by theoretical analysis and experiments, that the sizes of lskyline and gskyline are both quite similar to that of conventional skyline over certain data. Comprehensive experiments demonstrate that our techniques are efficient and scalable regarding both CPU and IO costs.

...read moreread less

Proceedings Article•DOI•

3D City Modeling from Street-Level Data for Augmented Reality Applications

[...]

Timo Pylvänäinen¹, Jérôme Berclaz¹, Thommen Korah¹, Varsha Hedau¹, Mridul Aanjaneya², Radek Grzeszczuk¹ - Show less +2 more•Institutions (2)

Nokia¹, Stanford University²

13 Oct 2012

TL;DR: This work proposes a novel reconstruction technique that exploits architectural properties of urban environments to create an accurate 3D city model from incomplete data and shows that the reconstruction achieves higher accuracy than a commercial solution.

...read moreread less

Abstract: We present a method for automatically creating compact and accurate 3D city models needed for enhanced Augmented Reality applications. The input data are panorama images and LIDAR scans collected at street level and positioned using an IMU and a GPS. Our method corrects for the GPS error and the IMU drift to produce a globally consistent and well registered dataset for the whole city. We use structure from motion and skyline detection to complement the limited range of LIDAR data. Additionally, we propose a novel reconstruction technique that exploits architectural properties of urban environments to create an accurate 3D city model from incomplete data. Our method is able to process an entire city, or several terabytes of data, in a matter of days. We show that our reconstruction achieves higher accuracy than a commercial solution.

...read moreread less