scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Pattern Recognition and Machine Learning

01 Aug 2007-Technometrics (Taylor & Francis)-Vol. 49, Iss: 3, pp 366-366
TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.
Abstract: (2007). Pattern Recognition and Machine Learning. Technometrics: Vol. 49, No. 3, pp. 366-366.
Citations
More filters
Journal ArticleDOI
TL;DR: Results show that both the objectives of small variance and spatial compactness can be achieved with this partitioning mechanism, and demonstrates the superiority of the proposed method in effectiveness and robustness compared with other clustering algorithms.
Abstract: It has been recently shown that a macroscopic fundamental diagram (MFD) linking space-mean network flow, density and speed exists in the urban transportation networks under some conditions. An MFD is further well defined if the network is homogeneous with links of similar properties. This collective behavior concept can also be utilized to introduce simple control strategies to improve mobility in homogeneous city centers without the need for details in individual links. However many real urban transportation networks are heterogeneous with different levels of congestion. In order to study the existence of MFD and the feasibility of simple control strategies to improve network performance in heterogeneously congested networks, this paper focuses on the clustering of transportation networks based on the spatial features of congestion during a specific time period. Insights are provided on how to extend this framework in the dynamic case. The objectives of partitioning are to obtain (i) small variance of link densities within a cluster which increases the network flow for the same average density and (ii) spatial compactness of each cluster which makes feasible the application of perimeter control strategies. Therefore, a partitioning mechanism which consists of three consecutive algorithms, is designed to minimize the variance of link densities while maintaining the spatial compactness of the clusters. Firstly, an over segmenting of the network is provided by a sophisticated algorithm (Normalized Cut). Secondly, a merging algorithm is developed based on initial segmenting and a rough partitioning of the network is obtained. Finally, a boundary adjustment algorithm is designed to further improve the quality of partitioning by decreasing the variance of link densities while keeping the spatial compactness of the clusters. In addition, both density variance and shape smoothness metrics are introduced to identify the desired number of clusters and evaluate the partitioning results. These results show that both the objectives of small variance and spatial compactness can be achieved with this partitioning mechanism. A simulation in a real urban transportation network further demonstrates the superiority of the proposed method in effectiveness and robustness compared with other clustering algorithms.

354 citations


Cites background from "Pattern Recognition and Machine Lea..."

  • ...There is a vast literature on studying clustering algorithms and they generally fall into two large categories: hierarchical and partitional (Jain, 2010; Bishop, 2007)....

    [...]

Journal ArticleDOI
TL;DR: A blind source separation method for convolutive mixtures of speech/audio sources that can be applied to an underdetermined case where there are fewer microphones than sources is presented.
Abstract: This paper presents a blind source separation method for convolutive mixtures of speech/audio sources. The method can even be applied to an underdetermined case where there are fewer microphones than sources. The separation operation is performed in the frequency domain and consists of two stages. In the first stage, frequency-domain mixture samples are clustered into each source by an expectation-maximization (EM) algorithm. Since the clustering is performed in a frequency bin-wise manner, the permutation ambiguities of the bin-wise clustered samples should be aligned. This is solved in the second stage by using the probability on how likely each sample belongs to the assigned class. This two-stage structure makes it possible to attain a good separation even under reverberant conditions. Experimental results for separating four speech signals with three microphones under reverberant conditions show the superiority of the new method over existing methods. We also report separation results for a benchmark data set and live recordings of speech mixtures.

354 citations

Proceedings Article
22 Jul 2012
TL;DR: This paper proposes an approach by which individual appliances can be iteratively separated from an aggregate load, and evaluates the accuracy of the approach using the REDD data set, and shows the disaggregation performance when using the training approach is comparable to when sub-metered training data is used.
Abstract: Non-intrusive appliance load monitoring is the process of disaggregating a household's total electricity consumption into its contributing appliances. In this paper we propose an approach by which individual appliances can be iteratively separated from an aggregate load. Unlike existing approaches, our approach does not require training data to be collected by sub-metering individual appliances, nor does it assume complete knowledge of the appliances present in the household. Instead, we propose an approach in which prior models of general appliance types are tuned to specific appliance instances using only signatures extracted from the aggregate load. The tuned appliance models are then used to estimate each appliance's load, which is subsequently subtracted from the aggregate load. This process is applied iteratively until all appliances for which prior behaviour models are known have been disaggregated. We evaluate the accuracy of our approach using the REDD data set, and show the disaggregation performance when using our training approach is comparable to when sub-metered training data is used. We also present a deployment of our system as a live application and demonstrate the potential for personalised energy saving feedback.

353 citations


Cites background from "Pattern Recognition and Machine Lea..."

  • ...Our prior state transition matrix is sparse as it contains mostly zeros, therefore restricting the range of behaviours that it can represent (Bishop 2006)....

    [...]

Journal ArticleDOI
TL;DR: An account of the state of the art in acoustic scene classification (ASC), the task of classifying environments from the sounds they produce, and a range of different algorithms submitted for a data challenge to provide a general and fair benchmark for ASC techniques.
Abstract: In this article, we present an account of the state of the art in acoustic scene classification (ASC), the task of classifying environments from the sounds they produce. Starting from a historical review of previous research in this area, we define a general framework for ASC and present different implementations of its components. We then describe a range of different algorithms submitted for a data challenge that was held to provide a general and fair benchmark for ASC techniques. The data set recorded for this purpose is presented along with the performance metrics that are used to evaluate the algorithms and statistical significance tests to compare the submitted methods.

352 citations


Additional excerpts

  • ...Different partition methods have been proposed in the literature for this purpose [7]....

    [...]

Journal ArticleDOI
TL;DR: A machine learning framework based on logistic regression that is specifically designed to tackle the specifics of display advertising and provides models with state-of-the-art accuracy.
Abstract: Clickthrough and conversation rates estimation are two core predictions tasks in display advertising We present in this article a machine learning framework based on logistic regression that is specifically designed to tackle the specifics of display advertising The resulting system has the following characteristics: It is easy to implement and deploy, it is highly scalable (we have trained it on terabytes of data), and it provides models with state-of-the-art accuracy

349 citations