scispace - formally typeset
Search or ask a question
Author

Costas Xydeas

Other affiliations: BT Group, University of Manchester, Bell Labs  ...read more
Bio: Costas Xydeas is an academic researcher from Lancaster University. The author has contributed to research in topics: Image fusion & Speech coding. The author has an hindex of 19, co-authored 77 publications receiving 3093 citations. Previous affiliations of Costas Xydeas include BT Group & University of Manchester.


Papers
More filters
Journal ArticleDOI
TL;DR: Experimental results clearly indicate that this metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms.
Abstract: A measure for objectively assessing the pixel level fusion performance is defined. The proposed metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms. Experimental results clearly indicate that this metric is perceptually meaningful.

1,446 citations

Journal ArticleDOI
TL;DR: A novel approach to multiresolution signal-level image fusion is presented for accurately transferring visual information from any number of input image signals, into a single fused image without loss of information or the introduction of distortion.
Abstract: A novel approach to multiresolution signal-level image fusion is presented for accurately transferring visual information from any number of input image signals, into a single fused image without loss of information or the introduction of distortion. The proposed system uses a "fuse-then-decompose" technique realized through a novel, fusion/decomposition system architecture. In particular, information fusion is performed on a multiresolution gradient map representation domain of image signal information. At each resolution, input images are represented as gradient maps and combined to produce new, fused gradient maps. Fused gradient map signals are processed, using gradient filters derived from high-pass quadrature mirror filters to yield a fused multiresolution pyramid representation. The fused output image is obtained by applying, on the fused pyramid, a reconstruction process that is analogous to that of conventional discrete wavelet transform. This new gradient fusion significantly reduces the amount of distortion artefacts and the loss of contrast information usually observed in fused images obtained from conventional multiresolution fusion schemes. This is because fusion in the gradient map domain significantly improves the reliability of the feature selection and information fusion processes. Fusion performance is evaluated through informal visual inspection and subjective psychometric preference tests, as well as objective fusion performance measurements. Results clearly demonstrate the superiority of this new approach when compared to conventional fusion systems.

536 citations

PatentDOI
TL;DR: In this paper, the LPC (linear preductive coding) filter was used to reduce the error between the input and regenerated speech signals, and the selection process involved derivation of an initial estimate followed by an iterative adjustment process in which pulses having a low energy contribution were tested in alternative positions and transferred to them if a reduced error results.
Abstract: Speech is coded such that it can be generated by a pulse excitation sequence filtered by an LPC (linear preductive coding) filter. The sequence contains, in each of successive frame periods, pulses whose positions and amplitudes may be varied. These variables are selected at the coding end to reduce the error between the input and regenerated speech signals. The selection process involves derivation of an initial estimate followed by an iterative adjustment process in which pulses having a low energy contribution are tested in alternative positions and transferred to them if a reduced error results.

184 citations

Proceedings ArticleDOI
03 Apr 2000
TL;DR: In this article, a pixel level image fusion performance metric is proposed to measure the accuracy with which visual information is transferred from the input images to the fused image, which is perceptually meaningful.
Abstract: This paper addresses the issue of objectively measuring the performance of pixel level image fusion systems. The proposed fusion performance metric models the accuracy with which visual information is transferred from the input images to the fused image. Experimental results clearly indicate that the metric is perceptually meaningful.

137 citations

Patent
05 Aug 1982
TL;DR: In this article, the authors proposed a means for simultaneous transmission of data and speech with only a minimal expansion of the bandwidth of the speech signal, where a Fourier transform is performed on the speech signals and a predetermined number of phase components are replaced with data (d(n)) in an appropriate form.
Abstract: The present invention relates to a means for achieving simultaneous transmission of data and speech with only a minimal expansion of the bandwidth of the speech signal. A Fourier transform (14) is performed on the speech signal and a predetermined number of phase components are replaced with data (d(n)) in an appropriate form. The number of phase components replaced with data is determined by approximately classifying the speech (16) as either "silence", no data inserted; "unvoiced" speech, M phase components convey data; and "voiced" speech, J phase components convey data; where J is less than M, and M is not greater than the number of phase components in the message band of the speech signal. An inverse Fourier transform (22) is subsequently performed on the combined data and speech signal. The combined message signal (G(t)) will comprise approximately the same bandwidth as the original speech signal, by virtue of the frequency domain insertion of the data into the speech. At the receiver the signal is inspected and a classifier (38) determines if data is embedded in the received signal. If data is deemed embedded, a Fourier transformation is performed, the data carrying phase components are inspected, and the data signal regenerated in an appropriate form. The phase components used for the conveyance of data are replaced by random phase components, and the inverse Fourier transformation performed. Median filtering is employed to mitigate the effects of end-of-block distortion and yield the recovered speech signal.

121 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The Photobook system is described, which is a set of interactive tools for browsing and searching images and image sequences that make direct use of the image content rather than relying on text annotations to provide a sophisticated browsing and search capability.
Abstract: We describe the Photobook system, which is a set of interactive tools for browsing and searching images and image sequences. These query tools differ from those used in standard image databases in that they make direct use of the image content rather than relying on text annotations. Direct search on image content is made possible by use of semantics-preserving image compression, which reduces images to a small set of perceptually-significant coefficients. We discuss three types of Photobook descriptions in detail: one that allows search based on appearance, one that uses 2-D shape, and a third that allows search based on textural properties. These image content descriptions can be combined with each other and with text-based descriptions to provide a sophisticated browsing and search capability. In this paper we demonstrate Photobook on databases containing images of people, video keyframes, hand tools, fish, texture swatches, and 3-D medical data.

1,748 citations

Patent
11 Jan 2011
TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

1,462 citations

Journal ArticleDOI
TL;DR: Experimental results clearly indicate that this metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms.
Abstract: A measure for objectively assessing the pixel level fusion performance is defined. The proposed metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms. Experimental results clearly indicate that this metric is perceptually meaningful.

1,446 citations

Journal ArticleDOI
TL;DR: Experimental results demonstrate that the proposed method can obtain state-of-the-art performance for fusion of multispectral, multifocus, multimodal, and multiexposure images.
Abstract: A fast and effective image fusion method is proposed for creating a highly informative fused image through merging multiple images. The proposed method is based on a two-scale decomposition of an image into a base layer containing large scale variations in intensity, and a detail layer capturing small scale details. A novel guided filtering-based weighted average technique is proposed to make full use of spatial consistency for fusion of the base and detail layers. Experimental results demonstrate that the proposed method can obtain state-of-the-art performance for fusion of multispectral, multifocus, multimodal, and multiexposure images.

1,300 citations