A fractal dimension based framework for night vision fusion

doi:10.1109/JAS.2018.7511102

Home
/
Papers
/
A fractal dimension based framework for night vision fusion

Journal Article•DOI•

A fractal dimension based framework for night vision fusion

Gaurav Bhatnagar¹, Q. M. Jonathan Wu²•Institutions (2)

Indian Institute of Technology, Jodhpur¹, University of Windsor²

01 Jan 2019-IEEE/CAA Journal of Automatica Sinica (IEEE)-Vol. 6, Iss: 1, pp 220-227

TL;DR: A novel fusion framework is proposed for night-vision applications such as pedestrian recognition, vehicle navigation and surveillance that is consistently superior to the conventional image fusion methods in terms of visual and quantitative evaluations.

read less

Abstract: In this paper, a novel fusion framework is proposed for night-vision applications such as pedestrian recognition, vehicle navigation and surveillance. The underlying concept is to combine low-light visible and infrared imagery into a single output to enhance visual perception. The proposed framework is computationally simple since it is only realized in the spatial domain. The core idea is to obtain an initial fused image by averaging all the source images. The initial fused image is then enhanced by selecting the most salient features guided from the root mean square error ( RMSE ) and fractal dimension of the visual and infrared images to obtain the final fused image. Extensive experiments on different scene imaginary demonstrate that it is consistently superior to the conventional image fusion methods in terms of visual and quantitative evaluations.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer

[...]

Jiayi Ma, Linfeng Tang, Fan Fan, Jun Huang, Xiaoguang Mei, Yong Ma - Show less +2 more

01 Jul 2022-IEEE/CAA Journal of Automatica Sinica

TL;DR: An attention-guided cross-domain module is devised to achieve sufficient integration of complementary information and global interaction, and an elaborate loss function, consisting of SSIM loss, texture loss, and intensity loss, drives the network to preserve abundant texture details and structural information, as well as presenting optimal apparent intensity.

...read moreread less

Abstract: This study proposes a novel general image fusion framework based on cross-domain long-range learning and Swin Transformer, termed as SwinFusion. On the one hand, an attention-guided cross-domain module is devised to achieve sufficient integration of complementary information and global interaction. More specifically, the proposed method involves an intra-domain fusion unit based on self-attention and an inter-domain fusion unit based on cross-attention, which mine and integrate long dependencies within the same domain and across domains. Through long-range dependency modeling, the network is able to fully implement domain-specific information extraction and cross-domain complementary information integration as well as maintaining the appropriate apparent intensity from a global perspective. In particular, we introduce the shifted windows mechanism into the self-attention and cross-attention, which allows our model to receive images with arbitrary sizes. On the other hand, the multi-scene image fusion problems are generalized to a unified framework with structure maintenance, detail preservation, and proper intensity control. Moreover, an elaborate loss function, consisting of SSIM loss, texture loss, and intensity loss, drives the network to preserve abundant texture details and structural information, as well as presenting optimal apparent intensity. Extensive experiments on both multi-modal image fusion and digital photography image fusion demonstrate the superiority of our SwinFusion compared to the state-of-the-art unified image fusion algorithms and task-specific alternatives. Implementation code and pre-trained weights can be accessed at https://github.com/Linfeng-Tang/SwinFusion.

...read moreread less

112 citations

Journal Article•DOI•

SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer

[...]

01 Jul 2022-IEEE/CAA Journal of Automatica Sinica

TL;DR: Tang et al. as mentioned in this paper proposed a cross-domain long-range learning and Swin Transformer (SwinFusion) framework for image fusion, which achieved sufficient integration of complementary information and global interaction.

...read moreread less

111 citations

Journal Article•DOI•

Efficient and High-quality Recommendations via Momentum-incorporated Parallel Stochastic Gradient Descent-Based Learning

[...]

Xin Luo¹, Wen Qin², Ani Dong¹, Khaled Sedraoui³, MengChu Zhou⁴ - Show less +1 more•Institutions (4)

Dongguan University of Technology¹, Chongqing University of Posts and Telecommunications², King Abdulaziz University³, New Jersey Institute of Technology⁴

01 Feb 2021-IEEE/CAA Journal of Automatica Sinica

TL;DR: In this paper, a momentum-incorporated parallel stochastic gradient descent (MPSGD) algorithm is proposed to accelerate the convergence rate by integrating momentum effects into its training process.

...read moreread less

Abstract: A recommender system (RS) relying on latent factor analysis usually adopts stochastic gradient descent (SGD) as its learning algorithm. However, owing to its serial mechanism, an SGD algorithm suffers from low efficiency and scalability when handling large-scale industrial problems. Aiming at addressing this issue, this study proposes a momentum-incorporated parallel stochastic gradient descent (MPSGD) algorithm, whose main idea is two-fold: a) implementing parallelization via a novel data-splitting strategy, and b) accelerating convergence rate by integrating momentum effects into its training process. With it, an MPSGD-based latent factor (MLF) model is achieved, which is capable of performing efficient and high-quality recommendations. Experimental results on four high-dimensional and sparse matrices generated by industrial RS indicate that owing to an MPSGD algorithm, an MLF model outperforms the existing state-of-the-art ones in both computational efficiency and scalability.

...read moreread less

108 citations

Journal Article•DOI•

Glioma Segmentation-Oriented Multi-Modal MR Image Fusion With Adversarial Learning

[...]

Yu Biao Liu, Yu Shi, Fuhao Mu, Quan Cheng, Xun Chen - Show less +1 more

01 Aug 2022-IEEE/CAA Journal of Automatica Sinica

TL;DR: Wang et al. as discussed by the authors proposed a glioma segmentation-oriented multi-modal magnetic resonance (MR) image fusion method using an adversarial learning framework, which adopts a segmentation network as the discriminator to achieve more meaningful fusion results.

...read moreread less

Abstract: Dear Editor, In recent years, multi-modal medical image fusion has received widespread attention in the image processing community. However, existing works on medical image fusion methods are mostly devoted to pursuing high performance on visual perception and objective fusion metrics, while ignoring the specific purpose in clinical applications. In this letter, we propose a glioma segmentation-oriented multi-modal magnetic resonance (MR) image fusion method using an adversarial learning framework, which adopts a segmentation network as the discriminator to achieve more meaningful fusion results from the perspective of the segmentation task. Experimental results demonstrate the advantage of the proposed method over some state-of-the-art medical image fusion methods.

...read moreread less

12 citations

Journal Article•DOI•

Cascading Scene and Viewpoint Feature Learning for Pedestrian Gender Recognition

[...]

Lei Cai¹, Huanqiang Zeng¹, Jianqing Zhu¹, Jiuwen Cao², Yongtao Wang³, Kai-Kuang Ma⁴ - Show less +2 more•Institutions (4)

Huaqiao University¹, Hangzhou Dianzi University², Peking University³, Nanyang Technological University⁴

15 Feb 2021-IEEE Internet of Things Journal

TL;DR: Extensive experiments conducted on the commonly used pedestrian attribute data sets have demonstrated that the proposed CSVFL approach outperforms multiple recently reported pedestrian gender recognition methods.

...read moreread less

Abstract: Pedestrian gender recognition plays an important role in smart city. To effectively improve the pedestrian gender recognition performance, a new method, called cascading scene and viewpoint feature learning (CSVFL), is proposed in this article. The novelty of the proposed CSVFL lies on the joint consideration of two crucial challenges in pedestrian gender recognition, namely, scene and viewpoint variation. For that, the proposed CSVFL starts with the scene transfer (ST) scheme, followed by the viewpoint adaptation (VA) scheme in a cascading manner. Specifically, the ST scheme exploits the key pedestrian segmentation network to extract the key pedestrian masks for the subsequent key pedestrian transfer generative adversarial network, with the goal of encouraging the input pedestrian image to have the similar style to the target scene while preserving the image details of the key pedestrian as much as possible. Afterward, the obtained scene-transferred pedestrian images are fed to train the deep feature learning network with the VA scheme, in which each neuron will be enabled/disabled for different viewpoints depending on whether it has contribution on the corresponding viewpoint. Extensive experiments conducted on the commonly used pedestrian attribute data sets have demonstrated that the proposed CSVFL approach outperforms multiple recently reported pedestrian gender recognition methods.

...read moreread less

9 citations

References

PDF

Open Access

More filters

Book•

The Fractal Geometry of Nature

[...]

Benoit B. Mandelbrot

01 Jan 1982

TL;DR: This book is a blend of erudition, popularization, and exposition, and the illustrations include many superb examples of computer graphics that are works of art in their own right.

...read moreread less

Abstract: "...a blend of erudition (fascinating and sometimes obscure historical minutiae abound), popularization (mathematical rigor is relegated to appendices) and exposition (the reader need have little knowledge of the fields involved) ...and the illustrations include many superb examples of computer graphics that are works of art in their own right." Nature

...read moreread less

24,199 citations

Journal Article•DOI•

The Fractal Geometry of Nature

[...]

Benoit B. Mandelbrot

01 Jul 1984

TL;DR: A blend of erudition (fascinating and sometimes obscure historical minutiae abound), popularization (mathematical rigor is relegated to appendices) and exposition (the reader need have little knowledge of the fields involved) is presented in this article.

...read moreread less

7,560 citations

"A fractal dimension based framework..." refers background in this paper

...In reference to the images, it provides variations in the features resulting from changes in the scale and hence acts as the texture masking function [16], [17]....
[...]

Journal Article•DOI•

Multisensor image fusion using the wavelet transform

[...]

Hui Li¹, B.S. Manjunath¹, Sanjit K. Mitra¹•Institutions (1)

University of California, Santa Barbara¹

01 May 1995-Graphical Models and Image Processing

TL;DR: In this article, an image fusion scheme based on the wavelet transform is presented, where wavelet transforms of the input images are appropriately combined, and the new image is obtained by taking the inverse wavelet transformation of the fused wavelet coefficients.

...read moreread less

1,532 citations

Journal Article•DOI•

Objective image fusion performance measure

[...]

Costas Xydeas¹, Vladimir Petrovic²•Institutions (2)

Lancaster University¹, University of Manchester²

17 Feb 2000-Electronics Letters

TL;DR: Experimental results clearly indicate that this metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms.

...read moreread less

Abstract: A measure for objectively assessing the pixel level fusion performance is defined. The proposed metric reflects the quality of visual information obtained from the fusion of input images and can be used to compare the performance of different image fusion algorithms. Experimental results clearly indicate that this metric is perceptually meaningful.

...read moreread less

1,446 citations

Journal Article•DOI•

Information measure for performance of image fusion

[...]

Guihong Qu¹, Dali Zhang¹, P. Yan¹•Institutions (1)

Tsinghua University¹

28 Mar 2002-Electronics Letters

TL;DR: The results show that the measure represents how much information is obtained from the input images and is meaningful and explicit.

...read moreread less

Abstract: Mutual information is proposed as an information measure for evaluating image fusion performance. The proposed measure represents how much information is obtained from the input images. No assumption is made regarding the nature of the relation between the intensities in both input modalities. The results show that the measure is meaningful and explicit.

...read moreread less

1,059 citations