Showing papers in "Journal of Visual Communication and Image Representation in 2019"

PDF

Open Access

Journal Article•DOI•

SphereReID: Deep hypersphere manifold embedding for person re-identification

[...]

Xing Fan¹, Wei Jiang¹, Hao Luo¹, Mengjuan Fei¹•Institutions (1)

01 Apr 2019-Journal of Visual Communication and Image Representation

TL;DR: A convolutional neural network called SphereReID is proposed adopting Sphere Softmax and training a single model end-to-end with a new warming-up learning rate schedule on four challenging datasets including Market-1501, DukeMTMC-reID, CHHK-03, and CUHK-SYSU.

...read moreread less

163 citations

Journal Article•DOI•

Interpretable convolutional neural networks via feedforward design

[...]

C.-C. Jay Kuo¹, Min Zhang¹, Siyang Li¹, Jiali Duan¹, Yueru Chen¹ - Show less +1 more•Institutions (1)

University of Southern California¹

01 Apr 2019-Journal of Visual Communication and Image Representation

TL;DR: In this paper, the authors proposed an interpretable feedforward (FF) design without any backpropagation, which adopts a data-centric approach to derive network parameters of the current layer based on data statistics from the output of the previous layer in a one-pass manner.

...read moreread less

105 citations

Journal Article•DOI•

A survey on image tampering and its detection in real-world photos

[...]

Lilei Zheng¹, Ying Zhang¹, Vrizlynn L. L. Thing¹•Institutions (1)

Institute for Infocomm Research Singapore¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: This survey provides an overview on typical image tampering types, released image tampering datasets and recent tampering detection approaches and encourages the research community to develop general tampering localization methods in the future instead of adhering to single-type tampering detection.

...read moreread less

103 citations

Journal Article•DOI•

Multi-camera transfer GAN for person re-identification

[...]

Shuren Zhou¹, Maolin Ke¹, Peng Luo¹•Institutions (1)

Changsha University of Science and Technology¹

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: A method of image-to-image translation, CTGAN (Multi-Camera Transfer GAN), which can be performed on multiple camera domains of pedestrian dataset by using one single model, and adopts the MSCDA (Mixed Selective Convolution Descriptor Aggregation) method, which can locate the main pedestrian objects in the image, filter out the background noise, and keep the useful depth descriptor.

...read moreread less

101 citations

Journal Article•DOI•

Dimension reduction of image deep feature using PCA

[...]

Ji Ma¹, Yuyu Yuan¹•Institutions (1)

Beijing University of Posts and Telecommunications¹

01 Aug 2019-Journal of Visual Communication and Image Representation

TL;DR: This paper optimizes the PCA algorithm for dimension reduction of image feature extraction by deep learning, aiming at the problem that it is difficult to process high-dimensional sparse big data based on PCA algorithms.

...read moreread less

88 citations

Journal Article•DOI•

AFIF4: Deep Gender Classification based on AdaBoost-based Fusion of Isolated Facial Features and Foggy Faces

[...]

Mahmoud Afifi¹, Mahmoud Afifi², Abdelrahman Abdelhamed¹, Abdelrahman Abdelhamed²•Institutions (2)

Assiut University¹, York University²

01 Jul 2019-Journal of Visual Communication and Image Representation

TL;DR: In this paper, the combination of isolated facial components and a contextual feature called foggy face is used to train deep convolutional neural networks followed by an AdaBoost-based score fusion to infer the final gender class.

...read moreread less

85 citations

Journal Article•DOI•

Face recognition based on genetic algorithm

[...]

Hui Zhi¹, Hui Zhi², Sanyang Liu¹•Institutions (2)

Xidian University¹, Xi'an University of Architecture and Technology²

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: An effective face recognition model based on principal component analysis, genetic algorithm and support vector machine is established, in which principal components analysis is used to reduce feature dimension, genetic algorithms are used to optimize search strategy, and support vectors machine isUsed to realize classification.

...read moreread less

81 citations

Journal Article•DOI•

Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks

[...]

Min Hu¹, Haowen Wang¹, Xiaohua Wang¹, Juan Yang¹, Ronggui Wang¹ - Show less +1 more•Institutions (1)

Hefei University of Technology¹

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: Experiments on the AFEW, CK+ and MMI datasets using subject-independent validation scheme demonstrate that the integrated framework of two networks achieves a better performance than using individual network separately, compared with state-of-the-arts methods.

...read moreread less

77 citations

Journal Article•DOI•

Scene graph captioner: Image captioning based on structural visual representation

[...]

Ning Xu¹, An-An Liu¹, Jing Liu¹, Weizhi Nie¹, Yuting Su¹ - Show less +1 more•Institutions (1)

Tianjin University¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: This work proposes a novel framework to embed a scene graph into the structural representation, which captures the semantic concepts and the graph topology and develops the scene-graph-driven method to generate the attention graph.

...read moreread less

75 citations

Journal Article•DOI•

An artificial intelligence based data-driven approach for design ideation

[...]

Liuqing Chen¹, Pan Wang¹, Hao Dong¹, Feng Shi¹, Ji Han², Yike Guo¹, Peter R.N. Childs¹, Jun Xiao³, Chao Wu³ - Show less +5 more•Institutions (3)

Imperial College London¹, University of Liverpool², Zhejiang University³

01 May 2019-Journal of Visual Communication and Image Representation

TL;DR: An integrated approach for enhancing design ideation by applying artificial intelligence and data mining techniques, which consists of two models, a semantic ideation network and a visual concepts combination model, which provide inspiration semantically and visually based on computational creativity theory.

...read moreread less

70 citations

Journal Article•DOI•

High-capacity reversible data hiding in encrypted images based on extended run-length coding and block-based MSB plane rearrangement

[...]

Kaimeng Chen¹, Chin-Chen Chang²•Institutions (2)

Jimei University¹, Feng Chia University²

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: The experimental results prove that the proposed novel reversible data hiding method can reach a high embedding rate and a high PSNR.

...read moreread less

Journal Article•DOI•

Hyperspectral remote sensing image change detection based on tensor and deep learning

[...]

Fenghua Huang, Ying Yu, Tinghao Feng¹•Institutions (1)

University of North Carolina at Charlotte¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: Experimental results demonstrate that TRS-DBN has higher change detection accuracy than similar methods and a good automation level.

...read moreread less

Journal Article•DOI•

Iterative fusion convolutional neural networks for classification of optical coherence tomography images

[...]

Leyuan Fang¹, Yuxuan Jin¹, Laifeng Huang¹, Siyu Guo¹, Guangzhe Zhao², Xiangdong Chen¹ - Show less +2 more•Institutions (2)

Hunan University¹, Beijing University of Civil Engineering and Architecture²

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: Experimental results on a real retinal OCT dataset and a musculoskeletal radiographs dataset demonstrate the superiority of the proposed convolutional neural network method over the traditional CNN and several well-known OCT classification methods.

...read moreread less

Journal Article•DOI•

A mix-pooling CNN architecture with FCRF for brain tumor segmentation

[...]

Jie Chang¹, Jie Chang², Luming Zhang³, Naijie Gu¹, Xiaoci Zhang¹, Minquan Ye², Rongzhang Yin², Qianqian Meng⁴ - Show less +4 more•Institutions (4)

University of Science and Technology of China¹, Wannan Medical College², Zhejiang University³, Capital Medical University⁴

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: A two-pathway model with average and max pooling layers in different paths is designed and combined with fully connected CRF(FCRF) as a mixture model to introduce the global context information to optimize prediction results.

...read moreread less

Journal Article•DOI•

Mesoscopic structure PFC∼2D model of soil rock mixture based on digital image

[...]

Pengfei Shan¹, Xingping Lai¹•Institutions (1)

Xi'an University of Science and Technology¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: The PFC∼2D numerical calculation model of soil-rock mixtures is established and the results show that when the stone content is 80%, the analysis should be caused by the large amount of rock, which leads to the large internal voids, and the sudden unloading between the rock and the rock during compaction and then the structural reorganization.

...read moreread less

Journal Article•DOI•

Ghost-free multi exposure image fusion technique using dense SIFT descriptor and guided filter

[...]

Naila Hayat, Muhammad Imran

01 Jul 2019-Journal of Visual Communication and Image Representation

TL;DR: Experimental results prove the superiority of the proposed technique over existing state-of-the-art methods in terms of both subjective and objective evaluation.

...read moreread less

Journal Article•DOI•

Learning spatiotemporal representations for human fall detection in surveillance video

[...]

Yongqiang Kong¹, Jianhui Huang², Shanshan Huang³, Zhengang Wei³, Shengke Wang³ - Show less +1 more•Institutions (3)

Beihang University¹, Shandong University of Science and Technology², Ocean University of China³

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: A computer vision based framework is proposed that detects falls from surveillance videos by employing background subtraction and rank pooling to model spatial and temporal representations in videos, and introducing a novel three-stream Convolutional Neural Networks as an event classifier.

...read moreread less

Journal Article•DOI•

Development of a N-type GM-PHD filter for multiple target, multiple type visual tracking

[...]

Nathanael L. Baisa¹, Andrew M. Wallace¹•Institutions (1)

Heriot-Watt University¹

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: Under Gaussianity and linearity assumptions, the existing Gaussian mixture implementation of the standard PHD filter is extended to create a N-type GM-PHD filter, and Munkres's variant of the Hungarian assignment algorithm is used to associate tracked target identities between frames.

...read moreread less

Journal Article•DOI•

Band reordering heuristics for lossless satellite image compression with 3D-CALIC and CCSDS

[...]

Masud Ibn Afjal¹, Md. Al Mamun¹, Md. Palash Uddin¹•Institutions (1)

Rajshahi University of Engineering & Technology¹

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: Three different methods namely Band Reordering based on Consecutive Continuity Breakdown Heuristics (BRCCBH), Band Re ordering based on Weighted-Correlation Heuristic (BRWCH) and Segmented BRCCBh have been proposed for the compression of multispectral, hyperspectral and hyperspectrals sounder data.

...read moreread less

Journal Article•DOI•

SRLibrary: Comparing different loss functions for super-resolution over various convolutional architectures

[...]

Yildiray Anagun¹, Sahin Isik¹, Erol Seke¹•Institutions (1)

Eskişehir Osmangazi University¹

01 May 2019-Journal of Visual Communication and Image Representation

TL;DR: Charbonnier and L1 loss functions are fastest ones when the computational time cost is examined during training stage, and both are sensitive to noise that misleads the learning process and consequently resulting in lower quality HR outcomes.

...read moreread less

Journal Article•DOI•

A method of processing color image watermarking based on the Haar wavelet

[...]

Jianyu Wang¹, Zhiguo Du¹•Institutions (1)

Southwest University¹

01 Oct 2019-Journal of Visual Communication and Image Representation

TL;DR: The algorithm presented in this article can well embed the color image in the carrier image, and has good resistance to attack operations such as loss compression and adding of noise.

...read moreread less

Journal Article•DOI•

Depth-aware saliency detection using convolutional neural networks

[...]

Ding Yu¹, Zhi Liu¹, Mengke Huang¹, Ran Shi², Xiangyang Wang¹ - Show less +1 more•Institutions (2)

Shanghai University¹, Nanjing University of Science and Technology²

01 May 2019-Journal of Visual Communication and Image Representation

TL;DR: A new end-to-end depth-aware saliency model using three convolutional neural networks including color saliency network, depth Saliency network and saliency fusion network, for saliency detection in RGBD images and stereoscopic images is proposed.

...read moreread less

Journal Article•DOI•

EVS-DK: Event video skimming using deep keyframe

[...]

Krishan Kumar

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: This work proposes an event summarization technique using Deep learning framework for monocular videos that outperforms the state-of-the-art models on Precision and F-measure and also cover the major contents of the original video.

...read moreread less

Journal Article•DOI•

A weighted edge-based level set method based on multi-local statistical information for noisy image segmentation

[...]

Cheng Liu¹, Weibin Liu¹, Weiwei Xing¹•Institutions (1)

Beijing Jiaotong University¹

01 Feb 2019-Journal of Visual Communication and Image Representation

TL;DR: A weighted edge-based level set method based on multi-local statistical information to better segment noisy images and provides higher segmentation accuracies and more accurate segmentation results, which demonstrate its effectiveness and robustness.

...read moreread less

Journal Article•DOI•

A novel framework for semantic segmentation with generative adversarial network

[...]

Xiaobin Zhu¹, Xiaobin Zhu², Xinming Zhang², Xiaoyu Zhang³, Ziyu Xue⁴, Lei Wang⁴ - Show less +2 more•Institutions (4)

University of Science and Technology Beijing¹, Beijing Technology and Business University², Chinese Academy of Sciences³, Information Technology Institute⁴

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: A novel post-processing method based on GAN (Generative Adversarial Network) is explored to reinforce spatial contiguity in the output label maps to get better performance and stability.

...read moreread less

Journal Article•DOI•

Influence of CT scanning parameters on rock and soil images

[...]

Pengfei Shan¹, Xingping Lai¹•Institutions (1)

Xi'an University of Science and Technology¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: The results show that the scanning voltage and filtering function have great influence on CT images and CT numbers of rock and soil samples, and with the help of reasonable CT scanning parameters, the quality of the geotechnical CT image can be improved and the relatively accurate geotehnical CT value can be obtained.

...read moreread less

Journal Article•DOI•

Sharing hand gesture and sketch cues in remote collaboration

[...]

Weidong Huang¹, Seungwon Kim², Mark Billinghurst², Leila Alem•Institutions (2)

Swinburne University of Technology¹, University of South Australia²

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: A user study comparing remote collaboration with an interface that combined hand gestures and sketching to one that only used hand gestures, when solving two tasks; Lego assembly and repairing a laptop found that adding sketch cues improved the task completion time, only with the repairing task.

...read moreread less

Journal Article•DOI•

A review on classifying abnormal behavior in crowd scene

[...]

A. A. Afiq¹, Mohd Azman Zakariya¹, Mohamad Naufal Mohamad Saad¹, A. A. Nurfarzana¹, M. H. Md Khir¹, A. F. Fadzil¹, A. Jale¹, W. Gunawan¹, Z. A. A. Izuddin¹, M. Faizari¹ - Show less +6 more•Institutions (1)

Universiti Teknologi Petronas¹

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: A review of crowd behavior analysis methods including Gaussian Mixture Model (GMM), Hidden Markov Model (HMM), Optical Flow method and Spatio-Temporal Technique (STT) to provide insight on several detection methods.

...read moreread less

Journal Article•DOI•

ImmerTai: Immersive Motion Learning in VR Environments

[...]

Xiaoming Chen¹, Zhibo Chen¹, Ye Li¹, Tianyu He¹, Junhui Hou², Sen Liu¹, Ying He³ - Show less +3 more•Institutions (3)

University of Science and Technology of China¹, City University of Hong Kong², Nanyang Technological University³

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: The results show that ImmerTai can accelerate the learning process of students noticeably compared to non-immersive learning with the conventional PC setup, and there is a substantial difference in the quality of the learnt motion between CAVE and HMD compared to PC.

...read moreread less

Journal Article•DOI•

Boosting content based image retrieval performance through integration of parametric & nonparametric approaches

[...]

Soumya Prakash Rana¹, Maitreyee Dey¹, Patrick Siarry²•Institutions (2)

London South Bank University¹, University of Paris-Est²

01 Jan 2019-Journal of Visual Communication and Image Representation

TL;DR: The research addresses that point for content based image retrieval (CBIR) by fusing parametric color and shape features with nonparametric texture feature to propose a robust and effective algorithm.

...read moreread less

Collapse