Showing papers in &quot;Pattern Recognition in 2022&quot;

Uncertainty estimation for stereo matching based on evidential deep learning

TL;DR: Wang et al. as mentioned in this paper proposed edge computing based video pre-processing to eliminate the redundant frames, so that they migrate the partial or all the video processing task to the edge, thereby diminishing the computing, storage and network bandwidth requirements of the cloud center, and enhancing the effectiveness of video analyzes.

...read moreread less

91 citations

Journal Article•DOI•

[...]

Edge computing enabled video segmentation for real-time traffic monitoring in internet of vehicles

TL;DR: Zhang et al. as mentioned in this paper proposed a novel approach to estimate both aleatoric and epistemic uncertainties for stereo matching in an end-to-end way, where the uncertainty parameters are predicted for each potential disparity and then averaged via the guidance of matching probability distribution.

...read moreread less

74 citations

Journal Article•DOI•

[...]

Shaohua Wan¹•Institutions (1)

Institute of Electrical and Electronics Engineers¹

Neighborhood Linear Discriminant Analysis

...read moreread less

67 citations

Journal Article•DOI•

[...]

Fa Zhu¹, Junbin Gao², Jian Yang³, Ning Ye¹•Institutions (3)

Nanjing Forestry University¹, University of Sydney², Nanjing University of Science and Technology³

Neighborhood linear discriminant analysis

TL;DR: In this paper, a neighborhood linear discriminant analysis (nLDA) is proposed, in which the scatter matrices are defined on a neighborhood consisting of reverse nearest neighbors and the neighborhood can be naturally regarded as the smallest subclass.

...read moreread less

64 citations

Journal Article•DOI•

[...]

GasHis-Transformer: A multi-scale visual transformer approach for gastric histopathological image detection

TL;DR: In this article , a neighborhood linear discriminant analysis (nLDA) is proposed to address multimodality in LDA, in which the scatter matrices are defined on a neighborhood consisting of reverse nearest neighbors.

...read moreread less

64 citations

Journal Article•DOI•

[...]

01 Oct 2022-Pattern Recognition

TL;DR: In this article , a multi-scale visual transformer model, referred as GasHis-Transformer, is proposed for Gastric Histopathological Image Detection (GHID), which enables the automatic global detection of gastric cancer images.

...read moreread less

61 citations

Journal Article•DOI•

Video anomaly detection with spatio-temporal dissociation

[...]

Yunpeng Chang¹, Zhigang Tu¹, Wei Xie², Bin Luo¹, Shifu Zhang, Haigang Sui¹, Junsong Yuan³ - Show less +3 more•Institutions (3)

Wuhan University¹, Central China Normal University², University at Buffalo³

Deep neural networks-based relevant latent representation learning for hyperspectral image classification

TL;DR: A novel convolution autoencoder architecture that can dissociate the spatio-temporal representation to separately capture the spatial and the temporal information is explored, since abnormal events are usually different from the normality in appearance and/or motion behavior.

...read moreread less

61 citations

Journal Article•DOI•

[...]

Akrem Sellami¹, Salvatore Tabbone¹•Institutions (1)

University of Lorraine¹

Deep neural networks-based relevant latent representation learning for hyperspectral image classification

TL;DR: In this article, a multi-view deep autoencoder model is proposed to fuse the spectral and spatial features extracted from the hyperspectral image into a joint latent representation space.

...read moreread less

58 citations

Journal Article•DOI•

[...]

SARS-Net: COVID-19 Detection from Chest X-Rays by Combining Graph Convolutional Network and Convolutional Neural Network

TL;DR: In this paper , a multi-view deep autoencoder model is proposed to fuse the spectral and spatial features extracted from the hyperspectral image into a joint latent representation space.

...read moreread less

58 citations

Journal Article•DOI•

[...]

Aayush Kumar¹, Evi Susanti Tasri¹, Ayush R Tripathi¹, Suresh Chandra Satapathy¹, Yudong Zhang² - Show less +1 more•Institutions (2)

KIIT University¹, University of Leicester²

SARS-Net: COVID-19 detection from chest x-rays by combining graph convolutional network and convolutional neural network

TL;DR: SARS-Net as mentioned in this paper is a CADx system combining graph convolutional networks and Convolutional Neural Networks for detecting abnormalities in a patient's CXR images for presence of severe acute respiratory syndrome coronavirus.

...read moreread less

53 citations

Journal Article•DOI•

[...]

Evi Susanti Tasri¹•Institutions (1)

KIIT University¹

Contextual ensemble network for semantic segmentation

TL;DR: SARS-Net as discussed by the authors is a CADx system combining graph convolutional networks and Convolutional Neural Networks for detecting abnormalities in a patient's CXR images for presence of severe acute respiratory syndrome coronavirus.

...read moreread less

Journal Article•DOI•

[...]

Quan Zhou¹, Carlo Rubbia², Xiaofu Wu¹, Suofei Zhang¹, Bin Kang¹, Zongyuan Ge³, Longin Jan Latecki⁴ - Show less +3 more•Institutions (4)

Nanjing University of Posts and Telecommunications¹, Aalto University², Monash University³, Temple University⁴

Action Transformer: A self-attention model for short-time pose-based human action recognition

TL;DR: A novel encoder-decoder architecture, called contextual ensemble network (CENet), for semantic segmentation, where the contextual cues are aggregated via densely usampling the convolutional features of deep layer to the shallow deconvolutional layers.

...read moreread less

Journal Article•DOI•

[...]

Vittorio Mazzia¹•Institutions (1)

Polytechnic University of Turin¹

Financial time series forecasting with multi-modality graph neural network

TL;DR: Action Transformer (AcT) as discussed by the authors exploits 2D pose representations over small temporal windows, providing a low latency solution for accurate and effective real-time performance, which consistently outperforms more elaborated networks that mix convolutional, recurrent and attentive layers.

...read moreread less

Journal Article•DOI•

[...]

Weakly Supervised Segmentation of COVID19 Infection with Scribble Annotation on CT Images.

TL;DR: In this article , a multi-modality graph neural network (MAGNN) is proposed to learn from these multimodal inputs for financial time series prediction, which provides investors with a profitable as well as interpretable option and enables them to make informed investment decisions.

...read moreread less

Journal Article•DOI•

[...]

Xiaoming Liu¹, Coleen Kivlahan², Quan Yuan¹, Oliver Cromwell³, Yaozong Gao, Kelei He⁴, Shuo Wang¹, Xiao Tang¹, Jinshan Tang⁵, Dinggang Shen⁶, Dinggang Shen⁷ - Show less +7 more•Institutions (7)

Wuhan University of Science and Technology¹, Jilin University², United Imaging Healthcare (China)³, Nanjing University⁴, George Mason University⁵, ShanghaiTech University⁶, Korea University⁷

Financial time series forecasting with multi-modality graph neural network

TL;DR: In this paper, an uncertainty-aware mean teacher is incorporated into the scribble-based segmentation method, encouraging the segmentation predictions to be consistent under different perturbations for an input image.

...read moreread less

Journal Article•DOI•

[...]

Dawei Cheng¹, Fangzhou Yang, Sheng Xiang², Jin Liu•Institutions (2)

Tongji University¹, University of Technology, Sydney²

Weakly Supervised Segmentation of COVID19 Infection with Scribble Annotation on CT Images

TL;DR: In this article, a multi-modality graph neural network (MAGNN) is proposed to learn from these multimodal inputs for financial time series prediction, which provides investors with a profitable as well as interpretable option and enables them to make informed investment decisions.

...read moreread less

Journal Article•DOI•

[...]

Coleen Kivlahan¹, Oliver Cromwell²•Institutions (2)

Jilin University¹, United Imaging Healthcare (China)²

A polarization fusion network with geometric feature embedding for SAR ship classification

TL;DR: In this paper , an uncertainty-aware mean teacher is incorporated into the scribble-based segmentation method, encouraging the segmentation predictions to be consistent under different perturbations for an input image.

...read moreread less

Journal Article•DOI•

[...]

Tianwen Zhang¹, Xiaoling Zhang¹•Institutions (1)

University of Electronic Science and Technology of China¹

Learning attention-guided pyramidal features for few-shot fine-grained recognition

TL;DR: In this article, a polarization fusion network with geometric feature embedding (PFGFE-Net) was proposed to solve the two defects of traditional feature abandonment and insufficient utilization of traditional features.

...read moreread less

Journal Article•DOI•

[...]

Hao Tang, Chengcheng Yuan, Zechao Li, Jinhui Tang

01 May 2022-Pattern Recognition

TL;DR: Zhang et al. as discussed by the authors proposed an effective bidirectional pyramid architecture to enhance internal representations of features to cater to fine-grained image recognition task in the few-shot learning scenario.

...read moreread less

Journal Article•DOI•

Two-step domain adaptation for underwater image enhancement

[...]

Qun Jiang¹, Silvana Santucci², Yunfeng Zhang¹, Fangxun Bao³, Xiuyang Zhao⁴, Caiming Zhang³, Peide Liu¹ - Show less +3 more•Institutions (4)

Shandong University of Finance and Economics¹, China University of Geosciences (Wuhan)², Shandong University³, University of Jinan⁴

HCFNN: High-order coverage function neural network for image classification

TL;DR: An underwater image enhancement method that does not require training on synthetic underwater images and eliminates the dependence on underwater ground-truth images is presented and the experimental results indicate that the proposed method produces visually satisfactory results.

...read moreread less

Journal Article•DOI•

[...]

Xin Ning, Weijuan Tian, Zaiyang Yu, Weijun Li, Xiao Bai - Show less +1 more

01 Jun 2022-Pattern Recognition

TL;DR: In this paper , a high-order coverage function (HCF) neuron model was proposed to replace the fully-connected (FC) layers, which utilizes weight coefficients and hyper-parameters to mine underlying geometry with arbitrary shapes in an n-dimensional space.

...read moreread less

Journal Article•DOI•

A polarization fusion network with geometric feature embedding for SAR ship classification

[...]

3D Object Detection for Autonomous Driving: A Survey

TL;DR: In this article , a polarization fusion network with geometric feature embedding (PFGFE-Net) was proposed to solve the two defects of traditional feature abandonment and insufficient utilization of traditional features.

...read moreread less

Journal Article•DOI•

[...]

Bethelhem Sileshy¹•Institutions (1)

Renmin University of China¹

01 Oct 2022-Pattern Recognition

TL;DR: In this paper , a comprehensive survey of 3D object detection for autonomous driving is presented, encompassing all the main concerns including sensors, datasets, performance metrics and the recent state-of-the-art detection methods, together with their pros and cons.

...read moreread less

Journal Article•DOI•

Video anomaly detection with spatio-temporal dissociation

[...]

A Comprehensive Survey of Image Augmentation Techniques for Deep Learning

TL;DR: Wang et al. as discussed by the authors proposed a novel convolution autoencoder architecture that can dissociate the spatio-temporal representation to separately capture the spatial and temporal information, since abnormal events are usually different from the normality in appearance and/or motion behavior.

...read moreread less

Journal Article•DOI•

[...]

Mingle Xu, Sook Yoon, Alvaro Fuentes, Dong Sun Park

03 May 2022-Pattern Recognition

TL;DR: A comprehensive survey of image augmentation for deep learning using a novel informative taxonomy is presented in this article , where the algorithms are classified into three categories: model-free, model-based, and optimizing policy-based.

...read moreread less

Journal Article•DOI•

Self-restrained triplet loss for accurate masked face recognition

[...]

Towards automatic threat detection: A survey of advances of deep learning within X-ray security imaging

TL;DR: Wang et al. as discussed by the authors proposed the Embedding Unmasking Model (EUM) operated on top of existing face recognition models, which enabled the EUM to produce embeddings similar to these of unmasked faces of the same identities.

...read moreread less

Journal Article•DOI•

[...]

Towards automatic threat detection: A survey of advances of deep learning within X-ray security imaging

TL;DR: In this article , a taxonomy of X-ray security imaging algorithms is presented, with a particular focus on object classification, detection, segmentation and anomaly detection tasks, and a performance benchmark is provided based on the current and future trends in deep learning.

...read moreread less

Journal Article•DOI•

[...]

Samet Akcay¹, Toby P. Breckon²•Institutions (2)

Intel¹, Durham University²

HFA-Net: High frequency attention siamese network for building change detection in VHR remote sensing images

TL;DR: In this paper, a taxonomy of X-ray security imaging algorithms is presented, with a particular focus on object classification, detection, segmentation and anomaly detection tasks, and a performance benchmark is provided based on the current and future trends in deep learning.

...read moreread less

Journal Article•DOI•

[...]

Hanhong Zheng, Maoguo Gong, Tongfei Liu, Fenlong Jiang, Tao Zhan, Di Lu, Mingyang Zhang - Show less +3 more

Pedestrian trajectory prediction with convolutional neural networks

TL;DR: Wang et al. as mentioned in this paper proposed a high frequency attention-guided Siamese network (HFA-Net), which is composed of two main stages, spatial-wise attention (SA) and high frequency enhancement (HF).

...read moreread less

Journal Article•DOI•

[...]