Topic

Upsampling

About: Upsampling is a research topic. Over the lifetime, 2426 publications have been published within this topic receiving 57613 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Posted Content•

Dynamic Multimodal Instance Segmentation guided by natural language queries

[...]

Edgar A. Margffoy-Tuay¹, Juan C. Pérez¹, Emilio Botero¹, Pablo Arbeláez¹•Institutions (1)

University of Los Andes¹

06 Jul 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: The problem of segmenting an object given a natural language expression that describes it is addressed and a novel method that integrates linguistic and visual information in the channel dimension and the intermediate information generated when downsampling the image is proposed, so that detailed segmentations can be obtained.

...read moreread less

Abstract: We address the problem of segmenting an object given a natural language expression that describes it. Current techniques tackle this task by either (\textit{i}) directly or recursively merging linguistic and visual information in the channel dimension and then performing convolutions; or by (\textit{ii}) mapping the expression to a space in which it can be thought of as a filter, whose response is directly related to the presence of the object at a given spatial coordinate in the image, so that a convolution can be applied to look for the object. We propose a novel method that integrates these two insights in order to fully exploit the recursive nature of language. Additionally, during the upsampling process, we take advantage of the intermediate information generated when downsampling the image, so that detailed segmentations can be obtained. We compare our method against the state-of-the-art approaches in four standard datasets, in which it surpasses all previous methods in six of eight of the splits for this task.

...read moreread less

34 citations

Posted Content•

Convolutional Neural Pyramid for Image Processing

[...]

Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia

07 Apr 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks based on the essential finding that many applications require large receptive fields for structure understanding.

...read moreread less

Abstract: We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks. It is based on the essential finding that many applications require large receptive fields for structure understanding. But corresponding neural networks for regression either stack many layers or apply large kernels to achieve it, which is computationally very costly. Our pyramid structure can greatly enlarge the field while not sacrificing computation efficiency. Extra benefit includes adaptive network depth and progressive upsampling for quasi-realtime testing on VGA-size input. Our method profits a broad set of applications, such as depth/RGB image restoration, completion, noise/artifact removal, edge refinement, image filtering, image enhancement and colorization.

...read moreread less

34 citations

Journal Article•DOI•

MLNet: multichannel feature fusion lozenge network for land segmentation

[...]

26 Mar 2022-Journal of Applied Remote Sensing

TL;DR: Wang et al. as discussed by the authors proposed a multichannel feature fusion lozenge network (MLNet), which is a three-sided network composed of three branches: one branch uses different levels of feature indexes to sample to maintain the integrity of high-frequency information; one branch focuses on contextual information and strengthen the compatibility of information within and between classes; and the last branch uses feature integration to filter redundant information based on multiresolution segmentation to extract key features.

...read moreread less

Abstract: The use of remote sensing images for land cover analysis has broad prospects. At present, the resolution of aerial remote sensing images is getting higher and higher, and the span of time and space is getting larger and larger, therefore segmenting target objects enconter great difficulties. Convolutional neural networks are widely used in many image semantic segmentation tasks, but existing models often use simple accumulation of various convolutional layers or the direct stacking of interfeature reuse of up- and downsampling, the network very heavy. To improve the accuracy of land cover segmentation, we propose a multichannel feature fusion lozenge network. The multichannel feature fusion lozenge network (MLNet) is a three-sided network composed of three branches: one branch uses different levels of feature indexes to sample to maintain the integrity of high-frequency information; one branch focuses on contextual information and strengthens the compatibility of information within and between classes; and the last branch uses feature integration to filter redundant information based on multiresolution segmentation to extract key features. Compared with FCN, UNet, PSP, and other serial single road computing models, the MLNet, which performs feature fusion after three-way parallelism structure, can significantly improve the accuracy with only small increase in complexity. Experimental results show that the average accuracy of 85.30% is obtained on the land cover data set, which is much higher than that of 82.98% of FCN, 81.87% of UNet, 77.52% of SegNet, and 83.09% of EspNet, which proves the effectiveness of the model.

...read moreread less

33 citations

Posted Content•

UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning

[...]

Kunming Luo, Chuan Wang, Shuaicheng Liu¹, Haoqiang Fan, Jue Wang, Jian Sun - Show less +2 more•Institutions (1)

University of Electronic Science and Technology of China¹

01 Dec 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work designs a self-guided upsample module to tackle the interpolation blur problem caused by bilinear upsampling between pyramid levels, and proposes a pyramid distillation loss to add supervision for intermediate levels via distilling the finest flow as pseudo labels.

...read moreread less

Abstract: We present an unsupervised learning approach for optical flow estimation by improving the upsampling and learning of pyramid network. We design a self-guided upsample module to tackle the interpolation blur problem caused by bilinear upsampling between pyramid levels. Moreover, we propose a pyramid distillation loss to add supervision for intermediate levels via distilling the finest flow as pseudo labels. By integrating these two components together, our method achieves the best performance for unsupervised optical flow learning on multiple leading benchmarks, including MPI-SIntel, KITTI 2012 and KITTI 2015. In particular, we achieve EPE=1.4 on KITTI 2012 and F1=9.38% on KITTI 2015, which outperform the previous state-of-the-art methods by 22.2% and 15.7%, respectively.

...read moreread less

33 citations

Journal Article•DOI•

Underwater Target Recognition Based on Improved YOLOv4 Neural Network

[...]

Lingyu Chen, Meicheng Zheng, Shunqiang Duan, Weilin Luo, Ligang Yao - Show less +1 more

01 Jul 2021-Electronics

TL;DR: The recognition results and the comparison with the other target detectors demonstrate the effectiveness of the proposed YOLOv4 structure and the method of data preprocessing.

...read moreread less

Abstract: The YOLOv4 neural network is employed for underwater target recognition. To improve the accuracy and speed of recognition, the structure of YOLOv4 is modified by replacing the upsampling module with a deconvolution module and by incorporating depthwise separable convolution into the network. Moreover, the training set used in the YOLO network is preprocessed by using a modified mosaic augmentation, in which the gray world algorithm is used to derive two images when performing mosaic augmentation. The recognition results and the comparison with the other target detectors demonstrate the effectiveness of the proposed YOLOv4 structure and the method of data preprocessing. According to both subjective and objective evaluation, the proposed target recognition strategy can effectively improve the accuracy and speed of underwater target recognition and reduce the requirement of hardware performance as well.

...read moreread less

33 citations

Collapse

Network Information

Performance

Metrics

3,775

Papers

82,717

Citations

No. of papers in the topic in previous years
Year	Papers
2023	469
2022	859
2021	330
2020	322
2019	298
2018	236

Upsampling

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics