Author

York Kitajima

Bio: York Kitajima is an academic researcher from Middlebury College. The author has contributed to research in topics: Ground truth & Pixel. The author has an hindex of 1, co-authored 1 publications receiving 802 citations.

Topics: Ground truth, Pixel, Bundle adjustment, Projector, Subpixel rendering ...read more

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth

[...]

Daniel Scharstein¹, Heiko Hirschmüller², York Kitajima¹, Greg Krathwohl¹, Nera Nešić³, Xi Wang¹, Porter Westling - Show less +3 more•Institutions (3)

Middlebury College¹, German Aerospace Center², Reykjavík University³

02 Sep 2014

TL;DR: A structured lighting system for creating high-resolution stereo datasets of static indoor scenes with highly accurate ground-truth disparities using novel techniques for efficient 2D subpixel correspondence search and self-calibration of cameras and projectors with modeling of lens distortion is presented.

...read moreread less

Abstract: We present a structured lighting system for creating high-resolution stereo datasets of static indoor scenes with highly accurate ground-truth disparities. The system includes novel techniques for efficient 2D subpixel correspondence search and self-calibration of cameras and projectors with modeling of lens distortion. Combining disparity estimates from multiple projector positions we are able to achieve a disparity accuracy of 0.2 pixels on most observed surfaces, including in half-occluded regions. We contribute 33 new 6-megapixel datasets obtained with our system and demonstrate that they present new challenges for the next generation of stereo algorithms.

...read moreread less

1,071 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

[...]

Nikolaus Mayer¹, Eddy Ilg¹, Philip Häusser², Philipp Fischer¹, Daniel Cremers², Alexey Dosovitskiy¹, Thomas Brox¹ - Show less +3 more•Institutions (2)

University of Freiburg¹, Technische Universität München²

07 Dec 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a large-scale synthetic stereo video dataset is proposed to enable training and evaluation of optical flow estimation with a convolutional network and disparity estimation with CNNs.

...read moreread less

Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

...read moreread less

1,759 citations

Journal Article•DOI•

A Fast Single Image Haze Removal Algorithm Using Color Attenuation Prior

[...]

Qingsong Zhu¹, Jiaming Mai¹, Ling Shao²•Institutions (2)

Chinese Academy of Sciences¹, Northumbria University²

18 Jun 2015-IEEE Transactions on Image Processing

TL;DR: A simple but powerful color attenuation prior for haze removal from a single input hazy image is proposed and outperforms state-of-the-art haze removal algorithms in terms of both efficiency and the dehazing effect.

...read moreread less

Abstract: Single image haze removal has been a challenging problem due to its ill-posed nature. In this paper, we propose a simple but powerful color attenuation prior for haze removal from a single input hazy image. By creating a linear model for modeling the scene depth of the hazy image under this novel prior and learning the parameters of the model with a supervised learning method, the depth information can be well recovered. With the depth map of the hazy image, we can easily estimate the transmission and restore the scene radiance via the atmospheric scattering model, and thus effectively remove the haze from a single image. Experimental results show that the proposed approach outperforms state-of-the-art haze removal algorithms in terms of both efficiency and the dehazing effect.

...read moreread less

1,495 citations

Proceedings Article•DOI•

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

[...]

Nikolaus Mayer¹, Eddy Ilg¹, Philip Häusser², Philipp Fischer¹, Daniel Cremers², Alexey Dosovitskiy¹, Thomas Brox¹ - Show less +3 more•Institutions (2)

University of Freiburg¹, Technische Universität München²

27 Jun 2016

TL;DR: This paper proposes three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks and presents a convolutional network for real-time disparity estimation that provides state-of-the-art results.

...read moreread less

Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluation of scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

...read moreread less

1,184 citations

Journal Article•

Stereo matching by training a convolutional neural network to compare image patches

[...]

Jure Žbontar¹, Yann LeCun²•Institutions (2)

University of Ljubljana¹, Courant Institute of Mathematical Sciences²

01 Jan 2016-Journal of Machine Learning Research

TL;DR: In this paper, the first stage of many stereo algorithms, matching cost computation, is addressed by learning a similarity measure on small image patches using a convolutional neural network, and then a series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter.

...read moreread less

Abstract: We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.

...read moreread less

860 citations

Proceedings Article•DOI•

Densely Connected Pyramid Dehazing Network

[...]

He Zhang¹, Vishal M. Patel¹•Institutions (1)

Rutgers University¹

18 Jun 2018

TL;DR: Zhang et al. as discussed by the authors proposed a Densely Connected Pyramid Dehazing Network (DCPDN), which can jointly learn the transmission map, atmospheric light and dehazing all together.

...read moreread less

Abstract: We propose a new end-to-end single image dehazing method, called Densely Connected Pyramid Dehazing Network (DCPDN), which can jointly learn the transmission map, atmospheric light and dehazing all together. The end-to-end learning is achieved by directly embedding the atmospheric scattering model into the network, thereby ensuring that the proposed method strictly follows the physics-driven scattering model for dehazing. Inspired by the dense network that can maximize the information flow along features from different levels, we propose a new edge-preserving densely connected encoder-decoder structure with multi-level pyramid pooling module for estimating the transmission map. This network is optimized using a newly introduced edge-preserving loss function. To further incorporate the mutual structural information between the estimated transmission map and the dehazed result, we propose a joint-discriminator based on generative adversarial network framework to decide whether the corresponding dehazed image and the estimated transmission map are real or fake. An ablation study is conducted to demonstrate the effectiveness of each module evaluated at both estimated transmission map and dehazed result. Extensive experiments demonstrate that the proposed method achieves significant improvements over the state-of-the-art methods. Code and dataset is made available at: https://github.com/hezhangsprinter/DCPDN

...read moreread less

708 citations

Collapse