Showing papers by "Kaiming He published in 2013"

PDF

Open Access

Journal Article•DOI•

[...]

Kaiming He¹, Jian Sun¹, Xiaoou Tang²•Institutions (2)

Microsoft¹, The Chinese University of Hong Kong²

01 Jun 2013-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The guided filter is a novel explicit image filter derived from a local linear model that can be used as an edge-preserving smoothing operator like the popular bilateral filter, but it has better behaviors near edges.

...read moreread less

Abstract: In this paper, we propose a novel explicit image filter called guided filter. Derived from a local linear model, the guided filter computes the filtering output by considering the content of a guidance image, which can be the input image itself or another different image. The guided filter can be used as an edge-preserving smoothing operator like the popular bilateral filter [1], but it has better behaviors near edges. The guided filter is also a more generic concept beyond smoothing: It can transfer the structures of the guidance image to the filtering output, enabling new filtering applications like dehazing and guided feathering. Moreover, the guided filter naturally has a fast and nonapproximate linear time algorithm, regardless of the kernel size and the intensity range. Currently, it is one of the fastest edge-preserving filters. Experiments show that the guided filter is both effective and efficient in a great variety of computer vision and computer graphics applications, including edge-aware smoothing, detail enhancement, HDR compression, image matting/feathering, dehazing, joint upsampling, etc.

...read moreread less

4,730 citations

Proceedings Article•DOI•

K-Means Hashing: An Affinity-Preserving Quantization Method for Learning Binary Compact Codes

[...]

Kaiming He¹, Fang Wen¹, Jian Sun¹•Institutions (1)

Microsoft¹

23 Jun 2013

TL;DR: A novel Affinity-Preserving K-means algorithm which simultaneously performs k-mean clustering and learns the binary indices of the quantized cells and outperforms various state-of-the-art hashing encoding methods.

...read moreread less

Abstract: In computer vision there has been increasing interest in learning hashing codes whose Hamming distance approximates the data similarity. The hashing functions play roles in both quantizing the vector space and generating similarity-preserving codes. Most existing hashing methods use hyper-planes (or kernelized hyper-planes) to quantize and encode. In this paper, we present a hashing method adopting the k-means quantization. We propose a novel Affinity-Preserving K-means algorithm which simultaneously performs k-means clustering and learns the binary indices of the quantized cells. The distance between the cells is approximated by the Hamming distance of the cell indices. We further generalize our algorithm to a product space for learning longer codes. Experiments show our method, named as K-means Hashing (KMH), outperforms various state-of-the-art hashing encoding methods.

...read moreread less

437 citations

Proceedings Article•DOI•

Optimized Product Quantization for Approximate Nearest Neighbor Search

[...]

Tiezheng Ge¹, Kaiming He², Qifa Ke², Jian Sun²•Institutions (2)

University of Science and Technology of China¹, Microsoft²

23 Jun 2013

TL;DR: This paper optimization product quantization by minimizing quantization distortions w.r.t. the space decomposition and the quantization codebooks and presents two novel methods for optimization: a non-parametric method that alternatively solves two smaller sub-problems, and a parametric method guarantees the optimal solution if the input data follows some Gaussian distribution.

...read moreread less

Abstract: Product quantization is an effective vector quantization approach to compactly encode high-dimensional vectors for fast approximate nearest neighbor (ANN) search. The essence of product quantization is to decompose the original high-dimensional space into the Cartesian product of a finite number of low-dimensional subspaces that are then quantized separately. Optimal space decomposition is important for the performance of ANN search, but still remains unaddressed. In this paper, we optimize product quantization by minimizing quantization distortions w.r.t. the space decomposition and the quantization codebooks. We present two novel methods for optimization: a non-parametric method that alternatively solves two smaller sub-problems, and a parametric method that is guaranteed to achieve the optimal solution if the input data follows some Gaussian distribution. We show by experiments that our optimized approach substantially improves the accuracy of product quantization for ANN search.

...read moreread less

396 citations

Proceedings Article•DOI•

Constant Time Weighted Median Filtering for Stereo Matching and Beyond

[...]

Ziyang Ma¹, Kaiming He², Yichen Wei², Jian Sun³, Enhua Wu¹ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, Microsoft², Xi'an Jiaotong University³

01 Dec 2013

TL;DR: It is discovered that with this refinement, even the simple box filter aggregation achieves comparable accuracy with various sophisticated aggregation methods (with the same refinement), revealing that the previously overlooked refinement can be at least as crucial as aggregation.

...read moreread less

Abstract: Despite the continuous advances in local stereo matching for years, most efforts are on developing robust cost computation and aggregation methods. Little attention has been seriously paid to the disparity refinement. In this work, we study weighted median filtering for disparity refinement. We discover that with this refinement, even the simple box filter aggregation achieves comparable accuracy with various sophisticated aggregation methods (with the same refinement). This is due to the nice weighted median filtering properties of removing outlier error while respecting edges/structures. This reveals that the previously overlooked refinement can be at least as crucial as aggregation. We also develop the first constant time algorithm for the previously time-consuming weighted median filter. This makes the simple combination ``box aggregation + weighted median'' an attractive solution in practice for both speed and accuracy. As a byproduct, the fast weighted median filtering unleashes its potential in other applications that were hampered by high complexities. We show its superiority in various applications such as depth up sampling, clip-art JPEG artifact removal, and image stylization.

...read moreread less

295 citations

Journal Article•DOI•

Rectangling panoramic images via warping

[...]

Kaiming He¹, Huiwen Chang², Jian Sun¹•Institutions (2)

Microsoft¹, Tsinghua University²

21 Jul 2013

TL;DR: This paper presents a content-aware warping algorithm that generates rectangular images from stitched panoramic images, and demonstrates that the results are often visually plausible, and the introduced distortion is often unnoticeable.

...read moreread less

Abstract: Stitched panoramic images mostly have irregular boundaries. Artists and common users generally prefer rectangular boundaries, which can be obtained through cropping or image completion techniques. In this paper, we present a content-aware warping algorithm that generates rectangular images from stitched panoramic images. Our algorithm consists of two steps. The first local step is mesh-free and preliminarily warps the image into a rectangle. With a grid mesh placed on this rectangle, the second global step optimizes the mesh to preserve shapes and straight lines. In various experiments we demonstrate that the results of our approach are often visually plausible, and the introduced distortion is often unnoticeable.

...read moreread less

76 citations

Proceedings Article•DOI•

Joint Inverted Indexing

[...]

Yan Xia¹, Kaiming He², Fang Wen², Jian Sun³•Institutions (3)

University of Science and Technology of China¹, Microsoft², Tsinghua University³

01 Dec 2013

TL;DR: This paper presents a method that jointly optimizes all code words in all quantizers and creates them jointly, which is faster and more accurate than a recent state-of-the-art inverted indexing method.

...read moreread less

Abstract: Inverted indexing is a popular non-exhaustive solution to large scale search. An inverted file is built by a quantizer such as k-means or a tree structure. It has been found that multiple inverted files, obtained by multiple independent random quantizers, are able to achieve practically good recall and speed. Instead of computing the multiple quantizers independently, we present a method that creates them jointly. Our method jointly optimizes all code words in all quantizers. Then it assigns these code words to the quantizers. In experiments this method shows significant improvement over various existing methods that use multiple independent quantizers. On the one-billion set of SIFT vectors, our method is faster and more accurate than a recent state-of-the-art inverted indexing method.

...read moreread less

66 citations

Proceedings Article•DOI•

Content-Aware Rotation

[...]

Kaiming He¹, Huiwen Chang¹, Jian Sun²•Institutions (2)

Microsoft¹, Xi'an Jiaotong University²

01 Dec 2013

TL;DR: This work designs an optimization-based method that preserves the rotation of horizontal/vertical lines, maintains the completeness of the image content, and reduces the warping distortion.

...read moreread less

Abstract: We present an image editing tool called Content-Aware Rotation. Casually shot photos can appear tilted, and are often corrected by rotation and cropping. This trivial solution may remove desired content and hurt image integrity. Instead of doing rigid rotation, we propose a warping method that creates the perception of rotation and avoids cropping. Human vision studies suggest that the perception of rotation is mainly due to horizontal/vertical lines. We design an optimization-based method that preserves the rotation of horizontal/vertical lines, maintains the completeness of the image content, and reduces the warping distortion. An efficient algorithm is developed to address the challenging optimization. We demonstrate our content-aware rotation method on a variety of practical cases.

...read moreread less

25 citations

Patent•

Creation of Rectangular Images from Input Images

[...]

Kaiming He¹, Huiwen Chang¹, Jian Sun¹•Institutions (1)

Microsoft¹

13 Nov 2013

TL;DR: In this article, a global optimization is applied to the image by finding an energy minimum, or reduced energy below a threshold, for a function that gives the image a rectangular shape while preserving shapes and preserving straight lines.

...read moreread less

Abstract: Stitched images generated from combinations of multiple separate images mostly have irregular boundaries. Users generally prefer rectangular boundaries. Techniques for warping an image with irregular boundaries to give the image rectangular boundaries are disclosed herein. Preliminary warping of the image into the rectangle provides a rectangular shape on which to overlay a mesh. The image is reverted to its original shape with irregular boundaries and the mesh is warped accordingly. Global optimization is applied to the image by finding an energy minimum, or reduced energy below a threshold, for a function that gives the image a rectangular shape while preserving shapes and preserving straight lines. The mesh is warped according to the solution of the function and the image is stretched and/or compressed along with the mesh. This approach generates results that are qualitatively more visually attractive than other contemporary techniques.

...read moreread less

23 citations