Showing papers on "Markov random field published in 2015"

PDF

Open Access

Proceedings Article•DOI•

Semantic Image Segmentation via Deep Parsing Network

[...]

Ziwei Liu¹, Xiaoxiao Li¹, Ping Luo¹, Chen Change Loy¹, Xiaoou Tang¹ - Show less +1 more•Institutions (1)

07 Dec 2015

TL;DR: Deep Parsing Network (DPN) as mentioned in this paper proposes a convolutional neural network (CNN) to model unary terms and additional layers are carefully devised to approximate the mean field algorithm (MF) for pairwise terms.

...read moreread less

Abstract: This paper addresses semantic image segmentation by incorporating rich information into Markov Random Field (MRF), including high-order relations and mixture of label contexts. Unlike previous works that optimized MRFs using iterative algorithm, we solve MRF by proposing a Convolutional Neural Network (CNN), namely Deep Parsing Network (DPN), which enables deterministic end-to-end computation in a single forward pass. Specifically, DPN extends a contemporary CNN architecture to model unary terms and additional layers are carefully devised to approximate the mean field algorithm (MF) for pairwise terms. It has several appealing properties. First, different from the recent works that combined CNN and MRF, where many iterations of MF were required for each training image during back-propagation, DPN is able to achieve high performance by approximating one iteration of MF. Second, DPN represents various types of pairwise terms, making many existing works as its special cases. Third, DPN makes MF easier to be parallelized and speeded up in Graphical Processing Unit (GPU). DPN is thoroughly evaluated on the PASCAL VOC 2012 dataset, where a single DPN model yields a new state-of-the-art segmentation accuracy of 77.5%.

...read moreread less

693 citations

Proceedings Article•DOI•

Learning a convolutional neural network for non-uniform motion blur removal

[...]

Jian Sun¹, Wenfei Cao¹, Zongben Xu¹, Jean Ponce²•Institutions (2)

Xi'an Jiaotong University¹, École Normale Supérieure²

07 Jun 2015

TL;DR: In this article, a deep learning approach is proposed to predict the probabilistic distribution of motion blur at the patch level using a convolutional neural network (CNN) and further extend the candidate set of motion kernels predicted by the CNN using carefully designed image rotations.

...read moreread less

Abstract: In this paper, we address the problem of estimating and removing non-uniform motion blur from a single blurry image. We propose a deep learning approach to predicting the probabilistic distribution of motion blur at the patch level using a convolutional neural network (CNN). We further extend the candidate set of motion kernels predicted by the CNN using carefully designed image rotations. A Markov random field model is then used to infer a dense non-uniform motion blur field enforcing motion smoothness. Finally, motion blur is removed by a non-uniform deblurring model using patch-level image prior. Experimental evaluations show that our approach can effectively estimate and remove complex non-uniform motion blur that is not handled well by previous approaches.

...read moreread less

678 citations

Journal Article•DOI•

Multi-focus image fusion using dictionary-based sparse representation

[...]

Mansour Nejati¹, Shadrokh Samavi², Shahram Shirani²•Institutions (2)

Isfahan University of Technology¹, McMaster University²

01 Sep 2015-Information Fusion

TL;DR: This paper presents a novel multi-focus image fusion method in spatial domain that utilizes a dictionary which is learned from local patches of source images and outperforms existing state-of-the-art methods, in terms of visual and quantitative evaluations.

...read moreread less

343 citations

Posted Content•

Learning a Convolutional Neural Network for Non-uniform Motion Blur Removal

[...]

Jian Sun¹, Wenfei Cao¹, Zongben Xu¹, Jean Ponce²•Institutions (2)

Xi'an Jiaotong University¹, École Normale Supérieure²

02 Mar 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: A deep learning approach to predicting the probabilistic distribution of motion blur at the patch level using a convolutional neural network (CNN) is proposed and the candidate set of motion kernels predicted by the CNN are extended using carefully designed image rotations.

...read moreread less

255 citations

Journal Article•DOI•

Optimal Symmetric Multimodal Templates and Concatenated Random Forests for Supervised Brain Tumor Segmentation (Simplified) with ANTsR

[...]

Nicholas J. Tustison¹, K. L. Shrinidhi², Max Wintermark¹, Christopher R. Durst¹, Benjamin M. Kandel², James C. Gee², Murray Grossman², Brian B. Avants² - Show less +4 more•Institutions (2)

University of Virginia¹, University of Pennsylvania²

01 Apr 2015-Neuroinformatics

TL;DR: This work introduces a framework for supervised segmentation based on multiple modality intensity, geometry, and asymmetry feature sets that interface the supervised learning capabilities of the random forest model with regularized probabilistic segmentation using the recently developed ANTsR package.

...read moreread less

Abstract: Segmenting and quantifying gliomas from MRI is an important task for diagnosis, planning intervention, and for tracking tumor changes over time. However, this task is complicated by the lack of prior knowledge concerning tumor location, spatial extent, shape, possible displacement of normal tissue, and intensity signature. To accommodate such complications, we introduce a framework for supervised segmentation based on multiple modality intensity, geometry, and asymmetry feature sets. These features drive a supervised whole-brain and tumor segmentation approach based on random forest-derived probabilities. The asymmetry-related features (based on optimal symmetric multimodal templates) demonstrate excellent discriminative properties within this framework. We also gain performance by generating probability maps from random forest models and using these maps for a refining Markov random field regularized probabilistic segmentation. This strategy allows us to interface the supervised learning capabilities of the random forest model with regularized probabilistic segmentation using the recently developed ANTsR package--a comprehensive statistical and visualization interface between the popular Advanced Normalization Tools (ANTs) and the R statistical project. The reported algorithmic framework was the top-performing entry in the MICCAI 2013 Multimodal Brain Tumor Segmentation challenge. The challenge data were widely varying consisting of both high-grade and low-grade glioma tumor four-modality MRI from five different institutions. Average Dice overlap measures for the final algorithmic assessment were 0.87, 0.78, and 0.74 for "complete", "core", and "enhanced" tumor components, respectively.

...read moreread less

245 citations

Journal Article•DOI•

Supervised Spectral–Spatial Hyperspectral Image Classification With Weighted Markov Random Fields

[...]

Le Sun¹, Zebin Wu¹, Jianjun Liu¹, Liang Xiao¹, Zhihui Wei¹ - Show less +1 more•Institutions (1)

Nanjing University of Science and Technology¹

01 Mar 2015-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This model takes full advantage of exploiting the spatial and contextual information present in the hyperspectral image, and outperforms many state-of-the-art methods in terms of the overall accuracy, average accuracy, and kappa (k) statistic.

...read moreread less

Abstract: This paper presents a new approach for hyperspectral image classification exploiting spectral-spatial information. Under the maximum a posteriori framework, we propose a supervised classification model which includes a spectral data fidelity term and a spatially adaptive Markov random field (MRF) prior in the hidden field. The data fidelity term adopted in this paper is learned from the sparse multinomial logistic regression (SMLR) classifier, while the spatially adaptive MRF prior is modeled by a spatially adaptive total variation (SpATV) regularization to enforce a spatially smooth classifier. To further improve the classification accuracy, the true labels of training samples are fixed as an additional constraint in the proposed model. Thus, our model takes full advantage of exploiting the spatial and contextual information present in the hyperspectral image. An efficient hyperspectral image classification algorithm, named SMLR-SpATV, is then developed to solve the final proposed model using the alternating direction method of multipliers. Experimental results on real hyperspectral data sets demonstrate that the proposed approach outperforms many state-of-the-art methods in terms of the overall accuracy, average accuracy, and kappa (k) statistic.

...read moreread less

210 citations

Journal Article•DOI•

Bayesian Inference of Multiple Gaussian Graphical Models

[...]

Christine B. Peterson¹, Francesco C. Stingo², Marina Vannucci³•Institutions (3)

Stanford University¹, University of Texas MD Anderson Cancer Center², Rice University³

01 Mar 2015-Journal of the American Statistical Association

TL;DR: This article addresses the problem of inferring multiple undirected networks in situations where some of the networks may be unrelated, while others share common features, and proposes a Bayesian approach to inference on multiple Gaussian graphical models.

...read moreread less

Abstract: In this article, we propose a Bayesian approach to inference on multiple Gaussian graphical models. Specifically, we address the problem of inferring multiple undirected networks in situations where some of the networks may be unrelated, while others share common features. We link the estimation of the graph structures via a Markov random field (MRF) prior, which encourages common edges. We learn which sample groups have a shared graph structure by placing a spike-and-slab prior on the parameters that measure network relatedness. This approach allows us to share information between sample groups, when appropriate, as well as to obtain a measure of relative network similarity across groups. Our modeling framework incorporates relevant prior knowledge through an edge-specific informative prior and can encourage similarity to an established network. Through simulations, we demonstrate the utility of our method in summarizing relative network similarity and compare its performance against related methods. We ...

...read moreread less

189 citations

Proceedings Article•DOI•

Monocular Object Instance Segmentation and Depth Ordering with CNNs

[...]

Ziyu Zhang¹, Alexander G. Schwing¹, Sanja Fidler¹, Raquel Urtasun¹•Institutions (1)

University of Toronto¹

07 Dec 2015

TL;DR: In this article, a Markov Random Field (MRF) is proposed to predict instance-level segmentation and depth ordering from a single monocular image, where the instance ID encodes the depth ordering within image patches.

...read moreread less

Abstract: In this paper we tackle the problem of instance-level segmentation and depth ordering from a single monocular image. Towards this goal, we take advantage of convolutional neural nets and train them to directly predict instance-level segmentations where the instance ID encodes the depth ordering within image patches. To provide a coherent single explanation of an image we develop a Markov random field which takes as input the predictions of convolutional neural nets applied at overlapping patches of different resolutions, as well as the output of a connected component algorithm. It aims to predict accurate instance-level segmentation and depth ordering. We demonstrate the effectiveness of our approach on the challenging KITTI benchmark and show good performance on both tasks.

...read moreread less

173 citations

Journal Article•DOI•

LOD Generation for Urban Scenes

[...]

Yannick Verdie¹, Florent Lafarge¹, Pierre Alliez¹•Institutions (1)

French Institute for Research in Computer Science and Automation¹

08 May 2015-ACM Transactions on Graphics

TL;DR: This work introduces a novel approach that reconstructs 3D urban scenes in the form of levels of detail (LODs) by combining semantic segmentation and abstraction and outperforms general mesh approximation approaches at preserving urban structures.

...read moreread less

Abstract: We introduce a novel approach that reconstructs 3D urban scenes in the form of levels of detail (LODs). Starting from raw datasets such as surface meshes generated by multiview stereo systems, our algorithm proceeds in three main steps: classification, abstraction, and reconstruction. From geometric attributes and a set of semantic rules combined with a Markov random field, we classify the scene into four meaningful classes. The abstraction step detects and regularizes planar structures on buildings, fits icons on trees, roofs, and facades, and performs filtering and simplification for LOD generation. The abstracted data are then provided as input to the reconstruction step which generates watertight buildings through a min-cut formulation on a set of 3D arrangements. Our experiments on complex buildings and large-scale urban scenes show that our approach generates meaningful LODs while being robust and scalable. By combining semantic segmentation and abstraction, it also outperforms general mesh approximation approaches at preserving urban structures.

...read moreread less

164 citations

Journal Article•DOI•

Context-Aware Patch-Based Image Inpainting Using Markov Random Field Modeling

[...]

Tijana Ruzic¹, Aleksandra Pizurica¹•Institutions (1)

Ghent University¹

01 Jan 2015-IEEE Transactions on Image Processing

TL;DR: A general approach is introduced for context-aware patch-based image inpainting, where textural descriptors are used to guide and accelerate the search for well-matching (candidate) patches and the resulting optimization problem is solved with an efficient low-complexity inference method.

...read moreread less

Abstract: In this paper, we first introduce a general approach for context-aware patch-based image inpainting, where textural descriptors are used to guide and accelerate the search for well-matching (candidate) patches. A novel top-down splitting procedure divides the image into variable size blocks according to their context, constraining thereby the search for candidate patches to nonlocal image regions with matching context. This approach can be employed to improve the speed and performance of virtually any (patch-based) inpainting method. We apply this approach to the so-called global image inpainting with the Markov random field (MRF) prior, where MRF encodes a priori knowledge about consistency of neighboring image patches. We solve the resulting optimization problem with an efficient low-complexity inference method. Experimental results demonstrate the potential of the proposed approach in inpainting applications like scratch, text, and object removal. Improvement and significant acceleration of a related global MRF-based inpainting method is also evident.

...read moreread less

158 citations

Posted Content•

Semantic Image Segmentation via Deep Parsing Network

[...]

Ziwei Liu¹, Xiaoxiao Li¹, Ping Luo¹, Chen Change Loy¹, Xiaoou Tang¹ - Show less +1 more•Institutions (1)

The Chinese University of Hong Kong¹

09 Sep 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper addresses semantic image segmentation by incorporating rich information into Markov Random Field (MRF), including high-order relations and mixture of label contexts by proposing a Convolutional Neural Network (CNN), namely Deep Parsing Network (DPN), which enables deterministic end-to-end computation in a single forward pass.

...read moreread less

Journal Article•DOI•

Accurate Stereo Matching by Two-Step Energy Minimization

[...]

Mikhail G. Mozerov¹, Joost van de Weijer¹•Institutions (1)

Autonomous University of Barcelona¹

22 Jan 2015-IEEE Transactions on Image Processing

TL;DR: It is shown that a global optimization with a fully connected model can be solved by cost-filtering methods, and a two-step energy-minimization algorithm is proposed that achieves the state-of-the-arts results.

...read moreread less

Abstract: In stereo matching, cost-filtering methods and energy-minimization algorithms are considered as two different techniques. Due to their global extent, energy-minimization methods obtain good stereo matching results. However, they tend to fail in occluded regions, in which cost-filtering approaches obtain better results. In this paper, we intend to combine both the approaches with the aim to improve overall stereo matching results. We show that a global optimization with a fully connected model can be solved by cost-filtering methods. Based on this observation, we propose to perform stereo matching as a two-step energy-minimization algorithm. We consider two Markov random field (MRF) models: 1) a fully connected model defined on the complete set of pixels in an image and 2) a conventional locally connected model. We solve the energy-minimization problem for the fully connected model, after which the marginal function of the solution is used as the unary potential in the locally connected MRF model. Experiments on the Middlebury stereo data sets show that the proposed method achieves the state-of-the-arts results.

...read moreread less

Proceedings Article•DOI•

Real-time coarse-to-fine topologically preserving segmentation

[...]

Jian Yao¹, Marko Boben², Sanja Fidler¹, Raquel Urtasun¹•Institutions (2)

University of Toronto¹, University of Ljubljana²

07 Jun 2015

TL;DR: This paper proposes a coarse to fine optimization technique that speeds up inference in terms of the number of updates by an order of magnitude and significantly outperforms the baselines in the segmentation metrics and achieves the lowest error on the stereo task.

...read moreread less

Abstract: In this paper, we tackle the problem of unsupervised segmentation in the form of superpixels. Our main emphasis is on speed and accuracy. We build on [31] to define the problem as a boundary and topology preserving Markov random field. We propose a coarse to fine optimization technique that speeds up inference in terms of the number of updates by an order of magnitude. Our approach is shown to outperform [31] while employing a single iteration. We evaluate and compare our approach to state-of-the-art superpixel algorithms on the BSD and KITTI benchmarks. Our approach significantly outperforms the baselines in the segmentation metrics and achieves the lowest error on the stereo task.

...read moreread less

Proceedings Article•DOI•

segDeepM: Exploiting segmentation and context in deep neural networks for object detection

[...]

Yukun Zhu¹, Raquel Urtasun¹, Ruslan Salakhutdinov¹, Sanja Fidler¹•Institutions (1)

University of Toronto¹

07 Jun 2015

TL;DR: In this paper, the authors propose an approach that exploits object segmentation in order to improve the accuracy of object detection, and frame the problem as inference in a Markov Random Field, in which each detection hypothesis scores object appearance as well as contextual information using Convolutional Neural Networks.

...read moreread less

Abstract: In this paper, we propose an approach that exploits object segmentation in order to improve the accuracy of object detection. We frame the problem as inference in a Markov Random Field, in which each detection hypothesis scores object appearance as well as contextual information using Convolutional Neural Networks, and allows the hypothesis to choose and score a segment out of a large pool of accurate object segmentation proposals. This enables the detector to incorporate additional evidence when it is available and thus results in more accurate detections. Our experiments show an improvement of 4.1% in mAP over the R-CNN baseline on PASCAL VOC 2010, and 3.4% over the current state-of-the-art, demonstrating the power of our approach.

...read moreread less

Proceedings Article•DOI•

Enhancing Road Maps by Parsing Aerial Images Around the World

[...]

Gellert Mattyus, Shenlong Wang¹, Sanja Fidler¹, Raquel Urtasun¹•Institutions (1)

University of Toronto¹

07 Dec 2015

TL;DR: This paper proposes to exploit aerial images in order to enhance freely available world maps using OpenStreetMap using a Markov random field parameterized in terms of the location of the road-segment centerlines as well as their width to enable very efficient inference and returns only topologically correct roads.

...read moreread less

Abstract: In recent years, contextual models that exploit maps have been shown to be very effective for many recognition and localization tasks. In this paper we propose to exploit aerial images in order to enhance freely available world maps. Towards this goal, we make use of OpenStreetMap and formulate the problem as the one of inference in a Markov random field parameterized in terms of the location of the road-segment centerlines as well as their width. This parameterization enables very efficient inference and returns only topologically correct roads. In particular, we can segment all OSM roads in the whole world in a single day using a small cluster of 10 computers. Importantly, our approach generalizes very well, it can be trained using only 1.5 km2 aerial imagery and produce very accurate results in any location across the globe. We demonstrate the effectiveness of our approach outperforming the state-of-the-art in two new benchmarks that we collect. We then show how our enhanced maps are beneficial for semantic segmentation of ground images.

...read moreread less

Journal Article•DOI•

An Efficient MRF Embedded Level Set Method for Image Segmentation

[...]

Xi Yang¹, Xinbo Gao¹, Dacheng Tao², Xuelong Li³, Jie Li¹ - Show less +1 more•Institutions (3)

Xidian University¹, University of Technology, Sydney², Chinese Academy of Sciences³

01 Jan 2015-IEEE Transactions on Image Processing

TL;DR: The proposed fast and robust level set method is robust against various kinds of noises and can segment an image of size 500 × 500 within 3 s on MATLAB R2010b installed in a computer with 3.30-GHz CPU and 4-GB memory.

...read moreread less

Abstract: This paper presents a fast and robust level set method for image segmentation. To enhance the robustness against noise, we embed a Markov random field (MRF) energy function to the conventional level set energy function. This MRF energy function builds the correlation of a pixel with its neighbors and encourages them to fall into the same region. To obtain a fast implementation of the MRF embedded level set model, we explore algebraic multigrid (AMG) and sparse field method (SFM) to increase the time step and decrease the computation domain, respectively. Both AMG and SFM can be conducted in a parallel fashion, which facilitates the processing of our method for big image databases. By comparing the proposed fast and robust level set method with the standard level set method and its popular variants on noisy synthetic images, synthetic aperture radar (SAR) images, medical images, and natural images, we comprehensively demonstrate the new method is robust against various kinds of noises. In particular, the new level set method can segment an image of size 500 × 500 within 3 s on MATLAB R2010b installed in a computer with 3.30-GHz CPU and 4-GB memory.

...read moreread less

Proceedings Article•DOI•

Robust Nonrigid Registration by Convex Optimization

[...]

Qifeng Chen¹, Vladlen Koltun²•Institutions (2)

Stanford University¹, Intel²

07 Dec 2015

TL;DR: This work casts isometric embedding as MRF optimization and applies efficient global optimization algorithms based on linear programming relaxations to solve the challenge of nonrigid registration of 3D surfaces.

...read moreread less

Abstract: We present an approach to nonrigid registration of 3D surfaces. We cast isometric embedding as MRF optimization and apply efficient global optimization algorithms based on linear programming relaxations. The Markov random field perspective suggests a natural connection with robust statistics and motivates robust forms of the intrinsic distortion functional. Our approach outperforms a large body of prior work by a significant margin, increasing registration precision on real data by a factor of 3.

...read moreread less

Proceedings Article•DOI•

Efficiently Learning Ising Models on Arbitrary Graphs

[...]

Guy Bresler¹•Institutions (1)

Massachusetts Institute of Technology¹

14 Jun 2015

TL;DR: A simple greedy procedure allows to learn the structure of an Ising model on an arbitrary bounded-degree graph in time on the order of p2, and it is shown that for any node there exists at least one neighbor with which it has a high mutual information.

...read moreread less

Abstract: graph underlying an Ising model from i.i.d. samples. Over the last fifteen years this problem has been of significant interest in the statistics, machine learning, and statistical physics communities, and much of the effort has been directed towards finding algorithms with low computational cost for various restricted classes of models. Nevertheless, for learning Ising models on general graphs with p nodes of degree at most d, it is not known whether or not it is possible to improve upon the pd computation needed to exhaustively search over all possible neighborhoods for each node.In this paper we show that a simple greedy procedure allows to learn the structure of an Ising model on an arbitrary bounded-degree graph in time on the order of p2. We make no assumptions on the parameters except what is necessary for identifiability of the model, and in particular the results hold at low-temperatures as well as for highly non-uniform models. The proof rests on a new structural property of Ising models: we show that for any node there exists at least one neighbor with which it has a high mutual information.

...read moreread less

Proceedings Article•DOI•

How many bits does it take for a stimulus to be salient

[...]

Sayed Hossein Khatoonabadi¹, Nuno Vasconcelos², Ivan V. Bajic¹, Yufeng Shan³•Institutions (3)

Simon Fraser University¹, University of California, San Diego², Cisco Systems, Inc.³

07 Jun 2015

TL;DR: A direct measure of saliency, namely the number of bits required by an optimal video compressor to encode a given video patch, is proposed, and it is shown that features derived from this measure are highly predictive of eye fixations.

...read moreread less

Abstract: Visual saliency has been shown to depend on the unpredictability of the visual stimulus given its surround. Various previous works have advocated the equivalence between stimulus saliency and uncompressibility. We propose a direct measure of this quantity, namely the number of bits required by an optimal video compressor to encode a given video patch, and show that features derived from this measure are highly predictive of eye fixations. To account for global saliency effects, these are embedded in a Markov random field model. The resulting saliency measure is shown to achieve state-of-the-art accuracy for the prediction of fixations, at a very low computational cost. Since most modern cameras incorporate video encoders, this paves the way for in-camera saliency estimation, which could be useful in a variety of computer vision applications.

...read moreread less

Proceedings Article•DOI•

Incorporating Word Correlation Knowledge into Topic Modeling

[...]

Pengtao Xie¹, Diyi Yang¹, Eric P. Xing¹•Institutions (1)

Carnegie Mellon University¹

01 Jan 2015

TL;DR: A Markov Random Field regularized Latent Dirichlet Allocation model, which defines a MRF on the latent topic layer of LDA to encourage words labeled as similar to share the same topic label, and can accommodate the subtlety that whether two words are similar depends on which topic they appear in.

...read moreread less

Abstract: This paper studies how to incorporate the external word correlation knowledge to improve the coherence of topic modeling. Existing topic models assume words are generated independently and lack the mechanism to utilize the rich similarity relationships among words to learn coherent topics. To solve this problem, we build a Markov Random Field (MRF) regularized Latent Dirichlet Allocation (LDA) model, which defines a MRF on the latent topic layer of LDA to encourage words labeled as similar to share the same topic label. Under our model, the topic assignment of each word is not independent, but rather affected by the topic labels of its correlated words. Similar words have better chance to be put into the same topic due to the regularization of MRF, hence the coherence of topics can be boosted. In addition, our model can accommodate the subtlety that whether two words are similar depends on which topic they appear in, which allows word with multiple senses to be put into different topics properly. We derive a variational inference method to infer the posterior probabilities and learn model parameters and present techniques to deal with the hardto-compute partition function in MRF. Experiments on two datasets demonstrate the effectiveness of our model.

...read moreread less

Journal Article•DOI•

Estimation of positive definite M-matrices and structure learning for attractive Gaussian Markov Random fields

[...]

Martin Slawski¹, Matthias Hein¹•Institutions (1)

Saarland University¹

15 May 2015-Linear Algebra and its Applications

TL;DR: This work studies estimation of M-matrices taking the role of inverse second moment or precision matrices using sign-constrained log-determinant divergence minimization, and proposes an algorithm based on block coordinate descent in which each sub-problem can be recast as non-negative least squares problem.

...read moreread less

Journal Article•DOI•

Random heterogeneous materials via texture synthesis

[...]

Xingchen Liu¹, Vadim Shapiro¹•Institutions (1)

University of Wisconsin-Madison¹

01 Mar 2015-Computational Materials Science

TL;DR: It is shown that formulating material reconstruction as a Markov Random Field (MRF) texture synthesis leads to a number of advantages over the traditional optimization-based approaches, including improved computational efficiency, preservation of many material descriptors, including correlation functions and Minkowski functionals, ability to reconstruct anisotropic materials, and direct use of the gray-scale material images and two-dimensional cross-sections.

...read moreread less

Journal Article•DOI•

Lidar waveform based analysis of depth images constructed using sparse single-photon data

[...]

Yoann Altmann¹, Ximing Ren¹, Aongus McCarthy¹, Gerald S. Buller¹, Steve McLaughlin¹ - Show less +1 more•Institutions (1)

Heriot-Watt University¹

09 Jul 2015-arXiv: Applications

TL;DR: A new Bayesian model and algorithm used for depth and reflectivity profiling using full waveforms from the time-correlated single-photon counting measurement in the limit of very low photon counts is presented.

...read moreread less

Abstract: This paper presents a new Bayesian model and algorithm used for depth and intensity profiling using full waveforms from the time-correlated single photon counting (TCSPC) measurement in the limit of very low photon counts. The model proposed represents each Lidar waveform as a combination of a known impulse response, weighted by the target intensity, and an unknown constant background, corrupted by Poisson noise. Prior knowledge about the problem is embedded in a hierarchical model that describes the dependence structure between the model parameters and their constraints. In particular, a gamma Markov random field (MRF) is used to model the joint distribution of the target intensity, and a second MRF is used to model the distribution of the target depth, which are both expected to exhibit significant spatial correlations. An adaptive Markov chain Monte Carlo algorithm is then proposed to compute the Bayesian estimates of interest and perform Bayesian inference. This algorithm is equipped with a stochastic optimization adaptation mechanism that automatically adjusts the parameters of the MRFs by maximum marginal likelihood estimation. Finally, the benefits of the proposed methodology are demonstrated through a serie of experiments using real data.

...read moreread less

Journal Article•DOI•

Online control of simulated humanoids using particle belief propagation

[...]

Perttu Hämäläinen¹, Joose Rajamäki¹, C. Karen Liu²•Institutions (2)

Aalto University¹, Georgia Institute of Technology²

27 Jul 2015

TL;DR: A novel, general-purpose Model-Predictive Control algorithm that combines multimodal, gradient-free sampling and a Markov Random Field factorization to effectively perform simultaneous path finding and smoothing in high-dimensional spaces is presented.

...read moreread less

Abstract: We present a novel, general-purpose Model-Predictive Control (MPC) algorithm that we call Control Particle Belief Propagation (C-PBP) C-PBP combines multimodal, gradient-free sampling and a Markov Random Field factorization to effectively perform simultaneous path finding and smoothing in high-dimensional spaces We demonstrate the method in online synthesis of interactive and physically valid humanoid movements, including balancing, recovery from both small and extreme disturbances, reaching, balancing on a ball, juggling a ball, and fully steerable locomotion in an environment with obstacles Such a large repertoire of movements has not been demonstrated before at interactive frame rates, especially considering that all our movement emerges from simple cost functions Furthermore, we abstain from using any precomputation to train a control policy offline, reference data such as motion capture clips, or state machines that break the movements down into more manageable subtasks Operating under these conditions enables rapid and convenient iteration when designing the cost functions

...read moreread less

Posted Content•

Fast image-based obstacle detection from unmanned surface vehicles

[...]

Matej Kristan, Vildana Sulic, Stanislav Kovačič, Janez Perš

06 Mar 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a Markov random field framework is adopted and a highly efficient algorithm for simultaneous optimization of model parameters and segmentation mask estimation is derived, which does not require computationally intensive extraction of texture features and comfortably runs in realtime.

...read moreread less

Abstract: Obstacle detection plays an important role in unmanned surface vehicles (USV). The USVs operate in highly diverse environments in which an obstacle may be a floating piece of wood, a scuba diver, a pier, or a part of a shoreline, which presents a significant challenge to continuous detection from images taken onboard. This paper addresses the problem of online detection by constrained unsupervised segmentation. To this end, a new graphical model is proposed that affords a fast and continuous obstacle image-map estimation from a single video stream captured onboard a USV. The model accounts for the semantic structure of marine environment as observed from USV by imposing weak structural constraints. A Markov random field framework is adopted and a highly efficient algorithm for simultaneous optimization of model parameters and segmentation mask estimation is derived. Our approach does not require computationally intensive extraction of texture features and comfortably runs in real-time. The algorithm is tested on a new, challenging, dataset for segmentation and obstacle detection in marine environments, which is the largest annotated dataset of its kind. Results on this dataset show that our model outperforms the related approaches, while requiring a fraction of computational effort.

...read moreread less

Proceedings Article•DOI•

Towards Probabilistic Volumetric Reconstruction Using Ray Potentials

[...]

Ali Osman Ulusoy¹, Andreas Geiger¹, Michael J. Black¹•Institutions (1)

Max Planck Society¹

19 Oct 2015

TL;DR: This paper presents a novel probabilistic foundation for volumetric 3D reconstruction as inference in a Markov random field, which accurately captures the dependencies between the occupancy and appearance of each voxel, given all input images.

...read moreread less

Abstract: This paper presents a novel probabilistic foundation for volumetric 3D reconstruction. We formulate the problem as inference in a Markov random field, which accurately captures the dependencies between the occupancy and appearance of each voxel, given all input images. Our main contribution is an approximate highly parallelized discrete-continuous inference algorithm to compute the marginal distributions of each voxel's occupancy and appearance. In contrast to the MAP solution, marginals encode the underlying uncertainty and ambiguity in the reconstruction. Moreover, the proposed algorithm allows for a Bayes optimal prediction with respect to a natural reconstruction loss. We compare our method to two state-of-the-art volumetric reconstruction algorithms on three challenging aerial datasets with LIDAR ground truth. Our experiments demonstrate that the proposed algorithm compares favorably in terms of reconstruction accuracy and the ability to expose reconstruction uncertainty.

...read moreread less

Journal Article•DOI•

GraSP: geodesic Graph-based Segmentation with Shape Priors for the functional parcellation of the cortex.

[...]

Nicolas Honnorat¹, Harini Eavani¹, Theodore D. Satterthwaite¹, Raquel E. Gur¹, Ruben C. Gur¹, Christos Davatzikos¹ - Show less +2 more•Institutions (1)

University of Pennsylvania¹

01 Feb 2015-NeuroImage

TL;DR: A novel graph-based parcellation method that relies on a discrete Markov Random Field framework that is initialization-free and rapidly segments the cortex in a single optimization, and provides superior reproducibility for a similar data fit.

...read moreread less

Journal Article•DOI•

An Automatic ${\cal U}$ -Distribution and Markov Random Field Segmentation Algorithm for PolSAR Images

[...]

Anthony P. Doulgeris¹•Institutions (1)

University of Tromsø¹

01 Apr 2015-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: The reasons behind a of the cluster evaluation from the contextual smoothing and a modified rationale for the adaptive number of classes are explained, which have simplified the overall algorithm while maintaining good visual results.

...read moreread less

Abstract: We have recently presented a novel unsupervised, non-Gaussian, and contextual clustering algorithm for segmentation of polarimetric synthetic aperture radar (PolSAR) images. This represents one of the most advanced PolSAR unsupervised statistical segmentation algorithms and uses the doubly flexible two-parameter ${\cal U}$ -distribution model for the PolSAR statistics and includes a Markov random field (MRF) approach for contextual smoothing. A goodness-of-fit testing stage adds a statistically rigorous approach to determine the significant number of classes. The fully automatic algorithm was demonstrated with good results for both simulated and real data sets. This paper discusses a rethinking of the overall strategy and leads to some simplifications. The primary issue was that the MRF optimization depends on the number of classes and did not behave well under the split-and-merge environment. We explain the reasons behind a separation of the cluster evaluation from the contextual smoothing and a modified rationale for the adaptive number of classes. Both aspects have simplified the overall algorithm while maintaining good visual results.

...read moreread less

Posted Content•

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

[...]

Yukun Zhu¹, Raquel Urtasun¹, Ruslan Salakhutdinov¹, Sanja Fidler¹•Institutions (1)

University of Toronto¹

15 Feb 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper frames the problem as inference in a Markov Random Field, in which each detection hypothesis scores object appearance as well as contextual information using Convolutional Neural Networks, and allows the hypothesis to choose and score a segment out of a large pool of accurate object segmentation proposals.

...read moreread less

Proceedings Article•DOI•

Discrete optimization of ray potentials for semantic 3D reconstruction

[...]

Nikolay Savinov¹, Lubor Ladicky¹, Christian Häne¹, Marc Pollefeys¹•Institutions (1)

ETH Zurich¹

07 Jun 2015

TL;DR: This work proposes to formulate an optimization problem that directly optimizes the reprojection error of the 3D model with respect to the image estimates, which corresponds to the optimization over rays, where the cost function depends on the semantic class and depth of the first occupied voxel along the ray.

...read moreread less

Abstract: Dense semantic 3D reconstruction is typically formulated as a discrete or continuous problem over label assignments in a voxel grid, combining semantic and depth likelihoods in a Markov Random Field framework. The depth and semantic information is incorporated as a unary potential, smoothed by a pairwise regularizer. However, modelling likelihoods as a unary potential does not model the problem correctly leading to various undesirable visibility artifacts. We propose to formulate an optimization problem that directly optimizes the reprojection error of the 3D model with respect to the image estimates, which corresponds to the optimization over rays, where the cost function depends on the semantic class and depth of the first occupied voxel along the ray. The 2-label formulation is made feasible by transforming it into a graph-representable form under QPBO relaxation, solvable using graph cut. The multi-label problem is solved by applying α-expansion using the same relaxation in each expansion move. Our method was indeed shown to be feasible in practice, running comparably fast to the competing methods, while not suffering from ray potential approximation artifacts.

...read moreread less

Collapse