scispace - formally typeset
Open AccessProceedings ArticleDOI

A probabilistic image jigsaw puzzle solver

TLDR
In this article, the problem of reconstructing an image from a bag of square, non-overlapping image patches, the jigsaw puzzle problem, is considered and a graphical model is developed to solve it.
Abstract
We explore the problem of reconstructing an image from a bag of square, non-overlapping image patches, the jigsaw puzzle problem. Completing jigsaw puzzles is challenging and requires expertise even for humans, and is known to be NP-complete. We depart from previous methods that treat the problem as a constraint satisfaction problem and develop a graphical model to solve it. Each patch location is a node and each patch is a label at nodes in the graph. A graphical model requires a pairwise compatibility term, which measures an affinity between two neighboring patches, and a local evidence term, which we lack. This paper discusses ways to obtain these terms for the jigsaw puzzle problem. We evaluate several patch compatibility metrics, including the natural image statistics measure, and experimentally show that the dissimilarity-based compatibility – measuring the sum-of-squared color difference along the abutting boundary – gives the best results. We compare two forms of local evidence for the graphical model: a sparse-and-accurate evidence and a dense-and-noisy evidence. We show that the sparse-and-accurate evidence, fixing as few as 4 – 6 patches at their correct locations, is enough to reconstruct images consisting of over 400 patches. To the best of our knowledge, this is the largest puzzle solved in the literature. We also show that one can coarsely estimate the low resolution image from a bag of patches, suggesting that a bag of image patches encodes some geometric information about the original image.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Domain Generalization by Solving Jigsaw Puzzles

TL;DR: This model learns the semantic labels in a supervised fashion, and broadens its understanding of the data by learning from self-supervised signals how to solve a jigsaw puzzle on the same images, which helps the network to learn the concepts of spatial correlation while acting as a regularizer for the classification task.
Proceedings ArticleDOI

Jigsaw puzzles with pieces of unknown orientation

TL;DR: A tree-based reassembly that greedily merges components while respecting the geometric constraints of the puzzle problem is proposed and has state-of-the-art performance for puzzle assembly, whether or not the orientation of the pieces is known.
Book ChapterDOI

Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches

TL;DR: PMG-Progressive multi-granularity training as mentioned in this paper proposes a progressive training strategy that effectively fuses features from different granularities, and a random jigsaw patch generator that encourages the network to learn features at specific granularity.
Posted Content

Near-Optimal Joint Object Matching via Convex Relaxation

TL;DR: MatchLift as mentioned in this paper uses a spectral method to estimate the total number of distinct elements to be matched, and then uses a convex program to recover the ground truth maps via a parameter-free convex function.
Proceedings ArticleDOI

Data-driven shape analysis and processing

TL;DR: Data-driven methods serve an increasingly important role in discovering geometric, structural, and semantic relationships between shapes as discussed by the authors, in contrast to traditional approaches that process shapes in isolation of each other.
References
More filters
Book

Pattern Recognition and Machine Learning

TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.
Journal ArticleDOI

Pattern Recognition and Machine Learning

Radford M. Neal
- 01 Aug 2007 - 
TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.
Journal ArticleDOI

Additive Logistic Regression : A Statistical View of Boosting

TL;DR: This work shows that this seemingly mysterious phenomenon of boosting can be understood in terms of well-known statistical principles, namely additive modeling and maximum likelihood, and develops more direct approximations and shows that they exhibit nearly identical results to boosting.
Journal ArticleDOI

LabelMe: A Database and Web-Based Tool for Image Annotation

TL;DR: In this article, a large collection of images with ground truth labels is built to be used for object detection and recognition research, such data is useful for supervised learning and quantitative evaluation.
Proceedings Article

Using the triangle inequality to accelerate k-means

TL;DR: The accelerated k-means algorithm is shown how to accelerate dramatically, while still always computing exactly the same result as the standard algorithm, and is effective for datasets with up to 1000 dimensions, and becomes more and more effective as the number k of clusters increases.
Related Papers (5)