Home
/
Authors
/
Chad Carson

Author

Chad Carson

Other affiliations: University of California

Bio: Chad Carson is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Image retrieval & Automatic image annotation. The author has an hindex of 9, co-authored 13 publications receiving 3694 citations. Previous affiliations of Chad Carson include University of California.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Blobworld: image segmentation using expectation-maximization and its application to image querying

[...]

Chad Carson¹, Serge Belongie², Hayit Greenspan³, Jitendra Malik¹•Institutions (3)

University of California, Berkeley¹, University of California, San Diego², Tel Aviv University³

01 Aug 2002-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Results indicating that querying for images using Blobworld produces higher precision than does querying using color and texture histograms of the entire image in cases where the image contains distinctive objects are presented.

...read moreread less

Abstract: Retrieving images from large and varied collections using image content as a key is a challenging and important problem We present a new image representation that provides a transformation from the raw pixel data to a small set of image regions that are coherent in color and texture This "Blobworld" representation is created by clustering pixels in a joint color-texture-position feature space The segmentation algorithm is fully automatic and has been run on a collection of 10,000 natural images We describe a system that uses the Blobworld representation to retrieve images from this collection An important aspect of the system is that the user is allowed to view the internal representation of the submitted image and the query results Similar systems do not offer the user this view into the workings of the system; consequently, query results from these systems can be inexplicable, despite the availability of knobs for adjusting the similarity metrics By finding image regions that roughly correspond to objects, we allow querying at the level of objects rather than global image properties We present results indicating that querying for images using Blobworld produces higher precision than does querying using color and texture histograms of the entire image in cases where the image contains distinctive objects

...read moreread less

1,574 citations

Book Chapter•DOI•

Blobworld: A System for Region-Based Image Indexing and Retrieval

[...]

Chad Carson¹, Megan C. Thomas¹, Serge Belongie¹, Joseph M. Hellerstein¹, Jitendra Malik¹ - Show less +1 more•Institutions (1)

University of California¹

02 Jun 1999-Lecture Notes in Computer Science

TL;DR: This work indexes the blob descriptions using a lower-rank approximation to the high-dimensional distance to make large-scale retrieval feasible, and shows encouraging results for both querying and indexing.

...read moreread less

Abstract: Blobworld is a system for image retrieval based on finding coherent image regions which roughly correspond to objects. Each image is automatically segmented into regions ("blobs") with associated color and texture descriptors. Queryingi s based on the attributes of one or two regions of interest, rather than a description of the entire image. In order to make large-scale retrieval feasible, we index the blob descriptions usinga tree. Because indexing in the high-dimensional feature space is computationally prohibitive, we use a lower-rank approximation to the high-dimensional distance. Experiments show encouraging results for both queryinga nd indexing.

...read moreread less

896 citations

Proceedings Article•DOI•

Color- and texture-based image segmentation using EM and its application to content-based image retrieval

[...]

Serge Belongie¹, Chad Carson¹, Hayit Greenspan¹, Jitendra Malik¹•Institutions (1)

University of California, Berkeley¹

04 Jan 1998

TL;DR: A new image representation is presented which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space based on segmentation using the expectation-maximization algorithm on combined color andtexture features.

...read moreread less

Abstract: Retrieving images from large and varied collections using image content as a key is a challenging and important problem. In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. This so-called "blobworld" representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features. The texture features we use for the segmentation arise from a new approach to texture description and scale selection. We describe a system that uses the blobworld representation to retrieve images. An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric.

...read moreread less

548 citations

Proceedings Article•DOI•

Region-based image querying

[...]

Chad Carson¹, Serge Belongie¹, Hayit Greenspan¹, Jitendra Malik¹•Institutions (1)

University of California, Berkeley¹

20 Jun 1997

TL;DR: A new image representation is presented which provides a transformation from the raw pixel data to a small set of localized coherent regions in color and texture space based on segmentation using the expectation-maximization algorithm on combined color andtexture features.

...read moreread less

Abstract: Retrieving images from large and varied collections using image content as a key is a challenging and important problem In this paper, we present a new image representation which provides a transformation from the raw pixel data to a small set of localized coherent regions in color and texture space This so-called lblobworldr representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features The texture features we use for the segmentation arise from a new approach to texture description and scale selection We describe a system that uses the blobworld representation to retrieve images An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric

...read moreread less

318 citations

Book Chapter•DOI•

Finding Pictures of Objects in Large Collections of Images

[...]

David Forsyth¹, Jitendra Malik¹, Margaret M. Fleck², Hayit Greenspan³, Hayit Greenspan¹, Thomas Leung¹, Serge Belongie¹, Chad Carson¹, Chris Bregler¹ - Show less +5 more•Institutions (3)

University of California, Berkeley¹, University of Iowa², California Institute of Technology³

13 Apr 1996

TL;DR: The approach to object recognition is described, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints.

...read moreread less

Abstract: Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.

...read moreread less

219 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

[...]

Aude Oliva¹, Antonio Torralba²•Institutions (2)

Brigham and Women's Hospital¹, Carleton College²

01 May 2001-International Journal of Computer Vision

TL;DR: The performance of the spatial envelope model shows that specific information about object shape or identity is not a requirement for scene categorization and that modeling a holistic representation of the scene informs about its probable semantic category.

...read moreread less

Abstract: In this paper, we propose a computational model of the recognition of real world scenes that bypasses the segmentation and the processing of individual objects or regions. The procedure is based on a very low dimensional representation of the scene, that we term the Spatial Envelope. We propose a set of perceptual dimensions (naturalness, openness, roughness, expansion, ruggedness) that represent the dominant spatial structure of a scene. Then, we show that these dimensions may be reliably estimated using spectral and coarsely localized information. The model generates a multidimensional space in which scenes sharing membership in semantic categories (e.g., streets, highways, coasts) are projected closed together. The performance of the spatial envelope model shows that specific information about object shape or identity is not a requirement for scene categorization and that modeling a holistic representation of the scene informs about its probable semantic category.

...read moreread less

6,882 citations

Proceedings Article•DOI•

A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics

[...]

David Martin¹, Charless C. Fowlkes¹, D. Tal¹, Jitendra Malik¹•Institutions (1)

University of California, Berkeley¹

07 Jul 2001

TL;DR: In this paper, the authors present a database containing ground truth segmentations produced by humans for images of a wide variety of natural scenes, and define an error measure which quantifies the consistency between segmentations of differing granularities.

...read moreread less

Abstract: This paper presents a database containing 'ground truth' segmentations produced by humans for images of a wide variety of natural scenes. We define an error measure which quantifies the consistency between segmentations of differing granularities and find that different human segmentations of the same image are highly consistent. Use of this dataset is demonstrated in two applications: (1) evaluating the performance of segmentation algorithms and (2) measuring probability distributions associated with Gestalt grouping factors as well as statistics of image region properties.

...read moreread less

6,505 citations

Journal Article•DOI•

Content-based image retrieval at the end of the early years

[...]

Arnold W. M. Smeulders¹, Marcel Worring¹, Simone Santini², Amarnath Gupta², Ramesh Jain - Show less +1 more•Institutions (2)

University of Amsterdam¹, University of California, San Diego²

01 Dec 2000-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The working conditions of content-based retrieval: patterns of use, types of pictures, the role of semantics, and the sensory gap are discussed, as well as aspects of system engineering: databases, system architecture, and evaluation.

...read moreread less

Abstract: Presents a review of 200 references in content-based image retrieval. The paper starts with discussing the working conditions of content-based retrieval: patterns of use, types of pictures, the role of semantics, and the sensory gap. Subsequent sections discuss computational steps for image retrieval systems. Step one of the review is image processing for retrieval sorted by color, texture, and local geometry. Features for retrieval are discussed next, sorted by: accumulative and global features, salient points, object and shape features, signs, and structural combinations thereof. Similarity of pictures and objects in pictures is reviewed for each of the feature types, in close connection to the types and means of feedback the user of the systems is capable of giving by interaction. We briefly discuss aspects of system engineering: databases, system architecture, and evaluation. In the concluding section, we present our view on: the driving force of the field, the heritage from computer vision, the influence on computer vision, the role of similarity and of interaction, the need for databases, the problem of evaluation, and the role of the semantic gap.

...read moreread less

6,447 citations

Journal Article•DOI•

Contour Detection and Hierarchical Image Segmentation

[...]

Pablo Arbeláez¹, Michael Maire², Charless C. Fowlkes³, Jitendra Malik¹•Institutions (3)

University of California, Berkeley¹, California Institute of Technology², University of California, Irvine³

01 May 2011-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper investigates two fundamental problems in computer vision: contour detection and image segmentation and presents state-of-the-art algorithms for both of these tasks.

...read moreread less

Abstract: This paper investigates two fundamental problems in computer vision: contour detection and image segmentation. We present state-of-the-art algorithms for both of these tasks. Our contour detector combines multiple local cues into a globalization framework based on spectral clustering. Our segmentation algorithm consists of generic machinery for transforming the output of any contour detector into a hierarchical region tree. In this manner, we reduce the problem of image segmentation to that of contour detection. Extensive experimental evaluation demonstrates that both our contour detection and segmentation methods significantly outperform competing algorithms. The automatically generated hierarchical segmentations can be interactively refined by user-specified annotations. Computation at multiple image resolutions provides a means of coupling our system to recognition applications.

...read moreread less

5,068 citations

Journal Article•DOI•

The Earth Mover's Distance as a Metric for Image Retrieval

[...]

Yossi Rubner¹, Carlo Tomasi¹, Leonidas J. Guibas¹•Institutions (1)

Stanford University¹

01 Nov 2000-International Journal of Computer Vision

TL;DR: This paper investigates the properties of a metric between two distributions, the Earth Mover's Distance (EMD), for content-based image retrieval, and compares the retrieval performance of the EMD with that of other distances.

...read moreread less

Abstract: We investigate the properties of a metric between two distributions, the Earth Mover's Distance (EMD), for content-based image retrieval. The EMD is based on the minimal cost that must be paid to transform one distribution into the other, in a precise sense, and was first proposed for certain vision problems by Peleg, Werman, and Rom. For image retrieval, we combine this idea with a representation scheme for distributions that is based on vector quantization. This combination leads to an image comparison framework that often accounts for perceptual similarity better than other previously proposed methods. The EMD is based on a solution to the transportation problem from linear optimization, for which efficient algorithms are available, and also allows naturally for partial matching. It is more robust than histogram matching techniques, in that it can operate on variable-length representations of the distributions that avoid quantization and other binning problems typical of histograms. When used to compare distributions with the same overall mass, the EMD is a true metric. In this paper we focus on applications to color and texture, and we compare the retrieval performance of the EMD with that of other distances.

...read moreread less

4,593 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse