scispace - formally typeset
Open AccessPosted Content

Deep Learning Representation using Autoencoder for 3D Shape Retrieval

TLDR
This work projects 3D shapes into 2D space and uses autoencoder for feature learning on the 2D images and shows the proposed deep learning feature is complementary to conventional local image descriptors, which can obtain the state-of-the-art performance on 3D shape retrieval benchmarks.
Abstract
We study the problem of how to build a deep learning representation for 3D shape. Deep learning has shown to be very effective in variety of visual applications, such as image classification and object detection. However, it has not been successfully applied to 3D shape recognition. This is because 3D shape has complex structure in 3D space and there are limited number of 3D shapes for feature learning. To address these problems, we project 3D shapes into 2D space and use autoencoder for feature learning on the 2D images. High accuracy 3D shape retrieval performance is obtained by aggregating the features learned on 2D images. In addition, we show the proposed deep learning feature is complementary to conventional local image descriptors. By combing the global deep learning representation and the local descriptor representation, our method can obtain the state-of-the-art performance on 3D shape retrieval benchmarks.

read more

Citations
More filters
Journal ArticleDOI

Projective Feature Learning for 3D Shapes with Multi-View Depth Images

TL;DR: The multi‐view depth image representation is adopted and the Multi‐View Deep Extreme Learning Machine (MVD‐ELM) is proposed to achieve fast and quality projective feature learning for 3D shapes to lead to a more accurate 3D feature learning.
Journal ArticleDOI

Stacked auto-encoder based tagging with deep features for content-based medical image retrieval

TL;DR: A four-step and effective hash code generation technique is presented to reduce the semantic gap between low-level features and high-level semantics for unbalanced medical image datasets and is more successful than other state-of-the-art methods.
Journal ArticleDOI

Multi-Task Network Representation Learning.

TL;DR: A novel multi-task network representation learning (MTNRL) framework is proposed, which is end-to-end and more effective for underlying tasks, and suitable for all network embedding methods.
Proceedings ArticleDOI

Object recognition with and without objects

TL;DR: Zhang et al. as discussed by the authors trained deep neural networks on the foreground (object) and background (context) regions of images respectively, and showed that useful visual hints can be explicitly learned separately and then combined to achieve higher performance.
Book ChapterDOI

Multi-scale Coarse-to-Fine Segmentation for Screening Pancreatic Ductal Adenocarcinoma

TL;DR: An intuitive approach of detecting pancreatic ductal adenocarcinoma (PDAC), the most common type of pancreatic cancer, by checking abdominal CT scans is proposed, and multi-scale segmentation-for-classification, which classifies volumes by checking if at least a sufficient number of voxels is segmented as tumors, is named.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.
Journal ArticleDOI

Neural networks and physical systems with emergent collective computational abilities

TL;DR: A model of a system having a large number of simple equivalent components, based on aspects of neurobiology but readily adapted to integrated circuits, produces a content-addressable memory which correctly yields an entire memory from any subpart of sufficient size.
Journal ArticleDOI

A fast learning algorithm for deep belief nets

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.
Related Papers (5)