SUGAMAN: describing floor plans for visually impaired by annotation learning and proximity-based grammar

doi:10.1049/IET-IPR.2018.5627

Open AccessJournal ArticleDOI

SUGAMAN: describing floor plans for visually impaired by annotation learning and proximity-based grammar

Shreya Goyal, +4 more

- 01 Nov 2019 -

Iet Image Processing

- Vol. 13, Iss: 13, pp 2623-2635

Chats0

TLDR

In this paper, the authors propose a framework called Sugaman (Supervised and Unified framework using Grammar and Annotation Model for Access and Navigation) for describing a floor plan and giving direction for obstacle-free movement within a building.

Abstract:

In this study, the authors propose a framework SUGAMAN (Supervised and Unified framework using Grammar and Annotation Model for Access and Navigation). SUGAMAN is a Hindi word meaning ‘easy passage from one place to another’. SUGAMAN synthesises textual description from a given floor plan image, usable by visually impaired to navigate by understanding the arrangement of rooms and furniture. It is the first framework for describing a floor plan and giving direction for obstacle-free movement within a building. The model learns five classes of room categories from 1355 room image samples under a supervised learning paradigm. These learned annotations are fed into a description synthesis framework to yield a holistic description of a floor plan image. Authors demonstrate the performance of various supervised classifiers on room learning and provided a comparative analysis of system generated and human-written descriptions. The contribution of this study includes a novel framework for description generation from document images with graphics while proposing a new feature representing the floor plans, text annotations for a publicly available data set, and an algorithm for door to door obstacle avoidance navigation. This work can be applied to areas like understanding floor plans and design of historical monuments, and retrieval.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People

Rakesh Chandra Joshi, +3 more

- 27 Aug 2020 -

Entropy

TL;DR: An artificial intelligence-based fully automatic assistive technology to recognize different objects, and auditory inputs are provided to the user in real time, which gives better understanding to the visually impaired person about their surroundings.

...read moreread less

Proceedings ArticleDOI

Travelling more independently: A Requirements Analysis for Accessible Journeys to Unknown Buildings for People with Visual Impairments

Christin Engel, +6 more

TL;DR: A survey with 106 people with visual impairments is presented, in which the strategies they use to prepare for a journey to unknown buildings, how they orient themselves in unfamiliar buildings and what materials they use are examined.

...read moreread less

Journal ArticleDOI

Traveling More Independently: A Study on the Diverse Needs and Challenges of People with Visual or Mobility Impairments in Unfamiliar Indoor Environments

Karin Müller, +4 more

- 23 Feb 2022 -

ACM Transactions on Accessible Computing

TL;DR: In this article , the authors present a survey of 125 participants with blindness, low vision, and mobility impairments, and investigate how mobile they are, what strategies they use to prepare a journey to an unknown building, how they orient themselves there, and what materials they use.

...read moreread less

Journal ArticleDOI

Knowledge-driven description synthesis for floor plan interpretation

Shreya Goyal, +2 more

- 26 Apr 2021 -

International Journal on Document Analys...

TL;DR: In this paper, the authors proposed two models, description synthesis from image cue (DSIC) and transformer-based description generation (TBDG), for text generation from floor plan images.

...read moreread less

Book ChapterDOI

Semantic Segmentation and Topological Mapping of Floor Plans

Ke Liu, +1 more

TL;DR: In this article, a topological mapping method from the floor plan model based on deep learning semantic segmentation is proposed for assistive blind navigation purposes in unknown indoor environments, where disturbances such as image rotation, color transformation and Gaussian noises are taken into consideration in the training to enhance the robustness.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Proceedings ArticleDOI

Rapid object detection using a boosted cascade of simple features

Paul A. Viola, +1 more

TL;DR: A machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates and the introduction of a new image representation called the "integral image" which allows the features used by the detector to be computed very quickly.

...read moreread less

Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

Timo Ojala, +2 more

- 01 Jul 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.

...read moreread less

Book ChapterDOI

SURF: speeded up robust features

Herbert Bay, +2 more

TL;DR: A novel scale- and rotation-invariant interest point detector and descriptor, coined SURF (Speeded Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.

...read moreread less

Book ChapterDOI

Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

Thorsten Joachims

TL;DR: This paper explores the use of Support Vector Machines for learning text classifiers from examples and analyzes the particular properties of learning with text data and identifies why SVMs are appropriate for this task.

...read moreread less

Collapse

ACM Transactions on Accessible Computing

SUGAMAN: describing floor plans for visually impaired by annotation learning and proximity-based grammar

Citations

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People

Travelling more independently: A Requirements Analysis for Accessible Journeys to Unknown Buildings for People with Visual Impairments

Traveling More Independently: A Study on the Diverse Needs and Challenges of People with Visual or Mobility Impairments in Unfamiliar Indoor Environments

Knowledge-driven description synthesis for floor plan interpretation

Semantic Segmentation and Topological Mapping of Floor Plans

References

Distinctive Image Features from Scale-Invariant Keypoints

Rapid object detection using a boosted cascade of simple features

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

SURF: speeded up robust features

Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

Related Papers (5)

ASYSST: A Framework for Synopsis Synthesis Empowering Visually Impaired

a.SCAtch - A Sketch-Based Retrieval for Architectural Floor Plans

Using Annotations in a Collective and Face-to-Face Design Situation

Automatic Annotation Synchronizing with Textual Description for Visualization

Guiding Novice Web Workers in Making Image Descriptions Using Templates