scispace - formally typeset
Proceedings ArticleDOI

Plan2Text: A framework for describing building floor plan images from first person perspective

Reads0
Chats0
TLDR
It is demonstrated that the proposed end-to-end framework for first person vision based textual description synthesis of building floor plans gives state of the art performance on challenging, real-world floor plan images.
Abstract
We focus on synthesis of textual description from a given building floor plan image based on the first-person vision perspective. Tasks like symbol spotting, wall and decor segmentation, semantic and perceptual segmentation has been done in the past on floor plans. Here, for the first time, we propose an end-to-end framework for first person vision based textual description synthesis of building floor plans. We demonstrate (qualitative and quantitatively) that the proposed framework gives state of the art performance on challenging, real-world floor plan images. Potential application of this work could be understanding floor plans, stability analysis of buildings, and retrieval.

read more

Citations
More filters
Proceedings ArticleDOI

BRIDGE: Building Plan Repository for Image Description Generation, and Evaluation

TL;DR: An extensive experimental study is presented for tasks like furniture localization in a floor plan, caption and description generation, on the proposed dataset showing the utility of BRIDGE.
Journal ArticleDOI

SUGAMAN: describing floor plans for visually impaired by annotation learning and proximity-based grammar

TL;DR: In this paper, the authors propose a framework called Sugaman (Supervised and Unified framework using Grammar and Annotation Model for Access and Navigation) for describing a floor plan and giving direction for obstacle-free movement within a building.
Proceedings ArticleDOI

ASYSST: A Framework for Synopsis Synthesis Empowering Visually Impaired

TL;DR: This work proposes an end to end framework (ASYSST) for textual description synthesis from digitized building floor plans and introduces a novel Bag of Decor feature to learn $5$ classes of a room from $1355$ samples under a supervised learning paradigm.
Journal ArticleDOI

Knowledge-driven description synthesis for floor plan interpretation

TL;DR: In this paper, the authors proposed two models, description synthesis from image cue (DSIC) and transformer-based description generation (TBDG), for text generation from floor plan images.
Journal ArticleDOI

Mask-Aware Semi-Supervised Object Detection in Floor Plans

TL;DR: A Mask R-CNN-based semi-supervised approach that provides pixel-to-pixel alignment to generate individual annotation masks for each class to mine the inter-class similarity in order to detect more accurate objects with less labeled data is presented.
References
More filters
Journal ArticleDOI

Segmentation and Recognition of Dimensioning Text from Engineering Drawings

TL;DR: Recognition of dimensioning text in engineering drawings is an essential part of the drawing understanding process, as this text provides the exact dimensions and tolerances of the object described in the drawing.
Proceedings ArticleDOI

A unified framework for semantic matching of architectural floorplans

TL;DR: A framework for the matching and retrieval of similar architectural floorplans under the query by example paradigm is proposed and a novel graph spectral embedding feature is proposed to uniquely represent the layout of the architectural floorplan.
Proceedings ArticleDOI

Text Extraction in Document Images: Highlight on Using Corner Points

TL;DR: A very simple technique based on FAST key points that highlights that accurate text extraction could be achieved without complex approach and could also be easily improved to be more precise, robust and useful for more complex layout analysis.
Proceedings ArticleDOI

Automatic image segmentation of old topographic maps and floor plans

TL;DR: A new algorithm for image segmentation of ancient maps and floor plans is introduced that aims to remove most part of non textual elements leaving just the text, which allows further automatic identification of the map or plan through automatic character recognition techniques.
Related Papers (5)