Topic
Optical character recognition
About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.
Papers published on a yearly basis
Papers
More filters
••
30 Aug 1992TL;DR: Three techniques are described for noise reduction from binary document pages to improve page appearance and subsequent optical character recognition and compression, and for subsampling the text image to fit on the computer screen white maintaining readability.
Abstract: Describes some of the document processing techniques used in the RightPages electronic library system. Since the system deals with scanned images of document pages, these techniques are critical to the use and appearance of the system. The author describes three techniques: (1) for noise reduction from binary document pages to improve page appearance and subsequent optical character recognition and compression; (2) for subsampling the text image to fit on the computer screen white maintaining readability; and (3) a document layout analysis technique to determine text blocks. >
95 citations
•
94 citations
••
TL;DR: A method is introduced to combine and jointly optimize recognition and image normalization in optical character recognition algorithms based on pseudo two-dimensional (2D) hidden Markov models (HMMs) that provides a maximum likelihood estimate of the transformation parameters that can be used by higher level modules in an intelligent document recognition system as an aid in the recognition process.
94 citations
••
TL;DR: A simple method is presented for automatically identifying regions in envelope images which are candidates for being the destination address and the success of the texture-based segmentation algorithm for identifying address blocks is demonstrated.
93 citations
•
TL;DR: In this paper, a systematic literature review (SLR) is presented to summarize research that has been conducted on character recognition of handwritten documents and to provide research directions, which serve the purpose of presenting state of the art results and techniques on OCR.
Abstract: Given the ubiquity of handwritten documents in human transactions, Optical Character Recognition (OCR) of documents have invaluable practical worth. Optical character recognition is a science that enables to translate various types of documents or images into analyzable, editable and searchable data. During last decade, researchers have used artificial intelligence / machine learning tools to automatically analyze handwritten and printed documents in order to convert them into electronic format. The objective of this review paper is to summarize research that has been conducted on character recognition of handwritten documents and to provide research directions. In this Systematic Literature Review (SLR) we collected, synthesized and analyzed research articles on the topic of handwritten OCR (and closely related topics) which were published between year 2000 to 2018. We followed widely used electronic databases by following pre-defined review protocol. Articles were searched using keywords, forward reference searching and backward reference searching in order to search all the articles related to the topic. After carefully following study selection process 142 articles were selected for this SLR. This review article serves the purpose of presenting state of the art results and techniques on OCR and also provide research directions by highlighting research gaps.
93 citations