scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Development of a character recognition software to solve a Sudoku puzzle

01 Oct 2016-pp 1-5
TL;DR: This work has strived to develop a system which can extract texts from a digital image of a Sudoku puzzle, solve the puzzle, and then provide a solution.
Abstract: An Image is a visual representation of any object, place, person etc. In the field of Computer Science a Digital Image is a numeric representation of a two-dimensional image. Often images may contain texts, which are a sequence of human-readable characters. Our general conundrum is extraction of texts from an image for processing and editing. In an image texts are defined by set of pixels just like any other object and thus cannot be processed or edited. In this work we have strived to develop a system which can extract texts from a digital image of a Sudoku puzzle, solve the puzzle, and then provide a solution. Our approach is specific but its application are varied.
Citations
More filters
Proceedings ArticleDOI
29 Mar 2019
TL;DR: This paper attempts to explore the solving of Sudoku puzzles (as commonly found in newspapers and mobile games) using image processing, machine learning algorithms for OCR, and an efficient solving algorithm to compute the correct answer.
Abstract: This paper attempts to explore the solving of Sudoku puzzles (as commonly found in newspapers and mobile games) using image processing, machine learning algorithms for OCR, and an efficient solving algorithm to compute the correct answer. We use various image processing techniques, such as Image Thresholding, Erosion and Dilation, etc. to convert a high-resolution and colored camera-generated image of the physical Sudoku puzzle into a format that can be digitally operated upon, in order to isolate the contents of the 9 × 9 puzzle grid correctly. We then use our custom Optical Character Reader (OCR), which is based on the k-NN machine learning algorithm, in order to correctly identify the digits contained in the grid and place them in the respective positions in a digital copy of the Sudoku grid, which is then processed by an efficient Sudoku solving algorithm, to compute the correct solution.

Cites methods from "Development of a character recognit..."

  • ...[5] is a very good approach to an automated Sudoku solver....

    [...]

References
More filters
Proceedings ArticleDOI
Ray Smith1
23 Sep 2007
TL;DR: The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview.
Abstract: The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier.

1,530 citations

Proceedings ArticleDOI
Ray Smith1
26 Jul 2009
TL;DR: A new hybrid page layout analysis algorithm is proposed, which uses bottom-up methods to form an initial data-type hypothesis and locate the tab-stops that were used when the page was formatted.
Abstract: A new hybrid page layout analysis algorithm is proposed, which uses bottom-up methods to form an initial data-type hypothesis and locate the tab-stops that were used when the page was formatted. The detected tab-stops, are used to deduce the column layout of the page. The column layout is then applied in a top-down manner to impose structure and reading-order on the detected regions. The complete C++ source code implementation is available as part of the Tesseract open source OCR engine at http://code.google.com/p/tesseract-ocr.

104 citations

Dissertation
01 Jan 1987
TL;DR: It is shown that highly accurate and fast recognition can be achieved using a remarkably small number of carefully chosen features, and that after training on only seven quite similar fonts, the recognition algorithm provides greater than 95% accuracy on fonts different to the training set.
Abstract: Almost all the current commercial OCR machines employ matrix matching, resulting in high speed and accuracy, but a severely restrictive range of recognized fonts. Published algorithms conversely, concentrate on feature extraction for font independence, yet they have previously been too slow for commercial use. Current algorithms also fail to distinguish between text and non-text images. This thesis presents a new approach to the automatic extraction of text from multimedia printed documents. An edge detection algorithm, which is capable of extracting the outlines of text from a grey level image, is used to obtain a high level of discrimination between text and non-text. An additional benefit is that text of any colour can be read from almost any background, provided that the contrast is reasonable. The outlines are approximated by polygons using a fast two-stage algorithm. A feature extraction approach to font independent character recognition is described, which uses these outline polygons. It is shown that highly accurate and fast recognition can be achieved using a remarkably small number of carefully chosen features. The results show that after training on only seven quite similar fonts, the recognition algorithm provides greater than 95% accuracy on fonts different to the training set. A more complex edge extraction algorithm is also described. This is capable of extracting text and line graphics from an arbitrary page. Although not essential for character recognition, this algorithm is useful for the interpretation of engineering drawings. As a further contribution to this problem, a thinning algorithm is defined, which is non-iterative and uses the polygonal approximated outlines from the edge extractor.

35 citations