scispace - formally typeset
Proceedings ArticleDOI: 10.1109/IEMCON.2016.7746076

Development of a character recognition software to solve a Sudoku puzzle

01 Oct 2016-pp 1-5
Abstract: An Image is a visual representation of any object, place, person etc. In the field of Computer Science a Digital Image is a numeric representation of a two-dimensional image. Often images may contain texts, which are a sequence of human-readable characters. Our general conundrum is extraction of texts from an image for processing and editing. In an image texts are defined by set of pixels just like any other object and thus cannot be processed or edited. In this work we have strived to develop a system which can extract texts from a digital image of a Sudoku puzzle, solve the puzzle, and then provide a solution. Our approach is specific but its application are varied. more

Topics: Digital image (56%)

Proceedings ArticleDOI: 10.1109/I2CT45611.2019.9033860
29 Mar 2019-
Abstract: This paper attempts to explore the solving of Sudoku puzzles (as commonly found in newspapers and mobile games) using image processing, machine learning algorithms for OCR, and an efficient solving algorithm to compute the correct answer. We use various image processing techniques, such as Image Thresholding, Erosion and Dilation, etc. to convert a high-resolution and colored camera-generated image of the physical Sudoku puzzle into a format that can be digitally operated upon, in order to isolate the contents of the 9 × 9 puzzle grid correctly. We then use our custom Optical Character Reader (OCR), which is based on the k-NN machine learning algorithm, in order to correctly identify the digits contained in the grid and place them in the respective positions in a digital copy of the Sudoku grid, which is then processed by an efficient Sudoku solving algorithm, to compute the correct solution. more

Topics: Image processing (54%), Thresholding (50%)

Open accessProceedings ArticleDOI: 10.1109/ICDAR.2007.4376991
Ray Smith1Institutions (1)
23 Sep 2007-
Abstract: The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier. more

1,178 Citations

Open accessProceedings ArticleDOI: 10.1109/ICDAR.2009.257
Ray Smith1Institutions (1)
26 Jul 2009-
Abstract: A new hybrid page layout analysis algorithm is proposed, which uses bottom-up methods to form an initial data-type hypothesis and locate the tab-stops that were used when the page was formatted. The detected tab-stops, are used to deduce the column layout of the page. The column layout is then applied in a top-down manner to impose structure and reading-order on the detected regions. The complete C++ source code implementation is available as part of the Tesseract open source OCR engine at more

Topics: Column (typography) (61%), Tesseract (51%), Source code (51%)

96 Citations

Open accessDissertation
01 Jan 1987-
Abstract: Almost all the current commercial OCR machines employ matrix matching, resulting in high speed and accuracy, but a severely restrictive range of recognized fonts. Published algorithms conversely, concentrate on feature extraction for font independence, yet they have previously been too slow for commercial use. Current algorithms also fail to distinguish between text and non-text images. This thesis presents a new approach to the automatic extraction of text from multimedia printed documents. An edge detection algorithm, which is capable of extracting the outlines of text from a grey level image, is used to obtain a high level of discrimination between text and non-text. An additional benefit is that text of any colour can be read from almost any background, provided that the contrast is reasonable. The outlines are approximated by polygons using a fast two-stage algorithm. A feature extraction approach to font independent character recognition is described, which uses these outline polygons. It is shown that highly accurate and fast recognition can be achieved using a remarkably small number of carefully chosen features. The results show that after training on only seven quite similar fonts, the recognition algorithm provides greater than 95% accuracy on fonts different to the training set. A more complex edge extraction algorithm is also described. This is capable of extracting text and line graphics from an arbitrary page. Although not essential for character recognition, this algorithm is useful for the interpretation of engineering drawings. As a further contribution to this problem, a thinning algorithm is defined, which is non-iterative and uses the polygonal approximated outlines from the edge extractor. more

35 Citations

No. of citations received by the Paper in previous years
Network Information
Related Papers (5)
05 Nov 2008

I. Marinovic, Igor Fürstner

01 Jan 2015

Astha Gautam, Anjana Kumari +1 more

03 Nov 2014

Sho Inaba, Asako Kanezaki +1 more

01 Jan 1995

Jeffrey Johnson

01 Jan 2008

Nadav Ben-Haim

View PDF