scispace - formally typeset
Open Access

Om: One tool for many (Indian) languages

Ganapathiraju, +4 more
- Vol. 6, Iss: 11, pp 1348-1353
TLDR
The development of a transliteration scheme Om which uses ASCII characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue.
Abstract
Many different languages are spoken in India, each language being the mother tongue of tens of millions of people.While the languages and scripts are distinct from each other, the grammar and the alphabet are similar to a large extent. One common feature is that all the Indian languages are phonetic in nature. In this paper we describe the development of a translit eration scheme Om which exploits this phonetic nature of the alphabet. Om uses ASCⅡ characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue. It is also useful in computer applications where local language tools such as email and chat are not yet available. Another significant contribution presented in this paper is the development of a text editor for Indian languages that integrates the Om input for many Indian languages into a word processor such as Microsoft WinWord(R). The text editor is also developed on Java(R) platform that can run on Unix machines as well. We propose this transliteration scheme as a possible standard for Indian language transliteration and keyboard entry.

read more

Citations
More filters

Text processing for text-to-speech systems in Indian languages.

TL;DR: The efforts in addressing the issues of Font-to-Akshara mapping, pronunciation rules for Aksharas, text normalization in the context of building text- to-speech systems in Indian languages are discussed.
Proceedings Article

A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages.

TL;DR: A more discerning method which applies different techniques based on the word origin in transliteration, which does not require training data on the target side, while it uses more sophisticated techniques on the source side.
Journal ArticleDOI

A simple approach for building transliteration editors for Indian languages

TL;DR: The usefulness of the Unicode based approach to build transliteration editors for Indian languages using Unicode and by taking advantage of its rendering engine is demonstrated and its advantages needing little maintenance and few entries in the mapping table, and ease of adding new features such as adding letters, to the transliterations scheme are reported.
Journal ArticleDOI

Hindi and marathi to english machine transliteration using svm

TL;DR: This paper has proposed the name d entity transliteration for Hindi to English and Mar athi to English language pairs using Support Vector Machine (SVM), and uses phonetic of the source language and nas two features for translator.
Journal ArticleDOI

Hindi to English Machine Transliteration of Named Entities using Conditional Random Fields

TL;DR: This paper focuses on Hindi to English machine transliteration of Indian named entities such as proper nouns, place names and organization names using conditional random fields (CRF).