Om： One tool for many （Indian） languages

Open Access

Om： One tool for many （Indian） languages

- Vol. 6, Iss: 11, pp 1348-1353

TLDR

The development of a transliteration scheme Om which uses ASCII characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue.

Abstract:

Many different languages are spoken in India, each language being the mother tongue of tens of millions of people.While the languages and scripts are distinct from each other, the grammar and the alphabet are similar to a large extent. One common feature is that all the Indian languages are phonetic in nature. In this paper we describe the development of a translit eration scheme Om which exploits this phonetic nature of the alphabet. Om uses ASCⅡ characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue. It is also useful in computer applications where local language tools such as email and chat are not yet available. Another significant contribution presented in this paper is the development of a text editor for Indian languages that integrates the Om input for many Indian languages into a word processor such as Microsoft WinWord(R). The text editor is also developed on Java(R) platform that can run on Unix machines as well. We propose this transliteration scheme as a possible standard for Indian language transliteration and keyboard entry.

Citations

PDF

Open Access

More filters

Text processing for text-to-speech systems in Indian languages.

Anand Arokia Raj, +6 more

TL;DR: The efforts in addressing the issues of Font-to-Akshara mapping, pronunciation rules for Aksharas, text normalization in the context of building text- to-speech systems in Indian languages are discussed.

...read moreread less

Proceedings Article

A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages.

Harshit Surana, +1 more

TL;DR: A more discerning method which applies different techniques based on the word origin in transliteration, which does not require training data on the target side, while it uses more sophisticated techniques on the source side.

...read moreread less

Journal ArticleDOI

A simple approach for building transliteration editors for Indian languages

Prahallad Lavanya, +2 more

- 01 Nov 2005 -

Journal of Zhejiang University Science

TL;DR: The usefulness of the Unicode based approach to build transliteration editors for Indian languages using Unicode and by taking advantage of its rendering engine is demonstrated and its advantages needing little maintenance and few entries in the mapping table, and ease of adding new features such as adding letters, to the transliterations scheme are reported.

...read moreread less

Journal ArticleDOI

Hindi and marathi to english machine transliteration using svm

P. H. Rathod, +2 more

TL;DR: This paper has proposed the name d entity transliteration for Hindi to English and Mar athi to English language pairs using Support Vector Machine (SVM), and uses phonetic of the source language and nas two features for translator.

...read moreread less

Journal ArticleDOI

Hindi to English Machine Transliteration of Named Entities using Conditional Random Fields

M. L. Dhore, +2 more

- 07 Mar 2012 -

International Journal of Computer Applic...

TL;DR: This paper focuses on Hindi to English machine transliteration of Indian named entities such as proper nouns, place names and organization names using conditional random fields (CRF).

...read moreread less

Om： One tool for many （Indian） languages

Citations

Text processing for text-to-speech systems in Indian languages.

A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages.

A simple approach for building transliteration editors for Indian languages

Hindi and marathi to english machine transliteration using svm

Hindi to English Machine Transliteration of Named Entities using Conditional Random Fields

Related Papers (5)

Om: one tool for many (Indian) languages

Machine Transliteration for Indian Languages: A Literature Survey

System and method for writing Indian languages using English alphabet

Punjabi Machine Transliteration

Text Based Language Identification System for Indian Languages Following Devanagiri Script