scispace - formally typeset
Proceedings ArticleDOI

Anusaaraka: An expert system based machine translation system

Reads0
Chats0
TLDR
Anusaaraka is a Language Accessor cum Machine Translation system based on the fundamental premise of sharing the load producing good enough results according to the needs of the reader, and is so designed that a user can contribute to it and participate in improving its quality.
Abstract
Most research in Machine translation is about having the computers completely bear the load of translating one human language into another. This paper looks at the machine translation problem afresh and observes that there is a need to share the load between man and machine, distinguish reliable knowledge from the heuristics, provide a spectrum of outputs to serve different strata of people, and finally make use of existing resources instead of reinventing the wheel. This paper describes a unique approach to develop machine translation system based on the insights of information dynamics from Paninian Grammar Formalism. Anusaaraka is a Language Accessor cum Machine Translation system based on the fundamental premise of sharing the load producing good enough results according to the needs of the reader. The system promises to give faithful representation of the translated text, no loss of information while translating and graceful degradation (robustness) in case of failure. The layered output provides an access to all the stages of translation making the whole process transparent. Thus, Anusaaraka differs from the Machine Translation systems in two respects: (1) its commitment to faithfulness and thereby providing a layer of 100% faithful output so that a user with some training can “access the source text” faithfully. (2) The system is so designed that a user can contribute to it and participate in improving its quality. Further Anusaaraka provides an eclectic combination of the Apertium architecture with the forward chaining expert system, allowing use of both the deep parser and shallow parser outputs to analyze the SL text. Existing language resources (parsers, taggers, chunkers) available under GPL are used instead of rewriting it again. Language data and linguistic rules are independent from the core programme, making it easy for linguists to modify and experiment with different language phenomena to improve the system. Users can become contributors by contributing new word sense disambiguation (WSD) rules of the ambiguous words through a web-interface available over internet. The system uses forward chaining of expert system to infer new language facts from the existing language data. It helps to solve the complex behavior of language translation by applying specific knowledge rather than specific technique creating a vast language knowledge base in electronic form. Or in other words, the expert system facilitates the transformation of subject matter expert's (SME) knowledge available with humans into a computer processable knowledge base.

read more

Citations
More filters
Journal ArticleDOI

Neural Machine Translation System for English to Indian Language Translation Using MTIL Parallel Corpus

TL;DR: A neural machine translation system for four language pairs, designed with long short-term memory (LSTM) networks and bi-directional recurrent neural networks (Bi-RNN) and able to perceive long-term contexts in the sentences.
Journal ArticleDOI

OpenLogos machine translation: philosophy, model, resources and customization

TL;DR: The paper illustrates how the intelligence inherent in SAL contributes to translation quality, presenting examples of OpenLogos output of a kind that non-linguistic systems would likely have difficulty emulating.
Proceedings Article

Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT

TL;DR: This paper describes a Hindi to English statistical machine translation system that improves over the baseline using multiple translation models and enhanced over both these baselines using a regression model.
Proceedings ArticleDOI

Developing a system for machine translation from Hindi language to English language

TL;DR: A machine translation system based on ruled-based theory is proposed, which takes Hindi as source language and English as target language.
References
More filters
Book

Natural language processing : a Paninian perspective

TL;DR: This book is presented with a paninian perspective and introduces three western grammar frameworks using examples from English.
Posted Content

Anusaaraka: Overcoming the Language Barrier in India

TL;DR: The anusaaraka system makes text in one Indian language accessible in another Indian language by presenting an image of the source text in a language close to the target language, and can function as a human assisted translation system.

Design and Architecture of 'Anusaaraka'- An Approach to Machine Translation

Amba Kulkarni
TL;DR: There is a need to share the load between man and machine, distinguish ‘reliable’ knowledge from the ‘heuristics’, provide a spectrum of outputs to serve different strata of people, and finally make use of existing resources instead of reinventing the wheel.
Posted Content

LERIL : Collaborative Effort for Creating Lexical Resources

TL;DR: The paper reports on efforts taken to create lexical resources pertaining to Indian languages, using the collaborative model, to transfer lexicon and grammar from English to several Indian languages.