Hybrid framework for information extraction for geographical terms in Hindi language texts

doi:10.1109/NLPKE.2005.1598803

Proceedings ArticleDOI

Hybrid framework for information extraction for geographical terms in Hindi language texts

- pp 577-581

TLDR

A hybrid information extraction (IE) framework based on geographical term detection approach has been developed to extract geographical information from an unrestricted Hindi text and the relationship between geographical entities extracted with the adjacent text is shown graphically so that information about these entities can be related.

Abstract:

A hybrid information extraction (IE) framework based on geographical term detection approach has been developed to extract geographical information from an unrestricted Hindi text The relationship between geographical entities extracted with the adjacent text is shown graphically so that information about these entities can be related The system, a combination of statistically and linguistically motivated techniques, identifies single geographical names and multiple geographical names as well The method is applied on Hindi language text, but the approach can be adapted for other languages also The paper presents some experiments illustrating the accuracy of the method The system being developed is in a prototype stage and will be extended to include relation mark-up as well

Hybrid framework for information extraction for geographical terms in Hindi language texts

Citations

Anaphora Resolution in Hindi: Issues and Challenges

A Practical Approach to Extracting Names of Geographical Entities and Their Relations from the Web

References

Information extraction

Automatic recognition of multi-word terms:. the C-value/NC-value method

Information Extraction

Termight: Identifying and Translating Technical Terminology

The interaction of knowledge sources in word sense disambiguation

Related Papers (5)

Using electronic texts for an annotated corpus building

Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language

Information extraction from mathematical texts by means of natural language processing techniques

Natural Language Processing and Information Systems: 15th International Conference on Applications of Natural Language to Information Systems, Cardiff, UK, June 2010, proceedings

Tokenization and proper noun recognition for information retrieval