Translingual information retrieval: learning from bilingual corpora

doi:10.1016/S0004-3702(98)00063-0

Open AccessJournal ArticleDOI

Translingual information retrieval: learning from bilingual corpora

Yiming Yang, +3 more

- 01 Aug 1998 -

Artificial Intelligence

- Vol. 103, Iss: 1, pp 323-345

Chats0

TLDR

The results show that using bilingual corpora for automated extraction of term equivalences in context outperforms dictionarybased methods and is comparable to that of other statistical corpus-based methods.

About:

This article is published in Artificial Intelligence.The article was published on 1998-08-01 and is currently open access. It has received 107 citations till now. The article focuses on the topics: Relevance (information retrieval) & Generalized vector space model.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Ontology learning and its application to automated terminology translation

Roberto Navigli, +2 more

- 01 Jan 2003 -

IEEE Intelligent Systems

TL;DR: The OntoLearn system is an infrastructure for automated ontology learning from domain text that uses natural language processing and machine learning techniques, and is part of a more general ontology engineering architecture.

...read moreread less

Proceedings ArticleDOI

Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web

Jian-Yun Nie, +3 more

TL;DR: It is shown that using a probabilistic model, it is able to obtain performances close to those using an MT system, and the possibility of automatically gather parallel texts from the Web in an attempt to construct a reasonable training corpus is investigated.

...read moreread less

Journal ArticleDOI

Cross-language plagiarism detection

Martin Potthast, +3 more

TL;DR: The results of the evaluation indicate that CL-CNG, despite its simple approach, is the best choice to rank and compare texts across languages if they are syntactically related.

...read moreread less

Automatic Cross-Language Retrieval Using Latent Semantic Indexing

Susan T. Dumais, +3 more

TL;DR: A method for fully automated cross-language document retrieval in which no query translation is required and this automatic method performs comparably to a retrieval method based on machine translation (MT-LSI).

...read moreread less

Proceedings ArticleDOI

An empirical study of required dimensionality for large-scale latent semantic indexing applications

Roger B. Bradford

TL;DR: The results suggest that there is something of an 'island of stability' in the k = 300 to 500 range, and indicate thatthere is relatively little room to employ k values outside of this range without incurring significant distortions in at least some term-term correlations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less

Book

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Gerard Salton

Journal ArticleDOI

Improving Retrieval Performance by Relevance Feedback

Gerard Salton, +1 more

- 01 Dec 1997 -

Journal of the Association for Informati...

TL;DR: Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce query formulations following an initial retrieval operation to demonstrate the effectiveness of the various methods.

...read moreread less

Proceedings ArticleDOI

OHSUMED: an interactive retrieval evaluation and new large test collection for research

William R. Hersh, +3 more

TL;DR: A series of information retrieval experiments was carried out with a computer installed in a medical practice setting for relatively inexperienced physician end-users using a commercial MEDLINE product based on the vector space model, finding that these physicians searched just as effectively as more experienced searchers using Boolean searching.

...read moreread less

Proceedings Article

Automatic Query expansion using SMART : TREC 3

Chris Buckley, +3 more

TL;DR: This work continues the work in TREC 3, performing runs in the routing, ad-hoc, and foreign language environments, with a major focus on massive query expansion, adding from 300 to 530 terms to each query.

...read moreread less

Collapse

Translingual information retrieval: learning from bilingual corpora

Citations

Ontology learning and its application to automated terminology translation

Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web

Cross-language plagiarism detection

Automatic Cross-Language Retrieval Using Latent Semantic Indexing

An empirical study of required dimensionality for large-scale latent semantic indexing applications

References

Indexing by Latent Semantic Analysis

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Improving Retrieval Performance by Relevance Feedback

OHSUMED: an interactive retrieval evaluation and new large test collection for research

Automatic Query expansion using SMART : TREC 3

Related Papers (5)

Indexing by Latent Semantic Analysis

Phrasal translation and query expansion techniques for cross-language information retrieval

The mathematics of statistical machine translation: parameter estimation

Cross-Language Information Retrieval

Modern Information Retrieval