M
Martin Rajman
Researcher at École Polytechnique Fédérale de Lausanne
Publications - 144
Citations - 2023
Martin Rajman is an academic researcher from École Polytechnique Fédérale de Lausanne. The author has contributed to research in topics: Parsing & Ranking (information retrieval). The author has an hindex of 23, co-authored 144 publications receiving 2000 citations. Previous affiliations of Martin Rajman include University of Geneva & École Polytechnique.
Papers
More filters
Book ChapterDOI
Text Mining at the Term Level
Ronen Feldman,Moshe Fresko,Yakkov Kinar,Yehuda Lindell,Orly Liphstat,Martin Rajman,Yonatan Schler,Oren Zamir +7 more
TL;DR: This paper describes the Term Extraction module of the Document Explorer system, and provides experimental evaluation performed on a set of 52,000 documents published by Reuters in the years 1995–1996.
A generalized CYK algorithm for parsing stochastic CFG
TL;DR: A bottom-up parsing algorithm for stochastic context-free grammars that is able to deal with multiple interpretations of sentences containing compound words, and to extract N-most probable parses in O(n 3 ) and compute at the same time all possible parses of any portion of the input sequence with their probabilities.
Book ChapterDOI
Text Mining: Natural Language techniques and Text Mining applications
Martin Rajman,Romaric Besançon +1 more
TL;DR: This paper presents two examples of Text Mining tasks, association extraction and prototypical document extraction, along with several related NLP techniques.
Proceedings Article
Knowledge Management: A Text Mining Approach
TL;DR: Document Explorer is described, a tool that implements text mining at the term level, in which knowledge discovery takes place on a more focused collection of words and phrases that are extracted from and label each document.
Book ChapterDOI
Text Mining, knowledge extraction from unstructured textual data
Martin Rajman,Romaric Besançon +1 more
TL;DR: This paper presented two examples of information that can be automatically extracted from text collections: probabilistic associations of key-words and prototypical document instances, and the Natural Language Processing (NLP) tools necessary for such extractions are also presented.