Classifying Semantic Relations in Bioscience Texts

doi:10.3115/1218955.1219010

Open AccessProceedings ArticleDOI

Classifying Semantic Relations in Bioscience Texts

- pp 430-437

TLDR

This work examines the problem of distinguishing among seven relation types that can occur between the entities "treatment" and "disease" in bioscience text, and finds that the latter help achieve high classification accuracy.

Abstract:

A crucial step toward the goal of automatic extraction of propositional information from natural language text is the identification of semantic relations between constituents in sentences. We examine the problem of distinguishing among seven relation types that can occur between the entities "treatment" and "disease" in bioscience text, and the problem of identifying such entities. We compare five generative graphical models and a neural network, using lexical, syntactic, and semantic features, finding that the latter help achieve high classification accuracy.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Opinion observer: analyzing and comparing opinions on the Web

Bing Liu, +2 more

TL;DR: A novel framework for analyzing and comparing consumer opinions of competing products is proposed, and a new technique based on language pattern mining is proposed to extract product features from Pros and Cons in a particular type of reviews.

...read moreread less

Proceedings Article

Open information extraction from the web

Michele Banko, +4 more

TL;DR: Open Information Extraction (OIE) as mentioned in this paper is a new extraction paradigm where the system makes a single data-driven pass over its corpus and extracts a large set of relational tuples without requiring any human input.

...read moreread less

Journal ArticleDOI

Measures of semantic similarity and relatedness in the biomedical domain

Ted Pedersen, +3 more

- 01 Jun 2007 -

Journal of Biomedical Informatics

TL;DR: There is a role both for more flexible measures of relatedness based on information derived from corpora, as well as for measures that rely on existing ontological structures.

...read moreread less

Journal ArticleDOI

Open information extraction from the web

Oren Etzioni, +3 more

- 01 Dec 2008 -

Communications of The ACM

TL;DR: In this paper, a self-supervised learner employs a parser and heuristics to determine criteria that will be used by an extraction classifier (or other ranking model) for evaluating the trustworthiness of candidate tuples that have been extracted from the corpus of text.

...read moreread less

Proceedings ArticleDOI

BANNER: an executable survey of advances in biomedical named entity recognition.

Robert Leaman, +1 more

TL;DR: BANNER is an open-source, executable survey of advances in biomedical named entity recognition, intended to serve as a benchmark for the field and is designed to maximize domain independence by not employing brittle semantic features or rule-based processing steps.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes

Andrew Y. Ng, +1 more

TL;DR: It is shown, contrary to a widely-held belief that discriminative classifiers are almost always to be preferred, that there can often be two distinct regimes of performance as the training set size is increased, one in which each algorithm does better.

...read moreread less

Journal ArticleDOI

A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval

ChengXiang Zhai, +1 more

TL;DR: This paper examines the sensitivity of retrieval performance to the smoothing parameters and compares several popular smoothing methods on different test collection.

...read moreread less

Journal Article

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Eric D. Brill

- 01 Dec 1995 -

Computational Linguistics

TL;DR: Injection molding wherein a pair of separable mold plates are initially urged together and fluid plastic is injected into a mold cavity formed between the mold plates to form an article.

...read moreread less

Proceedings ArticleDOI

Snowball: extracting relations from large plain-text collections

Eugene Agichtein, +1 more

TL;DR: This paper develops a scalable evaluation methodology and metrics for the task, and presents a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.

...read moreread less

Journal ArticleDOI

An Algorithm that Learns What‘s in a Name

Daniel M. Bikel, +2 more

- 01 Feb 1999 -

Machine Learning

TL;DR: IdentiFinderTM, a hidden Markov model that learns to recognize and classify names, dates, times, and numerical quantities, is evaluated and is competitive with approaches based on handcrafted rules on mixed case text and superior on text where case information is not available.

...read moreread less

Classifying Semantic Relations in Bioscience Texts

Citations

Opinion observer: analyzing and comparing opinions on the Web

Open information extraction from the web

Measures of semantic similarity and relatedness in the biomedical domain

Open information extraction from the web

BANNER: an executable survey of advances in biomedical named entity recognition.

References

On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes

A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Snowball: extracting relations from large plain-text collections

An Algorithm that Learns What‘s in a Name

Related Papers (5)

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

GENIA corpus—a semantically annotated corpus for bio-textmining

Constructing Biological Knowledge Bases by Extracting Information from Text Sources

LIBSVM: A library for support vector machines

Snowball: extracting relations from large plain-text collections