scispace - formally typeset
Open AccessProceedings ArticleDOI

Classifying Semantic Relations in Bioscience Texts

TLDR
This work examines the problem of distinguishing among seven relation types that can occur between the entities "treatment" and "disease" in bioscience text, and finds that the latter help achieve high classification accuracy.
Abstract
A crucial step toward the goal of automatic extraction of propositional information from natural language text is the identification of semantic relations between constituents in sentences. We examine the problem of distinguishing among seven relation types that can occur between the entities "treatment" and "disease" in bioscience text, and the problem of identifying such entities. We compare five generative graphical models and a neural network, using lexical, syntactic, and semantic features, finding that the latter help achieve high classification accuracy.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Opinion observer: analyzing and comparing opinions on the Web

TL;DR: A novel framework for analyzing and comparing consumer opinions of competing products is proposed, and a new technique based on language pattern mining is proposed to extract product features from Pros and Cons in a particular type of reviews.
Proceedings Article

Open information extraction from the web

TL;DR: Open Information Extraction (OIE) as mentioned in this paper is a new extraction paradigm where the system makes a single data-driven pass over its corpus and extracts a large set of relational tuples without requiring any human input.
Journal ArticleDOI

Measures of semantic similarity and relatedness in the biomedical domain

TL;DR: There is a role both for more flexible measures of relatedness based on information derived from corpora, as well as for measures that rely on existing ontological structures.
Journal ArticleDOI

Open information extraction from the web

TL;DR: In this paper, a self-supervised learner employs a parser and heuristics to determine criteria that will be used by an extraction classifier (or other ranking model) for evaluating the trustworthiness of candidate tuples that have been extracted from the corpus of text.
Proceedings ArticleDOI

BANNER: an executable survey of advances in biomedical named entity recognition.

TL;DR: BANNER is an open-source, executable survey of advances in biomedical named entity recognition, intended to serve as a benchmark for the field and is designed to maximize domain independence by not employing brittle semantic features or rule-based processing steps.
References
More filters
Proceedings Article

On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes

TL;DR: It is shown, contrary to a widely-held belief that discriminative classifiers are almost always to be preferred, that there can often be two distinct regimes of performance as the training set size is increased, one in which each algorithm does better.
Journal ArticleDOI

A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval

TL;DR: This paper examines the sensitivity of retrieval performance to the smoothing parameters and compares several popular smoothing methods on different test collection.
Journal Article

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

TL;DR: Injection molding wherein a pair of separable mold plates are initially urged together and fluid plastic is injected into a mold cavity formed between the mold plates to form an article.
Proceedings ArticleDOI

Snowball: extracting relations from large plain-text collections

TL;DR: This paper develops a scalable evaluation methodology and metrics for the task, and presents a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.
Journal ArticleDOI

An Algorithm that Learns What‘s in a Name

TL;DR: IdentiFinderTM, a hidden Markov model that learns to recognize and classify names, dates, times, and numerical quantities, is evaluated and is competitive with approaches based on handcrafted rules on mixed case text and superior on text where case information is not available.
Related Papers (5)