scispace - formally typeset
Open Access

Discourse-level argumentation in scientific articles: human and automatic annotation

Simone Teufel, +1 more
TLDR
A rhetorically defined annotation scheme which is part of the authors' corpus-based method for the summarisation of scientific articles, and it is shown that this kind of resource can be used to train a system to automate the annotation work.
Abstract
In this paper we present a rhetorically defined annotation scheme which is part of our corpus-based method for the summarisation of scientific articles. The annotation scheme consists of seven non-hierarchical labels which model prototypical academic argumentation and expected intentional 'moves'. In a large-scale experiments with three expert coders, we found the scheme stable and reproducible. We have built a resource consisting of 80 papers annotated by the scheme, and we show that this kind of resource can be used to train a system to automate the annotation work.

read more

Citations
More filters
Journal ArticleDOI

Extractive summarisation of legal texts

TL;DR: Results are encouraging as they achieve state-of-the-art accuracy using robust, automatically generated cue phrase information and the utility of the rhetorical annotation scheme as a model of legal discourse, which provides a clear means for structuring summaries and tailoring them to different types of users.
Journal ArticleDOI

Survey about citation context analysis: Tasks, techniques, and resources

TL;DR: An overview of general concepts and contributions to the solutions to problems related to bibliometric calculations are presented, with the purpose of identifying trends and suggesting possible future research directions.
Journal ArticleDOI

Combining information extraction with genetic algorithms for text mining

TL;DR: This work has brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery that doesn't rely on external resources or conceptual descriptions and performs the discovery using only information from the original corpus of text documents and from training data computed from them.
Proceedings ArticleDOI

Multidimensional text analysis for eRulemaking

TL;DR: Techniques to automatically analyze large number of public comments on proposed regulations, performed on comments submitted to the Environmental Protection Agency in response to their proposed rule for mercury regulation, are developed.
Book

The Scientific Article in the Age of Digitization

TL;DR: The author examines the development of scientific communication through digitization, the impact of digitization on scientific communication, and the dynamics of change in the period 1987-2004.
References
More filters
Book

Nonparametric statistics for the behavioral sciences

Sidney Siegel
TL;DR: This is the revision of the classic text in the field, adding two new chapters and thoroughly updating all others as discussed by the authors, and the original structure is retained, and the book continues to serve as a combined text/reference.
Book

Content analysis: an introduction to its methodology

TL;DR: History Conceptual Foundations Uses and Kinds of Inference The Logic of Content Analysis Designs Unitizing Sampling Recording Data Languages Constructs for Inference Analytical Techniques The Use of Computers Reliability Validity A Practical Guide
Book

Genre Analysis: English in Academic and Research Settings

TL;DR: The authors provides a survey of approaches to various genres of language, and considers these in relation to communication and task-based language learning, as well as examples of different genres and how they can be made accessible through genre analysis.
Journal ArticleDOI

The automatic creation of literature abstracts

TL;DR: In the exploratory research described, the complete text of an article in machine-readable form is scanned by an IBM 704 data-processing machine and analyzed in accordance with a standard program.
Related Papers (5)