Improving Short Answer Grading Using Transformer-Based Pre-training.

doi:10.1007/978-3-030-23204-7_39

Book ChapterDOI

Improving Short Answer Grading Using Transformer-Based Pre-training.

- pp 469-481

TLDR

This work experiments with fine-tuning a pre-trained self-attention language model, namely Bidirectional Encoder Representations from Transformers (BERT) applying it to short answer grading, and shows that it produces superior results across multiple domains.

Abstract:

Dialogue-based tutoring platforms have shown great promise in helping individual students improve mastery. Short answer grading is a crucial component of such platforms. However, generative short answer grading using the same platform for diverse disciplines and titles is a crucial challenge due to data distribution variations across domains and a frequent occurrence of non-sentential answers. Recent NLP research has introduced novel deep learning architectures such as the Transformer, which merely uses self-attention mechanisms. Pre-trained models based on the Transformer architecture have been used to produce impressive results across a range of NLP tasks. In this work, we experiment with fine-tuning a pre-trained self-attention language model, namely Bidirectional Encoder Representations from Transformers (BERT) applying it to short answer grading, and show that it produces superior results across multiple domains. On the benchmarking dataset of SemEval-2013, we report up to 10% absolute improvement in macro-average-F1 over state-of-the-art results. On our two psychology domain datasets, the fine-tuned model yields classification almost up to the human-agreement levels. Moreover, we study the effectiveness of fine-tuning as a function of the size of the task-specific labeled data, the number of training epochs, and its generalizability to cross-domain and join-domain scenarios.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Pre-Training BERT on Domain Resources for Short Answer Grading.

Chul Sung, +5 more

TL;DR: It is shown that the pre-trained BERT model can be improved by augmenting data from the domain-specific resources like textbooks like textbooks, and a new approach to use labeled short answering grading data for further enhancement of the language model.

...read moreread less

Book ChapterDOI

Investigating Transformers for Automatic Short Answer Grading

Leon Camus, +1 more

TL;DR: This work trains the newest and most powerful, according to the glue benchmark, transformers on the SemEval-2013 dataset, and shows that models trained with knowledge distillation are feasible for use in short answer grading.

...read moreread less

Proceedings ArticleDOI

Neural Automated Essay Scoring Incorporating Handcrafted Features

Masaki Uto, +2 more

TL;DR: This method concatenates handcrafted essay-level features to a distributed essay representation vector, which is obtained from an intermediate layer of a DNN-AES model, which significantly improves scoring accuracy.

...read moreread less

Journal ArticleDOI

A review of deep-neural automated essay scoring models

Masaki Uto

- 01 Jul 2021 -

Behaviormetrika

TL;DR: A comprehensive survey of deep neural network AES models is presented, describing the main idea and detailed architecture of each model and introducing existing DNN-AES models according to this classification.

...read moreread less

Book ChapterDOI

Robust Neural Automated Essay Scoring Using Item Response Theory

Masaki Uto, +1 more

TL;DR: A new DNN-AES framework that integrates IRT models to deal with rater bias within training data is proposed, a first attempt at addressing rating bias effects in training data, which is a crucial but overlooked problem.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Journal ArticleDOI

A Survey on Transfer Learning

Sinno Jialin Pan, +1 more

- 01 Oct 2010 -

IEEE Transactions on Knowledge and Data ...

TL;DR: The relationship between transfer learning and other related machine learning techniques such as domain adaptation, multitask learning and sample selection bias, as well as covariate shift are discussed.

...read moreread less

Posted Content

Attention Is All You Need

Ashish Vaswani, +7 more

- 12 Jun 2017 -

arXiv: Computation and Language

TL;DR: A new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely is proposed, which generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

Posted Content

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Yonghui Wu, +30 more

- 26 Sep 2016 -

arXiv: Computation and Language

TL;DR: GNMT, Google's Neural Machine Translation system, is presented, which attempts to address many of the weaknesses of conventional phrase-based translation systems and provides a good balance between the flexibility of "character"-delimited models and the efficiency of "word"-delicited models.

...read moreread less

Proceedings ArticleDOI

Self-taught learning: transfer learning from unlabeled data

Rajat Raina, +4 more

TL;DR: An approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data to form a succinct input representation and significantly improve classification performance.

...read moreread less

Collapse

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

The Eras and Trends of Automatic Short Answer Grading

Steven Burrows, +2 more

- 01 Mar 2015 -

International Journal of Artificial Inte...

Improving Short Answer Grading Using Transformer-Based Pre-training.

Citations

Pre-Training BERT on Domain Resources for Short Answer Grading.

Investigating Transformers for Automatic Short Answer Grading

Neural Automated Essay Scoring Incorporating Handcrafted Features

A review of deep-neural automated essay scoring models

Robust Neural Automated Essay Scoring Using Item Response Theory

References

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

A Survey on Transfer Learning

Attention Is All You Need

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Self-taught learning: transfer learning from unlabeled data

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

The Eras and Trends of Automatic Short Answer Grading

Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments

Fast and Easy Short Answer Grading with High Accuracy

Investigating neural architectures for short answer scoring