Cross-Lingual Transfer Learning for POS Tagging without Cross-Lingual Resources

doi:10.18653/V1/D17-1302

Open AccessProceedings ArticleDOI

Cross-Lingual Transfer Learning for POS Tagging without Cross-Lingual Resources

- pp 2832-2838

TLDR

Evaluating on POS datasets from 14 languages in the Universal Dependencies corpus, it is shown that the proposed transfer learning model improves the POS tagging performance of the target languages without exploiting any linguistic knowledge between the source language and the target language.

Abstract:

Training a POS tagging model with crosslingual transfer learning usually requires linguistic knowledge and resources about the relation between the source language and the target language. In this paper, we introduce a cross-lingual transfer learning model for POS tagging without ancillary resources such as parallel corpora. The proposed cross-lingual model utilizes a common BLSTM that enables knowledge transfer from other languages, and private BLSTMs for language-specific representations. The cross-lingual model is trained with language-adversarial training and bidirectional language modeling as auxiliary objectives to better represent language-general information while not losing the information about a specific target language. Evaluating on POS datasets from 14 languages in the Universal Dependencies corpus, we show that the proposed transfer learning model improves the POS tagging performance of the target languages without exploiting any linguistic knowledge between the source language and the target language.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

Shijie Wu, +1 more

TL;DR: This paper explored the broader cross-lingual potential of multilingual BERT as a zero-shot language transfer model on 5 NLP tasks covering a total of 39 languages from various language families: NLI, document classification, NER, POS tagging, and dependency parsing.

...read moreread less

Posted Content

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT.

Shijie Wu, +1 more

- 19 Apr 2019 -

arXiv: Computation and Language

TL;DR: This paper explores the broader cross-lingual potential of mBERT (multilingual) as a zero shot language transfer model on 5 NLP tasks covering a total of 39 languages from various language families: NLI, document classification, NER, POS tagging, and dependency parsing.

...read moreread less

Journal ArticleDOI

Deep convolutional neural networks with ensemble learning and transfer learning for capacity estimation of lithium-ion batteries

Sheng Shen, +4 more

- 15 Feb 2020 -

Applied Energy

TL;DR: The verification and comparison results demonstrate that the proposed DCNN-ETL method can produce a higher accuracy and robustness than these other data-driven methods in estimating the capacities of the Li-ion cells in the target task.

...read moreread less

Proceedings ArticleDOI

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism

Pengfei Cao, +4 more

TL;DR: This paper proposes a novel adversarial transfer learning framework to make full use of task-shared boundaries information and prevent the task-specific features of CWS, and exploits self-attention to explicitly capture long range dependencies between two tokens.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

A Survey on Transfer Learning

Sinno Jialin Pan, +1 more

- 01 Oct 2010 -

IEEE Transactions on Knowledge and Data ...

TL;DR: The relationship between transfer learning and other related machine learning techniques such as domain adaptation, multitask learning and sample selection bias, as well as covariate shift are discussed.

...read moreread less

Proceedings ArticleDOI

Convolutional Neural Networks for Sentence Classification

Yoon Kim

TL;DR: The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification, and are proposed to allow for the use of both task-specific and static vectors.

...read moreread less

Proceedings Article

Understanding the difficulty of training deep feedforward neural networks

Xavier Glorot, +1 more

TL;DR: The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.

...read moreread less

Posted Content

Convolutional Neural Networks for Sentence Classification

Yoon Kim

- 25 Aug 2014 -

arXiv: Computation and Language

TL;DR: In this article, CNNs are trained on top of pre-trained word vectors for sentence-level classification tasks and a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks.

...read moreread less

Collapse

Neural Computation

Cross-Lingual Transfer Learning for POS Tagging without Cross-Lingual Resources

Citations

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT.

Deep convolutional neural networks with ensemble learning and transfer learning for capacity estimation of lithium-ion batteries

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation

References

Adam: A Method for Stochastic Optimization

A Survey on Transfer Learning

Convolutional Neural Networks for Sentence Classification

Understanding the difficulty of training deep feedforward neural networks

Convolutional Neural Networks for Sentence Classification

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Adam: A Method for Stochastic Optimization

Attention is All you Need

Neural Architectures for Named Entity Recognition

Long short-term memory