scispace - formally typeset
M

Mingda Chen

Researcher at Toyota Technological Institute at Chicago

Publications -  33
Citations -  5024

Mingda Chen is an academic researcher from Toyota Technological Institute at Chicago. The author has contributed to research in topics: Sentence & Computer science. The author has an hindex of 10, co-authored 28 publications receiving 2984 citations.

Papers
More filters
Proceedings Article

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

TL;DR: This work presents two parameter-reduction techniques to lower memory consumption and increase the training speed of BERT, and uses a self-supervised loss that focuses on modeling inter-sentence coherence.
Posted Content

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

TL;DR: The authors proposed a self-supervised loss that focuses on modeling inter-sentence coherence, and showed it consistently helps downstream tasks with multientence inputs, achieving state-of-the-art results on the GLUE, RACE, and \squad benchmarks.
Proceedings ArticleDOI

A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations

TL;DR: A generative model for a sentence that uses two latent variables, with one intended to represent the syntax of the sentence and the other to represent its semantics, is proposed, which shows it can achieve better disentanglement between semantic and syntactic representations by training with multiple losses.
Proceedings ArticleDOI

Controllable Paraphrase Generation with a Syntactic Exemplar

TL;DR: The authors proposed a controllable text generation task, where the syntax of a generated sentence is controlled rather by a sentential exemplar, and developed a variational model with a neural module designed for capturing syntactic knowledge and several multitask training objectives to promote disentangled representation learning.
Proceedings ArticleDOI

Variational Sequential Labelers for Semi-Supervised Learning

TL;DR: A family of multitask variational methods for semi-supervised sequence labeling that combines a latent-variable generative model and a discriminative labeler, and explores several latent variable configurations, including ones with hierarchical structure.