scispace - formally typeset
C

Chi Hu

Publications -  12
Citations -  123

Chi Hu is an academic researcher. The author has contributed to research in topics: Machine translation & Transformer (machine learning model). The author has an hindex of 5, co-authored 10 publications receiving 85 citations.

Papers
More filters
Proceedings ArticleDOI

Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition.

TL;DR: This paper improves differentiable architecture search by removing the softmax-local constraint and applies differentiable NAS to named entity recognition (NER), the first time that differentiableNAS methods are adopted in NLP tasks other than language modeling.
Proceedings ArticleDOI

Learning Architectures from an Extended Search Space for Language Modeling

TL;DR: This paper presents a general approach to learn both intra-cell and inter-cell architectures to extend the search space of NAS and implements it in a differentiable architecture search system.
Proceedings Article

The NiuTrans System for the WMT20 Quality Estimation Shared Task

TL;DR: The NiuTrans Team as discussed by the authors used a combination of transfer learning, multi-task learning and model ensemble for the WMT 2020 Quality Estimation shared task, which achieved remarkable results in multiple level tasks, e.g., their system achieved the best results on all tracks in the sentence-level Direct Assessment task.
Proceedings ArticleDOI

The NiuTrans System for WNGT 2020 Efficiency Task.

TL;DR: The NiuTrans Team explored the combination of deep encoder and shallow decoder in Transformer models via model compression and knowledge distillation, and the neural machine translation decoding also benefits from FP16 inference, attention caching, dynamic batching, and batch pruning.
Posted Content

Learning Architectures from an Extended Search Space for Language Modeling

TL;DR: In this article, a joint learning method is proposed to perform intra-cell and inter-cell NAS simultaneously, which achieves state-of-the-art results on the CoNLL and WNUT named entity recognition (NER) tasks and Co-NLL chunking task.