Daxin Jiang

Researcher at Microsoft

Publications - 193

Citations - 8482

Daxin Jiang is an academic researcher from Microsoft. The author has contributed to research in topics: Computer science & Question answering. The author has an hindex of 32, co-authored 159 publications receiving 5330 citations. Previous affiliations of Daxin Jiang include Peking University & University of Electronic Science and Technology of China.

Papers

PDF

Open Access

More filters

Journal ArticleDOI

Cluster analysis for gene expression data: a survey

Daxin Jiang, +2 more

- 01 Nov 2004 -

IEEE Transactions on Knowledge and Data ...

TL;DR: This paper divides cluster analysis for gene expression data into three categories, presents specific challenges pertinent to each clustering category and introduces several representative approaches, and suggests the promising trends in this field.

...read moreread less

Posted Content

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

Zhangyin Feng, +10 more

- 19 Feb 2020 -

arXiv: Computation and Language

TL;DR: This work develops CodeBERT with Transformer-based neural architecture, and trains it with a hybrid objective function that incorporates the pre-training task of replaced token detection, which is to detect plausible alternatives sampled from generators.

...read moreread less

Journal ArticleDOI

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training.

Gen Li, +4 more

TL;DR: After pretraining on large-scale image-caption pairs, Unicoder-VL is transferred to caption-based image-text retrieval and visual commonsense reasoning, with just one additional output layer, and shows the powerful ability of the cross-modal pre-training.

...read moreread less

Proceedings ArticleDOI

Context-aware query suggestion by mining click-through and session data

Huanhuan Cao, +6 more

TL;DR: This paper proposes a novel context-aware query suggestion approach which is in two steps, and outperforms two baseline methods in both coverage and quality of suggestions.

...read moreread less

Posted Content

GraphCodeBERT: Pre-training Code Representations with Data Flow

Daya Guo, +17 more

- 17 Sep 2020 -

arXiv: Software Engineering

TL;DR: Results show that code structure and newly introduced pre-training tasks can improve GraphCodeBERT and achieves state-of-the-art performance on the four downstream tasks and it is shown that the model prefers structure-level attentions over token- level attentions in the task of code search.

...read moreread less

Collapse