Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

scRMD: imputation for single cell RNA-seq data via robust matrix decomposition.

[...]

Chong Chen¹, Chong Chen², Changjing Wu¹, Linjie Wu¹, Xiaochen Wang¹, Minghua Deng¹, Ruibin Xi¹ - Show less +3 more•Institutions (2)

Peking University¹, Alibaba Group²

01 May 2020-Bioinformatics

TL;DR: This model has minimal assumptions and allows for a computational efficient imputation method called scRMD to be developed, which can accurately recover the dropout values and help to improve downstream analysis such as differential expression analysis and clustering analysis.

...read moreread less

Abstract: Motivation Single cell RNA-sequencing (scRNA-seq) technology enables whole transcriptome profiling at single cell resolution and holds great promises in many biological and medical applications. Nevertheless, scRNA-seq often fails to capture expressed genes, leading to the prominent dropout problem. These dropouts cause many problems in down-stream analysis, such as significant increase of noises, power loss in differential expression analysis and obscuring of gene-to-gene or cell-to-cell relationship. Imputation of these dropout values can be beneficial in scRNA-seq data analysis. Results In this article, we model the dropout imputation problem as robust matrix decomposition. This model has minimal assumptions and allows us to develop a computational efficient imputation method called scRMD. Extensive data analysis shows that scRMD can accurately recover the dropout values and help to improve downstream analysis such as differential expression analysis and clustering analysis. Availability and implementation The R package scRMD is available at https://github.com/XiDsLab/scRMD. Supplementary information Supplementary data are available at Bioinformatics online.

...read moreread less

40 citations

Proceedings Article•DOI•

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones

[...]

Chong Liu¹, Yuqi Zhang², Hao Luo², Jiasheng Tang², Weihua Chen², Xianzhe Xu², Fan Wang², Hao Li², Yi-Dong Shen¹ - Show less +5 more•Institutions (2)

Chinese Academy of Sciences¹, Alibaba Group²

01 Jun 2021

TL;DR: In this paper, a multi-target multi-camera vehicle tracking framework guided by the crossroad zones is proposed, which obtained an IDF1 score of 0.8095, ranking first on the leaderboard.

...read moreread less

Abstract: Multi-Target Multi-Camera Tracking has a wide range of applications and is the basis for many advanced inferences and predictions. This paper describes our solution to the Track 3 multi-camera vehicle tracking task in 2021 AI City Challenge (AICITY21). This paper proposes a multi-target multi-camera vehicle tracking framework guided by the crossroad zones. The framework includes: (1) Use mature detection and vehicle re-identification models to extract targets and appearance features. (2) Use modified JDE-Tracker (without detection module) to track single-camera vehicles and generate single-camera tracklets. (3) According to the characteristics of the crossroad, the Tracklet Filter Strategy and the Direction Based Temporal Mask are proposed. (4) Propose Sub-clustering in Adjacent Cameras for multi-camera tracklets matching. Through the above techniques, our method obtained an IDF1 score of 0.8095, ranking first on the leaderboard1. The code will be released later.

...read moreread less

40 citations

Proceedings Article•

ControlVAE: Controllable Variational Autoencoder

[...]

Huajie Shao¹, Shuochao Yao¹, Dachun Sun¹, Aston Zhang², Shengzhong Liu¹, Dongxin Liu¹, Jun Wang³, Tarek Abdelzaher¹ - Show less +4 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, Amazon.com², Alibaba Group³

12 Jul 2020

TL;DR: A new non-linear PI controller is designed, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training.

...read moreread less

Abstract: Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models have some limitations in different applications. For example, a VAE easily suffers from KL vanishing in language modeling and low reconstruction quality for disentangling. To address these issues, we propose a novel controllable variational autoencoder framework, ControlVAE, that combines a controller, inspired by automatic control theory, with the basic VAE to improve the performance of resulting generative models. Specifically, we design a new non-linear PI controller, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training. The framework is evaluated using three applications; namely, language modeling, disentangled representation learning, and image generation. The results show that ControlVAE can achieve better disentangling and reconstruction quality than the existing methods. For language modelling, it not only averts the KL-vanishing, but also improves the diversity of generated text. Finally, we also demonstrate that ControlVAE improves the reconstruction quality of generated images compared to the original VAE.

...read moreread less

40 citations

Proceedings Article•DOI•

eventAI at SemEval-2019 Task 7: Rumor Detection on Social Media by Exploiting Content, User Credibility and Propagation Information.

[...]

Quanzhi Li¹, Qiong Zhang¹, Luo Si¹•Institutions (1)

Alibaba Group¹

01 Jun 2019

TL;DR: This paper describes the system for SemEval 2019 RumorEval: Determining rumor veracity and support for rumors, which includes neural network models as well as the traditional classification algorithms.

...read moreread less

Abstract: This paper describes our system for SemEval 2019 RumorEval: Determining rumor veracity and support for rumors (SemEval 2019 Task 7). This track has two tasks: Task A is to determine a user’s stance towards the source rumor, and Task B is to detect the veracity of the rumor: true, false or unverified. For stance classification, a neural network model with language features is utilized. For rumor verification, our approach exploits information from different dimensions: rumor content, source credibility, user credibility, user stance, event propagation path, etc. We use an ensemble approach in both tasks, which includes neural network models as well as the traditional classification algorithms. Our system is ranked 1st place in the rumor verification task by both the macro F1 measure and the RMSE measure.

...read moreread less

40 citations

Journal Article•DOI•

Acquiring Knowledge from Pre-trained Model to Neural Machine Translation

[...]

Rongxiang Weng¹, Heng Yu¹, Shujian Huang², Shanbo Cheng¹, Weihua Luo¹ - Show less +1 more•Institutions (2)

Alibaba Group¹, Nanjing University²

03 Apr 2020

TL;DR: This article proposed an Apt framework for acquiring knowledge from pre-trained model to NMT. But, the problem that the training objective of the bilingual task is far different from the monolingual pretrained model is not addressed.

...read moreread less

Abstract: Pre-training and fine-tuning have achieved great success in natural language process field. The standard paradigm of exploiting them includes two steps: first, pre-training a model, e.g. BERT, with a large scale unlabeled monolingual data. Then, fine-tuning the pre-trained model with labeled data from downstream tasks. However, in neural machine translation (NMT), we address the problem that the training objective of the bilingual task is far different from the monolingual pre-trained model. This gap leads that only using fine-tuning in NMT can not fully utilize prior language knowledge. In this paper, we propose an Apt framework for acquiring knowledge from pre-trained model to NMT. The proposed approach includes two modules: 1). a dynamic fusion mechanism to fuse task-specific features adapted from general knowledge into NMT network, 2). a knowledge distillation paradigm to learn language knowledge continuously during the NMT training process. The proposed approach could integrate suitable knowledge from pre-trained models to improve the NMT. Experimental results on WMT English to German, German to English and Chinese to English machine translation tasks show that our model outperforms strong baselines and the fine-tuning counterparts.

...read moreread less

40 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863