Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

[...]

Shaohui Kuang¹, Junhui Li², António Branco³, Weihua Luo¹, Deyi Xiong² - Show less +1 more•Institutions (3)

Alibaba Group¹, Soochow University (Suzhou)², University of Lisbon³

01 Jul 2018

TL;DR: Experiments and analysis presented in this paper demonstrate that the proposed bridging models are able to significantly improve quality of both sentence translation, in general, and alignment and translation of individual source words with target words, in particular.

...read moreread less

Abstract: In neural machine translation, a source sequence of words is encoded into a vector from which a target sequence is generated in the decoding phase. Differently from statistical machine translation, the associations between source words and their possible target counterparts are not explicitly stored. Source and target words are at the two ends of a long information processing procedure, mediated by hidden states at both the source encoding and the target decoding phases. This makes it possible that a source word is incorrectly translated into a target word that is not any of its admissible equivalent counterparts in the target language. In this paper, we seek to somewhat shorten the distance between source and target words in that procedure, and thus strengthen their association, by means of a method we term bridging source and target word embeddings. We experiment with three strategies: (1) a source-side bridging model, where source word embeddings are moved one step closer to the output target sequence; (2) a target-side bridging model, which explores the more relevant source word embeddings for the prediction of the target sequence; and (3) a direct bridging model, which directly connects source and target word embeddings seeking to minimize errors in the translation of ones by the others. Experiments and analysis presented in this paper demonstrate that the proposed bridging models are able to significantly improve quality of both sentence translation, in general, and alignment and translation of individual source words with target words, in particular.

...read moreread less

29 citations

Posted Content•

Tracklets Predicting Based Adaptive Graph Tracking

[...]

Chaobing Shan¹, Chunbo Wei², Bing Deng², Jianqiang Huang², Xian-Sheng Hua, Xiaoliang Cheng¹, Kewei Liang¹ - Show less +3 more•Institutions (2)

Zhejiang University¹, Alibaba Group²

18 Oct 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work presents an accurate and end-to-end learning framework for multi-object tracking, namely TPAGT, which re-extracts the features of the tracklets in the current frame based on motion predicting, which is the key to solve the problem of features inconsistent.

...read moreread less

Abstract: Most of the existing tracking methods link the detected boxes to the tracklets using a linear combination of feature cosine distances and box overlap. But the problem of inconsistent features of an object in two different frames still exists. In addition, when extracting features, only appearance information is utilized, neither the location relationship nor the information of the tracklets is considered. We present an accurate and end-to-end learning framework for multi-object tracking, namely \textbf{TPAGT}. It re-extracts the features of the tracklets in the current frame based on motion predicting, which is the key to solve the problem of features inconsistent. The adaptive graph neural network in TPAGT is adopted to fuse locations, appearance, and historical information, and plays an important role in distinguishing different objects. In the training phase, we propose the balanced MSE LOSS to successfully overcome the unbalanced samples. Experiments show that our method reaches state-of-the-art performance. It achieves 76.5\% MOTA on the MOT16 challenge and 76.2\% MOTA on the MOT17 challenge.

...read moreread less

29 citations

Proceedings Article•DOI•

Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search.

[...]

Yougen Yuan¹, Cheung-Chi Leung², Lei Xie¹, Hongjie Chen¹, Bin Ma³, Haizhou Li⁴ - Show less +2 more•Institutions (4)

Northwestern Polytechnical University¹, Institute for Infocomm Research Singapore², Alibaba Group³, National University of Singapore⁴

02 Sep 2018

TL;DR: The experiments show that the proposed acoustic word embeddings learned with temporal context for query-by-example (QbE) speech search outperform the state-of-the-art frame-level feature representations and reduce run-time computation since no dynamic time warping is required in QbE speech search.

...read moreread less

Abstract: We propose to learn acoustic word embeddings with temporal context for query-by-example (QbE) speech search. The temporal context includes the leading and trailing word sequences of a word. We assume that there exist spoken word pairs in the training database. We pad the word pairs with their original temporal context to form fixed-length speech segment pairs. We obtain the acoustic word embeddings through a deep convolutional neural network (CNN) which is trained on the speech segment pairs with a triplet loss. Shifting a fixed-length analysis window through the search content, we obtain a running sequence of embeddings. In this way, searching for the spoken query is equivalent to the matching of acoustic word embeddings. The experiments show that our proposed acoustic word embeddings learned with temporal context are effective in QbE speech search. They outperform the state-of-the-art frame-level feature representations and reduce run-time computation since no dynamic time warping is required in QbE speech search. We also find that it is important to have sufficient speech segment pairs to train the deep CNN for effective acoustic word embeddings.

...read moreread less

29 citations

Proceedings Article•DOI•

OpenUE: An Open Toolkit of Universal Extraction from Text

[...]

Ningyu Zhang¹, Shumin Deng¹, Zhen Bi¹, Haiyang Yu¹, Jiacheng Yang, Mosha Chen¹, Fei Huang², Wei Zhang³, Huajun Chen² - Show less +5 more•Institutions (3)

Zhejiang University¹, Alibaba Group², Nanjing Agricultural University³

01 Oct 2020

TL;DR: A prototype model is introduced and an open-source and extensible toolkit called OpenUE for various extraction tasks, which allows developers to train custom models to extract information from the text and supports quick model validation for researchers.

...read moreread less

Abstract: Natural language processing covers a wide variety of tasks with token-level or sentence-level understandings. In this paper, we provide a simple insight that most tasks can be represented in a single universal extraction format. We introduce a prototype model and provide an open-source and extensible toolkit called OpenUE for various extraction tasks. OpenUE allows developers to train custom models to extract information from the text and supports quick model validation for researchers. Besides, OpenUE provides various functional modules to maintain sufficient modularity and extensibility. Except for the toolkit, we also deploy an online demo with restful APIs to support real-time extraction without training and deploying. Additionally, the online system can extract information in various tasks, including relational triple extraction, slot & intent detection, event extraction, and so on. We release the source code, datasets, and pre-trained models to promote future researches in http://github.com/zjunlp/openue.

...read moreread less

29 citations

Proceedings Article•DOI•

Label-Aware Graph Convolutional Networks

[...]

Hao Chen¹, Yue Xu², Feiran Huang³, Zengde Deng⁴, Wenbing Huang⁵, Senzhang Wang⁶, Peng He⁷, Zhoujun Li¹ - Show less +4 more•Institutions (7)

Beihang University¹, Alibaba Group², Jinan University³, The Chinese University of Hong Kong⁴, Tsinghua University⁵, Nanjing University of Aeronautics and Astronautics⁶, Tencent⁷

19 Oct 2020

TL;DR: Zhang et al. as discussed by the authors proposed a label-aware edge classifier that can filter distracting neighbors and add valuable neighbors for each node to refine the original graph into a label aware (LA) graph.

...read moreread less

Abstract: Recent advances in Graph Convolutional Networks (GCNs) have led to state-of-the-art performance on various graph-related tasks. However, most existing GCN models do not explicitly identify whether all the aggregated neighbors are valuable to the learning tasks, which may harm the learning performance. In this paper, we consider the problem of node classification and propose the Label-Aware Graph Convolutional Network (LAGCN) framework which can directly identify valuable neighbors to enhance the performance of existing GCN models. Our contribution is three-fold. First, we propose a label-aware edge classifier that can filter distracting neighbors and add valuable neighbors for each node to refine the original graph into a label-aware (LA) graph. Existing GCN models can directly learn from the LA graph to improve the performance without changing their model architectures. Second, we introduce the concept of positive ratio to evaluate the density of valuable neighbors in the LA graph. Theoretical analysis reveals that using the edge classifier to increase the positive ratio can improve the learning performance of existing GCN models. Third, we conduct extensive node classification experiments on benchmark datasets. The results verify that LAGCN can improve the performance of existing GCN models considerably, in terms of node classification.

...read moreread less

29 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863