Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Bilingual Methods for Adaptive Training Data Selection for Machine Translation

[...]

Boxing Chen¹, Roland Kuhn², George Foster², Colin Cherry³, Fei Huang⁴ - Show less +1 more•Institutions (4)

Alibaba Group¹, National Research Council², University of Alberta³, Facebook⁴

01 Jan 2016

TL;DR: This paper proposed a new data selection method which uses semi-supervised convolutional neural networks based on bitokens (Bi-SSCNNs) for training machine translation systems from a large bilingual corpus.

...read moreread less

Abstract: In this paper, we propose a new data selection method which uses semi-supervised convolutional neural networks based on bitokens (Bi-SSCNNs) for training machine translation systems from a large bilingual corpus. In earlier work, we devised a data selection method based on semi-supervised convolutional neural networks (SSCNNs). The new method, Bi-SSCNN, is based on bitokens, which use bilingual information. When the new methods are tested on two translation tasks (Chinese-to-English and Arabic-to-English), they significantly outperform the other three data selection methods in the experiments. We also show that the BiSSCNN method is much more effective than other methods in preventing noisy sentence pairs from being chosen for training. More interestingly, this method only needs a tiny amount of in-domain data to train the selection model, which makes fine-grained topic-dependent translation adaptation possible. In the follow-up experiments, we find that neural machine translation (NMT) is more sensitive to noisy data than statistical machine translation (SMT). Therefore, Bi-SSCNN which can effectively screen out noisy sentence pairs, can benefit NMT much more than SMT.We observed a BLEU improvement over 3 points on an English-to-French WMT task when Bi-SSCNNs were used.

...read moreread less

27 citations

Proceedings Article•DOI•

Aspect Sentiment Classification Towards Question-Answering with Reinforced Bidirectional Attention Network

[...]

Jingjing Wang¹, Changlong Sun², Shoushan Li¹, Xiaozhong Liu³, Luo Si², Min Zhang⁴, Guodong Zhou¹ - Show less +3 more•Institutions (4)

Soochow University (Suzhou)¹, Alibaba Group², Indiana University³, Shanghai Jiao Tong University⁴

01 Jul 2019

TL;DR: A Reinforced Bidirectional Attention Network (RBAN) approach is proposed to address two inherent challenges in ASC-QA, i.e., semantic matching between question and answer, and data noise.

...read moreread less

Abstract: In the literature, existing studies on aspect sentiment classification (ASC) focus on individual non-interactive reviews. This paper extends the research to interactive reviews and proposes a new research task, namely Aspect Sentiment Classification towards Question-Answering (ASC-QA), for real-world applications. This new task aims to predict sentiment polarities for specific aspects from interactive QA style reviews. In particular, a high-quality annotated corpus is constructed for ASC-QA to facilitate corresponding research. On this basis, a Reinforced Bidirectional Attention Network (RBAN) approach is proposed to address two inherent challenges in ASC-QA, i.e., semantic matching between question and answer, and data noise. Experimental results demonstrate the great advantage of the proposed approach to ASC-QA against several state-of-the-art baselines.

...read moreread less

27 citations

Journal Article•DOI•

PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning

[...]

Tan Tang¹, Renzhong Li¹, Xinke Wu¹, Shuhan Liu¹, Johannes Knittel, Steffen Koch, Lingyun Yu², Peiran Ren³, Thomas Ertl, Yingcai Wu¹ - Show less +6 more•Institutions (3)

Zhejiang University¹, University of Liverpool², Alibaba Group³

28 Jan 2021-IEEE Transactions on Visualization and Computer Graphics

TL;DR: In this article, a reinforcement learning framework is proposed to train an AI agent that assists users in exploring the design space efficiently and generating well-optimized storylines, and an authoring tool that integrates a set of flexible interactions to support easy customization of storyline visualizations.

...read moreread less

Abstract: Storyline visualizations are an effective means to present the evolution of plots and reveal the scenic interactions among characters. However, the design of storyline visualizations is a difficult task as users need to balance between aesthetic goals and narrative constraints. Despite that the optimization-based methods have been improved significantly in terms of producing aesthetic and legible layouts, the existing (semi-) automatic methods are still limited regarding 1) efficient exploration of the storyline design space and 2) flexible customization of storyline layouts. In this work, we propose a reinforcement learning framework to train an AI agent that assists users in exploring the design space efficiently and generating well-optimized storylines. Based on the framework, we introduce PlotThread, an authoring tool that integrates a set of flexible interactions to support easy customization of storyline visualizations. To seamlessly integrate the AI agent into the authoring process, we employ a mixed-initiative approach where both the agent and designers work on the same canvas to boost the collaborative design of storylines. We evaluate the reinforcement learning model through qualitative and quantitative experiments and demonstrate the usage of PlotThread using a collection of use cases.

...read moreread less

27 citations

Journal Article•DOI•

Modeling Data, Information and Knowledge for Security Protection of Hybrid IoT and Edge Resources

[...]

Yucong Duan¹, Xiaobing Sun², Haoyang Che, Chunjie Cao¹, Zhao Li³, Xiaoxian Yang⁴ - Show less +2 more•Institutions (4)

Hainan University¹, Yangzhou University², Alibaba Group³, Shanghai Second Polytechnic University⁴

25 Jul 2019-IEEE Access

TL;DR: This work proposes to cognitively formalize the semantics of the key elements of the DIKW in a conceptual process and shows the initial case for using this formalization to construct security protection solutions for edge computing scenarios centering on type conversions among typed resources formalized through the proposed formalization of theDIKW.

...read moreread less

Abstract: Currently, with the growth of the Internet of Things devices and the emergence of massive edge resources, security protection content has not only empowered IoT devices with the accumulation of networked computing and storage as a flexible whole but also enabled storing, transferring and processing DIKW (data, information, knowledge, and wisdom) content at the edge of the network from multiple devices in a mobile manner. However, understanding various DIKW content or resources poses a conceptual challenge in unifying the semantics of the core concepts as a starting point. Through building metamodels of the DIKW framework, we propose to cognitively formalize the semantics of the key elements of the DIKW in a conceptual process. The formalization centers on modeling the perceived world only by relationships or semantics as the prime atomic comprising elements. Based on this cognitive world model, we reveal the difference between relationships and entities during the conceptualization process as a foundation for distinguishing data and information. Thereafter, we show the initial case for using this formalization to construct security protection solutions for edge computing scenarios centering on type conversions among typed resources formalized through our proposed formalization of the DIKW.

...read moreread less

27 citations

Proceedings Article•DOI•

Heterogeneous Embedding Propagation for Large-Scale E-Commerce User Alignment

[...]

Vincent W. Zheng¹, Mo Sha², Yuchen Li³, Hongxia Yang⁴, Yuan Fang³, Zhenjie Zhang¹, Kian-Lee Tan², Kevin Chen-Chuan Chang⁵ - Show less +4 more•Institutions (5)

Agency for Science, Technology and Research¹, National University of Singapore², Singapore Management University³, Alibaba Group⁴, University of Illinois at Urbana–Champaign⁵

01 Nov 2018

TL;DR: A novel Heterogeneous Embedding Propagation model is proposed, which is to iteratively reconstruct a node's embedding from its heterogeneous neighbors in a weighted manner, and meanwhile propagate its embedding updates from reconstruction loss and/or classification loss to its neighbors.

...read moreread less

Abstract: We study the important problem of user alignment in e-commerce: to predict whether two online user identities that access an e-commerce site from different devices belong to one real-world person. As input, we have a set of user activity logs from Taobao and some labeled user identity linkages. User activity logs can be modeled using a heterogeneous interaction graph (HIG), and subsequently the user alignment task can be formulated as a semi-supervised HIG embedding problem. HIG embedding is challenging for two reasons: its heterogeneous nature and the presence of edge features. To address the challenges, we propose a novel Heterogeneous Embedding Propagation (HEP) model. The core idea is to iteratively reconstruct a node's embedding from its heterogeneous neighbors in a weighted manner, and meanwhile propagate its embedding updates from reconstruction loss and/or classification loss to its neighbors. We conduct extensive experiments on large-scale datasets from Taobao, demonstrating that HEP significantly outperforms state-of-the-art baselines often by more than 10% in F-scores.

...read moreread less

27 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863