Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Posted Content•

A Hybrid Retrieval-Generation Neural Conversation Model

[...]

Liu Yang¹, Junjie Hu², Minghui Qiu³, Chen Qu¹, Jianfeng Gao⁴, W. Bruce Croft¹, Xiaodong Liu⁵, Yelong Shen⁶, Jingjing Liu⁵ - Show less +5 more•Institutions (6)

University of Massachusetts Amherst¹, Carnegie Mellon University², Alibaba Group³, Association for Computing Machinery⁴, Microsoft⁵, Tencent⁶

19 Apr 2019-arXiv: Information Retrieval

TL;DR: This paper proposes a hybrid neural conversation model that combines the merits of both response retrieval and generation methods and shows that the proposed model outperforms both retrieval- based methods and generation-based methods under both automatic evaluation metrics and human evaluation.

...read moreread less

Abstract: Intelligent personal assistant systems that are able to have multi-turn conversations with human users are becoming increasingly popular. Most previous research has been focused on using either retrieval-based or generation-based methods to develop such systems. Retrieval-based methods have the advantage of returning fluent and informative responses with great diversity. However, the performance of the methods is limited by the size of the response repository. On the other hand, generation-based methods can produce highly coherent responses on any topics. But the generated responses are often generic and not informative due to the lack of grounding knowledge. In this paper, we propose a hybrid neural conversation model that combines the merits of both response retrieval and generation methods. Experimental results on Twitter and Foursquare data show that the proposed model outperforms both retrieval-based methods and generation-based methods (including a recently proposed knowledge-grounded neural conversation model) under both automatic evaluation metrics and human evaluation. We hope that the findings in this study provide new insights on how to integrate text retrieval and text generation models for building conversation systems.

...read moreread less

56 citations

Posted Content•

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems.

[...]

Chang Zhou¹, Jianxin Ma¹, Jianwei Zhang¹, Jingren Zhou¹, Hongxia Yang¹ - Show less +1 more•Institutions (1)

Alibaba Group¹

20 May 2020-arXiv: Information Retrieval

TL;DR: CLRec is designed, a contrastive learning method to improve DCG in terms of fairness, effectiveness and efficiency in recommender systems with extremely large candidate size, and improved upon CLRec and proposes Multi-CLRec, for accurate multi-intention aware bias reduction.

...read moreread less

Abstract: Deep candidate generation (DCG) that narrows down the collection of relevant items from billions to hundreds via representation learning is essential to large-scale recommender systems. Standard approaches approximate maximum likelihood estimation (MLE) through sampling for better scalability and address the problem of DCG in a way similar to language modeling. However, live recommender systems face severe unfairness of exposure with a vocabulary several orders of magnitude larger than that of natural language, implying that (1) MLE will preserve and even exacerbate the exposure bias in the long run in order to faithfully fit the observed samples, and (2) suboptimal sampling and inadequate use of item features can lead to inferior representations for the unfairly ignored items. In this paper, we introduce CLRec, a Contrastive Learning paradigm that has been successfully deployed in a real-world massive recommender system, to alleviate exposure bias in DCG. We theoretically prove that a popular choice of contrastive loss is equivalently reducing the exposure bias via inverse propensity scoring, which provides a new perspective on the effectiveness of contrastive learning. We further employ a fixed-size queue to store the items' representations computed in previously processed batches, and use the queue to serve as an effective sampler of negative examples. This queue-based design provides great efficiency in incorporating rich features of the thousand negative items per batch thanks to computation reuse. Extensive offline analyses and four-month online A/B tests in Mobile Taobao demonstrate substantial improvement, including a dramatic reduction in the Matthew effect.

...read moreread less

56 citations

Proceedings Article•DOI•

CATN: Cross-Domain Recommendation for Cold-Start Users via Aspect Transfer Network

[...]

Cheng Zhao¹, Chenliang Li¹, Rong Xiao², Hongbo Deng², Aixin Sun³ - Show less +1 more•Institutions (3)

Wuhan University¹, Alibaba Group², Nanyang Technological University³

25 Jul 2020

TL;DR: Wang et al. as mentioned in this paper proposed a cross-domain recommendation framework via aspect transfer network for cold-start users (named CATN), which extracts multiple aspects for each user and each item from their review documents, and learns aspect correlations across domains with an attention mechanism.

...read moreread less

Abstract: In a large recommender system, the products (or items) could be in many different categories or domains. Given two relevant domains (e.g., Book and Movie), users may have interactions with items in one domain but not in the other domain. To the latter, these users are considered as cold-start users. How to effectively transfer users' preferences based on their interactions from one domain to the other relevant domain, is the key issue in cross-domain recommendation. Inspired by the advances made in review-based recommendation, we propose to model user preference transfer at aspect-level derived from reviews. To this end, we propose a cross-domain recommendation framework via aspect transfer network for cold-start users (named CATN). CATN is devised to extract multiple aspects for each user and each item from their review documents, and learn aspect correlations across domains with an attention mechanism. In addition, we further exploit auxiliary reviews from like-minded users to enhance a user's aspect representations. Then, an end-to-end optimization framework is utilized to strengthen the robustness of our model. On real-world datasets, the proposed CATN outperforms SOTA models significantly in terms of rating prediction accuracy. Further analysis shows that our model is able to reveal user aspect connections across domains at a fine level of granularity, making the recommendation explainable.

...read moreread less

56 citations

Proceedings Article•DOI•

PD-GAN: Adversarial Learning for Personalized Diversity-Promoting Recommendation

[...]

Qiong Wu¹, Yong Liu¹, Chunyan Miao¹, Binqiang Zhao², Yin Zhao², Lu Guan² - Show less +2 more•Institutions (2)

Nanyang Technological University¹, Alibaba Group²

01 Aug 2019

TL;DR: Experimental results show that PD-GAN is superior to generate recommendations that are both diverse and relevant.

...read moreread less

Abstract: This paper proposes Personalized Diversity-promoting GAN (PD-GAN), a novel recommendation model to generate diverse, yet relevant recommendations. Specifically, for each user, a generator recommends a set of diverse and relevant items by sequentially sampling from a personalized Determinantal Point Process (DPP) kernel matrix. This kernel matrix is constructed by two learnable components: the general co-occurrence of diverse items and the user's personal preference to items. To learn the first component, we propose a novel pairwise learning paradigm using training pairs, and each training pair consists of a set of diverse items and a set of similar items randomly sampled from the observed data of all users. The second component is learnt through adversarial training against a discriminator which strives to distinguish between recommended items and the ground-truth sets randomly sampled from the observed data of the target user. Experimental results show that PD-GAN is superior to generate recommendations that are both diverse and relevant.

...read moreread less

55 citations

Proceedings Article•DOI•

Data Poisoning Attack against Knowledge Graph Embedding

[...]

Hengtong Zhang¹, Tianhang Zheng¹, Jing Gao¹, Chenglin Miao¹, Lu Su¹, Yaliang Li², Kui Ren³ - Show less +3 more•Institutions (3)

University at Buffalo¹, Alibaba Group², Zhejiang University³

01 Aug 2019

TL;DR: This work proposes a collection of data poisoning attack strategies, which can effectively manipulate the plausibility of arbitrary targeted facts in a knowledge graph by adding or deleting facts on the graph.

...read moreread less

Abstract: Knowledge graph embedding (KGE) is a technique for learning continuous embeddings for entities and relations in the knowledge graph.Due to its benefit to a variety of downstream tasks such as knowledge graph completion, question answering and recommendation, KGE has gained significant attention recently. Despite its effectiveness in a benign environment, KGE' robustness to adversarial attacks is not well-studied. Existing attack methods on graph data cannot be directly applied to attack the embeddings of knowledge graph due to its heterogeneity. To fill this gap, we propose a collection of data poisoning attack strategies, which can effectively manipulate the plausibility of arbitrary targeted facts in a knowledge graph by adding or deleting facts on the graph. The effectiveness and efficiency of the proposed attack strategies are verified by extensive evaluations on two widely-used benchmarks.

...read moreread less

55 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863