Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Toward Achieving Robust Low-Level and High-Level Scene Parsing

[...]

Bing Shuai¹, Henghui Ding¹, Ting Liu², Gang Wang², Xudong Jiang¹ - Show less +1 more•Institutions (2)

Nanyang Technological University¹, Alibaba Group²

01 Mar 2019-IEEE Transactions on Image Processing

TL;DR: The segmentation network enhanced fully convolutional network (EFCN) is named based on its significantly enhanced structure over FCN and achieves state-of-the-arts on segmentation datasets of ADE20K, Pascal Context, SUN-RGBD, and Pascal VOC 2012.

...read moreread less

Abstract: In this paper, we address the challenging task of scene segmentation. We first discuss and compare two widely used approaches to retain detailed spatial information from pre-trained convolutional context network (CNN)—“dilation” and “skip”. Then, we demonstrate that the parsing performance of “skip” network can be noticeably improved by modifying the parameterization of skip layers. Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance. Meanwhile, we propose a CCN and place it on top of pre-trained CNNs, which is used to aggregate contexts for high-level feature maps so that robust high-level parsing can be achieved. We name our segmentation network enhanced fully convolutional network (EFCN) based on its significantly enhanced structure over FCN. Extensive experimental studies justify each contribution separately. Without bells and whistles, EFCN achieves state-of-the-arts on segmentation datasets of ADE20K, Pascal Context, SUN-RGBD, and Pascal VOC 2012.

...read moreread less

39 citations

Proceedings Article•DOI•

ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases

[...]

Xinyi Zhang¹, Hong Wu², Zhuo Chang¹, Shuowei Jin², Jian Tan², Feifei Li², Tieying Zhang², Bin Cui¹ - Show less +4 more•Institutions (2)

Peking University¹, Alibaba Group²

09 Jun 2021

TL;DR: ResTune as discussed by the authors leverages the tuning experience from the history tasks and transfers the accumulated knowledge to accelerate the tuning process of the new tasks, which significantly reduces the tuning time by a meta-learning based approach.

...read moreread less

Abstract: Modern database management systems (DBMS) contain tens to hundreds of critical performance tuning knobs that determine the system runtime behaviors. To reduce the total cost of ownership, cloud database providers put in drastic effort to automatically optimize the resource utilization by tuning these knobs. There are two challenges. First, the tuning system should always abide by the service level agreement (SLA) while optimizing the resource utilization, which imposes strict constrains on the tuning process. Second, the tuning time should be reasonably acceptable since time-consuming tuning is not practical for production and online troubleshooting. In this paper, we design ResTune to automatically optimize the resource utilization without violating SLA constraints on the throughput and latency requirements. ResTune leverages the tuning experience from the history tasks and transfers the accumulated knowledge to accelerate the tuning process of the new tasks. The prior knowledge is represented from historical tuning tasks through an ensemble model. The model learns the similarity between the historical workloads and the target, which significantly reduces the tuning time by a meta-learning based approach. ResTune can efficiently handle different workloads and various hardware environments. We perform evaluations using benchmarks and real world workloads on different types of resources. The results show that, compared with the manually tuned configurations, ResTune reduces 65%, 87%, 39% of CPU utilization, I/O and memory on average, respectively. Compared with the state-of-the-art methods, ResTune finds better configurations with up to ~18x speedups.

...read moreread less

39 citations

Journal Article•DOI•

Enhancing Social Media Analysis with Visual Data Analytics: A Deep Learning Approach

[...]

Donghyuk Shin, Shu He¹, Gene Moo Lee², Andrew B. Whinston³, Suleyman Cetintas⁴, Kuang-Chih Lee⁵ - Show less +2 more•Institutions (5)

University of Connecticut¹, University of British Columbia², University of Texas at Austin³, Yahoo!⁴, Alibaba Group⁵

01 Dec 2020-Management Information Systems Quarterly

TL;DR: A visual data analytics framework to enhance social media research using deep learning models including complexity, similarity, and consistency measures that can play important roles in the persuasiveness of social media content is proposed.

...read moreread less

Abstract: This research methods article proposes a visual data analytics framework to enhance social media research using deep learning models. Drawing on the literature of information systems and marketing, complemented with data-driven methods, we propose a number of visual and textual content features including complexity, similarity, and consistency measures that can play important roles in the persuasiveness of social media content. We then employ state-of-the-art machine learning approaches such as deep learning and text mining to operationalize these new content features in a scalable and systematic manner. For the newly developed features, we validate them against human coders on Amazon Mechanical Turk. Furthermore, we conduct two case studies with a large social media dataset from Tumblr to show the effectiveness of the proposed content features. The first case study demonstrates that both theoretically motivated and data-driven features significantly improve the model’s power to predict the popularity of a post, and the second one highlights the relationships between content features and consumer evaluations of the corresponding posts. The proposed research framework illustrates how deep learning methods can enhance the analysis of unstructured visual and textual data for social media research.

...read moreread less

39 citations

Journal Article•DOI•

Information Theoretic Subspace Clustering

[...]

Ran He¹, Liang Wang¹, Zhenan Sun¹, Yingya Zhang², Bo Li³ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, Alibaba Group², Beihang University³

01 Dec 2016-IEEE Transactions on Neural Networks

TL;DR: This paper addresses the problem of grouping the data points sampled from a union of multiple subspaces in the presence of outliers and develops information theoretic subspace clustering methods via correntropy, which can further improve the robustness of LRR sub space clustering and outperform other state-of-the-art subspace clusters methods.

...read moreread less

Abstract: This paper addresses the problem of grouping the data points sampled from a union of multiple subspaces in the presence of outliers. Information theoretic objective functions are proposed to combine structured low-rank representations (LRRs) to capture the global structure of data and information theoretic measures to handle outliers. In theoretical part, we point out that group sparsity-induced measures ( $\ell _{2,1}$ -norm, $\ell _{\alpha }$ -norm, and correntropy) can be justified from the viewpoint of half-quadratic (HQ) optimization, which facilitates both convergence study and algorithmic development. In particular, a general formulation is accordingly proposed to unify HQ-based group sparsity methods into a common framework. In algorithmic part, we develop information theoretic subspace clustering methods via correntropy. With the help of Parzen window estimation, correntropy is used to handle either outliers under any distributions or sample-specific errors in data. Pairwise link constraints are further treated as a prior structure of LRRs. Based on the HQ framework, iterative algorithms are developed to solve the nonconvex information theoretic loss functions. Experimental results on three benchmark databases show that our methods can further improve the robustness of LRR subspace clustering and outperform other state-of-the-art subspace clustering methods.

...read moreread less

39 citations

Posted Content•

Graph Convolution for Multimodal Information Extraction from Visually Rich Documents.

[...]

Xiaojing Liu¹, Feiyu Gao¹, Qiong Zhang¹, Huasha Zhao¹•Institutions (1)

Alibaba Group¹

27 Mar 2019-arXiv: Information Retrieval

TL;DR: Wang et al. as mentioned in this paper introduced a graph convolution based model to combine textual and visual information presented in VRDs, which is trained to summarize the context of a text segment in the document, and further combined with text embeddings for entity extraction.

...read moreread less

Abstract: Visually rich documents (VRDs) are ubiquitous in daily business and life. Examples are purchase receipts, insurance policy documents, custom declaration forms and so on. In VRDs, visual and layout information is critical for document understanding, and texts in such documents cannot be serialized into the one-dimensional sequence without losing information. Classic information extraction models such as BiLSTM-CRF typically operate on text sequences and do not incorporate visual features. In this paper, we introduce a graph convolution based model to combine textual and visual information presented in VRDs. Graph embeddings are trained to summarize the context of a text segment in the document, and further combined with text embeddings for entity extraction. Extensive experiments have been conducted to show that our method outperforms BiLSTM-CRF baselines by significant margins, on two real-world datasets. Additionally, ablation studies are also performed to evaluate the effectiveness of each component of our model.

...read moreread less

39 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863