Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Learning the Best Pooling Strategy for Visual Semantic Embedding

[...]

Jiacheng Chen, Hexiang Hu¹, Hao Wu, Yuning Jiang², Changhu Wang - Show less +1 more•Institutions (2)

University of Southern California¹, Alibaba Group²

01 Jun 2021

TL;DR: Wang et al. as discussed by the authors proposed a generalized pooling operator (GPO) to automatically adapt itself to the best pooling strategy for different features, requiring no manual tuning while staying effective and efficient.

...read moreread less

Abstract: Visual Semantic Embedding (VSE) is a dominant approach for vision-language retrieval, which aims at learning a deep embedding space such that visual data are embedded close to their semantic text labels or descriptions. Recent VSE models use complex methods to better contextualize and aggregate multi-modal features into holistic embeddings. However, we discover that surprisingly simple (but carefully selected) global pooling functions (e.g., max pooling) outperform those complex models, across different feature extractors. Despite its simplicity and effectiveness, seeking the best pooling function for different data modality and feature extractor is costly and tedious, especially when the size of features varies (e.g., text, video). Therefore, we propose a Generalized Pooling Operator (GPO), which learns to automatically adapt itself to the best pooling strategy for different features, requiring no manual tuning while staying effective and efficient. We extend the VSE model using this proposed GPO and denote it as VSE∞.Without bells and whistles, VSE∞ outperforms previous VSE methods significantly on image-text retrieval benchmarks across popular feature extractors. With a simple adaptation, variants of VSE∞ further demonstrate its strength by achieving the new state of the art on two video-text retrieval datasets. Comprehensive experiments and visualizations confirm that GPO always discovers the best pooling strategy and can be a plug-and-play feature aggregation module for standard VSE models. Code and pre-trained models are available at http://jcchen.me/vse_infty/

...read moreread less

37 citations

Patent•

System and method for processing task resources

[...]

Cong Lanlan¹, Heshan Lin¹, Yang Yehui¹•Institutions (1)

Alibaba Group¹

21 Sep 2016

TL;DR: In this article, a device and method for automatically allocating computing resources is described, which includes receiving a task from a client, the task including a plurality of instances and a resource description manifest representing resource needs of the plurality of instance, determining an initial computing resource allocation of a cluster of machines based on the resource description, wherein the initial computing resources allocation is determined according to the resource needs included in the task description manifest.

...read moreread less

Abstract: A device and method for automatically allocating computing resources is disclosed herein. The method includes receiving a task from a client, the task including a plurality of instances and a resource description manifest representing resource needs of the plurality of instances; determining an initial computing resource allocation of a cluster of machines based on the resource description manifest, wherein the initial computing resource allocation is determined based on the resource needs included in the resource description manifest; determining that the resource description manifest indicates a request to utilize an actual computing resource allocation in excess of the initial computing resource allocation; configuring a plurality of actual computing resources to process the plurality of instances, wherein the plurality of actual computing resources are configured to utilize resources in excess of the initial computing resource allocation; and executing the plurality of instances using the plurality of actual computing resources.

...read moreread less

37 citations

Posted Content•

Hybrid Spatio-Temporal Graph Convolutional Network: Improving Traffic Prediction with Navigation Data

[...]

Dai Rui¹, Shenkun Xu¹, Qian Gu¹, Chenguang Ji¹, Kaikui Liu¹ - Show less +1 more•Institutions (1)

Alibaba Group¹

23 Jun 2020-arXiv: Learning

TL;DR: The Hybrid Spatio-Temporal Graph Convolutional Network (H-STGCN), which is able to "deduce" future travel time by exploiting the data of upcoming traffic volume by taking advantage of the piecewise-linear flow-density relationship.

...read moreread less

Abstract: Traffic forecasting has recently attracted increasing interest due to the popularity of online navigation services, ridesharing and smart city projects. Owing to the non-stationary nature of road traffic, forecasting accuracy is fundamentally limited by the lack of contextual information. To address this issue, we propose the Hybrid Spatio-Temporal Graph Convolutional Network (H-STGCN), which is able to "deduce" future travel time by exploiting the data of upcoming traffic volume. Specifically, we propose an algorithm to acquire the upcoming traffic volume from an online navigation engine. Taking advantage of the piecewise-linear flow-density relationship, a novel transformer structure converts the upcoming volume into its equivalent in travel time. We combine this signal with the commonly-utilized travel-time signal, and then apply graph convolution to capture the spatial dependency. Particularly, we construct a compound adjacency matrix which reflects the innate traffic proximity. We conduct extensive experiments on real-world datasets. The results show that H-STGCN remarkably outperforms state-of-the-art methods in various metrics, especially for the prediction of non-recurring congestion.

...read moreread less

36 citations

Posted Content•

Bi-Classifier Determinacy Maximization for Unsupervised Domain Adaptation

[...]

Shuang Li¹, Fangrui Lv¹, Binhui Xie¹, Chi Harold Liu¹, Jian Liang², Chen Qin³ - Show less +2 more•Institutions (3)

Beijing Institute of Technology¹, Alibaba Group², University of Edinburgh³

13 Dec 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper designs a novel classifier determinacy disparity (CDD) metric, which formulates classifier discrepancy as the class relevance of distinct target predictions and implicitly introduces constraint on the target feature discriminability.

...read moreread less

Abstract: Unsupervised domain adaptation challenges the problem of transferring knowledge from a well-labelled source domain to an unlabelled target domain. Recently,adversarial learning with bi-classifier has been proven effective in pushing cross-domain distributions close. Prior approaches typically leverage the disagreement between bi-classifier to learn transferable representations, however, they often neglect the classifier determinacy in the target domain, which could result in a lack of feature discriminability. In this paper, we present a simple yet effective method, namely Bi-Classifier Determinacy Maximization(BCDM), to tackle this problem. Motivated by the observation that target samples cannot always be separated distinctly by the decision boundary, here in the proposed BCDM, we design a novel classifier determinacy disparity (CDD) metric, which formulates classifier discrepancy as the class relevance of distinct target predictions and implicitly introduces constraint on the target feature discriminability. To this end, the BCDM can generate discriminative representations by encouraging target predictive outputs to be consistent and determined, meanwhile, preserve the diversity of predictions in an adversarial manner. Furthermore, the properties of CDD as well as the theoretical guarantees of BCDM's generalization bound are both elaborated. Extensive experiments show that BCDM compares favorably against the existing state-of-the-art domain adaptation methods.

...read moreread less

36 citations

Journal Article•DOI•

Moving Object Detection Revisited: Speed and Robustness

[...]

Hong Han¹, Jianfei Zhu², Shengcai Liao³, Zhen Lei³, Stan Z. Li³ - Show less +1 more•Institutions (3)

Xidian University¹, Alibaba Group², Chinese Academy of Sciences³

01 Jun 2015-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: A new integration framework of texture and color information for background modeling is proposed, in which the foreground decision equation includes three parts (one part for color information, one part for texture information, and the left part for the integration of color and texture information).

...read moreread less

Abstract: The detection of moving objects in videos is very important in many video processing applications, and background modeling is often an indispensable process to achieve this goal. Most of the traditional background modeling methods utilize color or texture information. However, color information is sensitive to illumination variations and texture information cannot be utilized to separate smooth foreground from smooth background in most cases. Achieving good performance in terms of high foreground detection accuracy and low computational cost is also challenging. In this paper, we propose a new integration framework of texture and color information for background modeling, in which the foreground decision equation includes three parts (one part for color information, one part for texture information, and the left part for the integration of color and texture information). This framework is able to combine the advantages of texture and color features while inhibiting their disadvantages as well. Moreover, we propose a block-based method to accelerate the background modeling. In particular, in the texture information modeling process, a single histogram model is established for each block whose bins indicate the occurrence probabilities of different patterns, which is different from the traditional multihistogram model for block-based background modeling, and then dominant background patterns are selected to calculate the background likelihood of new coming blocks. Dynamic background and multimodal problems can be handled through this technique. To evaluate the foreground detection performance reasonably, a new quality measure is proposed. Extensive experiments on various challenging videos validate the effectiveness of the proposed method over state-of-the-art methods.

...read moreread less

36 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863