Institution

Alibaba Group

Company•Hangzhou, China•

About: Alibaba Group is a company organization based out in Hangzhou, China. It is known for research contribution in the topics: Computer science & Terminal (electronics). The organization has 6810 authors who have published 7389 publications receiving 55653 citations. The organization is also known as: Alibaba Group Holding Limited & Alibaba Group (Cayman Islands).

...read moreread less

Topics: Computer science, Terminal (electronics), Graph (abstract data type), Node (networking), Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Controllable Time-Delay Transformer for Real-Time Punctuation Prediction and Disfluency Detection

[...]

Qian Chen¹, Mengzhe Chen¹, Bo Li¹, Wen Wang¹•Institutions (1)

Alibaba Group¹

03 Mar 2020

TL;DR: A Controllable Time-delay Transformer model that jointly completes the punctuation prediction and disfluency detection tasks in real time and facilitates freezing partial outputs with controllable time delay to fulfill the real-time constraints in partial decoding required by subsequent applications is proposed.

...read moreread less

Abstract: With the increased applications of automatic speech recognition (ASR) in recent years, it is essential to automatically insert punctuation marks and remove disfluencies in transcripts, to improve the readability of the transcripts as well as the performance of subsequent applications, such as machine translation, dialogue systems, and so forth. In this paper, we propose a Controllable Time-delay Transformer (CT-Transformer) model that jointly completes the punctuation prediction and disfluency detection tasks in real time. The CT-Transformer model facilitates freezing partial outputs with controllable time delay to fulfill the real-time constraints in partial decoding required by subsequent applications. We further propose a fast decoding strategy to minimize latency while maintaining competitive performance. Experimental results on the IWSLT2011 benchmark dataset and an in-house Chinese annotated dataset demonstrate that the proposed approach outperforms the previous state-of-the-art models on F-scores and achieves a competitive inference speed.

...read moreread less

26 citations

Journal Article•

Knapsack Pruning with Inner Distillation

[...]

Yonathan Aflalo¹, Itamar Friedman², Asaf Noy², Lihi Zelnik-Manor¹, Ming Lin² - Show less +1 more•Institutions (2)

Technion – Israel Institute of Technology¹, Alibaba Group²

04 May 2021-arXiv: Learning

TL;DR: This work proposes a novel pruning method that optimizes the final accuracy of the pruned network and distills knowledge from the over-parameterized parent network's inner layers, and proposes a block grouping approach to cope with complex network structures such as convolutions with skip-links and depth-wise convolutions.

...read moreread less

Abstract: Neural network pruning reduces the computational cost of an over-parameterized network to improve its efficiency. Popular methods vary from l1-norm sparsification to Neural Architecture Search (NAS). In this work, we propose a novel pruning method that optimizes the final accuracy of the pruned network and distills knowledge from the over-parameterized parent network's inner layers. To enable this approach, we formulate the network pruning as a Knapsack Problem which optimizes the trade-off between the importance of neurons and their associated computational cost. Then we prune the network channels while maintaining the high-level structure of the network. The pruned network is fine-tuned under the supervision of the parent network using its inner network knowledge, a technique we refer to as the {\it Inner Knowledge Distillation}. Our method leads to state-of-the-art pruning results on ImageNet, CIFAR-10 and CIFAR-100 using ResNet backbones. To prune complex network structures such as convolutions with skip-links and depth-wise convolutions, we propose a block grouping approach to cope with these structures. Through this we produce compact architectures with the same FLOPs as EfficientNet-B0 and MobileNetV3 but with higher accuracy, by 1% and 0.3% respectively on ImageNet, and faster runtime on GPU.

...read moreread less

26 citations

Proceedings Article•

KylinX: A Dynamic Library Operating System for Simplified and Efficient Cloud Virtualization.

[...]

Yiming Zhang, Jon Crowcroft¹, Dongsheng Li, Chengfen Zhang, Huiba Li², Yaozheng Wang, Kai Yu³, Yongqiang Xiong⁴, Guihai Chen⁵ - Show less +5 more•Institutions (5)

University of Cambridge¹, Alibaba Group², Tencent³, Microsoft⁴, Shanghai Jiao Tong University⁵

01 Jan 2018

TL;DR: KylinX is presented, a dynamic library operating system for simplified and efficient cloud virtualization by providing the pVM (process-like VM) abstraction, allowing both page-level and library-level dynamic mapping.

...read moreread less

Abstract: Unikernel specializes a minimalistic LibOS and a target application into a standalone single-purpose virtual machine (VM) running on a hypervisor, which is referred to as (virtual) appliance. Compared to traditional VMs, Unikernel appliances have smaller memory footprint and lower overhead while guaranteeing the same level of isolation. On the downside, Unikernel strips off the process abstraction from its monolithic appliance and thus sacrifices flexibility, efficiency, and applicability. This paper examines whether there is a balance embracing the best of both Unikernel appliances (strong isolation) and processes (high flexibility/efficiency). We present KylinX, a dynamic library operating system for simplified and efficient cloud virtualization by providing the pVM (process-like VM) abstraction. A pVM takes the hypervisor as an OS and the Unikernel appliance as a process allowing both page-level and library-level dynamic mapping. At the page level, KylinX supports pVM fork plus a set of API for inter-pVM communication (IpC). At the library level, KylinX supports shared libraries to be linked to a Unikernel appliance at runtime. KylinX enforces mapping restrictions against potential threats. KylinX can fork a pVM in about 1.3 ms and link a library to a running pVM in a few ms, both comparable to process fork on Linux (about 1 ms). Latencies of KylinX IpCs are also comparable to that of UNIX IPCs.

...read moreread less

26 citations

Posted Content•

Knowing What, How and Why: A Near Complete Solution for Aspect-based Sentiment Analysis

[...]

Haiyun Peng¹, Lu Xu², Lidong Bing¹, Fei Huang¹, Wei Lu², Luo Si¹ - Show less +2 more•Institutions (2)

Alibaba Group¹, Singapore University of Technology and Design²

05 Nov 2019-arXiv: Computation and Language

TL;DR: Wang et al. as mentioned in this paper proposed a two-stage framework to extract triplets from the inputs, which show what the targeted aspects are, how their sentiment polarities are and why they have such polarities (i.e. opinion reasons).

...read moreread less

Abstract: Target-based sentiment analysis or aspect-based sentiment analysis (ABSA) refers to addressing various sentiment analysis tasks at a fine-grained level, which includes but is not limited to aspect extraction, aspect sentiment classification, and opinion extraction. There exist many solvers of the above individual subtasks or a combination of two subtasks, and they can work together to tell a complete story, i.e. the discussed aspect, the sentiment on it, and the cause of the sentiment. However, no previous ABSA research tried to provide a complete solution in one shot. In this paper, we introduce a new subtask under ABSA, named aspect sentiment triplet extraction (ASTE). Particularly, a solver of this task needs to extract triplets (What, How, Why) from the inputs, which show WHAT the targeted aspects are, HOW their sentiment polarities are and WHY they have such polarities (i.e. opinion reasons). For instance, one triplet from "Waiters are very friendly and the pasta is simply average" could be ('Waiters', positive, 'friendly'). We propose a two-stage framework to address this task. The first stage predicts what, how and why in a unified model, and then the second stage pairs up the predicted what (how) and why from the first stage to output triplets. In the experiments, our framework has set a benchmark performance in this novel triplet extraction task. Meanwhile, it outperforms a few strong baselines adapted from state-of-the-art related methods.

...read moreread less

26 citations

Patent•

Interactive resource competition and competitive information display

[...]

Tongyu Sun¹, Yulei Wang¹, Dong Chai¹, Peng Jiang¹, Anning Li¹, Jianfeng Zhang¹, Lijuan Sun Tongyu Wang Yulei C¹ - Show less +3 more•Institutions (1)

Alibaba Group¹

09 May 2007

TL;DR: In this article, a trigger frequency of a predetermined action with respect to the target resource is recorded, and the CoP based on the respective reciprocal resource value and the trigger frequency is determined.

...read moreread less

Abstract: Interactive resource competition ranks resource users using coefficients of performance (CoP). Resource users compete for a target resource and each offer a quantifiable reciprocal resource in exchange for using the target resource. A trigger frequency of a predetermined action with respect to the target resource is recorded, and the CoP based on the respective reciprocal resource value and the trigger frequency is determined. The target resource is then allocated based on the ranking of the CoP's. An intermediate and credit fund account is used for both making payments and recording the trigger frequency. The competition is particularly useful for allocating an information display position in online electronic marketing.

...read moreread less

26 citations

Collapse

Authors

Showing all 6829 results

Name	H-index	Papers	Citations
Philip S. Yu	148	1914	107374
Lei Zhang	130	2312	86950
Jian Xu	94	1366	52057
Wei Chu	80	670	28771
Le Song	76	345	21382
Yuan Xie	76	739	24155
Narendra Ahuja	76	474	29517
Rong Jin	75	449	19456
Beng Chin Ooi	73	408	19174
Wotao Yin	72	303	27233
Deng Cai	70	326	24524
Xiaofei He	70	260	28215
Irwin King	67	476	19056
Gang Wang	65	373	21579
Xiaodan Liang	61	318	14121

Network Information

Related Institutions (5)

Microsoft

86.9K papers, 4.1M citations

94% related

Google

39.8K papers, 2.1M citations

94% related

Facebook

10.9K papers, 570.1K citations

93% related

AT&T Labs

5.5K papers, 483.1K citations

38.6K papers, 1.3M citations

87% related

Performance

Metrics

7,410

Papers

106,380

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	5
2022	30
2021	1,352
2020	1,671
2019	1,459
2018	863