Institution

SenseTime

About: SenseTime is a based out in . It is known for research contribution in the topics: Feature (computer vision) & Computer science. The organization has 637 authors who have published 1163 publications receiving 46320 citations. The organization is also known as: Sense Time.

...read moreread less

Topics: Feature (computer vision), Computer science, Convolutional neural network, Object detection, Segmentation ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Pyramid Scene Parsing Network

[...]

Hengshuang Zhao¹, Jianping Shi², Xiaojuan Qi¹, Xiaogang Wang¹, Jiaya Jia¹ - Show less +1 more•Institutions (2)

The Chinese University of Hong Kong¹, SenseTime²

21 Jul 2017

TL;DR: This paper exploits the capability of global context information by different-region-based context aggregation through the pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet) to produce good quality results on the scene parsing task.

...read moreread less

Abstract: Scene parsing is challenging for unrestricted open vocabulary and diverse scenes. In this paper, we exploit the capability of global context information by different-region-based context aggregation through our pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet). Our global prior representation is effective to produce good quality results on the scene parsing task, while PSPNet provides a superior framework for pixel-level prediction. The proposed approach achieves state-of-the-art performance on various datasets. It came first in ImageNet scene parsing challenge 2016, PASCAL VOC 2012 benchmark and Cityscapes benchmark. A single PSPNet yields the new record of mIoU accuracy 85.4% on PASCAL VOC 2012 and accuracy 80.2% on Cityscapes.

...read moreread less

10,189 citations

Proceedings Article•DOI•

Path Aggregation Network for Instance Segmentation

[...]

Shu Liu¹, Lu Qi¹, Haifang Qin², Jianping Shi³, Jiaya Jia¹ - Show less +1 more•Institutions (3)

The Chinese University of Hong Kong¹, Peking University², SenseTime³

18 Jun 2018

TL;DR: PANet as mentioned in this paper enhances the entire feature hierarchy with accurate localization signals in lower layers by bottom-up path augmentation, which shortens the information path between lower layers and topmost feature.

...read moreread less

Abstract: The way that information propagates in neural networks is of great importance. In this paper, we propose Path Aggregation Network (PANet) aiming at boosting information flow in proposal-based instance segmentation framework. Specifically, we enhance the entire feature hierarchy with accurate localization signals in lower layers by bottom-up path augmentation, which shortens the information path between lower layers and topmost feature. We present adaptive feature pooling, which links feature grid and all feature levels to make useful information in each level propagate directly to following proposal subnetworks. A complementary branch capturing different views for each proposal is created to further improve mask prediction. These improvements are simple to implement, with subtle extra computational overhead. Yet they are useful and make our PANet reach the 1st place in the COCO 2017 Challenge Instance Segmentation task and the 2nd place in Object Detection task without large-batch training. PANet is also state-of-the-art on MVD and Cityscapes.

...read moreread less

3,784 citations

Proceedings Article•DOI•

Residual Attention Network for Image Classification

[...]

Fei Wang¹, Mengqing Jiang², Chen Qian¹, Shuo Yang³, Cheng Li¹, Honggang Zhang⁴, Xiaogang Wang³, Xiaoou Tang³ - Show less +4 more•Institutions (4)

SenseTime¹, Tsinghua University², The Chinese University of Hong Kong³, Beijing University of Posts and Telecommunications⁴

01 Jul 2017

TL;DR: Residual Attention Network as mentioned in this paper is a convolutional neural network using attention mechanism which can incorporate with state-of-the-art feed forward network architecture in an end-to-end training fashion.

...read moreread less

Abstract: In this work, we propose Residual Attention Network, a convolutional neural network using attention mechanism which can incorporate with state-of-art feed forward network architecture in an end-to-end training fashion. Our Residual Attention Network is built by stacking Attention Modules which generate attention-aware features. The attention-aware features from different modules change adaptively as layers going deeper. Inside each Attention Module, bottom-up top-down feedforward structure is used to unfold the feedforward and feedback attention process into a single feedforward process. Importantly, we propose attention residual learning to train very deep Residual Attention Networks which can be easily scaled up to hundreds of layers. Extensive analyses are conducted on CIFAR-10 and CIFAR-100 datasets to verify the effectiveness of every module mentioned above. Our Residual Attention Network achieves state-of-the-art object recognition performance on three benchmark datasets including CIFAR-10 (3.90% error), CIFAR-100 (20.45% error) and ImageNet (4.8% single model and single crop, top-5 error). Note that, our method achieves 0.6% top-1 accuracy improvement with 46% trunk depth and 69% forward FLOPs comparing to ResNet-200. The experiment also demonstrates that our network is robust against noisy labels.

...read moreread less

2,625 citations

Proceedings Article•DOI•

High Performance Visual Tracking with Siamese Region Proposal Network

[...]

Bo Li¹, Junjie Yan², Wei Wu³, Zheng Zhu⁴, Xiaolin Hu² - Show less +1 more•Institutions (4)

Beihang University¹, Tsinghua University², SenseTime³, Chinese Academy of Sciences⁴

18 Jun 2018

TL;DR: The Siamese region proposal network (Siamese-RPN) is proposed which is end-to-end trained off-line with large-scale image pairs for visual object tracking and consists of SiAMESe subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch.

...read moreread less

Abstract: Visual object tracking has been a fundamental topic in recent years and many deep learning based trackers have achieved state-of-the-art performance on multiple benchmarks. However, most of these trackers can hardly get top performance with real-time speed. In this paper, we propose the Siamese region proposal network (Siamese-RPN) which is end-to-end trained off-line with large-scale image pairs. Specifically, it consists of Siamese subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch. In the inference phase, the proposed framework is formulated as a local one-shot detection task. We can pre-compute the template branch of the Siamese subnetwork and formulate the correlation layers as trivial convolution layers to perform online tracking. Benefit from the proposal refinement, traditional multi-scale test and online fine-tuning can be discarded. The Siamese-RPN runs at 160 FPS while achieving leading performance in VOT2015, VOT2016 and VOT2017 real-time challenges.

...read moreread less

2,016 citations

Proceedings Article•DOI•

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations

[...]

Ziwei Liu¹, Ping Luo¹, Shi Qiu², Xiaogang Wang¹, Xiaoou Tang¹ - Show less +1 more•Institutions (2)

The Chinese University of Hong Kong¹, SenseTime²

27 Jun 2016

TL;DR: This work introduces DeepFashion1, a large-scale clothes dataset with comprehensive annotations, and proposes a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks.

...read moreread less

Abstract: Recent advances in clothes recognition have been driven by the construction of clothes datasets. Existing datasets are limited in the amount of annotations and are difficult to cope with the various challenges in real-world applications. In this work, we introduce DeepFashion1, a large-scale clothes dataset with comprehensive annotations. It contains over 800,000 images, which are richly annotated with massive attributes, clothing landmarks, and correspondence of images taken under different scenarios including store, street snapshot, and consumer. Such rich annotations enable the development of powerful algorithms in clothes recognition and facilitating future researches. To demonstrate the advantages of DeepFashion, we propose a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks. The estimated landmarks are then employed to pool or gate the learned features. It is optimized in an iterative manner. Extensive experiments demonstrate the effectiveness of FashionNet and the usefulness of DeepFashion.

...read moreread less

1,649 citations

Collapse

Authors

Showing all 637 results

Name	H-index	Papers	Citations
Xiaoou Tang	132	553	94555
Xiaogang Wang	128	452	73740
Chen Change Loy	84	236	36263
Wangmeng Zuo	80	496	28053
Jiaya Jia	80	294	33545
Liang Lin	73	499	19904
Hongsheng Li	69	289	19582
Yu Qiao	69	484	29922
Dahua Lin	61	269	17717
Junjie Yan	59	247	15090
Ping Luo	57	247	21715
Yu-Wing Tai	54	213	10917
Jianping Shi	48	124	19523
Shaoting Zhang	44	233	8427
Yu Lu	43	232	6485

Network Information

Related Institutions (5)

Adobe Systems

8K papers, 214.7K citations

90% related

Facebook

10.9K papers, 570.1K citations

90% related

Google

39.8K papers, 2.1M citations

90% related

Microsoft

86.9K papers, 4.1M citations

86% related

Nvidia

7.2K papers, 201.2K citations

84% related

Performance

Metrics

1,163

Papers

85,149

Citations

No. of papers from the Institution in previous years
Year	Papers
2022	5
2021	254
2020	350
2019	227
2018	210
2017	64