Home
/
Authors
/
Yue He

Author

Yue He

Bio: Yue He is an academic researcher from Tsinghua University. The author has contributed to research in topics: Generalization & Computer science. The author has an hindex of 6, co-authored 11 publications receiving 63 citations. Previous affiliations of Yue He include Beihang University.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Deep Stable Learning for Out-Of-Distribution Generalization

[...]

Xingxuan Zhang¹, Peng Cui¹, Renzhe Xu¹, Linjun Zhou¹, Yue He¹, Zheyan Shen¹ - Show less +2 more•Institutions (1)

Tsinghua University¹

01 Jun 2021

TL;DR: In this paper, the authors propose to remove the dependencies between features via learning weights for training samples, which helps deep models get rid of spurious correlations and, in turn, concentrate more on the true connection between discriminative features and labels.

...read moreread less

Abstract: Approaches based on deep neural networks have achieved striking performance when testing data and training data share similar distribution, but can significantly fail otherwise. Therefore, eliminating the impact of distribution shifts between training and testing data is crucial for building performance-promising deep models. Conventional methods assume either the known heterogeneity of training data (e.g. domain labels) or the approximately equal capacities of different domains. In this paper, we consider a more challenging case where neither of the above assumptions holds. We propose to address this problem by removing the dependencies between features via learning weights for training samples, which helps deep models get rid of spurious correlations and, in turn, concentrate more on the true connection between discriminative features and labels. Extensive experiments clearly demonstrate the effectiveness of our method on multiple distribution generalization benchmarks compared with state-of-the-art counterparts. Through extensive experiments on distribution generalization benchmarks including PACS, VLCS, MNIST-M, and NICO, we show the effectiveness of our method compared with state-of-the-art counterparts.

...read moreread less

113 citations

Posted Content•

Towards Non-I.I.D. Image Classification: A Dataset and Baselines.

[...]

Yue He¹, Zheyan Shen¹, Peng Cui¹•Institutions (1)

Tsinghua University¹

07 Jun 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: The experimental results demonstrate that NICO can well support the training of ConvNet model from scratch, and a batch balancing module can help ConvNets to perform better in Non-I.I.D.D., situations with sufficient flexibility.

...read moreread less

Abstract: I.I.D. hypothesis between training and testing data is the basis of numerous image classification methods. Such property can hardly be guaranteed in practice where the Non-IIDness is common, causing instable performances of these models. In literature, however, the Non-I.I.D. image classification problem is largely understudied. A key reason is lacking of a well-designed dataset to support related research. In this paper, we construct and release a Non-I.I.D. image dataset called NICO, which uses contexts to create Non-IIDness consciously. Compared to other datasets, extended analyses prove NICO can support various Non-I.I.D. situations with sufficient flexibility. Meanwhile, we propose a baseline model with ConvNet structure for General Non-I.I.D. image classification, where distribution of testing data is unknown but different from training data. The experimental results demonstrate that NICO can well support the training of ConvNet model from scratch, and a batch balancing module can help ConvNets to perform better in Non-I.I.D. settings.

...read moreread less

71 citations

Journal Article•DOI•

Towards Non-I.I.D. image classification: A dataset and baselines

[...]

Yue He¹, Zheyan Shen¹, Peng Cui¹•Institutions (1)

Tsinghua University¹

01 Feb 2021-Pattern Recognition

TL;DR: In this paper, a Non-I.I.D. image dataset called NICO 4, which uses contexts to create non-IIDness consciously, was constructed and released.

...read moreread less

44 citations

Proceedings Article•

Counterfactual Prediction for Bundle Treatment

[...]

Hao Zou¹, Peng Cui¹, Bo Li², Zheyan Shen¹, Jianxin Ma¹, Hongxia Yang³, Yue He¹ - Show less +3 more•Institutions (3)

Tsinghua University¹, Northwestern Polytechnical University², Alibaba Group³

01 Jan 2020

TL;DR: This work proposes a novel variational sample re-weighting (VSR) method to eliminate confounding bias by decorrelating the treatments and confounders and conducts extensive experiments to demonstrate that the predictive model trained on this re-weightsed dataset can achieve more accurate counterfactual outcome prediction.

...read moreread less

Abstract: Estimating counterfactual outcome of different treatments from observational data is an important problem to assist decision making in a variety of fields. Among the various forms of treatment specification, bundle treatment has been widely adopted in many scenarios, such as recommendation systems and online marketing. The bundle treatment usually can be abstracted as a high dimensional binary vector, which makes it more challenging for researchers to remove the confounding bias in observational data. In this work, we assume the existence of low dimensional latent structure underlying bundle treatment. Via the learned latent representations of treatments, we propose a novel variational sample re-weighting (VSR) method to eliminate confounding bias by decorrelating the treatments and confounders. Finally, we conduct extensive experiments to demonstrate that the predictive model trained on this re-weighted dataset can achieve more accurate counterfactual outcome prediction.

...read moreread less

32 citations

Proceedings Article•DOI•

StylizedNeRF: Consistent 3D Scene Stylization as Stylized NeRF via 2D-3D Mutual Learning

[...]

Yihua Huang, Yue He, Yu-Jie Yuan, Yu-Kun Lai, Lin Gao - Show less +1 more

24 May 2022

TL;DR: A novel mutual learning framework for 3D scene stylization that combines a 2D image stylization network and NeRF to fuse the stylization ability of 2D stylized network with the 3D consistency of NeRF is proposed.

...read moreread less

Abstract: 3D scene stylization aims at generating stylized images of the scene from arbitrary novel views following a given set of style examples, while ensuring consistency when rendered from different views. Directly applying methods for image or video stylization to 3D scenes cannot achieve such consistency. Thanks to recently proposed neural radiance fields (NeRF), we are able to represent a 3D scene in a consistent way. Consistent 3D scene stylization can be effectively achieved by stylizing the corresponding NeRF. However, there is a significant domain gap between style examples which are 2D images and NeRF which is an implicit volumetric representation. To address this problem, we propose a novel mutual learning framework for 3D scene stylization that combines a 2D image stylization network and NeRF to fuse the stylization ability of 2D stylization network with the 3D consistency of NeRF. We first pre-train a standard NeRF of the 3D scene to be stylized and replace its color prediction module with a style network to obtain a stylized NeRF. It is followed by distilling the prior knowledge of spatial consistency from NeRF to the 2D stylization network through an introduced consistency loss. We also introduce a mimic loss to supervise the mutual learning of the NeRF style module and fine-tune the 2D stylization decoder. In order to further make our model handle ambiguities of 2D stylization results, we introduce learnable latent codes that obey the probability distributions conditioned on the style. They are attached to training samples as conditional inputs to better learn the style module in our novel stylized NeRF. Experimental results demonstrate that our method is superior to existing approaches in both visual quality and long-range consistency.

...read moreread less

29 citations

Cited by

PDF

Open Access

More filters

Posted Content•

WILDS: A Benchmark of in-the-Wild Distribution Shifts

[...]

Pang Wei Koh¹, Shiori Sagawa¹, Henrik Marklund¹, Sang Michael Xie², Marvin Zhang¹, Akshay Balsubramani¹, Weihua Hu¹, Michihiro Yasunaga³, Richard Lanas Phillips¹, Irena Gao¹, Tony Lee¹, Etienne David⁴, Ian Stavness⁵, Wei Guo⁵, Berton A. Earnshaw, Imran S. Haque⁶, Sara Beery¹, Jure Leskovec¹, Anshul Kundaje⁷, Emma Pierson², Sergey Levine¹, Chelsea Finn¹, Percy Liang¹ - Show less +19 more•Institutions (7)

Stanford University¹, University of California, Berkeley², Cornell University³, University of Saskatchewan⁴, University of Tokyo⁵, California Institute of Technology⁶, Microsoft⁷

14 Dec 2020-arXiv: Learning

TL;DR: WILDS is presented, a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications, and is hoped to encourage the development of general-purpose methods that are anchored to real-world distribution shifts and that work well across different applications and problem settings.

...read moreread less

Abstract: Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity, these real-world distribution shifts are under-represented in the datasets widely used in the ML community today. To address this gap, we present WILDS, a curated collection of 8 benchmark datasets that reflect a diverse range of distribution shifts which naturally arise in real-world applications, such as shifts across hospitals for tumor identification; across camera traps for wildlife monitoring; and across time and location in satellite imaging and poverty mapping. On each dataset, we show that standard training results in substantially lower out-of-distribution than in-distribution performance, and that this gap remains even with models trained by existing methods for handling distribution shifts. This underscores the need for new training methods that produce models which are more robust to the types of distribution shifts that arise in practice. To facilitate method development, we provide an open-source package that automates dataset loading, contains default model architectures and hyperparameters, and standardizes evaluations. Code and leaderboards are available at this https URL.

...read moreread less

579 citations

Journal Article•DOI•

Perceptual-Sensitive GAN for Generating Adversarial Patches.

[...]

Aishan Liu¹, Xianglong Liu¹, Jiaxin Fan¹, Yuqing Ma¹, Anlan Zhang¹, Huiyuan Xie², Dacheng Tao³ - Show less +3 more•Institutions (3)

Beihang University¹, University of Cambridge², University of Sydney³

17 Jul 2019

TL;DR: This paper proposes a perceptual-sensitive generative adversarial network (PS-GAN) that can simultaneously enhance the visual fidelity and the attacking ability for the adversarial patch, and treats the patch generation as a patch-to-patch translation via an adversarial process.

...read moreread less

Abstract: Deep neural networks (DNNs) are vulnerable to adversarial examples where inputs with imperceptible perturbations mislead DNNs to incorrect results. Recently, adversarial patch, with noise confined to a small and localized patch, emerged for its easy accessibility in real-world. However, existing attack strategies are still far from generating visually natural patches with strong attacking ability, since they often ignore the perceptual sensitivity of the attacked network to the adversarial patch, including both the correlations with the image context and the visual attention. To address this problem, this paper proposes a perceptual-sensitive generative adversarial network (PS-GAN) that can simultaneously enhance the visual fidelity and the attacking ability for the adversarial patch. To improve the visual fidelity, we treat the patch generation as a patch-to-patch translation via an adversarial process, feeding any types of seed patch and outputting the similar adversarial patch with high perceptual correlation with the attacked image. To further enhance the attacking ability, an attention mechanism coupled with adversarial generation is introduced to predict the critical attacking areas for placing the patches, which can help producing more realistic and aggressive patches. Extensive experiments under semi-whitebox and black-box settings on two large-scale datasets GTSRB and ImageNet demonstrate that the proposed PS-GAN outperforms state-of-the-art adversarial patch attack methods.

...read moreread less

173 citations

Posted Content•

Continual Learning for Robotics: Definition, Framework, Learning Strategies, Opportunities and Challenges

[...]

Timothée Lesort¹, Vincenzo Lomonaco², Andrei Stoian, Davide Maltoni², David Filliat¹, Natalia Díaz-Rodríguez¹ - Show less +2 more•Institutions (2)

French Institute for Research in Computer Science and Automation¹, University of Bologna²

29 Jun 2019-arXiv: Learning

TL;DR: ContinContinual Learning (CL) is a particular machine learning paradigm where the data distribution and learning objective changes through time, or where all the training data and objective criteria are never available at once as mentioned in this paper.

...read moreread less

Abstract: Continual learning (CL) is a particular machine learning paradigm where the data distribution and learning objective changes through time, or where all the training data and objective criteria are never available at once. The evolution of the learning process is modeled by a sequence of learning experiences where the goal is to be able to learn new skills all along the sequence without forgetting what has been previously learned. Continual learning also aims at the same time at optimizing the memory, the computation power and the speed during the learning process. An important challenge for machine learning is not necessarily finding solutions that work in the real world but rather finding stable algorithms that can learn in real world. Hence, the ideal approach would be tackling the real world in a embodied platform: an autonomous agent. Continual learning would then be effective in an autonomous agent or robot, which would learn autonomously through time about the external world, and incrementally develop a set of complex skills and knowledge. Robotic agents have to learn to adapt and interact with their environment using a continuous stream of observations. Some recent approaches aim at tackling continual learning for robotics, but most recent papers on continual learning only experiment approaches in simulation or with static datasets. Unfortunately, the evaluation of those algorithms does not provide insights on whether their solutions may help continual learning in the context of robotics. This paper aims at reviewing the existing state of the art of continual learning, summarizing existing benchmarks and metrics, and proposing a framework for presenting and evaluating both robotics and non robotics approaches in a way that makes transfer between both fields easier.

...read moreread less

160 citations

Journal Article•DOI•

Collective Reconstructive Embeddings for Cross-Modal Hashing

[...]

Mengqiu Hu¹, Yang Yang¹, Fumin Shen¹, Ning Xie¹, Richang Hong², Heng Tao Shen¹ - Show less +2 more•Institutions (2)

University of Electronic Science and Technology of China¹, Hefei University of Technology²

01 Jun 2019-IEEE Transactions on Image Processing

TL;DR: This paper unify the projections of text and image to the Hamming space into a common reconstructive embedding through rigid mathematical reformulation, which not only reduces the optimization complexity significantly but also facilitates the inter-modal similarity preservation among different modalities.

...read moreread less

Abstract: In this paper, we study the problem of cross-modal retrieval by hashing-based approximate nearest neighbor search techniques. Most existing cross-modal hashing works mainly address the issue of multi-modal integration complexity using the same mapping and similarity calculation for data from different media types. Nonetheless, this may cause information loss during the mapping process due to overlooking the specifics of each individual modality. In this paper, we propose a simple yet effective cross-modal hashing approach, termed collective reconstructive embeddings (CRE), which can simultaneously solve the heterogeneity and integration complexity of multi-modal data. To address the heterogeneity challenge, we propose to process heterogeneous types of data using different modality-specific models. Specifically, we model textual data with cosine similarity-based reconstructive embedding to alleviate the data sparsity to the greatest extent, while for image data, we utilize the Euclidean distance to characterize the relationships of the projected hash codes. Meanwhile, we unify the projections of text and image to the Hamming space into a common reconstructive embedding through rigid mathematical reformulation, which not only reduces the optimization complexity significantly but also facilitates the inter-modal similarity preservation among different modalities. We further incorporate the code balance and uncorrelation criteria into the problem and devise an efficient iterative algorithm for optimization. Comprehensive experiments on four widely used multimodal benchmarks show that the proposed CRE can achieve a superior performance compared with the state of the art on several challenging cross-modal tasks.

...read moreread less

113 citations

Journal Article•DOI•

Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments

[...]

Xiao Bai¹, Xiang Wang¹, Xianglong Liu¹, Qiang Liu², Jingkuan Song³, Niculae Sebe⁴, Been Kim⁵ - Show less +3 more•Institutions (5)

Beihang University¹, University of Texas at Austin², University of Electronic Science and Technology of China³, University of Trento⁴, Google⁵

01 Dec 2021-Pattern Recognition

TL;DR: In this article, explainable deep learning methods are grouped into three main categories: efficient deep learning via model compression and acceleration, as well as robustness and stability in deep learning.

...read moreread less

101 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66

Collapse