Home
/
Authors
/
Jianmin Wang

Author

Jianmin Wang

Other affiliations: Chinese Ministry of Education

Bio: Jianmin Wang is an academic researcher from Tsinghua University. The author has contributed to research in topics: Process mining & Computer science. The author has an hindex of 52, co-authored 324 publications receiving 16308 citations. Previous affiliations of Jianmin Wang include Chinese Ministry of Education.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
1996

Papers

PDF

Open Access

More filters

Posted Content•

Learning Transferable Features with Deep Adaptation Networks

[...]

Mingsheng Long¹, Mingsheng Long², Yue Cao¹, Jianmin Wang¹, Michael I. Jordan² - Show less +1 more•Institutions (2)

Tsinghua University¹, University of California, Berkeley²

10 Feb 2015-arXiv: Learning

TL;DR: A new Deep Adaptation Network (DAN) architecture is proposed, which generalizes deep convolutional neural network to the domain adaptation scenario and can learn transferable features with statistical guarantees, and can scale linearly by unbiased estimate of kernel embedding.

...read moreread less

Abstract: Recent studies reveal that a deep neural network can learn transferable features which generalize well to novel tasks for domain adaptation. However, as deep features eventually transition from general to specific along the network, the feature transferability drops significantly in higher layers with increasing domain discrepancy. Hence, it is important to formally reduce the dataset bias and enhance the transferability in task-specific layers. In this paper, we propose a new Deep Adaptation Network (DAN) architecture, which generalizes deep convolutional neural network to the domain adaptation scenario. In DAN, hidden representations of all task-specific layers are embedded in a reproducing kernel Hilbert space where the mean embeddings of different domain distributions can be explicitly matched. The domain discrepancy is further reduced using an optimal multi-kernel selection method for mean embedding matching. DAN can learn transferable features with statistical guarantees, and can scale linearly by unbiased estimate of kernel embedding. Extensive empirical evidence shows that the proposed architecture yields state-of-the-art image classification error rates on standard domain adaptation benchmarks.

...read moreread less

3,351 citations

Proceedings Article•DOI•

Transfer Feature Learning with Joint Distribution Adaptation

[...]

Mingsheng Long¹, Jianmin Wang¹, Guiguang Ding¹, Jiaguang Sun¹, Philip S. Yu² - Show less +1 more•Institutions (2)

Tsinghua University¹, University of Illinois at Chicago²

01 Dec 2013

TL;DR: JDA aims to jointly adapt both the marginal distribution and conditional distribution in a principled dimensionality reduction procedure, and construct new feature representation that is effective and robust for substantial distribution difference.

...read moreread less

Abstract: Transfer learning is established as an effective technology in computer vision for leveraging rich labeled data in the source domain to build an accurate classifier for the target domain. However, most prior methods have not simultaneously reduced the difference in both the marginal distribution and conditional distribution between domains. In this paper, we put forward a novel transfer learning approach, referred to as Joint Distribution Adaptation (JDA). Specifically, JDA aims to jointly adapt both the marginal distribution and conditional distribution in a principled dimensionality reduction procedure, and construct new feature representation that is effective and robust for substantial distribution difference. Extensive experiments verify that JDA can significantly outperform several state-of-the-art methods on four types of cross-domain image classification problems.

...read moreread less

1,542 citations

Proceedings Article•

Learning Transferable Features with Deep Adaptation Networks

[...]

Mingsheng Long¹, Mingsheng Long², Yue Cao², Jianmin Wang², Michael I. Jordan¹ - Show less +1 more•Institutions (2)

University of California, Berkeley¹, Tsinghua University²

06 Jul 2015

TL;DR: Deep Adaptation Network (DAN) as mentioned in this paper embeds hidden representations of all task-specific layers in a reproducing kernel Hilbert space where the mean embeddings of different domain distributions can be explicitly matched.

...read moreread less

Abstract: Recent studies reveal that a deep neural network can learn transferable features which generalize well to novel tasks for domain adaptation. However, as deep features eventually transition from general to specific along the network, the feature transferability drops significantly in higher layers with increasing domain discrepancy. Hence, it is important to formally reduce the dataset bias and enhance the transferability in task-specific layers. In this paper, we propose a new Deep Adaptation Network (DAN) architecture, which generalizes deep convolutional neural network to the domain adaptation scenario. In DAN, hidden representations of all task-specific layers are embedded in a reproducing kernel Hilbert space where the mean embeddings of different domain distributions can be explicitly matched. The domain discrepancy is further reduced using an optimal multikernel selection method for mean embedding matching. DAN can learn transferable features with statistical guarantees, and can scale linearly by unbiased estimate of kernel embedding. Extensive empirical evidence shows that the proposed architecture yields state-of-the-art image classification error rates on standard domain adaptation benchmarks.

...read moreread less

1,272 citations

Proceedings Article•

Unsupervised domain adaptation with residual transfer networks

[...]

Mingsheng Long¹, Han Zhu¹, Jianmin Wang¹, Michael I. Jordan²•Institutions (2)

Tsinghua University¹, University of California, Berkeley²

05 Dec 2016

TL;DR: Empirical evidence shows that the new approach to domain adaptation in deep networks that can jointly learn adaptive classifiers and transferable features from labeled data in the source domain and unlabeledData in the target domain outperforms state of the art methods on standard domain adaptation benchmarks.

...read moreread less

Abstract: The recent success of deep neural networks relies on massive amounts of labeled data. For a target task where labeled data is unavailable, domain adaptation can transfer a learner from a different source domain. In this paper, we propose a new approach to domain adaptation in deep networks that can jointly learn adaptive classifiers and transferable features from labeled data in the source domain and unlabeled data in the target domain. We relax a shared-classifier assumption made by previous methods and assume that the source classifier and target classifier differ by a residual function. We enable classifier adaptation by plugging several layers into deep network to explicitly learn the residual function with reference to the target classifier. We fuse features of multiple layers with tensor product and embed them into reproducing kernel Hilbert spaces to match distributions for feature adaptation. The adaptation can be achieved in most feed-forward models by extending them with new residual layers and loss functions, which can be trained efficiently via back-propagation. Empirical evidence shows that the new approach outperforms state of the art methods on standard domain adaptation benchmarks.

...read moreread less

1,229 citations

Book Chapter•DOI•

Process Mining Manifesto

[...]

Wil M. P. van der Aalst¹, Wil M. P. van der Aalst², A Arya Adriansyah¹, Ana Karla Alves de Medeiros³, Franco Arcieri⁴, Thomas Baier⁵, Tobias Blickle⁶, Jagadeesh Chandra Bose¹, Peter van den Brand, Ronald Brandtjen, Joos C. A. M. Buijs¹, Andrea Burattin⁷, Josep Carmona⁸, Malu Castellanos⁹, Jan Claes¹⁰, Jonathan Cook¹¹, Nicola Costantini, Francisco Curbera¹², Ernesto Damiani¹³, Massimiliano de Leoni¹, Pavlos Delias, Boudewijn F. van Dongen¹, Marlon Dumas¹⁴, Schahram Dustdar¹⁵, Dirk Fahland¹, Diogo R. Ferreira¹⁶, Walid Gaaloul¹⁷, Frank van Geffen¹⁸, Sukriti Goel¹⁹, CW Christian Günther, Antonella Guzzo²⁰, Paul Harmon, Arthur H. M. ter Hofstede¹, Arthur H. M. ter Hofstede², John Hoogland, Jon Espen Ingvaldsen, Koki Kato²¹, Rudolf Kuhn, Akhil Kumar²², Marcello La Rosa², Fabrizio Maria Maggi¹, Donato Malerba²³, RS Ronny Mans¹, Alberto Manuel, Martin McCreesh, Paola Mello²⁴, Jan Mendling²⁵, Marco Montali²⁶, Hamid Reza Motahari-Nezhad⁹, Michael zur Muehlen²⁷, Jorge Munoz-Gama⁸, Luigi Pontieri²⁸, Joel Ribeiro¹, A Anne Rozinat, Hugo Seguel Pérez, Ricardo Seguel Pérez, Marcos Sepúlveda²⁹, Jim Sinur, Pnina Soffer³⁰, Minseok Song³¹, Alessandro Sperduti⁷, Giovanni Stilo⁴, Casper Stoel, Keith D. Swenson²¹, Maurizio Talamo⁴, Wei Tan¹², Christopher Turner³², Jan Vanthienen³³, George Varvaressos, Eric Verbeek¹, Marc Verdonk³⁴, Roberto Vigo, Jianmin Wang³⁵, Barbara Weber³⁶, Matthias Weidlich³⁷, Ton Weijters¹, Lijie Wen³⁵, Michael Westergaard¹, Moe Thandar Wynn² - Show less +75 more•Institutions (37)

Eindhoven University of Technology¹, Queensland University of Technology², Capgemini³, University of Rome Tor Vergata⁴, Humboldt University of Berlin⁵, Software AG⁶, University of Padua⁷, Polytechnic University of Catalonia⁸, Hewlett-Packard⁹, Ghent University¹⁰, New Mexico State University¹¹, IBM¹², University of Milan¹³, University of Tartu¹⁴, University of Vienna¹⁵, Technical University of Lisbon¹⁶, Telecom SudParis¹⁷, Rabobank¹⁸, Infosys¹⁹, University of Calabria²⁰, Fujitsu²¹, Pennsylvania State University²², University of Bari²³, University of Bologna²⁴, Vienna University of Economics and Business²⁵, Free University of Bozen-Bolzano²⁶, Stevens Institute of Technology²⁷, Indian Council of Agricultural Research²⁸, Pontifical Catholic University of Chile²⁹, University of Haifa³⁰, Ulsan National Institute of Science and Technology³¹, Cranfield University³², Katholieke Universiteit Leuven³³, Deloitte³⁴, Tsinghua University³⁵, University of Innsbruck³⁶, Hasso Plattner Institute³⁷

01 Jan 2012

TL;DR: This manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users to increase the maturity of process mining as a new tool to improve the design, control, and support of operational business processes.

...read moreread less

Abstract: Process mining techniques are able to extract knowledge from event logs commonly available in today’s information systems. These techniques provide new means to discover, monitor, and improve processes in a variety of application domains. There are two main drivers for the growing interest in process mining. On the one hand, more and more events are being recorded, thus, providing detailed information about the history of processes. On the other hand, there is a need to improve and support business processes in competitive and rapidly changing environments. This manifesto is created by the IEEE Task Force on Process Mining and aims to promote the topic of process mining. Moreover, by defining a set of guiding principles and listing important challenges, this manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users. The goal is to increase the maturity of process mining as a new tool to improve the (re)design, control, and support of operational business processes.

...read moreread less

1,135 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72

Collapse

Cited by

PDF

Open Access

More filters

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Book Chapter•DOI•

Domain-adversarial training of neural networks

[...]

Yaroslav Ganin¹, Evgeniya Ustinova¹, Hana Ajakan², Pascal Germain², Hugo Larochelle³, François Laviolette², Mario Marchand², Victor Lempitsky¹ - Show less +4 more•Institutions (3)

Skolkovo Institute of Science and Technology¹, Laval University², Université de Sherbrooke³

01 Jan 2016-Journal of Machine Learning Research

TL;DR: In this article, a new representation learning approach for domain adaptation is proposed, in which data at training and test time come from similar but different distributions, and features that cannot discriminate between the training (source) and test (target) domains are used to promote the emergence of features that are discriminative for the main learning task on the source domain.

...read moreread less

Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test (target) domains. The approach implements this idea in the context of neural network architectures that are trained on labeled data from the source domain and unlabeled data from the target domain (no labeled target-domain data is necessary). As the training progresses, the approach promotes the emergence of features that are (i) discriminative for the main learning task on the source domain and (ii) indiscriminate with respect to the shift between the domains. We show that this adaptation behaviour can be achieved in almost any feed-forward model by augmenting it with few standard layers and a new gradient reversal layer. The resulting augmented architecture can be trained using standard backpropagation and stochastic gradient descent, and can thus be implemented with little effort using any of the deep learning packages. We demonstrate the success of our approach for two distinct classification problems (document sentiment analysis and image classification), where state-of-the-art domain adaptation performance on standard benchmarks is achieved. We also validate the approach for descriptor learning task in the context of person re-identification application.

...read moreread less

4,862 citations

Proceedings Article•DOI•

Adversarial Discriminative Domain Adaptation

[...]

Eric Tzeng¹, Judy Hoffman², Kate Saenko³, Trevor Darrell¹•Institutions (3)

University of California, Berkeley¹, Stanford University², Boston University³

21 Jul 2017

TL;DR: Adversarial Discriminative Domain Adaptation (ADDA) as mentioned in this paper combines discriminative modeling, untied weight sharing, and a generative adversarial network (GAN) loss.

...read moreread less

Abstract: Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization performance. However, while generative adversarial networks (GANs) show compelling visualizations, they are not optimal on discriminative tasks and can be limited to smaller shifts. On the other hand, discriminative approaches can handle larger domain shifts, but impose tied weights on the model and do not exploit a GAN-based loss. In this work, we first outline a novel generalized framework for adversarial adaptation, which subsumes recent state-of-the-art approaches as special cases, and use this generalized view to better relate prior approaches. We then propose a previously unexplored instance of our general framework which combines discriminative modeling, untied weight sharing, and a GAN loss, which we call Adversarial Discriminative Domain Adaptation (ADDA). We show that ADDA is more effective yet considerably simpler than competing domain-adversarial methods, and demonstrate the promise of our approach by exceeding state-of-the-art unsupervised adaptation results on standard domain adaptation tasks as well as a difficult cross-modality object classification task.

...read moreread less

4,288 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse