Home
/
Authors
/
Toshiki Kataoka

Author

Toshiki Kataoka

Bio: Toshiki Kataoka is an academic researcher. The author has contributed to research in topics: Reinforcement learning & Normalization (statistics). The author has an hindex of 4, co-authored 4 publications receiving 3001 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•

Spectral Normalization for Generative Adversarial Networks

[...]

Takeru Miyato¹, Toshiki Kataoka, Masanori Koyama², Yuichi Yoshida³•Institutions (3)

Kyoto University¹, Ritsumeikan University², National Institute of Informatics³

15 Feb 2018

TL;DR: In this paper, the authors proposed a novel weight normalization technique called spectral normalization to stabilize the training of the discriminator, which is computationally light and easy to incorporate into existing implementations.

...read moreread less

Abstract: One of the challenges in the study of generative adversarial networks is the instability of its training. In this paper, we propose a novel weight normalization technique called spectral normalization to stabilize the training of the discriminator. Our new normalization technique is computationally light and easy to incorporate into existing implementations. We tested the efficacy of spectral normalization on CIFAR10, STL-10, and ILSVRC2012 dataset, and we experimentally confirmed that spectrally normalized GANs (SN-GANs) is capable of generating images of better or equal quality relative to the previous training stabilization techniques.

...read moreread less

2,640 citations

Posted Content•

Spectral Normalization for Generative Adversarial Networks

[...]

Takeru Miyato¹, Toshiki Kataoka, Masanori Koyama², Yuichi Yoshida³•Institutions (3)

Kyoto University¹, Ritsumeikan University², National Institute of Informatics³

16 Feb 2018-arXiv: Learning

TL;DR: In this article, the authors proposed a novel weight normalization technique called spectral normalization to stabilize the training of the discriminator, which is computationally light and easy to incorporate into existing implementations.

...read moreread less

1,194 citations

Posted Content•

ChainerRL: A Deep Reinforcement Learning Library

[...]

Yasuhiro Fujita, Prabhat Nagarajan, Toshiki Kataoka, Takahiro Ishikawa¹•Institutions (1)

University of Tokyo¹

09 Dec 2019-arXiv: Learning

TL;DR: ChainerRL is an open-source Deep Reinforcement Learning library built using Python and the Chainer deep learning framework that implements a comprehensive set of DRL algorithms and techniques drawn from the state-of-the-art research in the field.

...read moreread less

Abstract: In this paper, we introduce ChainerRL, an open-source deep reinforcement learning (DRL) library built using Python and the Chainer deep learning framework. ChainerRL implements a comprehensive set of DRL algorithms and techniques drawn from state-of-the-art research in the field. To foster reproducible research, and for instructional purposes, ChainerRL provides scripts that closely replicate the original papers' experimental settings and reproduce published benchmark results for several algorithms. Lastly, ChainerRL offers a visualization tool that enables the qualitative inspection of trained agents. The ChainerRL source code can be found on GitHub: this https URL.

...read moreread less

56 citations

Journal Article•

ChainerRL: A Deep Reinforcement Learning Library

[...]

Yasuhiro Fujita, Prabhat Nagarajan, Toshiki Kataoka, Takahiro Ishikawa

01 Jan 2021-Journal of Machine Learning Research

TL;DR: ChainerRL as discussed by the authors is an open-source deep reinforcement learning (DRL) library built using Python and the Chainer deep learning framework, which implements a comprehensive set of DRL algorithms and techniques drawn from state-of-the-art research in the field.

...read moreread less

29 citations

Journal Article•DOI•

Molecular Design Method Using a Reversible Tree Representation of Chemical Compounds and Deep Reinforcement Learning

[...]

Ryuichiro Ishitani, Toshiki Kataoka, Kentaro Rikimaru

12 Aug 2022-Journal of Chemical Information and Modeling

TL;DR: A novel coarse-grained tree representation of molecules (Reversible Junction Tree) is designed, which is reversely convertible to the original molecule without external information, and formulated the molecular design and optimization problem as a tree-structure construction using deep reinforcement learning (“RJT-RL”).

...read moreread less

Abstract: Automatic design of molecules with specific chemical and biochemical properties is an important process in material informatics and computational drug discovery. In this study, we designed a novel coarse-grained tree representation of molecules (Reversible Junction Tree; “RJT”) for the aforementioned purposes, which is reversely convertible to the original molecule without external information. By leveraging this representation, we further formulated the molecular design and optimization problem as a tree-structure construction using deep reinforcement learning (“RJT-RL”). In this method, all of the intermediate and final states of reinforcement learning are convertible to valid molecules, which could efficiently guide the optimization process in simple benchmark tasks. We further examined the multiobjective optimization and fine-tuning of the reinforcement learning models using RJT-RL, demonstrating the applicability of our method to more realistic tasks in drug discovery.

...read moreread less

2 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Denoising Diffusion Probabilistic Models

[...]

Jonathan Ho¹, Ajay Jain¹, Pieter Abbeel¹•Institutions (1)

University of California, Berkeley¹

19 Jun 2020-arXiv: Learning

TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.

...read moreread less

Abstract: We present high quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics. Our best results are obtained by training on a weighted variational bound designed according to a novel connection between diffusion probabilistic models and denoising score matching with Langevin dynamics, and our models naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding. On the unconditional CIFAR10 dataset, we obtain an Inception score of 9.46 and a state-of-the-art FID score of 3.17. On 256x256 LSUN, we obtain sample quality similar to ProgressiveGAN. Our implementation is available at this https URL

...read moreread less

2,704 citations

Posted Content•

Analyzing and Improving the Image Quality of StyleGAN

[...]

Tero Karras¹, Samuli Laine¹, Miika Aittala¹, Janne Hellsten¹, Jaakko Lehtinen², Timo Aila¹ - Show less +2 more•Institutions (2)

Nvidia¹, Aalto University²

03 Dec 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work redesigns the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images, and thereby redefines the state of the art in unconditional image modeling.

...read moreread less

Abstract: The style-based GAN architecture (StyleGAN) yields state-of-the-art results in data-driven unconditional generative image modeling. We expose and analyze several of its characteristic artifacts, and propose changes in both model architecture and training methods to address them. In particular, we redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images. In addition to improving image quality, this path length regularizer yields the additional benefit that the generator becomes significantly easier to invert. This makes it possible to reliably attribute a generated image to a particular network. We furthermore visualize how well the generator utilizes its output resolution, and identify a capacity problem, motivating us to train larger models for additional quality improvements. Overall, our improved model redefines the state of the art in unconditional image modeling, both in terms of existing distribution quality metrics as well as perceived image quality.

...read moreread less

2,411 citations

Proceedings Article•DOI•

Semantic Image Synthesis With Spatially-Adaptive Normalization

[...]

Taesung Park¹, Ming-Yu Liu², Ting-Chun Wang², Jun-Yan Zhu³•Institutions (3)

University of California, Berkeley¹, Nvidia², Massachusetts Institute of Technology³

18 Mar 2019

TL;DR: S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

...read moreread less

Abstract: We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the network, forcing the network to memorize the information throughout all the layers. Instead, we propose using the input layout for modulating the activations in normalization layers through a spatially-adaptive, learned affine transformation. Experiments on several challenging datasets demonstrate the superiority of our method compared to existing approaches, regarding both visual fidelity and alignment with input layouts. Finally, our model allows users to easily control the style and content of image synthesis results as well as create multi-modal results. Code is available upon publication.

...read moreread less

2,159 citations

Posted Content•

Self-Attention Generative Adversarial Networks

[...]

Han Zhang¹, Ian Goodfellow¹, Dimitris N. Metaxas², Augustus Odena¹•Institutions (2)

Google¹, Rutgers University²

21 May 2018-arXiv: Machine Learning

TL;DR: Self-Attention Generative Adversarial Network (SAGAN) as mentioned in this paper uses attention-driven, long-range dependency modeling for image generation tasks and achieves state-of-the-art results.

...read moreread less

Abstract: In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover, the discriminator can check that highly detailed features in distant portions of the image are consistent with each other. Furthermore, recent work has shown that generator conditioning affects GAN performance. Leveraging this insight, we apply spectral normalization to the GAN generator and find that this improves training dynamics. The proposed SAGAN achieves the state-of-the-art results, boosting the best published Inception score from 36.8 to 52.52 and reducing Frechet Inception distance from 27.62 to 18.65 on the challenging ImageNet dataset. Visualization of the attention layers shows that the generator leverages neighborhoods that correspond to object shapes rather than local regions of fixed shape.

...read moreread less

2,106 citations

Proceedings Article•DOI•

Analyzing and Improving the Image Quality of StyleGAN

[...]

Tero Karras¹, Samuli Laine¹, Miika Aittala¹, Janne Hellsten¹, Jaakko Lehtinen², Timo Aila¹ - Show less +2 more•Institutions (2)

Nvidia¹, Aalto University²

14 Jun 2020

TL;DR: In this paper, the authors propose to redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images.

...read moreread less

2,006 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse