scispace - formally typeset
Search or ask a question
Author

Sung-Wook Park

Bio: Sung-Wook Park is an academic researcher from Sunchon National University. The author has contributed to research in topics: Deep learning & Convolutional neural network. The author has an hindex of 4, co-authored 8 publications receiving 29 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: An overview of GAN and application solutions for computer vision and artificial intelligence healthcare field researchers was provided and the discussion tackled GAN’s problems and solutions, and the future research direction was finally proposed.
Abstract: The emergence of deep learning model GAN (Generative Adversarial Networks) is an important turning point in generative modeling. GAN is more powerful in feature and expression learning compared to machine learning-based generative model algorithms. Nowadays, it is also used to generate non-image data, such as voice and natural language. Typical technologies include BERT (Bidirectional Encoder Representations from Transformers), GPT-3 (Generative Pretrained Transformer-3), and MuseNet. GAN differs from the machine learning-based generative model and the objective function. Training is conducted by two networks: generator and discriminator. The generator converts random noise into a true-to-life image, whereas the discriminator distinguishes whether the input image is real or synthetic. As the training continues, the generator learns more sophisticated synthesis techniques, and the discriminator grows into a more accurate differentiator. GAN has problems, such as mode collapse, training instability, and lack of evaluation matrix, and many researchers have tried to solve these problems. For example, solutions such as one-sided label smoothing, instance normalization, and minibatch discrimination have been proposed. The field of application has also expanded. This paper provides an overview of GAN and application solutions for computer vision and artificial intelligence healthcare field researchers. The structure and principle of operation of GAN, the core models of GAN proposed to date, and the theory of GAN were analyzed. Application examples of GAN such as image classification and regression, image synthesis and inpainting, image-to-image translation, super-resolution and point registration were then presented. The discussion tackled GAN’s problems and solutions, and the future research direction was finally proposed.

32 citations

Journal ArticleDOI
01 Oct 2020
TL;DR: The linear estimation model developed in this study was validated using a single PV system and is possible to apply to other PV systems, even though the nature and error rates of the collected data may vary depending on the inverter manufacturer.
Abstract: The photovoltaic (PV) industry is an important part of the renewable energy industry. With the growing use of PV systems, interest in their operation and maintenance (O&M) is increasing. In this regard, analyses of power generation efficiency and inverter efficiency are very important. The first step in efficiency analysis is solar power estimation based on environment sensor data. In this study, solar power was estimated using a univariate linear regression model. The estimated solar power data were cross-validated with the actual solar power data obtained from the inverter. The results provide information on the power generation efficiency of the inverter. The linear estimation model developed in this study was validated using a single PV system. It is possible to apply the coefficients presented in this study to other PV systems, even though the nature and error rates of the collected data may vary depending on the inverter manufacturer. To apply the proposed model to PV systems with different power generation capacities, reconstructing the model according to the power generation capacity is necessary.

22 citations

Journal ArticleDOI
TL;DR: The proposed model did not cause mode collapse but converged to a better state than BEGAN-CS, which was improved in terms of loss function, but did not solve the mode collapse.
Abstract: In the field of deep learning, the generative model did not attract much attention until GANs (generative adversarial networks) appeared. In 2014, Google’s Ian Goodfellow proposed a generative model called GANs. GANs use different structures and objective functions from the existing generative model. For example, GANs use two neural networks: a generator that creates a realistic image, and a discriminator that distinguishes whether the input is real or synthetic. If there are no problems in the training process, GANs can generate images that are difficult even for experts to distinguish in terms of authenticity. Currently, GANs are the most researched subject in the field of computer vision, which deals with the technology of image style translation, synthesis, and generation, and various models have been unveiled. The issues raised are also improving one by one. In image synthesis, BEGAN (Boundary Equilibrium Generative Adversarial Network), which outperforms the previously announced GANs, learns the latent space of the image, while balancing the generator and discriminator. Nonetheless, BEGAN also has a mode collapse wherein the generator generates only a few images or a single one. Although BEGAN-CS (Boundary Equilibrium Generative Adversarial Network with Constrained Space), which was improved in terms of loss function, was introduced, it did not solve the mode collapse. The discriminator structure of BEGAN-CS is AE (AutoEncoder), which cannot create a particularly useful or structured latent space. Compression performance is not good either. In this paper, this characteristic of AE is considered to be related to the occurrence of mode collapse. Thus, we used VAE (Variational AutoEncoder), which added statistical techniques to AE. As a result of the experiment, the proposed model did not cause mode collapse but converged to a better state than BEGAN-CS.

19 citations

Proceedings ArticleDOI
01 Nov 2018
TL;DR: This paper has developed a classification application that can be used in mobile phones with high automation and portability to solve the above insect classification problems and proven that non-experts provide the appropriate performance to use.
Abstract: Insect identification has the disadvantage that it is difficult for non-experts to carry out due to the specificity of insects. Therefore, it is necessary for the general user to use auxiliary tools such as books to identify insects for education such as ecological learning. In recent years, researches using Deep Learning in fields such as object detection, behavior recognition, voice recognition as well as cancer detection in medical field have been actively conducted and show excellent results. In this paper, we developed a classification application that can be used in mobile phones with high automation and portability to solve the above insect classification problems. Experiments were conducted on 30 insect species selected for observable insects irrespective of environmental factors such as habitat and season, and the transform learning were applied to ResNet, which showed excellent performance in ILSVRC to classify forest insect. Our system achieved an average insect classification accuracy of 94%, an insect classification speed of 0.03 sec, and an insect photo transmission of 0.5 sec to output this information. This has proven that non-experts provide the appropriate performance to use.

17 citations

Journal ArticleDOI
TL;DR: It was concluded that ensemble in different models of high benchmarking scores is another way to get good results.
Abstract: In this paper, we compare and analyze the classification performance of deep learning algorithm Convolutional Neural Network(CNN) ac cording to ensemble generation and combining techniques. We used several CNN models(VGG16, VGG19, DenseNet121, DenseNet169, DenseNet201, ResNet18, ResNet34, ResNet50, ResNet101, ResNet152, GoogLeNet) to create 10 ensemble generation combinations and applied 6 combine techniques(average, weighted average, maximum, minimum, median, product) to the optimal combination. Experimental results, DenseNet169-VGG16-GoogLeNet combination in ensemble generation, and the product rule in ensemble combination showed the best performance. Based on this, it was concluded that ensemble in different models of high benchmarking scores is another way to get good results.

7 citations


Cited by
More filters
Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

09 Mar 2012
TL;DR: Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems as mentioned in this paper, and they have been widely used in computer vision applications.
Abstract: Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems. In this entry, we introduce ANN using familiar econometric terminology and provide an overview of ANN modeling approach and its implementation methods. † Correspondence: Chung-Ming Kuan, Institute of Economics, Academia Sinica, 128 Academia Road, Sec. 2, Taipei 115, Taiwan; ckuan@econ.sinica.edu.tw. †† I would like to express my sincere gratitude to the editor, Professor Steven Durlauf, for his patience and constructive comments on early drafts of this entry. I also thank Shih-Hsun Hsu and Yu-Lieh Huang for very helpful suggestions. The remaining errors are all mine.

2,069 citations

01 Jan 2011
TL;DR: The method is suited to online forecasting in many applications and in this paper it is used to predict hourly values of solar power for horizons of up to 36 h, where the results indicate that for forecasts up to 2 h ahead the most important input is the available observations ofSolar power, while for longer horizons NWPs are theMost important input.
Abstract: This paper describes a new approach to online forecasting of power production from PV systems. The method is suited to online forecasting in many applications and in this paper it is used to predict hourly values of solar power for horizons of up to 36 h. The data used is 15-min observations of solar power from 21 PV systems located on rooftops in a small village in Denmark. The suggested method is a two-stage method where first a statistical normalization of the solar power is obtained using a clear sky model. The clear sky model is found using statistical smoothing techniques. Then forecasts of the normalized solar power are calculated using adaptive linear time series models. Both autoregressive (AR) and AR with exogenous input (ARX) models are evaluated, where the latter takes numerical weather predictions (NWPs) as input. The results indicate that for forecasts up to 2 h ahead the most important input is the available observations of solar power, while for longer horizons NWPs are the most important input. A root mean square error improvement of around 35% is achieved by the ARX model compared to a proposed reference model.

585 citations

Journal ArticleDOI
TL;DR: A crop pest recognition method that accurately recognizes ten common species of crop pests by applying several deep convolutional neural networks (CNNs) has the potential to be applied in real-world applications and further motivate research on crop disease identification.

114 citations