scispace - formally typeset
Search or ask a question
Author

Song Wang

Bio: Song Wang is an academic researcher from Fujitsu. The author has contributed to research in topics: Convolutional neural network & Artificial neural network. The author has an hindex of 7, co-authored 25 publications receiving 208 citations.

Papers
More filters
Proceedings ArticleDOI
Li Chen1, Song Wang1, Wei Fan1, Jun Sun1, Satoshi Naoi1 
01 Nov 2015
TL;DR: In the experiments, the proposed CNN-based handwritten character recognition framework performed even better than human on handwritten digit (MNIST) and Chinese character (CASIA) recognition.
Abstract: Because of the various appearance (different writers, writing styles, noise, etc.), the handwritten character recognition is one of the most challenging task in pattern recognition. Through decades of research, the traditional method has reached its limit while the emergence of deep learning provides a new way to break this limit. In this paper, a CNN-based handwritten character recognition framework is proposed. In this framework, proper sample generation, training scheme and CNN network structure are employed according to the properties of handwritten characters. In the experiments, the proposed framework performed even better than human on handwritten digit (MNIST) and Chinese character (CASIA) recognition. The advantage of this framework is proved by these experimental results.

117 citations

Proceedings ArticleDOI
Song Wang1, Li Chen1, Liang Xu1, Wei Fan1, Jun Sun1, Satoshi Naoi1 
01 Oct 2016
TL;DR: The experimental results showed that the proposed framework could achieve much better performance than the state-of-the-art methods and can also be applied to other time sequence problems, such as speech recognition and video analysis.
Abstract: It is well known that the handwritten Chinese text recognition is a difficult problem since there are a large number of classes. In order to solve this problem, we proposed a whole new framework for unconstrained handwritten Chinese text recognition. The core module of the framework is the heterogeneous CNN trained by deep knowledge. The experimental results showed that our proposed method could achieve much better performance than the state-of-the-art methods (96.28% vs. 91.39% of CR on CASIA test set). Moreover, since the proposed framework is general, it can also be applied to other time sequence problems, such as speech recognition and video analysis.

53 citations

Patent
Li Chen1, Song Wang1, Wei Fan1, Jun Sun1, Naoi Satoshi1 
02 Mar 2017
TL;DR: In this article, a training method and a training apparatus for a neutral network for image recognition is provided, where a sample image is represented as a point set in a high-dimensional space.
Abstract: A training method and a training apparatus for a neutral network for image recognition are provided. The method includes: representing a sample image as a point set in a high-dimensional space, a size of the high-dimensional space being a size of space domain of the sample image multiplied by a size of intensity domain of the sample image; generating a first random perturbation matrix having a same size as the high-dimensional space; smoothing the first random perturbation matrix; perturbing the point set in the high-dimensional space using the smoothed first random perturbation matrix to obtain a perturbed point set; and training the neutral network using the perturbed point set as a new sample. With the training method and the training apparatus, classification performance of a conventional convolutional neural network is improved, thereby generating more training samples, reducing influence of overfitting, and enhancing generalization performance of the convolutional neural network.

13 citations

Proceedings ArticleDOI
01 Jul 2017
TL;DR: An automated CNN recommendation system for image classification task that is able to evaluate the complexity of the classification task and the classification ability of the CNN model precisely and can recommend the optimal CNN model and which can match the task perfectly.
Abstract: Nowadays the CNN is widely used in practical applications for image classification task. However the design of the CNN model is very professional work and which is very difficult for ordinary users. Besides, even for experts of CNN, to select an optimal model for specific task may still need a lot of time (to train many different models). In order to solve this problem, we proposed an automated CNN recommendation system for image classification task. Our system is able to evaluate the complexity of the classification task and the classification ability of the CNN model precisely. By using the evaluation results, the system can recommend the optimal CNN model and which can match the task perfectly. The recommendation process of the system is very fast since we don't need any model training. The experiment results proved that the evaluation methods are very accurate and reliable.

10 citations

Patent
Song Wang1, Wei Fan1, Jun Sun1
27 Apr 2017
TL;DR: In this paper, a deep neural network is obtained by inputting training samples comprising positive samples and negative samples into an input layer of the DNN and training, and a judging unit is configured to judge that a sample to be recognized is a suspected abnormal sample when confidences of positive sample classes in a classification result outputted by an output layer of DNN are all less than a predefined threshold value.
Abstract: A recognition apparatus based on a deep neural network, a training apparatus and methods thereof. The deep neural network is obtained by inputting training samples comprising positive samples and negative samples into an input layer of the deep neural network and training. The apparatus includes: a judging unit configured to judge that a sample to be recognized is a suspected abnormal sample when confidences of positive sample classes in a classification result outputted by an output layer of the deep neural network are all less than a predefined threshold value. Hence, reliability of a confidence of a classification result outputted by the deep neural network may be efficiently improved.

9 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a framework by using the recurrent neural network (RNN) as both a discriminative model for recognizing Chinese characters and a generator model for drawing (generating) Chinese characters.
Abstract: Recent deep learning based approaches have achieved great success on handwriting recognition. Chinese characters are among the most widely adopted writing systems in the world. Previous research has mainly focused on recognizing handwritten Chinese characters. However, recognition is only one aspect for understanding a language, another challenging and interesting task is to teach a machine to automatically write (pictographic) Chinese characters. In this paper, we propose a framework by using the recurrent neural network (RNN) as both a discriminative model for recognizing Chinese characters and a generative model for drawing (generating) Chinese characters. To recognize Chinese characters, previous methods usually adopt the convolutional neural network (CNN) models which require transforming the online handwriting trajectory into image-like representations. Instead, our RNN based approach is an end-to-end system which directly deals with the sequential structure and does not require any domain-specific knowledge. With the RNN system (combining an LSTM and GRU), state-of-the-art performance can be achieved on the ICDAR-2013 competition database. Furthermore, under the RNN framework, a conditional generative model with character embedding is proposed for automatically drawing recognizable Chinese characters. The generated characters (in vector format) are human-readable and also can be recognized by the discriminative RNN model with high accuracy. Experimental results verify the effectiveness of using RNNs as both generative and discriminative models for the tasks of drawing and recognizing Chinese characters.

252 citations

Journal ArticleDOI
TL;DR: A novel CNN model called CoroDet for automatic detection of COVID-19 by using raw chest X-ray and CT scan images have been proposed and the experimental results indicate the superiority of Corodet over the existing state-of-the-art-methods.
Abstract: Background and Objective The Coronavirus 2019, or shortly COVID-19, is a viral disease that causes serious pneumonia and impacts our different body parts from mild to severe depending on patient’s immune system. This infection was first reported in Wuhan city of China in December 2019, and afterward, it became a global pandemic spreading rapidly around the world. As the virus spreads through human to human contact, it has affected our lives in a devastating way, including the vigorous pressure on the public health system, the world economy, education sector, workplaces, and shopping malls. Preventing viral spreading requires early detection of positive cases and to treat infected patients as quickly as possible. The need for COVID-19 testing kits has increased, and many of the developing countries in the world are facing a shortage of testing kits as new cases are increasing day by day. In this situation, the recent research using radiology imaging (such as X-ray and CT scan) techniques can be proven helpful to detect COVID-19 as X-ray and CT scan images provide important information about the disease caused by COVID-19 virus. The latest data mining and machine learning techniques such as Convolutional Neural Network (CNN) can be applied along with X-ray and CT scan images of the lungs for the accurate and rapid detection of the disease, assisting in mitigating the problem of scarcity of testing kits. Methods Hence a novel CNN model called CoroDet for automatic detection of COVID-19 by using raw chest X-ray and CT scan images have been proposed in this study. CoroDet is developed to serve as an accurate diagnostics for 2 class classification (COVID and Normal), 3 class classification (COVID, Normal, and non-COVID pneumonia), and 4 class classification (COVID, Normal, non-COVID viral pneumonia, and non-COVID bacterial pneumonia). Results The performance of our proposed model was compared with ten existing techniques for COVID detection in terms of accuracy. A classification accuracy of 99.1% for 2 class classification, 94.2% for 3 class classification, and 91.2% for 4 class classification was produced by our proposed model, which is obviously better than the state-of-the-art-methods used for COVID-19 detection to the best of our knowledge. Moreover, the dataset with x-ray images that we prepared for the evaluation of our method is the largest datasets for COVID detection as far as our knowledge goes. Conclusion The experimental results of our proposed method CoroDet indicate the superiority of CoroDet over the existing state-of-the-art-methods. CoroDet may assist clinicians in making appropriate decisions for COVID-19 detection and may also mitigate the problem of scarcity of testing kits.

251 citations

Journal ArticleDOI
TL;DR: In this article, a new adaptation layer is proposed to reduce the mismatch between training and test data on a particular source layer, and the adaptation process can be efficiently and effectively implemented in an unsupervised manner.

232 citations

Journal ArticleDOI
TL;DR: Evaluating comprehensively neural network language models (NNLMs) and hybrid NNLMs in handwritten Chinese text recognition and replacing the baseline character classifier, over-segmentation, and geometric context models with convolutional neural network (CNN) based models.

144 citations

Journal ArticleDOI
TL;DR: This paper proposed a new method for building fast and compact CNN model for large scale handwritten Chinese character recognition (HCCR) using Adaptive Drop Weight (ADW) for effectively pruning CNN parameters.

112 citations