scispace - formally typeset
Search or ask a question
Author

Jing Yan

Bio: Jing Yan is an academic researcher. The author has contributed to research in topics: Random forest & Overfitting. The author has an hindex of 2, co-authored 3 publications receiving 75 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: The validation test on UCI data sets demonstrates that for imbalanced medical data, the proposed method enhanced the overall performance of the classifier while producing high accuracy in identifying both majority and minority class.
Abstract: The classification in class imbalanced data has drawn significant interest in medical application. Most existing methods are prone to categorize the samples into the majority class, resulting in bias, in particular the insufficient identification of minority class. A kind of novel approach, class weights random forest is introduced to address the problem, by assigning individual weights for each class instead of a single weight. The validation test on UCI data sets demonstrates that for imbalanced medical data, the proposed method enhanced the overall performance of the classifier while producing high accuracy in identifying both majority and minority class.

128 citations

Journal ArticleDOI
Jing Xia1, Su Pan1, Molei Yan, Guolong Cai, Jing Yan, Gangmin Ning1 
TL;DR: The results of application on sepsis revealed that the transLSTM model of only 100 training samples had comparable mortality prediction performance to the traditional model of 250 training samples and has significant advantages with higher prediciton accuracy and faster training speed.

5 citations

Book ChapterDOI
Jing Xia1, Min Zhu1, Shengyu Zhang1, Molei Yan, Guolong Cai, Jing Yan, Gangmin Ning1 
01 Jan 2015
TL;DR: Preliminary results exhibited that the established model is potential to help improve the patients’ management by quickly stratifying the sepsis severity and is superior to the conventional APACHE scoring method.
Abstract: Sepsis is a kind of systemic inflammatory response syndrome caused by infection and it endangers the life of patients seriously due to its rapid development progression and high mortality rate. In clinic it is highly demanded to quantitatively stratify the severity of sepsis for individual management. This work aimed to build a quantitative model for sepsis patients which can stratify the disease severity in three levels. For this purpose, clinical data were collected and preprocessed, i.e. screening, normalization and data replenishing. Afterwards, sepsis sensitive parameters were tested and selected, which were utilized as the input of the stratification model. For the model, the algorithm of Support Vector Machine was applied. Eventually, the model was tested in total of 522 clinical cases and an accuracy of 67.5% in stratification was achieved. The performance of the established model is superior to the conventional APACHE scoring method. Preliminary results exhibited that the established model is potential to help improve the patients’ management by quickly stratifying the sepsis severity.

Cited by
More filters
Journal ArticleDOI
10 Jul 2018-Sensors
TL;DR: A deep neural network model that integrates the CNN and LSTM architectures is developed, and through historical data such as cumulated hours of rain, cumulated wind speed and PM2.5 concentration, the forecasting accuracy of the proposed CNN-LSTM model (APNet) is verified to be the highest in this paper.
Abstract: In modern society, air pollution is an important topic as this pollution exerts a critically bad influence on human health and the environment. Among air pollutants, Particulate Matter (PM2.5) consists of suspended particles with a diameter equal to or less than 2.5 μm. Sources of PM2.5 can be coal-fired power generation, smoke, or dusts. These suspended particles in the air can damage the respiratory and cardiovascular systems of the human body, which may further lead to other diseases such as asthma, lung cancer, or cardiovascular diseases. To monitor and estimate the PM2.5 concentration, Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) are combined and applied to the PM2.5 forecasting system. To compare the overall performance of each algorithm, four measurement indexes, Mean Absolute Error (MAE), Root Mean Square Error (RMSE) Pearson correlation coefficient and Index of Agreement (IA) are applied to the experiments in this paper. Compared with other machine learning methods, the experimental results showed that the forecasting accuracy of the proposed CNN-LSTM model (APNet) is verified to be the highest in this paper. For the CNN-LSTM model, its feasibility and practicability to forecast the PM2.5 concentration are also verified in this paper. The main contribution of this paper is to develop a deep neural network model that integrates the CNN and LSTM architectures, and through historical data such as cumulated hours of rain, cumulated wind speed and PM2.5 concentration. In the future, this study can also be applied to the prevention and control of PM2.5.

426 citations

Journal Article
TL;DR: Patients with vertebral, hip, distal radius, and proximal humerus fractures are most common among the osteoporosis-related fractures.
Abstract: Patients with vertebral, hip, distal radius, and proximal humerus fractures are most common among the osteoporosis-related fractures. The incidences of these fractures increase with age, however, the increase patterns differ between the fracture sites. The prevalence of vertebral fracture for Japanese is similar or slightly higher and the incidences of osteoporosis-related limb fractures are lower than those for Caucacians. A decrease in prevalence of vertebral fractures and an increase in the incidence of limb fractures are the secular trend in Japan. Previous fractures are significant risk factor for both vertebral and hip fractures. Greater physical activity increases the risk of distal radius fractures, and decreases the risk of proximal humerus fractures.

364 citations

Journal ArticleDOI
TL;DR: Experimental results show that compared with other traditional machine learning methods, the prediction performance of the estimating model proposed in this paper is proven to be the best and the feasibility and practicality of electricity price prediction is confirmed.
Abstract: Electricity price is a key influencer in the electricity market. Electricity market trades by each participant are based on electricity price. The electricity price adjusted with the change in supply and demand relationship can reflect the real value of electricity in the transaction process. However, for the power generating party, bidding strategy determines the level of profit, and the accurate prediction of electricity price could make it possible to determine a more accurate bidding price. This cannot only reduce transaction risk, but also seize opportunities in the electricity market. In order to effectively estimate electricity price, this paper proposes an electricity price forecasting system based on the combination of 2 deep neural networks, the Convolutional Neural Network (CNN) and the Long Short Term Memory (LSTM). In order to compare the overall performance of each algorithm, the Mean Absolute Error (MAE) and Root-Mean-Square error (RMSE) evaluating measures were applied in the experiments of this paper. Experiment results show that compared with other traditional machine learning methods, the prediction performance of the estimating model proposed in this paper is proven to be the best. By combining the CNN and LSTM models, the feasibility and practicality of electricity price prediction is also confirmed in this paper.

131 citations

Journal ArticleDOI
TL;DR: This study found that of the six medical tasks that exist, the diagnosis medical task was that most frequently researched, and that the experiment-based empirical type and evaluation-based research type were the most dominant approaches adopted in the selected studies.

128 citations

Journal ArticleDOI
TL;DR: This survey provides four deep learning model series, which includes CNN series, GAN series, ELM-RVFL series, and other series, for comprehensive understanding towards the analytical techniques of image processing field, clarify the most important advancements and shed some light on future studies.
Abstract: During the past decade, deep learning is one of the essential breakthroughs made in artificial intelligence. In particular, it has achieved great success in image processing. Correspondingly, various applications related to image processing are also promoting the rapid development of deep learning in all aspects of network structure, layer designing, and training tricks. However, the deeper structure makes the back-propagation algorithm more difficult. At the same time, the scale of training images without labels is also rapidly increasing, and class imbalance severely affects the performance of deep learning, these urgently require more novelty deep models and new parallel computing system to more effectively interpret the content of the image and form a suitable analysis mechanism. In this context, this survey provides four deep learning model series, which includes CNN series, GAN series, ELM-RVFL series, and other series, for comprehensive understanding towards the analytical techniques of image processing field, clarify the most important advancements and shed some light on future studies. By further studying the relationship between deep learning and image processing tasks, which can not only help us understand the reasons for the success of deep learning but also inspires new deep models and training methods. More importantly, this survey aims to improve or arouse other researchers to catch a glimpse of the state-of-the-art deep learning methods in the field of image processing and facilitate the applications of these deep learning technologies in their research tasks. Besides, we discuss the open issues and the promising directions of future research in image processing using the new generation of deep learning.

113 citations