scispace - formally typeset
Open AccessJournal ArticleDOI

Spatio-temporal estimation of the daily cases of COVID-19 in worldwide using random forest machine learning algorithm.

Cafer Mert Yeşilkanat
- 01 Nov 2020 - 
- Vol. 140, pp 110210-110210
TLDR
The results show that the random forest machine learning algorithm performs well in estimating the number of cases for the near future in case of an epidemic like Novel Coronavirus, which outbreaks suddenly and spreads rapidly.
Abstract
Novel Coronavirus pandemic, which negatively affected public health in social, psychological and economical terms, spread to the whole world in a short period of 6 months However, the rate of increase in cases was not equal for every country The measures implemented by the countries changed the daily spreading speed of the disease This was determined by changes in the number of daily cases In this study, the performance of the Random Forest (RF) machine learning algorithm was investigated in estimating the near future case numbers for 190 countries in the world and it is mapped in comparison with actual confirmed cases results The number of confirmed cases between 23/01/2020 - 17/06/2020 were divided into 3 main sub-datasets: training sub-data, testing sub-data (interpolation data) and estimating sub-data (extrapolation data) for the random forest model At the end of the study, it has been found that R2 values for testing sub-data of RF model estimates range between 0843 and 0995 (average R2= 0959), and RMSE values between 14176 and 52618 (mean RMSE = 25938); and that R2 values for estimating sub-data range between 0690 and 0968 (mean R2 = 0914), and RMSE values between 54973 and 250079 (mean RMSE = 90937) These results show that the random forest machine learning algorithm performs well in estimating the number of cases for the near future in case of an epidemic like Novel Coronavirus, which outbreaks suddenly and spreads rapidly

read more

Citations
More filters
Journal ArticleDOI

Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods.

TL;DR: In this study, data of COVID-19 between 20/01/2020 and 18/09/2020 for USA, Germany and Global was obtained from World Health Organization and time series prediction model using machine learning was proposed to obtain the curve of disease and forecast the epidemic tendency.
Journal ArticleDOI

A systematic review on AI/ML approaches against COVID-19 outbreak

TL;DR: In this paper, the authors conducted a systematic literature review for framing the research questions, searching criteria and relevant data extraction, and 264 studies were taken into account after following inclusion and exclusion criteria.
Journal ArticleDOI

Comprehensive Survey of Using Machine Learning in the COVID-19 Pandemic.

TL;DR: In this article, the authors survey the decisive role of AI as a technology used to fight against the COVID-19 pandemic and highlight the open research challenges that could inspire the future application of AI in COVID19.
Journal ArticleDOI

Monitoring and Recognizing Enterprise Public Opinion from High-Risk Users Based on User Portrait and Random Forest Algorithm

TL;DR: This paper combines user portrait technology and a random forest algorithm to help enterprises identify high-risk users who have posted negative comments and thus may trigger negative public opinion.
Journal ArticleDOI

Integrating feature engineering, genetic algorithm and tree-based machine learning methods to predict the post-accident disability status of construction workers

TL;DR: A comprehensive framework to predict the post-accident disability status of construction workers is developed through four tree-based ensemble machine learning models, as well as a state-of-the-art optimization method for hyperparameter tuning, Genetic Algorithm.
References
More filters
Journal Article

R: A language and environment for statistical computing.

R Core Team
- 01 Jan 2014 - 
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Book

ggplot2: Elegant Graphics for Data Analysis

TL;DR: This book describes ggplot2, a new data visualization package for R that uses the insights from Leland Wilkisons Grammar of Graphics to create a powerful and flexible system for creating data graphics.
Journal ArticleDOI

Bagging predictors

Leo Breiman
TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

Classification and Regression by randomForest

TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Related Papers (5)