scispace - formally typeset
Search or ask a question
JournalISSN: 1673-7350

Frontiers of Computer Science 

Springer Nature
About: Frontiers of Computer Science is an academic journal. The journal publishes majorly in the area(s): Cloud computing & Cluster analysis. It has an ISSN identifier of 1673-7350. Over the lifetime, 1314 publications have been published receiving 11542 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: Challenges and possible research directions for each mainstream approach of ensemble learning are presented and an extra introduction is given for the combination of ensemblelearning with other machine learning hot spots such as deep learning, reinforcement learning, etc.
Abstract: Despite significant successes achieved in knowledge discovery, traditional machine learning methods may fail to obtain satisfactory performances when dealing with complex data, such as imbalanced, high-dimensional, noisy data, etc. The reason behind is that it is difficult for these methods to capture multiple characteristics and underlying structure of data. In this context, it becomes an important topic in the data mining field that how to effectively construct an efficient knowledge discovery and mining model. Ensemble learning, as one research hot spot, aims to integrate data fusion, data modeling, and data mining into a unified framework. Specifically, ensemble learning firstly extracts a set of features with a variety of transformations. Based on these learned features, multiple learning algorithms are utilized to produce weak predictive results. Finally, ensemble learning fuses the informative knowledge from the above results obtained to achieve knowledge discovery and better predictive performance via voting schemes in an adaptive way. In this paper, we review the research progress of the mainstream approaches of ensemble learning and classify them based on different characteristics. In addition, we present challenges and possible research directions for each mainstream approach of ensemble learning, and we also give an extra introduction for the combination of ensemble learning with other machine learning hot spots such as deep learning, reinforcement learning, etc.

649 citations

Journal ArticleDOI
TL;DR: This literature review can serve as a good reference for researchers in the areas of scene text detection and recognition and identify state-of-the-art algorithms, and predict potential research directions in the future.
Abstract: Text, as one of the most influential inventions of humanity, has played an important role in human life, so far from ancient times. The rich and precise information embodied in text is very useful in a wide range of vision-based applications, therefore text detection and recognition in natural scenes have become important and active research topics in computer vision and document analysis. Especially in recent years, the community has seen a surge of research efforts and substantial progresses in these fields, though a variety of challenges (e.g. noise, blur, distortion, occlusion and variation) still remain. The purposes of this survey are three-fold: 1) introduce up-to-date works, 2) identify state-of-the-art algorithms, and 3) predict potential research directions in the future. Moreover, this paper provides comprehensive links to publicly available resources, including benchmark datasets, source codes, and online demos. In summary, this literature review can serve as a good reference for researchers in the areas of scene text detection and recognition.

369 citations

Journal ArticleDOI
TL;DR: An improved ARIMA based prediction method to forecast the spatial-temporal variation of passengers in a hotspot and the application of the prediction approach to help drivers find their next passengers is proposed.
Abstract: This paper investigates human mobility patterns in an urban taxi transportation system. This work focuses on predicting humanmobility fromdiscovering patterns of in the number of passenger pick-ups quantity (PUQ) from urban hotspots. This paper proposes an improved ARIMA based prediction method to forecast the spatial-temporal variation of passengers in a hotspot. Evaluation with a large-scale realworld data set of 4 000 taxis' GPS traces over one year shows a prediction error of only 5.8%. We also explore the application of the prediction approach to help drivers find their next passengers. The simulation results using historical real-world data demonstrate that, with our guidance, drivers can reduce the time taken and distance travelled, to find their next passenger, by 37.1% and 6.4%, respectively.

278 citations

Journal ArticleDOI
TL;DR: This paper reviews big data challenges from a data management respective, and discusses big data diversity, big data reduction,big data integration and cleaning,Big data indexing and query, and finally big data analysis and mining.
Abstract: There is a trend that, virtually everyone, ranging from big Web companies to traditional enterprisers to physical science researchers to social scientists, is either already experiencing or anticipating unprecedented growth in the amount of data available in their world, as well as new opportunities and great untapped value. This paper reviews big data challenges from a data management respective. In particular, we discuss big data diversity, big data reduction, big data integration and cleaning, big data indexing and query, and finally big data analysis and mining. Our survey gives a brief overview about big-data-oriented research and problems.

278 citations

Journal ArticleDOI
TL;DR: This paper aims to review the state of the art of binary relevance from three perspectives, and some of the recent studies on binary relevance aimed at issues other than label correlation exploitation are introduced.
Abstract: Multi-label learning deals with problems where each example is represented by a single instance while being associated with multiple class labels simultaneously. Binary relevance is arguably the most intuitive solution for learning from multi-label examples. It works by decomposing the multi-label learning task into a number of independent binary learning tasks (one per class label). In view of its potential weakness in ignoring correlations between labels, many correlation-enabling extensions to binary relevance have been proposed in the past decade. In this paper, we aim to review the state of the art of binary relevance from three perspectives. First, basic settings for multi-label learning and binary relevance solutions are briefly summarized. Second, representative strategies to provide binary relevancewith label correlation exploitation abilities are discussed. Third, some of our recent studies on binary relevance aimed at issues other than label correlation exploitation are introduced. As a conclusion, we provide suggestions on future research directions.

257 citations

Network Information
Related Journals (5)
IEEE Transactions on Knowledge and Data Engineering
5.2K papers, 365.2K citations
84% related
Knowledge Based Systems
7.3K papers, 249.5K citations
84% related
Neurocomputing
18.1K papers, 518.9K citations
82% related
Future Generation Computer Systems
6.6K papers, 246.8K citations
82% related
IEEE Transactions on Parallel and Distributed Systems
5.2K papers, 237.8K citations
80% related
Performance
Metrics
No. of papers from the Journal in previous years
YearPapers
202257
2021208
2020157
2019109
201889
201780