Weka: A Tool for Data preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
Reads0
Chats0
TLDR
The fundamentals of data mining steps like preprocessing the data, feature selection (to select the relevant features and removing the irrelevant and redundant features), classification and evaluation of different classifier models using WEKA tool are given.Abstract:
basic principle of data mining is to analyze the data from different perspectives, classify it and recapitulate it. Data mining has become very popular in each and every application. Though we have large amount of data but we don't have useful information in every field. There are many data mining tools and software to facilitate us the useful information. This paper gives the fundamentals of data mining steps like preprocessing the data (removing the noisy data, replacing the missing values etc.), feature selection (to select the relevant features and removing the irrelevant and redundant features), classification and evaluation of different classifier models using WEKA tool. The WEKA tool is not useful for only one type of application, though it can be used in various applications. This tool consists of various algorithms for feature selection, classification and clustering as well. Keywordsfeature selection, classification, clustering, evaluation of classifier models, evaluation of cluster models.read more
Citations
More filters
Journal ArticleDOI
A Review of the Advancement in Intrusion Detection Datasets
Ankit Thakkar,Ritika Lohiya +1 more
TL;DR: The recent advancement in the IDS datasets that can be used by various research communities as the manifesto for using the new IDS dataset for developing efficient and effective ML and DM based IDS.
Journal ArticleDOI
Tele-connecting China's future urban growth to impacts on ecosystem services under the shared socioeconomic pathways.
TL;DR: This work proposes to integrate an urban growth simulation model with the multi-region input-output (MRIO) model, thereby illustrating how urban land consumption in one region can cause ecosystem services' degradation in another under five shared socioeconomic pathway (SSP) scenarios.
Journal ArticleDOI
Usage and analysis of Twitter during 2015 Chennai flood towards disaster management
TL;DR: How people of Chennai used social media especially twitter, in response to the country’s worst flood that had occurred recently was studied, and Random Forests is the best algorithm that can be relied on, during a disaster.
Journal ArticleDOI
Simulating urban growth boundaries using a patch-based cellular automaton with economic and ecological constraints
TL;DR: An innovative method that is capable of simulating UGB alternatives with economic and ecological constraints is developed and indicates that increasing the shares of low energy consumption industries and tertiary industries can effectively reduce urban land demand.
Proceedings ArticleDOI
Identification of ayurvedic medicinal plants by image processing of leaf samples
TL;DR: In this paper, the authors used feature vectors from both the front and back side of a green leaf along with morphological features to arrive at a unique optimum combination of features that maximizes the identification rate.
References
More filters
Proceedings ArticleDOI
WEKA: a machine learning workbench
TL;DR: WEKA is a workbench for machine learning that is intended to aid in the application of machine learning techniques to a variety of real-world problems, in particular, those arising from agricultural and horticultural domains.
Survey of Classification Techniques in Data Mining
TL;DR: The goal of this survey is to provide a comprehensive review of different classification techniques in data mining, capable of processing a wider variety of data than regression and growing in popularity.
Journal ArticleDOI
Survey on Classification Techniques for Data Mining
TL;DR: The different classification methods and classifiers that can be used for classification of observations that are initially uncategorized are compared to demonstrate the different accuracies and usefulness of classifiers.
Journal ArticleDOI
Unsupervised Quick Reduct Algorithm Using Rough Set Theory
C. Velayutham,K. Thangavel +1 more
TL;DR: This paper proposes a new unsupervised quick reduct (QR) algorithm using rough set theory and the quality of the reduced data is measured by the classification performance and it is evaluated using WEKA classifier tool.
Journal ArticleDOI
Analysis of Parametric & Non Parametric Classifiers for Classification Technique using WEKA
Yugal Kumar,Gadadhar Sahoo +1 more
TL;DR: This paper identifies parametric & non parametric classifiers that are used in classification process and provides tree representation of these classifiers.