scispace - formally typeset
Open AccessJournal ArticleDOI

Weka: A Tool for Data preprocessing, Classification, Ensemble, Clustering and Association Rule Mining

Shweta Srivastava
- 14 Feb 2014 - 
- Vol. 88, Iss: 10, pp 26-29
Reads0
Chats0
TLDR
The fundamentals of data mining steps like preprocessing the data, feature selection (to select the relevant features and removing the irrelevant and redundant features), classification and evaluation of different classifier models using WEKA tool are given.
Abstract
basic principle of data mining is to analyze the data from different perspectives, classify it and recapitulate it. Data mining has become very popular in each and every application. Though we have large amount of data but we don't have useful information in every field. There are many data mining tools and software to facilitate us the useful information. This paper gives the fundamentals of data mining steps like preprocessing the data (removing the noisy data, replacing the missing values etc.), feature selection (to select the relevant features and removing the irrelevant and redundant features), classification and evaluation of different classifier models using WEKA tool. The WEKA tool is not useful for only one type of application, though it can be used in various applications. This tool consists of various algorithms for feature selection, classification and clustering as well. Keywordsfeature selection, classification, clustering, evaluation of classifier models, evaluation of cluster models.

read more

Citations
More filters
Journal ArticleDOI

A Review of the Advancement in Intrusion Detection Datasets

TL;DR: The recent advancement in the IDS datasets that can be used by various research communities as the manifesto for using the new IDS dataset for developing efficient and effective ML and DM based IDS.
Journal ArticleDOI

Tele-connecting China's future urban growth to impacts on ecosystem services under the shared socioeconomic pathways.

TL;DR: This work proposes to integrate an urban growth simulation model with the multi-region input-output (MRIO) model, thereby illustrating how urban land consumption in one region can cause ecosystem services' degradation in another under five shared socioeconomic pathway (SSP) scenarios.
Journal ArticleDOI

Usage and analysis of Twitter during 2015 Chennai flood towards disaster management

TL;DR: How people of Chennai used social media especially twitter, in response to the country’s worst flood that had occurred recently was studied, and Random Forests is the best algorithm that can be relied on, during a disaster.
Journal ArticleDOI

Simulating urban growth boundaries using a patch-based cellular automaton with economic and ecological constraints

TL;DR: An innovative method that is capable of simulating UGB alternatives with economic and ecological constraints is developed and indicates that increasing the shares of low energy consumption industries and tertiary industries can effectively reduce urban land demand.
Proceedings ArticleDOI

Identification of ayurvedic medicinal plants by image processing of leaf samples

TL;DR: In this paper, the authors used feature vectors from both the front and back side of a green leaf along with morphological features to arrive at a unique optimum combination of features that maximizes the identification rate.
References
More filters
Proceedings ArticleDOI

WEKA: a machine learning workbench

TL;DR: WEKA is a workbench for machine learning that is intended to aid in the application of machine learning techniques to a variety of real-world problems, in particular, those arising from agricultural and horticultural domains.

Survey of Classification Techniques in Data Mining

Thair Nu Phyu
TL;DR: The goal of this survey is to provide a comprehensive review of different classification techniques in data mining, capable of processing a wider variety of data than regression and growing in popularity.
Journal ArticleDOI

Survey on Classification Techniques for Data Mining

TL;DR: The different classification methods and classifiers that can be used for classification of observations that are initially uncategorized are compared to demonstrate the different accuracies and usefulness of classifiers.
Journal ArticleDOI

Unsupervised Quick Reduct Algorithm Using Rough Set Theory

TL;DR: This paper proposes a new unsupervised quick reduct (QR) algorithm using rough set theory and the quality of the reduced data is measured by the classification performance and it is evaluated using WEKA classifier tool.
Journal ArticleDOI

Analysis of Parametric & Non Parametric Classifiers for Classification Technique using WEKA

TL;DR: This paper identifies parametric & non parametric classifiers that are used in classification process and provides tree representation of these classifiers.
Related Papers (5)