scispace - formally typeset
Journal ArticleDOI

Determination of sample size using power analysis and optimum bin size of histogram features

Reads0
Chats0
TLDR
This paper provides a mathematical study to choose the bin size and the minimum sample size to train the classifier using power analysis with statistical stability and the results are compared with that of entropy based algorithm (J48) for determiningminimum sample size and bin size.
Abstract
Vibration signals are used in fault diagnosis of rotary machines as a source of information. Lots of work have been reported on identification of faults in roller bearing by using many techniques. Of late, application of machine learning approach in fault diagnosis is gaining momentum. Machine learning approach consists of chain of activities like, data acquisition, feature extraction, feature selection and feature classification. While histogram features are used, there are still a few questions to be answered such as how many histogram bins are to be used to extract features and how many samples to be used to train the classifier. This paper provides a mathematical study to choose the bin size and the minimum sample size to train the classifier using power analysis with statistical stability. A typical bearing fault diagnosis problem is taken as a case for illustration and the results are compared with that of entropy based algorithm (J48) for determining minimum sample size and bin size.

read more

Citations
More filters
Journal ArticleDOI

A confidence-prioritisation approach for learning noisy data

TL;DR: This work proposes a methodological framework for assigning confidence to individual data records and augmenting training with that information, and results indicate that applying and utilising confidence in training improves performance.
References
More filters
Journal ArticleDOI

Exact Sample Sizes for Use with the Fisher-Irwin Test for 2 x 2 Tables

J. K. Haseman
- 01 Mar 1978 - 
TL;DR: Gail and Gart as discussed by the authors presented tables showing the required sample size, n, for the Fisher-Irwin exact conditional test for 2 X 2 tables to achieve a power of at least 0.50, 0.80 and 0.90 against one-sided alternatives at nominal.05 and.01 significance levels.
Journal ArticleDOI

On the sample size for one-sided equivalence of sensitivities based upon McNemar's test.

TL;DR: Results of a Monte Carlo simulation study suggest that the midpoint conditional sample size is the best choice to obtain the desired power for the type of equivalence studies discussed in the paper.
Journal ArticleDOI

A comparison of sample size methods for the logrank statistic.

TL;DR: Several methods are available for sample size calculation for clinical trials when survival curves are to be compared using the logrank statistic, and simulation results under exponential, proportional hazards and non-proportional hazard situations are presented.
Journal ArticleDOI

On sample-size and power calculations for studies using confidence intervals

TL;DR: How expected confidence intervals, if not properly centered, can be misleading indicators of the discriminatory power of a study and be designed so that the confidence interval has a high probability of not containing at least one plausible but incorrect parameter value.
Journal ArticleDOI

Computing Sample Size for Receiver Operating Characteristic Studies

TL;DR: The method of Hanley and McNeil can lead to underestimation of the minimum sample size, and an alternative method of computing the standard error based on a binormal distribution provides more appropriate estimates of sample size.
Related Papers (5)