Journal ArticleDOI
Determination of sample size using power analysis and optimum bin size of histogram features
Reads0
Chats0
TLDR
This paper provides a mathematical study to choose the bin size and the minimum sample size to train the classifier using power analysis with statistical stability and the results are compared with that of entropy based algorithm (J48) for determiningminimum sample size and bin size.Abstract:
Vibration signals are used in fault diagnosis of rotary machines as a source of information. Lots of work have been reported on identification of faults in roller bearing by using many techniques. Of late, application of machine learning approach in fault diagnosis is gaining momentum. Machine learning approach consists of chain of activities like, data acquisition, feature extraction, feature selection and feature classification. While histogram features are used, there are still a few questions to be answered such as how many histogram bins are to be used to extract features and how many samples to be used to train the classifier. This paper provides a mathematical study to choose the bin size and the minimum sample size to train the classifier using power analysis with statistical stability. A typical bearing fault diagnosis problem is taken as a case for illustration and the results are compared with that of entropy based algorithm (J48) for determining minimum sample size and bin size.read more
Citations
More filters
Journal ArticleDOI
A confidence-prioritisation approach for learning noisy data
TL;DR: This work proposes a methodological framework for assigning confidence to individual data records and augmenting training with that information, and results indicate that applying and utilising confidence in training improves performance.
References
More filters
Journal ArticleDOI
Exact Sample Sizes for Use with the Fisher-Irwin Test for 2 x 2 Tables
TL;DR: Gail and Gart as discussed by the authors presented tables showing the required sample size, n, for the Fisher-Irwin exact conditional test for 2 X 2 tables to achieve a power of at least 0.50, 0.80 and 0.90 against one-sided alternatives at nominal.05 and.01 significance levels.
Journal ArticleDOI
On the sample size for one-sided equivalence of sensitivities based upon McNemar's test.
Ying Lu,Judy A. Bean +1 more
TL;DR: Results of a Monte Carlo simulation study suggest that the midpoint conditional sample size is the best choice to obtain the desired power for the type of equivalence studies discussed in the paper.
Journal ArticleDOI
A comparison of sample size methods for the logrank statistic.
Edward Lakatos,K. K. Gordon Lan +1 more
TL;DR: Several methods are available for sample size calculation for clinical trials when survival curves are to be compared using the logrank statistic, and simulation results under exponential, proportional hazards and non-proportional hazard situations are presented.
Journal ArticleDOI
On sample-size and power calculations for studies using confidence intervals
TL;DR: How expected confidence intervals, if not properly centered, can be misleading indicators of the discriminatory power of a study and be designed so that the confidence interval has a high probability of not containing at least one plausible but incorrect parameter value.
Journal ArticleDOI
Computing Sample Size for Receiver Operating Characteristic Studies
TL;DR: The method of Hanley and McNeil can lead to underestimation of the minimum sample size, and an alternative method of computing the standard error based on a binormal distribution provides more appropriate estimates of sample size.