scispace - formally typeset
Search or ask a question

Showing papers on "Linear discriminant analysis published in 2022"


Journal ArticleDOI
TL;DR: In this paper, a neighborhood linear discriminant analysis (nLDA) is proposed, in which the scatter matrices are defined on a neighborhood consisting of reverse nearest neighbors and the neighborhood can be naturally regarded as the smallest subclass.

64 citations


Journal ArticleDOI
TL;DR: In this article , a neighborhood linear discriminant analysis (nLDA) is proposed to address multimodality in LDA, in which the scatter matrices are defined on a neighborhood consisting of reverse nearest neighbors.

64 citations


Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a robust discriminant subspace (RDS) learning method for feature extraction, with an objective function formulated in a different form to guarantee the subspace to be robust and discriminative.
Abstract: Recently, there are many works on discriminant analysis, which promote the robustness of models against outliers by using L1- or L2,1-norm as the distance metric. However, both of their robustness and discriminant power are limited. In this article, we present a new robust discriminant subspace (RDS) learning method for feature extraction, with an objective function formulated in a different form. To guarantee the subspace to be robust and discriminative, we measure the within-class distances based on [Formula: see text]-norm and use [Formula: see text]-norm to measure the between-class distances. This also makes our method include rotational invariance. Since the proposed model involves both [Formula: see text]-norm maximization and [Formula: see text]-norm minimization, it is very challenging to solve. To address this problem, we present an efficient nongreedy iterative algorithm. Besides, motivated by trace ratio criterion, a mechanism of automatically balancing the contributions of different terms in our objective is found. RDS is very flexible, as it can be extended to other existing feature extraction techniques. An in-depth theoretical analysis of the algorithm's convergence is presented in this article. Experiments are conducted on several typical databases for image classification, and the promising results indicate the effectiveness of RDS.

44 citations


Journal ArticleDOI
TL;DR: In this article , a review of the recent works applying chemometric methods to plastic waste sorting is presented, which includes principal component analysis (PCA), linear discriminant analysis (LDA), partial least square (PLS), k-nearest neighbors (k-NN), support vector machines (SVM), random forests (RF), artificial neural networks (ANNs), convolutional neural network (CNNs), and K-means clustering.
Abstract: Mismanagement of plastic waste globally has resulted in a multitude of environmental issues, which could be tackled by boosting plastic recycling rates. Chemometrics has emerged as a useful tool for boosting plastic recycling rates by automating the plastic sorting and recycling process. This paper will comprehensively review the recent works applying chemometric methods to plastic waste sorting. The review begins by introducing spectroscopic methods and chemometric tools that are commonly used in the plastic chemometrics literature. The spectroscopic methods include near-infrared spectroscopy (NIR), mid-infrared spectroscopy (MIR), Raman spectroscopy and laser-induced breakdown spectroscopy (LIBS). The chemometric tools include principal component analysis (PCA), linear discriminant analysis (LDA), partial least square (PLS), k-nearest neighbors (k-NN), support vector machines (SVM), random forests (RF), artificial neural networks (ANNs), convolutional neural networks (CNNs) and K-means clustering. This review revealed four main findings. (1) The scope of plastic waste should be expanded in terms of types, contamination and degradation level to mirror the heterogeneous plastic waste received at recycling plants towards understanding potential application in the recycling industry. (2) The use of hybrid spectroscopic method could potentially overcome the limitations of each spectroscopic methods. (3) Develop an open-sourced standardized database of plastic waste spectra would help to further expand the field. (4) There is limited use of more novel machine learning tools such as deep learning for plastic sorting.

39 citations


Journal ArticleDOI
01 Jan 2022-Sensors
TL;DR: The main objective of the proposed research is to classify data and predict heart disease using medical data and medical images using the hybrid Faster R-CNN with SE-ResNeXt-101 transfer learning model.
Abstract: Internet of Things (IoT) technology has recently been applied in healthcare systems as an Internet of Medical Things (IoMT) to collect sensor information for the diagnosis and prognosis of heart disease. The main objective of the proposed research is to classify data and predict heart disease using medical data and medical images. The proposed model is a medical data classification and prediction model that operates in two stages. If the result from the first stage is efficient in predicting heart disease, there is no need for stage two. In the first stage, data gathered from medical sensors affixed to the patient’s body were classified; then, in stage two, echocardiogram image classification was performed for heart disease prediction. A hybrid linear discriminant analysis with the modified ant lion optimization (HLDA-MALO) technique was used for sensor data classification, while a hybrid Faster R-CNN with SE-ResNet-101 modelwass used for echocardiogram image classification. Both classification methods were carried out, and the classification findings were consolidated and validated to predict heart disease. The HLDA-MALO method obtained 96.85% accuracy in detecting normal sensor data, and 98.31% accuracy in detecting abnormal sensor data. The proposed hybrid Faster R-CNN with SE-ResNeXt-101 transfer learning model performed better in classifying echocardiogram images, with 98.06% precision, 98.95% recall, 96.32% specificity, a 99.02% F-score, and maximum accuracy of 99.15%.

31 citations


Journal ArticleDOI
TL;DR: In this paper , an enhanced text classification method using Bag-of-Words representation model with term frequency-inverse document frequency (tf-idf) and word embedding technique to find words with similar semantic meaning.
Abstract: One of the challenging tasks in text classification is to reduce the dimensional feature space. This paper discusses an enhanced text classification method using Bag-of-Words representation model with term frequency-inverse document frequency (tf-idf) and word embedding technique ‘GloVe’ to find words with similar semantic meaning. We select the word with the highest sum of tf-idf as the most representative word with similar meanings. The performance of the proposed method is compared with other methods such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Latent Semantic Indexing (LSI), a hybrid approach PCA+LDA using the Naïve Bayes classifier. Experimental results on three datasets, namely BBC, Classic4, and 20-newsgroup datasets, show that the proposed algorithm gives better classification results than existing dimension reduction techniques. Lastly, we defined a new performance evaluation metric to check the classifier's performance on the reduced features.

30 citations


Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a joint Bayesian framework based on partial least squares discriminant analysis (PLS-DA), which involves three major stages: robust feature description, discriminative feature mapping, and separative verification.
Abstract: Finger vein recognition has attracted considerable attention from the biometric identification technology community owing to its convenience and security. Unlike most previous works only pay attention to one part of finger vein recognition, we propose a joint Bayesian framework in this paper, which is based on partial least squares discriminant analysis (PLS-DA). It involves three major stages: 1) robust feature description, 2) discriminative feature mapping, and 3) separative verification. In stage 1), we extract line responses and orientation of finger veins using a bank of Gabor filter, and histograms are constructed in local patches as primitive features. Subsequently, in stage 2), a discriminant feature mapping based the PLS-DA (PLS-DA-FM) method is proposed to project these primitive features into low-dimensional forms in a supervised manner. Thus, highly compact and discriminative features are obtained in this stage. Finally, in stage 3), we directly build a Bayesian model based on the joint distribution of finger vein feature pairs to measure the similarity between the features. Extensive experiments on five finger vein datasets demonstrate the superior performance of the proposed method to most state-of-the-art finger vein recognition methods.

29 citations


Journal ArticleDOI
TL;DR: A Fisher’s discriminant ratio-based health indicator is proposed to fully consider the contributions of all spectrum amplitudes in the frequency domain to machine PDA and can directly realize data-level fusion, namely fusion of frequency amplitudes.

27 citations


Journal ArticleDOI
01 May 2022
TL;DR: The feasibility of machine learning techniques like DA in the field of TCM is confirmed and using Bayesian optimization algorithms to fine-tune the model is confirmed, making it industry ready.
Abstract: With the advent of Industry 4.0, which conceptualizes self-monitoring of rotating machine parts by adopting techniques like Artificial Intelligence (AI), Machine Learning (ML), Internet of Things (IoT), data analytics, cloud computing, etc. The significant research area in predictive maintenance is Tool Condition Monitoring (TCM) as the tool condition affects the overall machining process and its economics. Lately, machine learning techniques are being used to classify the tool's condition in operation. These techniques are cost-saving and help industries with adopting future-proof solutions for their operations. One such technique called Discriminant analysis (DA) must be examined particularly for TCM. Owing to its less expensive computation and shorter run times, using them in TCM will ensure effective use of the cutting tool and reduce maintenance times. This paper presents a Bayesian optimized discriminant analysis model to classify and monitor the tool condition into three user-defined classes. The data is collected using an in-house designed and developed Data Acquisition (DAQ) module set up on a Vertical Machining Center (VMC). The hyperparameter tuning has been incorporated using Bayesian optimization search, and the parameter which gives the best model was found out to be ‘Linear’, achieving an accuracy of 93.3%. This work confirms the feasibility of machine learning techniques like DA in the field of TCM and using Bayesian optimization algorithms to fine-tune the model, making it industry-ready.

25 citations


Journal ArticleDOI
TL;DR: In this paper , a comparative study of EEG-based multiclass motor imagery classifiers based on Kullback-Leiber regularised Riemann Mean and support vector machine, hybrid one versus one classifier, linear discriminant analysis, and convolutional neural network is presented.
Abstract: This paper presents a comparative study of EEG-based multiclass motor imagery classifiers based on Kullback-Leiber regularised Riemann Mean and support vector machine, hybrid one versus one classifier, linear discriminant analysis, and convolutional neural network. The paper is felt to be of inter- est to those researchers working in the motor imagery classification of EEG signals. The work presented in this paper helps to understand the basics of different multi-class motor imagery classifiers, their accuracy, and the number of channels involved.

23 citations



Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper used the spectra of reflectance (R), absorbance (A) and Kubelka-Munk (K-M) in Lingwu long jujubes to detect bruising.

Journal ArticleDOI
TL;DR: In this paper , the authors present a review of the use of a combination of instrumental and chemometric methods to determine the geographical origin of tea products, which helps to ensure authenticity and traceability.

Journal ArticleDOI
TL;DR: In this paper , the authors applied Raman spectroscopy coupled with multivariate analysis to develop a new and robust analytical method to comprehensively interrogate the spectral profiles of seven micro-plastic references and real microplastic samples post-exposure to environmental stresses.

Journal ArticleDOI
TL;DR: Li et al. as mentioned in this paper developed laser-induced breakdown spectroscopy (LIBS) technique for rapid and all-element analysis of milk powder, and the traditional machine learning methods and convolutional neural network (CNN) were adopted to realize the accurate identification of various adulterated milk powders.

Journal ArticleDOI
TL;DR: In this paper , the performance of adaptive signal decomposition methods and machine learning (ML) algorithms for binary and multiclass classification of imagined words using EEG signals has been evaluated and the results show that the employed algorithms achieve excellent accuracy for the available EEG datasets.

Journal ArticleDOI
15 Jan 2022-Talanta
TL;DR: In this article, a method based on Raman spectroscopy combined with generative adversarial network and multiclass support vector machine was proposed to classify foodborne pathogenic bacteria. But it is still a challenge to overcome the cumbersome culture process of bacteria and the need for a large number of samples, which hinder qualitative analysis, to obtain a high classification accuracy.

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed an efficient face recognition system based on Gabor filter bank and a deep learning method known as Sparse AutoEncoder (SAE), the main aim of the proposed system is to improve the features extracted by Gabor Filter Bank using SAE method.
Abstract: These days, face recognition systems are widely being employed in various daily applications such as smart phone unlocking, tracking school attendance, and secure online bank transactions, smarter border control, to name a few. In spite of the remarkable progress, face recognition systems still suffer from occlusions, light variations, camera types and their resolutions. Face recognition is still a dynamic research field. In this paper, we propose an efficient face recognition system based on Gabor filter bank and a deep learning method known as Sparse AutoEncoder (SAE). The main aim of the proposed system is to improve the features extracted by Gabor filter bank using SAE method. Then, these enhanced features are subjected to features reduction using principal component analysis and linear discriminant analysis (PCA + LDA) technique. Finally, the matching stage is accomplished via cosine Mahalanobis distance. Experiments on seven publicly available databases (i.e., JAFFE, AT&T, Yale, Georgia Tech, CASIA, Extended Yale, Essex) show that the proposed system can achieve promising results with the combination of Gabor and SAE, as well as outperform previously proposed methods.

Journal ArticleDOI
01 Mar 2022-Sensors
TL;DR: The enhanced performance of fNIRS-BCI can be achieved in terms of classification accuracy using DL approaches compared to conventional ML approaches and the control commands generated by these classifiers can be used to initiate and stop the gait cycle of the lower limb exoskeleton for gait rehabilitation.
Abstract: This research presents a brain-computer interface (BCI) framework for brain signal classification using deep learning (DL) and machine learning (ML) approaches on functional near-infrared spectroscopy (fNIRS) signals. fNIRS signals of motor execution for walking and rest tasks are acquired from the primary motor cortex in the brain’s left hemisphere for nine subjects. DL algorithms, including convolutional neural networks (CNNs), long short-term memory (LSTM), and bidirectional LSTM (Bi-LSTM) are used to achieve average classification accuracies of 88.50%, 84.24%, and 85.13%, respectively. For comparison purposes, three conventional ML algorithms, support vector machine (SVM), k-nearest neighbor (k-NN), and linear discriminant analysis (LDA) are also used for classification, resulting in average classification accuracies of 73.91%, 74.24%, and 65.85%, respectively. This study successfully demonstrates that the enhanced performance of fNIRS-BCI can be achieved in terms of classification accuracy using DL approaches compared to conventional ML approaches. Furthermore, the control commands generated by these classifiers can be used to initiate and stop the gait cycle of the lower limb exoskeleton for gait rehabilitation.

Journal ArticleDOI
15 Jan 2022-Talanta
TL;DR: In this article , a method based on Raman spectroscopy combined with generative adversarial network and multiclass support vector machine was proposed to classify foodborne pathogenic bacteria. But it is still a challenge to overcome the cumbersome culture process of bacteria and the need for a large number of samples, which hinder qualitative analysis, to obtain a high classification accuracy.

Journal ArticleDOI
01 Jan 2022-Talanta
TL;DR: In this paper, the application of an electronic nose and chemometric analysis to discriminate volatile organic compounds between patients with COVID-19, post-COVID syndrome and controls in exhaled breath samples was evaluated.

Journal ArticleDOI
TL;DR: In this paper , the authors compared the performance of NIR-HSI-based analytical methods to quantify the oil content and fatty acid in Brassica seeds and obtain the best prediction models based on interval selection for erucic acid, MUFAs and PUFAs.

Journal ArticleDOI
01 Jan 2022-Talanta
TL;DR: In this article , a paper microfluidic paper-based analytical device for detecting and discriminating 8 antibiotics was proposed, which is based on combination of paper micro-fluidics, sensor array concept (an array of metallochromic complexes, which provides an optical tongue, and chemometrics data analysis).

Journal ArticleDOI
TL;DR: In this paper, the authors investigated the aroma quality of 44 Dianhong black tea (DBT) infusions by rapid gas phase electronic nose (GC-E-Nose) combined with multivariate statistical analysis for the first time.
Abstract: Aroma is one of the most important quality attributes of black tea infusions. In this study, the aroma quality of 44 Dianhong black tea (DBT) infusions were investigated by rapid gas phase electronic nose (GC-E-Nose) combined with multivariate statistical analysis for the first time. Among 61 volatile compounds identified, the most abundant ones were aldehydes. Both partial least squares discriminant analysis (PLS-DA) and Fisher discriminant analysis (FDA) could effectively classify the aroma quality of DBT infusions. The prediction accuracy of FDA (95.2%) was higher than that of PLS-DA (78.6%). Moreover, stepwise multiple linear regression was shown to accurately predict the quality score of DBT infusions, and the Rp value of the sensory score of the predicted model was 0.94. Furan, methyl acetate, 2,3-pentanedione, limonene, and linalool were found positively correlated with the aroma quality of DBT infusions, while 3-ethylpentane, 1-pentene, (E)-2-hexene, and methyl eugenol were negatively correlated with the aroma quality of DBT infusions. These results contribute to understand that rapid GC-E-Nose technology is a supplement to the objective sensory evaluation, and provide a new technical method for the quality evaluation and control of tea infusions.

Journal ArticleDOI
22 Jul 2022-PLOS ONE
TL;DR: The findings show promise for employment of DL models on raw EEG signals in future MI-BCI systems, particularly for BCI inefficient users who are unable to produce desired sensorimotor patterns for conventional ML approaches.
Abstract: Motor Imagery Brain-Computer Interfaces (MI-BCIs) are AI-driven systems that capture brain activity patterns associated with mental imagination of movement and convert them into commands for external devices. Traditionally, MI-BCIs operate on Machine Learning (ML) algorithms, which require extensive signal processing and feature engineering to extract changes in sensorimotor rhythms (SMR). In recent years, Deep Learning (DL) models have gained popularity for EEG classification as they provide a solution for automatic extraction of spatio-temporal features in the signals. However, past BCI studies that employed DL models, only attempted them with a small group of participants, without investigating the effectiveness of this approach for different user groups such as inefficient users. BCI inefficiency is a known and unsolved problem within BCI literature, generally defined as the inability of the user to produce the desired SMR patterns for the BCI classifier. In this study, we evaluated the effectiveness of DL models in capturing MI features particularly in the inefficient users. EEG signals from 54 subjects who performed a MI task of left- or right-hand grasp were recorded to compare the performance of two classification approaches; a ML approach vs. a DL approach. In the ML approach, Common Spatial Patterns (CSP) was used for feature extraction and then Linear Discriminant Analysis (LDA) model was employed for binary classification of the MI task. In the DL approach, a Convolutional Neural Network (CNN) model was constructed on the raw EEG signals. Additionally, subjects were divided into high vs. low performers based on their online BCI accuracy and the difference between the two classifiers’ performance was compared between groups. Our results showed that the CNN model improved the classification accuracy for all subjects within the range of 2.37 to 28.28%, but more importantly, this improvement was significantly larger for low performers. Our findings show promise for employment of DL models on raw EEG signals in future MI-BCI systems, particularly for BCI inefficient users who are unable to produce desired sensorimotor patterns for conventional ML approaches.

Journal ArticleDOI
01 Jan 2022-Talanta
TL;DR: In this article , the application of an electronic nose and chemometric analysis to discriminate volatile organic compounds between patients with COVID-19, post-COVID syndrome and controls in exhaled breath samples was evaluated.

Journal ArticleDOI
TL;DR:
Abstract: There is growing attention toward closed biological genomes in the environment and in health. To explore and reveal the intergroup differences among different samples or environments, it is crucial to discover biomarkers with statistical differences among groups. The application of Linear discriminant analysis Effect Size (LEfSe) can help find good biomarkers. Based on the original genome data, quality control, and quantification of different sequences based on taxa or genes are carried out. First, the Kruskal-Wallis rank test was used to distinguish between specific differences among statistical and biological groups. Then, the Wilcoxon rank test was performed between the two groups obtained in the previous step to assess whether the differences were consistent. Finally, a linear discriminant analysis (LDA) was conducted to evaluate the influence of biomarkers on significantly different groups based on LDA scores. To sum up, LEfSe provided the convenience for identifying genomic biomarkers that characterize statistical differences among biological groups.

Journal ArticleDOI
TL;DR: A smart auxiliary framework based on machine learning (ML) is proposed; it can help medical practitioners in the identification of COVID-19-affected patients, among others with pneumonia and healthy individuals, and can help in monitoring the status of CO VID-19 cases using X-ray images.

Journal ArticleDOI
TL;DR: In this article , a Fisher's discriminant ratio-based health indicator is proposed to fully consider the contributions of all spectrum amplitudes in the frequency domain to machine performance degradation assessment (PDA).