scispace - formally typeset
Search or ask a question
Author

M Rajesh Kumar

Bio: M Rajesh Kumar is an academic researcher from VIT University. The author has contributed to research in topics: Feature extraction & Statistical classification. The author has an hindex of 4, co-authored 26 publications receiving 61 citations.

Papers
More filters
Proceedings ArticleDOI
23 Jun 2017
TL;DR: This paper performed sentiment analysis on speaker discriminated speech transcripts to detect the emotions of individual speakers involved in the conversation, and analyzed different techniques to perform speaker discrimination and sentiment analysis to find efficient algorithms to perform this task.
Abstract: Sentiment analysis has evolved over past few decades, most of the work in it revolved around textual sentiment analysis with text mining techniques. But audio sentiment analysis is still in a nascent stage in the research community. In this proposed research, we perform sentiment analysis on speaker discriminated speech transcripts to detect the emotions of the individual speakers involved in the conversation. We analyzed different techniques to perform speaker discrimination and sentiment analysis to find efficient algorithms to perform this task.

27 citations

Proceedings ArticleDOI
08 Apr 2021
TL;DR: In this article, a novel deep learning convolutional neural network (CNN) architecture was introduced to identify Anthracnose disease of mango, which is the most commonly occurring fungal disease that is infecting mango trees in India.
Abstract: Mango is the fruit of high economic and ecological importance in India, as it exports mangoes in large quantities. More than 1500 mango species are cultivated in India and more than 1000 of them are commercial varieties. Mangoes are highly affected by number of diseases, which hamper its appearance, taste and has huge impact on the economy. Amongst number of diseases, Anthracnose is the most commonly occurring fungal disease that is infecting mango trees in India. It is necessary to have an easy and appropriate method to diagnose this highly infectious fungal disease Anthracnose. It would be easier of mango cultivators to identify this disease beforehand and apply proper medication. This will help in preserving its quality and improving the production. Deep learning technologies, computer vision have dragged lot of research attention over past few years due to its high computation and accuracy to classify variety of fungal and bacterial diseases affecting Mango trees. This paper introduces a novel deep learning convolutional neural network (CNN) architecture to identify Anthracnose disease of mango. A real-time dataset captured in farms of Karnataka, Maharashtra and New Delhi is used for validation. It comprises of the images of mango tree leaves by including both healthy and diseased category. In comparison with other state-of-the-art approaches, the proposed algorithm gives higher classification accuracy of about 96.16%.

24 citations

Proceedings ArticleDOI
01 Jun 2017
TL;DR: In this paper, the authors explore various methods available in each block in the process of speaker recognition with the objective to identify best of techniques that could be used to get precise results.
Abstract: The most pressing challenge in the field of voice biometrics is selecting the most efficient technique of speaker recognition. Every individual's voice is peculiar, factors like physical differences in vocal organs, accent and pronunciation contributes to the problem's complexity. In this paper, we explore the various methods available in each block in the process of speaker recognition with the objective to identify best of techniques that could be used to get precise results. We study the results on text independent corpora. We use MFCC (Mel-frequency cepstral coefficient), LPCC (linear predictive cepstral coefficient) and PLP (perceptual linear prediction) algorithms for feature extraction, PCA (Principal Component Analysis) and t-SNE for dimensionality reduction and SVM (Support Vector Machine), feed forward, nearest neighbor and decision tree algorithms for classification block in speaker recognition system and comparatively analyze each block to determine the best technique.

11 citations

Book ChapterDOI
01 Jan 2019
TL;DR: A novel algorithm is proposed to tackle the mentioned issues through a unique edge detection algorithm which drastically increases the probability of tracing a vehicle over having manual database attached to each camera for identification purpose.
Abstract: Vehicles play a vital role in modern-day transportation systems. Number plate provides a standard means of identification for any vehicle. To serve this purpose, automatic license plate recognition system was developed. This consisted of four major steps: preprocessing of obtained image, extraction of license plate region, segmentation, and character recognition. In earlier research, direct application of Sobel edge detection algorithm or applying threshold was used as key steps to extract the license plate region, which do not produce efficient results when captured image is subjected to high intensity of light. The use of morphological operations causes deformity in the characters during segmentation. We propose a novel algorithm to tackle the mentioned issues through a unique edge detection algorithm. It is also a tedious task to create and update the database of required vehicles frequently. This problem is solved by the use of ‘Internet of things’ where an online database can be created and updated from any module instantly. Also, through IoT, we connect all the cameras in a geographical area to one server to create a ‘universal eye’ which drastically increases the probability of tracing a vehicle over having manual database attached to each camera for identification purpose.

11 citations

Proceedings ArticleDOI
01 Dec 2006
TL;DR: In this paper, a simple and effective method for optimum generation dispatch to minimise the fuel cost, environmental cost and security requirement of power networks is proposed, which is based on the bi-criterion global optimisation and particle swarm optimisation (PSO) technique.
Abstract: The advancement in power systems has led to the development of generation dispatch (GD) that is difficult to solve by classical optimisation method. The proposed paper work is to evolve simple and effective method for optimum generation dispatch to minimise the fuel cost, environmental cost and security requirement of power networks. The approach is based on the bi-criterion global optimisation and particle swarm optimisation (PSO) technique. The proposed technique is tested on 3-area interconnected and longitudinal system. The effectiveness of the proposed optimisation is verified in simulation studies using MATLAB software. The PSO based approach has been extended to evaluate the trade-off curve between the fuel cost of power production and the environmental cost according to the bi-criterion objective function.

10 citations


Cited by
More filters
Proceedings ArticleDOI
22 Jul 2012
TL;DR: In this article, the impacts of various pollutants and price penalty factors such as Max-Max, MinMax, Average, and Common are considered in the multi-objective dispatch problem and the simulation results are provided for IEEE 30 bus system.
Abstract: Thermal power plants play a major role in power production. The impacts of the various pollutants such as sulphur dioxide (SO2), Nitrogen oxide (NOx) and carbon dioxide (CO2) affects the environmental issues. The fuel cost of the generator in an economic dispatch problem can be presented by any order polynomial. The literature review reported that most of the papers consider the single pollutant using the second order polynomial function. The paper formulates the dispatch problem using a cubic function for both fuel cost and emission values. The impacts of various pollutants and price penalty factors such as Max-Max, MinMax, Average, and Common are considered in the multi-objective dispatch problem. The dispatch problem is solved using Lagrange's method and the simulation results are provided for IEEE 30 bus system. It concludes that Min-Max Price penalty factor provides minimum fuel cost and minimum emission values in comparison to the other price penalty factors.

48 citations

Journal ArticleDOI
TL;DR: This work aims to present a survey of recent developments in analyzing the multimodal sentiments (involving text, audio, and video/image) which involve human–machine interaction and challenges involved in analyzing them.
Abstract: The analysis of sentiments is essential in identifying and classifying opinions regarding a source material that is, a product or service. The analysis of these sentiments finds a variety of applications like product reviews, opinion polls, movie reviews on YouTube, news video analysis, and health care applications including stress and depression analysis. The traditional approach of sentiment analysis which is based on text involves the collection of large textual data and different algorithms to extract the sentiment information from it. But multimodal sentimental analysis provides methods to carry out opinion analysis based on the combination of video, audio, and text which goes a way beyond the conventional text‐based sentimental analysis in understanding human behaviors. The remarkable increase in the use of social media provides a large collection of multimodal data that reflects the user's sentiment on certain aspects. This multimodal sentimental analysis approach helps in classifying the polarity (positive, negative, and neutral) of the individual sentiments. Our work aims to present a survey of recent developments in analyzing the multimodal sentiments (involving text, audio, and video/image) which involve human–machine interaction and challenges involved in analyzing them. A detailed survey on sentimental dataset, feature extraction algorithms, data fusion methods, and efficiency of different classification techniques are presented in this work.

47 citations

Journal ArticleDOI
TL;DR: A novel and comprehensive framework for multimodal sentiment analysis in conversations is proposed, called a quantum-like multi-modal network (QMN), which leverages the mathematical formalism of quantum theory (QT) and a long short-term memory (LSTM) network.

46 citations

Journal ArticleDOI
TL;DR: A new conversational dataset is presented, named ScenarioSA, and an interactive long short-term memory network is proposed for conversational sentiment analysis to model interactions between speakers in a conversation, which outperforms a wide range of strong baselines and achieves competitive results with the state-of-art approaches.

42 citations

Journal ArticleDOI
TL;DR: Two types of neural networks, known as deep neural networks in its expansion form, a convolutional neural network (CNN) and an auto-encoder, are implemented and by using a new combination of CNN layers one can obtain improved results in classifying Farsi digits.
Abstract: Handwriting recognition remains a challenge in the machine vision field, especially in optical character recognition (OCR). The OCR has various applications such as the detection of handwritten Farsi digits and the diagnosis of biomedical science. In expanding and improving quality of the subject, this research focus on the recognition of Farsi Handwriting Digits and illustration applications in biomedical science. The detection of handwritten Farsi digits is being widely used in most contexts involving the collection of generic digital numerical information, such as reading checks or digits of postcodes. Selecting an appropriate classifier has become an issue highlighted in the recognition of handwritten digits. The paper aims at identifying handwritten Farsi digits written with different handwritten styles. Digits are classified using several traditional methods, including K-nearest neighbor, artificial neural network (ANN), and support vector machine (SVM) classifiers. New features of digits, namely, geometric and correlation-based features, have demonstrated to achieve better recognition performance. A noble class of methods, known as deep neural networks (DNNs), is also used to identify handwritten digits through machine vision. Here, two types of introduce its expansion form, a convolutional neural network (CNN) and an auto-encoder, are implemented. Moreover, by using a new combination of CNN layers one can obtain improved results in classifying Farsi digits. The performances of the DNN-based and traditional classifiers are compared to investigate the improvements in accuracy and calculation time. The SVM shows the best results among the traditional classifiers, whereas the CNN achieves the best results among the investigated techniques. The ANN offers better execution time than the SVM, but its accuracy is lower. The best accuracy among the traditional classifiers based on all investigated features is 99.3% accuracy obtained by the SVM, and the CNN achieves the best overall accuracy of 99.45%.

30 citations