Home
/
Authors
/
Kiran Bhowmick

Author

Kiran Bhowmick

Dwarkadas J. Sanghvi College of Engineering

Bio: Kiran Bhowmick is an academic researcher from Dwarkadas J. Sanghvi College of Engineering. The author has contributed to research in topics: Statistical classification & Support vector machine. The author has an hindex of 6, co-authored 18 publications receiving 236 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Survey of Opinion Mining and Sentiment Analysis

[...]

Vishakha Patel, Gayatri Prabhu, Kiran Bhowmick

17 Dec 2015-International Journal of Computer Applications

TL;DR: A survey is presented which covers the problem of sentiment analysis, techniques and methods used for the same and the major challenge lies in analyzing the sentiments and identifying emotions expressed in texts.

...read moreread less

Abstract: A huge amount of online information, rich web resources are highly unstructured and such natural language are not solvable by machine directly. The increased demand to capture opinions of general public about social events, campaigns and sales of the product has led to study of the field opinion mining and sentiment analysis. Opinion refers to extraction of lines in raw data which expresses an opinion. Sentiment analysis identifies polarity of extracted opinions. The major challenge lies in analyzing the sentiments and identifying emotions expressed in texts. This paper presents a survey which covers a problem of sentiment analysis, techniques and methods used for the same.

...read moreread less

170 citations

Journal Article•DOI•

Virus Detection using Artificial Neural Networks

[...]

Shivani Shah, Himali Jani, Sathvik Shetty, Kiran Bhowmick

18 Dec 2013-International Journal of Computer Applications

TL;DR: A new technique of identifying virus infected files by using Fisher Score and applying them as input to the neural network is proposed.

...read moreread less

Abstract: A virus is defined as a program that spreads or replicates by copying itself, and generally has malicious effects. The antivirus systems used today mainly detect malware on the basis of known virus patterns, making detection of a new virus very difficult. This deficiency can be overcome by training an artificial neural network with the inputs from Portable Executable (PE) Structure of executable files, as they learn from the training data and will be able to identify unknown virus patterns. PE Structure contains various fields by which one can identify virus infected executable files from the legitimate ones without executing them, and Fisher Score can be used to select the most relevant features (fields) to speed up the analysis. A new technique of identifying virus infected files by using Fisher Score and applying them as input to the neural network is proposed. General Terms Virus, Program, Patterns, Executable files

...read moreread less

18 citations

Book Chapter•DOI•

Trajectory Outlier Detection for Traffic Events: A Survey

[...]

Kiran Bhowmick¹, Meera Narvekar¹•Institutions (1)

Dwarkadas J. Sanghvi College of Engineering¹

01 Jan 2018

TL;DR: A potential use of outlier detection to identify irregular events that cause traffic congestion is proposed and a future research direction is discussed.

...read moreread less

Abstract: With the advent of Global Positioning System (GPS) and extensive use of smartphones, trajectory data for moving objects is available easily and at cheaper price. Moreover, the use of GPS devices in vehicles is now possible to keep a track of moving vehicles on the road. It is also possible to identify anomalous behavior of vehicle with this trajectory data. In the field of trajectory mining, outlier detection of trajectories has become one of the important topics that can be used to detect anomalies in the trajectories. In this paper, certain existing issues and challenges of trajectory data are identified and a future research direction is discussed. This paper proposes a potential use of outlier detection to identify irregular events that cause traffic congestion.

...read moreread less

13 citations

Proceedings Article•DOI•

To improve classification of imbalanced datasets

[...]

Pratyusha Shukla¹, Kiran Bhowmick¹•Institutions (1)

Dwarkadas J. Sanghvi College of Engineering¹

17 Mar 2017

TL;DR: Results show that K-means helps in balancing the data and hence the accuracy and time taken to classify balanced dataset is much better than simply classifying the imbalanced dataset.

...read moreread less

Abstract: The task of accurately predicting the target class for each case in the data is called classification of data in data mining. Classification of balanced data set is fairly simple and easy to perform but it becomes difficult when the data is not balanced. Class Imbalance problem is the problem in machine learning where the total number of a class of data (positive) is far less than the total number of another class of data (negative). In this paper, we have used K-Means algorithm to balance the imbalanced dataset and then use SVM to classify the balanced dataset. We have compared the accuracy, precision, recall and time taken in classifying balanced as well as imbalanced datasets and results show that K-means helps in balancing the data and hence the accuracy and time taken to classify balanced dataset is much better than simply classifying the imbalanced dataset.

...read moreread less

10 citations

Journal Article•DOI•

Parallel Text Mining in Multicore Systems Using FP-tree Algorithm☆

[...]

Krishna Gadia, Kiran Bhowmick

01 Jan 2015-Procedia Computer Science

TL;DR: This paper tries to parallelize the FP-Growth algorithm on multicore machines, partition the huge database, into the number of cores, and utilize the combined strength of all the cores, to achieve maximum performance.

...read moreread less

9 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Text Classification Algorithms: A Survey

[...]

Kamran Kowsari¹, Kiana Jafari Meimandi¹, Mojtaba Heidarysafa¹, Sanjana Mendu¹, Laura E. Barnes¹, Donald E. Brown¹ - Show less +2 more•Institutions (1)

University of Virginia¹

17 Apr 2019-arXiv: Learning

TL;DR: An overview of text classification algorithms is discussed, which covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods.

...read moreread less

Abstract: In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in the real-world problem are discussed.

...read moreread less

612 citations

Posted Content•

Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey

[...]

Hamed Jelodar¹, Yongli Wang¹, Chi Yuan¹, Xia Feng¹, Xiahui Jiang¹, Yanchao Li¹, Liang Zhao¹ - Show less +3 more•Institutions (1)

Nanjing University of Science and Technology¹

12 Nov 2017-arXiv: Information Retrieval

TL;DR: In this article, the authors investigated the research development, current trends and intellectual structure of topic modeling based on Latent Dirichlet Allocation (LDA), and summarized challenges and introduced famous tools and datasets in topic modelling based on LDA.

...read moreread less

Abstract: Topic modeling is one of the most powerful techniques in text mining for data mining, latent data discovery, and finding relationships among data, text documents. Researchers have published many articles in the field of topic modeling and applied in various fields such as software engineering, political science, medical and linguistic science, etc. There are various methods for topic modeling, which Latent Dirichlet allocation (LDA) is one of the most popular methods in this field. Researchers have proposed various models based on the LDA in topic modeling. According to previous work, this paper can be very useful and valuable for introducing LDA approaches in topic modeling. In this paper, we investigated scholarly articles highly (between 2003 to 2016) related to Topic Modeling based on LDA to discover the research development, current trends and intellectual structure of topic modeling. Also, we summarize challenges and introduce famous tools and datasets in topic modeling based on LDA.

...read moreread less

546 citations

Journal Article•DOI•

Towards Sustainable Energy: A Systematic Review of Renewable Energy Sources, Technologies, and Public Opinions

[...]

Atika Qazi¹, Fayaz Hussain¹, Nasrudin Abd Rahim¹, Glenn Hardaker², Daniyal M. Alghazzawi³, Khaled Bashir Shaban⁴, Khalid Haruna⁵ - Show less +3 more•Institutions (5)

University of Malaya¹, Universiti Brunei Darussalam², King Abdulaziz University³, Qatar University⁴, Bayero University Kano⁵

23 May 2019-IEEE Access

TL;DR: The results of this study show that worldwide energy crises can be managed by integrating renewable energy sources in the power generation and the lack of public awareness is a major barrier to the acceptance of renewable energy technologies.

...read moreread less

Abstract: The use of renewable energy resources, such as solar, wind, and biomass will not diminish their availability. Sunlight being a constant source of energy is used to meet the ever-increasing energy need. This review discusses the world's energy needs, renewable energy technologies for domestic use, and highlights public opinions on renewable energy. A systematic review of the literature was conducted from 2009 to 2018. During this process, more than 300 articles were classified and 42 papers were filtered for critical review. The literature analysis showed that despite serious efforts at all levels to reduce reliance on fossil fuels by promoting renewable energy as its alternative, fossil fuels continue to contribute 73.5% to the worldwide electricity production in 2017. Conversely, renewable sources contributed only 26.5%. Furthermore, this study highlights that the lack of public awareness is a major barrier to the acceptance of renewable energy technologies. The results of this study show that worldwide energy crises can be managed by integrating renewable energy sources in the power generation. Moreover, in order to facilitate the development of renewable energy technologies, this systematic review has highlighted the importance of public opinion and performed a real-time analysis of public tweets. This example of tweet analysis is a relatively novel initiative in a review study that will seek to direct the attention of future researchers and policymakers toward public opinion and recommend the implications to both academia and industries.

...read moreread less

426 citations

Proceedings Article•DOI•

Unsupervised sentiment analysis with emotional signals

[...]

Xia Hu¹, Jiliang Tang¹, Huiji Gao¹, Huan Liu¹•Institutions (1)

Arizona State University¹

13 May 2013

TL;DR: This work investigates whether the signals in social media can potentially help sentiment analysis by providing a unified way to model two main categories of emotional signals, i.e., emotion indication and emotion correlation and incorporates the signals into an unsupervised learning framework for sentiment analysis.

...read moreread less

Abstract: The explosion of social media services presents a great opportunity to understand the sentiment of the public via analyzing its large-scale and opinion-rich data In social media, it is easy to amass vast quantities of unlabeled data, but very costly to obtain sentiment labels, which makes unsupervised sentiment analysis essential for various applications It is challenging for traditional lexicon-based unsupervised methods due to the fact that expressions in social media are unstructured, informal, and fast-evolving Emoticons and product ratings are examples of emotional signals that are associated with sentiments expressed in posts or words Inspired by the wide availability of emotional signals in social media, we propose to study the problem of unsupervised sentiment analysis with emotional signals In particular, we investigate whether the signals can potentially help sentiment analysis by providing a unified way to model two main categories of emotional signals, ie, emotion indication and emotion correlation We further incorporate the signals into an unsupervised learning framework for sentiment analysis In the experiment, we compare the proposed framework with the state-of-the-art methods on two Twitter datasets and empirically evaluate our proposed framework to gain a deep understanding of the effects of emotional signals

...read moreread less

374 citations

Journal Article•DOI•

A survey of multimodal sentiment analysis

[...]

Mohammad Soleymani¹, David Garcia², Brendan Jou³, Björn Schuller⁴, Björn Schuller⁵, Björn Schuller¹, Shih-Fu Chang³, Maja Pantic⁶, Maja Pantic⁴ - Show less +5 more•Institutions (6)

University of Geneva¹, ETH Zurich², Columbia University³, Imperial College London⁴, University of Passau⁵, University of Twente⁶

01 Sep 2017-Image and Vision Computing

TL;DR: The thesis is that multimodal sentiment analysis holds a significant untapped potential with the arrival of complementary data streams for improving and going beyond text-based sentiment analysis.

...read moreread less

357 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51

Collapse