Home
/
Authors
/
Asif Ekbal

Author

Asif Ekbal

Other affiliations: Jadavpur University, Indian Institutes of Technology, University of Trento ...read more

Bio: Asif Ekbal is an academic researcher from Indian Institute of Technology Patna. The author has contributed to research in topics: Computer science & Conditional random field. The author has an hindex of 35, co-authored 365 publications receiving 4579 citations. Previous affiliations of Asif Ekbal include Jadavpur University & Indian Institutes of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The CHEMDNER corpus of chemicals and drugs and its annotation principles.

[...]

Martin Krallinger, Obdulia Rabal¹, Florian Leitner², Miguel Vazquez, David Salgado, Zhiyong Lu³, Robert Leaman³, Yanan Lu⁴, Donghong Ji⁴, Daniel M. Lowe, Roger A. Sayle, Riza Theresa Batista-Navarro, Rafal Rak, Torsten Huber⁵, Tim Rocktäschel⁶, Sérgio Matos⁷, David Campos⁷, Buzhou Tang⁸, Hua Xu⁹, Tsendsuren Munkhdalai¹⁰, Keun Ho Ryu¹⁰, S. V. Ramanan¹¹, Senthil Nathan¹¹, Slavko Žitnik¹², Marko Bajec¹², Lutz Weber, Matthias Irmer, Saber A. Akhondi¹³, Jan A. Kors¹³, Shuo Xu, Xin An¹⁴, Utpal Kumar Sikdar¹⁵, Asif Ekbal¹⁵, Masaharu Yoshioka¹⁶, Thaer M. Dieb¹⁶, Miji Choi¹⁷, Karin Verspoor¹⁷, Madian Khabsa¹⁸, C. Lee Giles¹⁸, Hongfang Liu¹⁹, Komandur Elayavilli Ravikumar¹⁹, Andre Lamurias²⁰, Francisco M. Couto²⁰, Hong-Jie Dai²¹, Richard Tzong-Han Tsai²², Caglar Ata²³, Tolga Can²³, Anabel Usié, Rui Alves, Isabel Segura-Bedmar²⁴, Paloma Martínez²⁴, Julen Oyarzabal¹, Alfonso Valencia - Show less +49 more•Institutions (24)

University of Navarra¹, Technical University of Madrid², National Institutes of Health³, Wuhan University⁴, Humboldt University of Berlin⁵, University College London⁶, University of Aveiro⁷, Harbin Institute of Technology Shenzhen Graduate School⁸, University of Texas Health Science Center at Houston⁹, Chungbuk National University¹⁰, Indian Institute of Technology Madras¹¹, University of Ljubljana¹², Erasmus University Medical Center¹³, Beijing Forestry University¹⁴, Indian Institute of Technology Patna¹⁵, Hokkaido University¹⁶, University of Melbourne¹⁷, Pennsylvania State University¹⁸, University of Rochester¹⁹, University of Lisbon²⁰, Taipei Medical University²¹, National Central University²², Middle East Technical University²³, Charles III University of Madrid²⁴

19 Jan 2015-Journal of Cheminformatics

TL;DR: The CHEMDNER corpus is presented, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task.

...read moreread less

Abstract: The automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of different approaches that detect chemicals in documents. We present the CHEMDNER corpus, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task. The abstracts of the CHEMDNER corpus were selected to be representative for all major chemical disciplines. Each of the chemical entity mentions was manually labeled according to its structure-associated chemical entity mention (SACEM) class: abbreviation, family, formula, identifier, multiple, systematic and trivial. The difficulty and consistency of tagging chemicals in text was measured using an agreement study between annotators, obtaining a percentage agreement of 91. For a subset of the CHEMDNER corpus (the test set of 3,000 abstracts) we provide not only the Gold Standard manual annotations, but also mentions automatically detected by the 26 teams that participated in the BioCreative IV CHEMDNER chemical mention recognition task. In addition, we release the CHEMDNER silver standard corpus of automatically extracted mentions from 17,000 randomly selected PubMed abstracts. A version of the CHEMDNER corpus in the BioC format has been generated as well. We propose a standard for required minimum information about entity annotations for the construction of domain specific corpora on chemical and drug entities. The CHEMDNER corpus and annotation guidelines are available at: http://www.biocreative.org/resources/biocreative-iv/chemdner-corpus/

...read moreread less

368 citations

Journal Article•DOI•

How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [Application Notes]

[...]

Shad Akhtar¹, Asif Ekbal¹, Erik Cambria²•Institutions (2)

Indian Institute of Technology Patna¹, Nanyang Technological University²

10 Jan 2020-IEEE Computational Intelligence Magazine

TL;DR: A stacked ensemble method for predicting the degree of intensity for emotion and sentiment by combining the outputs obtained from several deep learning and classical feature-based models using a multi-layer perceptron network is proposed.

...read moreread less

Abstract: Emotions and sentiments are subjective in nature. They differ on a case-to-case basis. However, predicting only the emotion and sentiment does not always convey complete information. The degree or level of emotions and sentiments often plays a crucial role in understanding the exact feeling within a single class (e.g., `good' versus `awesome'). In this paper, we propose a stacked ensemble method for predicting the degree of intensity for emotion and sentiment by combining the outputs obtained from several deep learning and classical feature-based models using a multi-layer perceptron network. We develop three deep learning models based on convolutional neural network, long short-term memory and gated recurrent unit and one classical supervised model based on support vector regression. We evaluate our proposed technique for two problems, i.e., emotion analysis in the generic domain and sentiment analysis in the financial domain. The proposed model shows impressive results for both the problems. Comparisons show that our proposed model achieves improved performance over the existing state-of-the-art systems.

...read moreread less

184 citations

Book Chapter•DOI•

Fighting an Infodemic: COVID-19 Fake News Dataset.

[...]

Parth Patwa, Shivam Sharma, Srinivas Pykl, Vineeth Guptha, Gitanjali Kumari, Shad Akhtar, Asif Ekbal, Amitava Das, Tanmoy Chakraborty - Show less +5 more

06 Nov 2020-arXiv: Computation and Language

TL;DR: A manually annotated dataset of 10,700 social media posts and articles of real and fake news on COVID-19 is curate and released, and four machine learning baselines are benchmarked.

...read moreread less

Abstract: Along with COVID-19 pandemic we are also fighting an `infodemic'. Fake news and rumors are rampant on social media. Believing in rumors can cause significant harm. This is further exacerbated at the time of a pandemic. To tackle this, we curate and release a manually annotated dataset of 10,700 social media posts and articles of real and fake news on COVID-19. We benchmark the annotated dataset with four machine learning baselines - Decision Tree, Logistic Regression, Gradient Boost, and Support Vector Machine (SVM). We obtain the best performance of 93.46% F1-score with SVM. The data and code is available at: this https URL

...read moreread less

178 citations

Journal Article•DOI•

Feature selection and ensemble construction

[...]

Shad Akhtar¹, Deepak Gupta¹, Asif Ekbal¹, Pushpak Bhattacharyya¹•Institutions (1)

Indian Institute of Technology Patna¹

01 Jun 2017-Knowledge Based Systems

TL;DR: A cascaded framework of feature selection and classifier ensemble using particle swarm optimization (PSO) for aspect based sentiment analysis using three classifiers, namely Maximum Entropy, Conditional Random Field and Support Vector Machine are presented.

...read moreread less

Abstract: In this paper we present a cascaded framework of feature selection and classifier ensemble using particle swarm optimization (PSO) for aspect based sentiment analysis. Aspect based sentiment analysis is performed in two steps, viz. aspect term extraction and sentiment classification. The pruned, compact set of features performs better compared to the baseline model that makes use of the complete set of features for aspect term extraction and sentiment classification. We further construct an ensemble based on PSO, and put it in cascade after the feature selection module. We use the features that are identified based on the properties of different classifiers and domains. As base learning algorithms we use three classifiers, namely Maximum Entropy (ME), Conditional Random Field (CRF) and Support Vector Machine (SVM). Experiments for aspect term extraction and sentiment analysis on two different kinds of domains show the effectiveness of our proposed approach.

...read moreread less

138 citations

Proceedings Article•DOI•

Contextual Inter-modal Attention for Multi-modal Sentiment Analysis

[...]

Deepanway Ghosal¹, Shad Akhtar¹, Dushyant Singh Chauhan¹, Soujanya Poria², Asif Ekbal¹, Pushpak Bhattacharyya¹ - Show less +2 more•Institutions (2)

Indian Institute of Technology Patna¹, Agency for Science, Technology and Research²

01 Jan 2018

TL;DR: A recurrent neural network based multi-modal attention framework that leverages the contextual information for utterance-level sentiment prediction that applies attention on multi- modal multi-utterance representations and tries to learn the contributing features amongst them.

...read moreread less

Abstract: Multi-modal sentiment analysis offers various challenges, one being the effective combination of different input modalities, namely text, visual and acoustic. In this paper, we propose a recurrent neural network based multi-modal attention framework that leverages the contextual information for utterance-level sentiment prediction. The proposed approach applies attention on multi-modal multi-utterance representations and tries to learn the contributing features amongst them. We evaluate our proposed approach on two multi-modal sentiment analysis benchmark datasets, viz. CMU Multi-modal Opinion-level Sentiment Intensity (CMU-MOSI) corpus and the recently released CMU Multi-modal Opinion Sentiment and Emotion Intensity (CMU-MOSEI) corpus. Evaluation results show the effectiveness of our proposed approach with the accuracies of 82.31% and 79.80% for the MOSI and MOSEI datasets, respectively. These are approximately 2 and 1 points performance improvement over the state-of-the-art models for the datasets.

...read moreread less

119 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•

Data Mining Practical Machine Learning Tools and Techniques

[...]

อนิรุธ สืบสิงห์

01 Jan 2014-Journal of management science