Conference

The Florida AI Research Society

About: The Florida AI Research Society is an academic conference. The conference publishes majorly in the area(s): Computer science & Context (language use). Over the lifetime, 2787 publications have been published by the conference receiving 24900 citations.

...read moreread less

Topics: Computer science, Context (language use), Bayesian network, Ontology (information science), Artificial neural network ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

The Optimality of Naive Bayes.

[...]

Harry Zhang¹•Institutions (1)

University of New Brunswick¹

01 Jan 2004

TL;DR: A sufficient condition for the optimality of naive Bayes is presented and proved, in which the dependence between attributes do exist, and evidence that dependence among attributes may cancel out each other is provided.

...read moreread less

Abstract: Naive Bayes is one of the most efficient and effective inductive learning algorithms for machine learning and data mining. Its competitive performance in classification is surprising, because the conditional independence assumption on which it is based, is rarely true in realworld applications. An open question is: what is the true reason for the surprisingly good performance of naive Bayes in classification? In this paper, we propose a novel explanation on the superb classification performance of naive Bayes. We show that, essentially, the dependence distribution; i.e., how the local dependence of a node distributes in each class, evenly or unevenly, and how the local dependencies of all nodes work together, consistently (supporting a certain classification) or inconsistently (canceling each other out), plays a crucial role. Therefore, no matter how strong the dependences among attributes are, naive Bayes can still be optimal if the dependences distribute evenly in classes, or if the dependences cancel each other out. We propose and prove a sufficient and necessary conditions for the optimality of naive Bayes. Further, we investigate the optimality of naive Bayes under the Gaussian distribution. We present and prove a sufficient condition for the optimality of naive Bayes, in which the dependence between attributes do exist. This provides evidence that dependence among attributes may cancel out each other. In addition, we explore when naive Bayes works well. Naive Bayes and Augmented Naive Bayes Classification is a fundamental issue in machine learning and data mining. In classification, the goal of a learning algorithm is to construct a classifier given a set of training examples with class labels. Typically, an example E is represented by a tuple of attribute values (x1, x2, , · · · , xn), where xi is the value of attribute Xi. Let C represent the classification variable, and let c be the value of C. In this paper, we assume that there are only two classes: + (the positive class) or − (the negative class). A classifier is a function that assigns a class label to an example. From the probability perspective, according to Bayes Copyright c © 2004, American Association for Artificial Intelligence (www.aaai.org). All rights reserved. Rule, the probability of an example E = (x1, x2, · · · , xn) being class c is p(c|E) = p(E|c)p(c) p(E) . E is classified as the class C = + if and only if fb(E) = p(C = +|E) p(C = −|E) ≥ 1, (1) where fb(E) is called a Bayesian classifier. Assume that all attributes are independent given the value of the class variable; that is, p(E|c) = p(x1, x2, · · · , xn|c) = n ∏

...read moreread less

1,536 citations

Proceedings Article•

Feature Selection for Machine Learning: Comparing a Correlation-Based Filter Approach to the Wrapper

[...]

Mark Hall, Lloyd A. Smith¹•Institutions (1)

University of Waikato¹

01 May 1999

TL;DR: A new fllter approach to feature selection that uses a correlation based heuristic to evaluate the worth of feature subsets when applied as a data preprocessing step for two common machine learning algorithms.

...read moreread less

Abstract: Feature selection is often an essential data processing step prior to applying a learning algorithm. The removal of irrelevant and redundant information often improves the performance of machine learning algorithms. There are two common approaches: a wrapper uses the intended learning algorithm itself to evaluate the usefulness of features, while a fllter evaluates features according to heuristics based on general characteristics of the data. The wrapper approach is generally considered to produce better feature subsets but runs much more slowly than a fllter. This paper describes a new fllter approach to feature selection that uses a correlation based heuristic to evaluate the worth of feature subsets When applied as a data preprocessing step for two common machine learning algorithms, the new method compares favourably with the wrapper but requires much less computation.

...read moreread less

547 citations

Proceedings Article•

Algorithms for Large Scale Markov Blanket Discovery

[...]

Ioannis Tsamardinos, Constantin F. Aliferis, Alexander Statnikov

01 Jan 2003

TL;DR: A low-order polynomial algorithm and several variants that soundly induce the Markov Blanket under certain broad conditions in datasets with thousands of variables are introduced and compared to other state-of-the-art local and global methods with excellent results.

...read moreread less

Abstract: This paper presents a number of new algorithms for discovering the Markov Blanket of a target variable T from training data. The Markov Blanket can be used for variable selection for classification, for causal discovery, and for Bayesian Network learning. We introduce a low-order polynomial algorithm and several variants that soundly induce the Markov Blanket under certain broad conditions in datasets with thousands of variables and compare them to other state-of-the-art local and global methods with excellent results.

...read moreread less

503 citations

Proceedings Article•

SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis

[...]

Erik Cambria¹, Catherine Havasi², Amir Hussain³•Institutions (3)

National University of Singapore¹, Massachusetts Institute of Technology², University of Stirling³

16 May 2012

TL;DR: By providing the semantics and sentics associated with over 14,000 concepts, SenticNet 2 represents one of the most comprehensive semantic resources for the development of affect-sensitive applications in fields such as social data mining, multimodal affective HCI, and social media marketing.

...read moreread less

Abstract: Web 2.0 has changed the ways people communicate, collaborate, and express their opinions and sentiments. But despite social data on the Web being perfectly suitable for human consumption, they remain hardly accessible to machines. To bridge the cognitive and affective gap between word-level natural language data and the concept-level sentiments conveyed by them, we developed SenticNet 2, a publicly available semantic and affective resource for opinion mining and sentiment analysis. SenticNet 2 is built by means of sentic computing, a new paradigm that exploits both AI and Semantic Web techniques to better recognize, interpret, and process natural language opinions. By providing the semantics and sentics (that is, the cognitive and affective information) associated with over 14,000 concepts, SenticNet 2 represents one of the most comprehensive semantic resources for the development of affect-sensitive applications in fields such as social data mining, multimodal affective HCI, and social media marketing.

...read moreread less

215 citations

Proceedings Article•

Text Mining for Causal Relations

[...]

Roxana Girju¹, Dan Moldovan¹•Institutions (1)

University of Texas at Dallas¹

14 May 2002

TL;DR: This paper presents a semi-automatic method of discovering generally applicable lexico-syntactic patterns that refer to the causal relation, which is found automatically, but their validation is done semi-automatically.

...read moreread less

Abstract: Given a semantic relation, the automatic extraction of linguistic patterns that express that relation is a rather difficult problem. This paper presents a semi-automatic method of discovering generally applicable lexico-syntactic patterns that refer to the causal relation. The patterns are found automatically, but their validation is done semi-automatically.

...read moreread less

199 citations

Collapse

Performance

Metrics

2,787

Papers

24,900

Citations

No. of papers from the Conference in previous years
Year	Papers
2022	129
2021	111
2020	76
2019	73
2018	94
2017	122