Topic

Decision tree model

About: Decision tree model is a research topic. Over the lifetime, 2256 publications have been published within this topic receiving 38142 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Optimal decision trees and one-time-only branching programs for symmetric Boolean functions

[...]

Ingo Wegener¹•Institutions (1)

Goethe University Frankfurt¹

01 Aug 1984-Information & Computation

TL;DR: Efficient algorithms for the construction of optimal decision trees and optimal one-time-only branching programs for symmetric Boolean functions are presented and an exponential lower bound on the decision tree complexity of some Boolean function is shown having linear formula size.

...read moreread less

Abstract: Combinational complexity and depth are the most important complexity measures for Boolean functions. It has turned out to be very hard to prove good lower bounds on the combinational complexity or the depth of explicitly defined Boolean functions. Therefore one has restricted oneself to models where nontrivial lower bounds are easier to prove. Here decision trees, branching programs, and one-time-only branching programs are considered, where each variable may be tested on each path of computation only once. Efficient algorithms for the construction of optimal decision trees and optimal one-time-only branching programs for symmetric Boolean functions are presented. Furthermore, the following trade-off results are proved. An exponential lower bound on the decision tree complexity of some Boolean function is shown having linear formula size and linear one-time-only branching program complexity. Furthermore, a quadratic lower bound on the one-time-only branching program complexity of some Boolean function is shown having linear combinational complexity.

...read moreread less

41 citations

Journal Article•DOI•

The importance of the label hierarchy in hierarchical multi-label classification

[...]

Jurica Levatić¹, Dragi Kocev¹, Sašo Džeroski¹•Institutions (1)

Jožef Stefan Institute¹

01 Oct 2015

TL;DR: The results reveal that the hierarchy and the multiple labels do help to obtain a better single tree model, while this is not preserved for the ensemble models.

...read moreread less

Abstract: We address the task of hierarchical multi-label classification (HMC). HMC is a task of structured output prediction where the classes are organized into a hierarchy and an instance may belong to multiple classes. In many problems, such as gene function prediction or prediction of ecological community structure, classes inherently follow these constraints. The potential for application of HMC was recognized by many researchers and several such methods were proposed and demonstrated to achieve good predictive performances in the past. However, there is no clear understanding when is favorable to consider such relationships (hierarchical and multi-label) among classes, and when this presents unnecessary burden for classification methods. To this end, we perform a detailed comparative study over 8 datasets that have HMC properties. We investigate two important influences in HMC: the multiple labels per example and the information about the hierarchy. More specifically, we consider four machine learning tasks: multi-label classification, hierarchical multi-label classification, single-label classification and hierarchical single-label classification. To construct the predictive models, we use predictive clustering trees (a generalized form of decision trees), which are able to tackle each of the modelling tasks listed. Moreover, we investigate whether the influence of the hierarchy and the multiple labels carries over for ensemble models. For each of the tasks, we construct a single tree and two ensembles (random forest and bagging). The results reveal that the hierarchy and the multiple labels do help to obtain a better single tree model, while this is not preserved for the ensemble models.

...read moreread less

40 citations

Journal Article•DOI•

Polynomial-fuzzy decision tree structures for classifying medical data

[...]

Ernest Muthomi Mugambi¹, Andrew Hunter², Giles Oatley¹, Lee Kennedy¹•Institutions (2)

University of Sunderland¹, Durham University²

01 May 2004-Knowledge Based Systems

TL;DR: By trading-off comprehensibility and performance using a multi-objective genetic programming optimization algorithm, this paper can induce polynomial-fuzzy decision trees (PFDT) that are smaller, more compact and of better performance than their linear decision tree (LDT) counterparts.

...read moreread less

Abstract: Decision tree induction has been studied extensively in machine learning as a solution for classification problems. The way the linear decision trees partition the search space is found to be comprehensible and hence appealing to data modelers. Comprehensibility is an important aspect of models used in medical data mining as it determines model credibility and even acceptability. In the practical sense though, inordinately long decision trees compounded by replication problems detracts from comprehensibility. This demerit can be partially attributed to their rigid structure that is unable to handle complex non-linear or/and continuous data. To address this issue we introduce a novel hybrid multivariate decision tree composed of polynomial, fuzzy and decision tree structures. The polynomial nature of these multivariate trees enable them to perform well in non-linear territory while the fuzzy members are used to squash continuous variables. By trading-off comprehensibility and performance using a multi-objective genetic programming optimization algorithm, we can induce polynomial-fuzzy decision trees (PFDT) that are smaller, more compact and of better performance than their linear decision tree (LDT) counterparts. In this paper we discuss the structural differences between PFDT and LDT (C4.5) and compare the size and performance of their models using medical data.

...read moreread less

40 citations

Journal Article•DOI•

Applications of Ramsey's theorem to decision tree complexity

[...]

Shlomo Moran¹, Marc Snir², Udi Manber³•Institutions (3)

Technion – Israel Institute of Technology¹, Hebrew University of Jerusalem², University of Wisconsin-Madison³

01 Oct 1985-Journal of the ACM

TL;DR: All existing lower bounds for comparison-based algorithms are valid for general k-bounded decision trees, where k is a constant, and are shown to hold for nondeterministic and probabilistic decision trees as well.

...read moreread less

Abstract: Combinatorial techniques for extending lower bound results for decision trees to general types of queries are presented. Problems that are defined by simple inequalities between inputs, called order invariant problems, are considered. A decision tree is called k-bounded if each query depends on at most k variables. No further assumptions on the type of queries are made. It is proved that one can replace the queries of any k-bounded decision tree that solves an order-invariant problem over a large enough input domain with k-bounded queries whose outcome depends only on the relative order of the inputs. As a consequence, all existing lower bounds for comparison-based algorithms are valid for general k-bounded decision trees, where k is a constant.An O(n log n) lower bound for the element uniqueness problem and several other problems for any k-bounded decision tree, such that k = O(nc) and c

...read moreread less

40 citations

Journal Article•DOI•

A model of computation for VLSI with related complexity results

[...]

Bernard Chazelle¹, Louis Monier¹•Institutions (1)

Carnegie Mellon University¹

01 Jul 1985-Journal of the ACM

TL;DR: A new model of computation for VLSI, based on the assumption that time for propagating information is at least linear in the distance, is proposed, which is especially suited for deriving lower bounds and trade-offs.

...read moreread less

Abstract: A new model of computation for VLSI, based on the assumption that time for propagating information is at least linear in the distance, is proposed. While accommodating for basic laws of physics, the model is designed to be general and technology independent. Thus, from a complexity viewpoint, it is especially suited for deriving lower bounds and trade-offs. New results for a number of problems, including fan-in, transitive functions, matrix multiplication, and sorting are presented. As regards upper bounds, it must be noted that, because of communication costs, the model clearly favors regular and pipelined architectures (e.g., systolic arrays).

...read moreread less

40 citations

Collapse

Network Information

Performance

Metrics

2,288

Papers

43,502

Citations

No. of papers in the topic in previous years
Year	Papers
2023	10
2022	24
2021	101
2020	163
2019	158
2018	121

Decision tree model

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics