Home
/
Topics
/
Decision tree model

Topic

Decision tree model

About: Decision tree model is a research topic. Over the lifetime, 2256 publications have been published within this topic receiving 38142 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

Papers

PDF

Open Access

More filters

Proceedings Article•

Probabilistic models for query approximation with large sparse binary data sets

[...]

Dmitry Pavlov¹, Heikki Mannila², Padhraic Smyth¹•Institutions (2)

University of California, Irvine¹, Nokia²

30 Jun 2000

TL;DR: A Markov random field (MRF) approach based on frequent sets and maximum entropy is studied, and it is found that the MRF model provides substantially more accurate probability estimates than the other methods but is more expensive from a computational and memory viewpoint.

...read moreread less

Abstract: Large sparse sets of binary transaction data with millions of records and thousands of attributes occur in various domains: customers purchasing products, users visiting web pages, and documents containing words are just three typical examples. Real-time query selectivity estimation (the problem of estimating the number of rows in the data satisfying a given predicate) is an important practical problem for such databases. We investigate the application of probabilistic models to this problem. In particular, we study a Markov random field (MRF) approach based on frequent sets and maximum entropy, and compare it to the independence model and the Chow-Liu tree model. We find that the MRF model provides substantially more accurate probability estimates than the other methods but is more expensive from a computational and memory viewpoint. To alleviate the computational requirements we show how one can apply bucket elimination and clique tree approaches to take advantage of structure in the models and in the queries. We provide experimental results on two large real-world transaction datasets.

...read moreread less

30 citations

Journal Article•DOI•

Decision Trees for Geometric Models

[...]

Esther M. Arkin, Henk Meijer, Joseph S. B. Mitchell, David Rappaport, Steven Skiena - Show less +1 more

01 Jun 1998-International Journal of Computational Geometry and Applications

TL;DR: It is shown that a ⌈lg k⌉ height binary decision tree always exists for k polygonal models (in fixed position) and an efficient algorithm for constructing such decision tress is given when the models are given as a set of polygons in the plane.

...read moreread less

Abstract: A fundamental problem in model-based computer vision is that of identifying which of a given set of geometric models is present in an image. Considering a "probe" to be an oracle that tells us whether or not a model is present at a given point, we study the problem of computing efficient strategies ("decision trees") for probing an image, with the goal to minimize the number of probes necessary (in the worst case) to determine which single model is present. We show that a ⌈lg k⌉ height binary decision tree always exists for k polygonal models (in fixed position), provided (1) they are non-degenerate (do not share boundaries) and (2) they share a common point of intersection. Further, we give an efficient algorithm for constructing such decision tress when the models are given as a set of polygons in the plane. We show that constructing a minimum height tree is NP-complete if either of the two assumptions is omitted. We provide an efficient greedy heuristic strategy and show that, in the general case, it yields a decision tree whose height is at most ⌈lg k⌉ times that of an optimal tree. Finally, we discuss some restricted cases whose special structure allows for improved results.

...read moreread less

30 citations

On the Computational Complexity of Incremental Algorithms

[...]

Ganesan Ramalingam, Thomas Reps

01 Aug 1991

TL;DR: In this paper, the complexity hierarchy of P-time incremental problems, inherently Exp~ time incremental problems and non-incremental problems is investigated. But the results in this paper are restricted to locally persistent algorithms.

...read moreread less

Abstract: Our results, together with some previously known ones, shed light on the organization of the complexity hierarchy that exists when incremental-computation problems are classiﬁed according to their incremental complexity with respect to locally persistent algorithms. In particular, these results separate the classes of P-time incremental problems, inherently Exp~ time incremental problems, and non-incremental problems.

...read moreread less

30 citations

MB3-Miner: mining eMBedded subTREEs using Tree Model Guided candidate generation

[...]

Henry Tan, Tharam S. Dillon, Fedja Hadzic, Ling Feng, E. Chang - Show less +1 more

01 Jan 2005

TL;DR: This paper presents the mathematical model of a breadth-first-search Tree Model Guided (TMG) candidate generation approach, and proposes a novel and unique embedding list representation that is suitable for describing embedded subtrees.

...read moreread less

Abstract: Tree mining has many useful applications in areas such as Bioinformatics, XML mining, Web mining, etc. In general, most of the formally represented information in these domains is a tree structured form. In this paper we focus on mining frequent embedded subtrees from databases of rooted labeled ordered subtrees. We propose a novel and unique embedding list representation that is suitable for describing embedded subtrees. This representation is completely different from the string-like or conventional adjacency list representation previously utilized for trees. We present the mathematical model of a breadth-first-search Tree Model Guided (TMG) candidate generation approach previously introduced in [8]. The key characteristic of the TMG approach is that it enumerates fewer candidates by ensuring that only valid candidates that conform to the structural aspects of the data are generated as opposed to the join approach. Our experiments with both synthetic and real-life datasets provide comparisons against one of the state-of-the-art algorithms, TreeMiner [15], and they demonstrate the effectiveness and the efficiency of the technique.

...read moreread less

29 citations

Journal Article•DOI•

Improved disaggregation of conventional soil maps

[...]

Anders Bjørn Møller¹, Brendan P. Malone², Nathan P. Odgers³, Nathan P. Odgers², Amélie Beucher¹, Bo V. Iversen¹, Mogens Humlekrog Greve¹, Budiman Minasny² - Show less +4 more•Institutions (3)

Aarhus University¹, University of Sydney², Landcare Research³

01 May 2019-Geoderma

TL;DR: In this article, the DSMART algorithm was used to disaggregate conventional soil maps and to produce high-quality soil maps when point observations are not available, and the results demonstrated that a suitable approach can provide reliable soil maps at a national extent.

...read moreread less

29 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
…
53
54
55
56
57
58
59
…
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,288

Papers

43,502

Citations

No. of papers in the topic in previous years
Year	Papers
2023	10
2022	24
2021	101
2020	163
2019	158
2018	121

Decision tree model

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics