Home
/
Topics
/
Decision tree model

Topic

Decision tree model

About: Decision tree model is a research topic. Over the lifetime, 2256 publications have been published within this topic receiving 38142 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

Papers

PDF

Open Access

More filters

Posted Content•

Efficient Metropolis-Hastings Proposal Mechanisms for Bayesian Regression Tree Models

[...]

Matthew T. Pratola

06 Dec 2013-arXiv: Computation

TL;DR: This paper develops novel proposal mechanisms for efficient sampling in the Bayesian Additive Regression Tree (BART) model and implements this sampling algorithm in the model and demonstrates its effectiveness on a prediction problem from computer experiments and a test function where structural tree variability is needed to fully explore the posterior.

...read moreread less

Abstract: Bayesian regression trees are flexible non-parametric models that are well suited to many modern statistical regression problems. Many such tree models have been proposed, from the simple single- tree model to more complex tree ensembles. Their non-parametric formulation allows for effective and efficient modeling of datasets exhibiting complex non-linear relationships between the model pre- dictors and observations. However, the mixing behavior of the Markov Chain Monte Carlo (MCMC) sampler is sometimes poor. This is because the proposals in the sampler are typically local alterations of the tree structure, such as the birth/death of leaf nodes, which does not allow for efficient traversal of the model space. This poor mixing can lead to inferential problems, such as under-representing uncertainty. In this paper, we develop novel proposal mechanisms for efficient sampling. The first is a rule perturbation proposal while the second we call tree rotation. The perturbation proposal can be seen as an efficient variation of the change proposal found in existing literature. The novel tree rotation proposal is simple to implement as it only requires local changes to the regression tree structure, yet it efficiently traverses disparate regions of the model space along contours of equal probability. When combined with the classical birth/death proposal, the resulting MCMC sampler exhibits good acceptance rates and properly represents model uncertainty in the posterior samples. We implement this sampling algorithm in the Bayesian Additive Regression Tree (BART) model and demonstrate its effectiveness on a prediction problem from computer experiments and a test function where structural tree variability is needed to fully explore the posterior.

...read moreread less

17 citations

Journal Article•DOI•

Application of a Digital Soil Mapping Method in Producing Soil Orders on Mountain Areas of Hong Kong Based on Legacy Soil Data

[...]

Xiao-Lin Sun¹, Xiao-Lin Sun², Yu-Guo Zhao¹, Yu-Guo Zhao², Gan-Lin Zhang², Gan-Lin Zhang¹, Sheng-Chun Wu², Yu Bon Man², Ming Hung Wong² - Show less +5 more•Institutions (2)

Chinese Academy of Sciences¹, Hong Kong Baptist University²

01 Jun 2011-Pedosphere

TL;DR: Based on legacy soil data from a soil survey conducted recently in the traditional manner in Hong Kong of China, a digital soil mapping method was applied to produce soil order information for mountain areas of Hong Kong as discussed by the authors.

...read moreread less

17 citations

Posted Content•DOI•

Mining Significant Features of Diabetes Mellitus Applying Decision Trees: A Case Study In Bangladesh

[...]

Koushik Chandra Howladar¹, Md. Shahriare Satu, Avijit Barua¹, Mohammad Ali Moni²•Institutions (2)

Noakhali Science and Technology University¹, University of Sydney²

30 Nov 2018-bioRxiv

TL;DR: CDT unpruned tree shows highest accuracy, precision, recall, f-measure, second highest AUROC and lowest RMSE than other models and plasma glucose, plasma glucose 2hr after glucose and HDL-cholesterol have been found as the most significant features to predict the severity of Diabetes Mellitus.

...read moreread less

Abstract: Diabetes is a chronic condition which is associated with an abnormally high level of sugar in the blood. It is a lifelong disease that causes harmful effects in human life. The goal of this research is to predict the severity of diabetes and find out significant features of it. In this work, we gathered diabetes patients records from Noakhali Diabetes Association, Noakhali, Bangladesh. Thus, We preprocessed our raw dataset by replacing and removing missing and wrong records respectively. Thus, CDT, J48, NBTree and REPtree decision tree based classification techniques were used to analyze this dataset. After this analysis, we evaluated classification outcomes of these decision tree classifiers and found the best decision tree model from them. In this work, CDT unpruned tree shows highest accuracy, precision, recall, f-measure, second highest AUROC and lowest RMSE than other models. Then, we extracted possible rules and significant features from this model and plasma glucose, plasma glucose 2hr after glucose and HDL-cholesterol have been found as the most significant features to predict the severity of Diabetes Mellitus. We hope this work will be beneficial to build a predictive system and complementary tool for diabetes treatment in future.

...read moreread less

17 citations

Journal Article•

A compatible growth-density stand model derived from a distance-dependent individual tree model

[...]

Guofan Shao, Herman H. Shugart

01 Jan 1997-Forest Science

17 citations

Book Chapter•DOI•

An Efficient PIR Construction Using Trusted Hardware

[...]

Yanjiang Yang¹, Xuhua Ding², Robert H. Deng², Feng Bao¹•Institutions (2)

Institute for Infocomm Research Singapore¹, Singapore Management University²

15 Sep 2008

TL;DR: Using the trusted hardware based model, the computation complexity of the scheme, including offline computation, is linear to the number of queries and is bounded by ${\mathrm{O}}(\sqrt{n})$ after optimization.

...read moreread less

Abstract: For a private information retrieval (PIR) scheme to be deployed in practice, low communication complexity and low computation complexity are two fundamental requirements it must meet. Most existing PIR schemes only focus on the communication complexity. The reduction on the computational complexity did not receive the due treatment mainly because of its O(n) lower bound. By using the trusted hardware based model, we design a novel scheme which breaks this barrier. With constant storage, the computation complexity of our scheme, including offline computation, is linear to the number of queries and is bounded by ${\mathrm{O}}(\sqrt{n})$ after optimization.

...read moreread less

17 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
…
86
87
88
89
90
91
92
…
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,288

Papers

43,502

Citations

No. of papers in the topic in previous years
Year	Papers
2023	10
2022	24
2021	101
2020	163
2019	158
2018	121

Decision tree model

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics