Home
/
Topics
/
Decision tree model

Topic

Decision tree model

About: Decision tree model is a research topic. Over the lifetime, 2256 publications have been published within this topic receiving 38142 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

Papers

PDF

Open Access

More filters

Dissertation•

Landslide Occurrence Decision Tree Model for Western Washington Featuring Precipitation

[...]

Emily McDaniel

13 Aug 2015

Visualization of Tree-Structured Data Through

[...]

Nikolaos Gianniotis, Peter Ti

01 Jan 2008

TL;DR: A probabilistic generative approach for constructing topographic maps of tree-structured data induced by a smooth mapping from low-dimensional latent space, which allows for calculation of magnification factors—a useful tool for the detection of data clusters.

...read moreread less

Abstract: In this paper, we present a probabilistic generative approach for constructing topographic maps of tree-structured data. Our model defines a low-dimensional manifold of local noise models, namely, (hidden) Markov tree models, induced by a smooth mapping from low-dimensional latent space. We contrast our approach with that of topographic map formation using re- cursive neural-based techniques, namely, the self-organizing map for structured data (SOMSD) (Hagenbuchner et al., 2003). The probabilistic nature of our model brings a number of benefits: 1) naturally defined cost function that drives the model optimization; 2) principled model comparison and testing for overfitting; 3) a potential for transparent interpretation of the map by inspecting the underlying local noise models; 4) natural accommodation of alternative local noise models implicitly expressing different notions of structured data similarity. Furthermore, in contrast with the recursive neural-based approaches, the smooth nature of the mapping from the latent space to the local model space allows for calculation of magnification factors—a useful tool for the detection of data clusters. We demonstrate our approach on three data sets: a toy data set, an artificially generated data set, and on a data set of images represented as quadtrees. Index Terms—Hidden Markov tree model (HMTM), structured data, topographic mapping.

...read moreread less

Dissertation•

Learning Non-Impeding Noisy-AND Tree Model Based Bayesian Networks From Data

[...]

Qian Wang

01 Feb 2020

TL;DR: This research studies learning NAT model-based BNs from data by applying the Minimum Description Length principle and heuristic search, and advances BN structure learning with local models by focusing on inequality constraints.

...read moreread less

Abstract: LEARNING NON-IMPEDING NOISY-AND TREE MODEL BASED BAYESIAN NETWORKS FROM DATA Qian Wang Advisor: University of Guelph, 2020 Dr. Yang Xiang Bayesian Networks (BNs) are a widely utilized formalism for representing knowledge in intelligent agents on partially observable and stochastic application environments. When conditional probability tables are used in BNs to quantify strength of dependency between each variable and its parents, the space complexity is exponential on the number m of parents per variable. The time complexity of inference is also lower-bounded exponentially by m. The non-impeding noisy-AND Tree (NAT) model-based BNs can significantly improve both space and time complexity above, rendering both complexity measures linear on m, for a wide range of sparse BN structures. This research studies learning NAT model-based BNs from data by applying the Minimum Description Length principle and heuristic search. It advances BN structure learning with local models by focusing on inequality constraints. Practitioners can make tractable inferences using such BNs learned from data, especially when data admits high treewidth and low-density structures.

...read moreread less

Proceedings Article•DOI•

A Prediction Model of College Students' Employment Based on Data Mining

[...]

Houwen Fan

01 Oct 2020

TL;DR: In this paper, a decision tree algorithm was used to classify the relevant graduate employment data, and the balance coefficient was introduced to improve the algorithm, so that the decision tree has higher accuracy.

...read moreread less

Abstract: In this paper, the concept of data mining, algorithm, the actual mining process are discussed in detail. Aiming at the large amount of data accumulated in the university employment information management system, and taking the actual employment case analysis as an example, the decision tree algorithm in data mining is used to classify the relevant graduate employment data. For the improvement of C4.5, the balance coefficient is introduced to improve the algorithm, so that the decision tree has higher accuracy. Then, the analysis and decision tree model of the employment information system of college graduates is established by the sample data mining. Finally, the model is used to analyze the data of graduates and predict the success rate of graduates. This paper also summarizes the advantages of the improved algorithm in mining accuracy, rule number and so on, and illustrates the effectiveness of the improved algorithm.

...read moreread less

Posted Content•DOI•

treeheatr: an R package for interpretable decision tree visualizations

[...]

Trang T. Le¹, Jason H. Moore¹•Institutions (1)

University of Pennsylvania¹

10 Jul 2020-bioRxiv

TL;DR: Implemented in an easily installed package with a detailed vignette, treeheatr can be a useful teaching tool to enhance students’ understanding of a simple decision tree model before diving into more complex tree-based machine learning methods.

...read moreread less

Abstract: Summary treeheatr is an R package for creating interpretable decision tree visualizations with the data represented as a heatmap at the tree’s leaf nodes. The integrated presentation of the tree structure along with an overview of the data efficiently illustrates how the tree nodes split up the feature space and how well the tree model performs. This visualization can also be examined in depth to uncover the correlation structure in the data and importance of each feature in predicting the outcome. Implemented in an easily installed package with a detailed vignette, treeheatr can be a useful teaching tool to enhance students’ understanding of a simple decision tree model before diving into more complex tree-based machine learning methods.Availability The treeheatr package is freely available under the permissive MIT license at https://trang1618.github.io/treeheatr and https://cran.r-project.org/package=treeheatr. It comes with a detailed vignette that is automatically built with GitHub Actions continuous integration.Contact ttle{at}pennmedicine.upenn.eduCompeting Interest StatementThe authors have declared no competing interest.View Full Text

...read moreread less

Collapse

Network Information

Performance

Metrics

2,288

Papers

43,502

Citations

No. of papers in the topic in previous years
Year	Papers
2023	10
2022	24
2021	101
2020	163
2019	158
2018	121

Decision tree model

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics