An Algorithm for Fast Recovery of Sparse Causal Graphs

doi:10.1177/089443939100900106

Home
/
Papers
/
An Algorithm for Fast Recovery of Sparse Causal Graphs

Journal Article•DOI•

An Algorithm for Fast Recovery of Sparse Causal Graphs

Peter Spirtes¹, Clark Glymour¹•Institutions (1)

Carnegie Mellon University¹

01 Apr 1991-Social Science Computer Review (SAGE Publications)-Vol. 9, Iss: 1, pp 62-72

TL;DR: An asymptotically correct algorithm whose complexity for fixed graph connectivity increases polynomially in the number of vertices, and may in practice recover sparse graphs with several hundred variables.

read less

Abstract: Previous asymptotically correct algorithms for recovering causal structure from sample probabilities have been limited even in sparse causal graphs to a few variables. We describe an asymptotically...

...read moreread less

Citations

PDF

Open Access

More filters

Book•

Causation, prediction, and search

[...]

Peter Spirtes, Clark Glymour, Richard Scheines

01 Jan 1993

TL;DR: The authors axiomatize the connection between causal structure and probabilistic independence, explore several varieties of causal indistinguishability, formulate a theory of manipulation, and develop asymptotically reliable procedures for searching over equivalence classes of causal models.

...read moreread less

Abstract: What assumptions and methods allow us to turn observations into causal knowledge, and how can even incomplete causal knowledge be used in planning and prediction to influence and control our environment? In this book Peter Spirtes, Clark Glymour, and Richard Scheines address these questions using the formalism of Bayes networks, with results that have been applied in diverse areas of research in the social, behavioral, and physical sciences. The authors show that although experimental and observational study designs may not always permit the same inferences, they are subject to uniform principles. They axiomatize the connection between causal structure and probabilistic independence, explore several varieties of causal indistinguishability, formulate a theory of manipulation, and develop asymptotically reliable procedures for searching over equivalence classes of causal models, including models of categorical data and structural equation models with and without latent variables. The authors show that the relationship between causality and probability can also help to clarify such diverse topics in statistics as the comparative power of experimentation versus observation, Simpson's paradox, errors in regression models, retrospective versus prospective sampling, and variable selection. The second edition contains a new introduction and an extensive survey of advances and applications that have appeared since the first edition was published in 1993.

...read moreread less

4,863 citations

Journal Article•DOI•

A Bayesian Method for the Induction of Probabilistic Networks from Data

[...]

Gregory F. Cooper¹, Edward H. Herskovits²•Institutions (2)

University of Pittsburgh¹, Stanford University²

01 Oct 1992-Machine Learning

TL;DR: This paper presents a Bayesian method for constructing probabilistic networks from databases, focusing on constructing Bayesian belief networks, and extends the basic method to handle missing data and hidden variables.

...read moreread less

Abstract: This paper presents a Bayesian method for constructing probabilistic networks from databases. In particular, we focus on constructing Bayesian belief networks. Potential applications include computer-assisted hypothesis testing, automated scientific discovery, and automated construction of probabilistic expert systems. We extend the basic method to handle missing data and hidden (latent) variables. We show how to perform probabilistic inference by averaging over the inferences of multiple belief networks. Results are presented of a preliminary evaluation of an algorithm for constructing a belief network from a database of cases. Finally, we relate the methods in this paper to previous work, and we discuss open problems.

...read moreread less

3,971 citations

Journal Article•DOI•

Network Analysis: An Integrative Approach to the Structure of Psychopathology

[...]

Denny Borsboom¹, Angélique O. J. Cramer¹•Institutions (1)

University of Amsterdam¹

28 Mar 2013-Annual Review of Clinical Psychology

TL;DR: An examines methodologies suited to identify such symptom networks and discusses network analysis techniques that may be used to extract clinically and scientifically useful information from such networks (e.g., which symptom is most central in a person's network).

...read moreread less

Abstract: In network approaches to psychopathology, disorders result from the causal interplay between symptoms (e.g., worry → insomnia → fatigue), possibly involving feedback loops (e.g., a person may engage in substance abuse to forget the problems that arose due to substance abuse). The present review examines methodologies suited to identify such symptom networks and discusses network analysis techniques that may be used to extract clinically and scientifically useful information from such networks (e.g., which symptom is most central in a person's network). The authors also show how network analysis techniques may be used to construct simulation models that mimic symptom dynamics. Network approaches naturally explain the limited success of traditional research strategies, which are typically based on the idea that symptoms are manifestations of some common underlying factor, while offering promising methodological alternatives. In addition, these techniques may offer possibilities to guide and evaluate therape...

...read moreread less

1,824 citations

Cites methods from "An Algorithm for Fast Recovery of S..."

...Figure 7 (see Table 1 for the accompanying legend) provides a graphical representation of the result of applying the PC algorithm (Spirtes & Glymour 1991) to the NCS-R depression data using the R-package PcAlg (Kalisch et al. 2012)....
[...]

Journal Article•DOI•

Learning Bayesian networks from data: an information-theory based approach

[...]

Jie Cheng¹, Russell Greiner¹, Jonathan Kelly¹, David A. Bell², Weiru Liu² - Show less +1 more•Institutions (2)

University of Alberta¹, Ulster University²

01 May 2002-Artificial Intelligence

TL;DR: Algorithms that use an information-theoretic analysis to learn Bayesian network structures from data, requiring only polynomial numbers of conditional independence tests in typical cases are provided.

...read moreread less

804 citations

Cites background or methods from "An Algorithm for Fast Recovery of S..."

...For example, when compared to the PC algorithm (which also uses dependency analysis to learn Bayesian net structures; [48], see Section 7), we found that TPDA typically uses fewer CI tests than PC uses, but these tests are often more complex....
[...]
...This basic “try each subset” algorithm is used by essentially all other dependency-analysis based algorithms, including the SGS algorithm [47], the Verma–Pearl algorithm [56] and the PC algorithm [48]....
[...]
...PC [48] General Bayesian nets No O(Nk+2) K is the maximum degree of any node in the true structure; Can orient edges; enhanced from SGS algorithm; efficient...
[...]
...) However, this method is quite popular among Bayesian network learning algorithms due to its efficiency and reliability [48,56]....
[...]

Proceedings Article•

A Theory of Inferred Causation.

[...]

Judea Pearl¹, Thomas Verma¹•Institutions (1)

University of California, Los Angeles¹

01 Jan 1991

TL;DR: In this paper, the theory of inferred causation is discussed, where causal ordering is defined as the ordering at which subsets of variables can be solved independently of others; in other systems, it follows the way a disturbance is propagated from one variable to others.

...read moreread less

Abstract: Publisher Summary This chapter discusses the theory of inferred causation. The study of causation is central to the understanding of human reasoning. Inferences involving changing environments require causal theories that make formal distinctions between beliefs based on passive observations and those reflecting intervening actions. In applications such as diagnosis, qualitative physics, and plan recognition, a central task is that of finding a satisfactory explanation to a given set of observations, and the meaning of explanation is intimately related to the notion of causation. In some systems, causal ordering is defined as the ordering at which subsets of variables can be solved independently of others; in other systems, it follows the way a disturbance is propagated from one variable to others. An empirical semantics for causation is important for several reasons. The notion of causation is often associated with those of necessity and functional dependence; causal expressions often tolerate exceptions, primarily because of missing variables and coarse description. Temporal precedence is normally assumed essential for defining causation, and it is one of the most important clues that people use to distinguish causal from other types of associations.

...read moreread less

767 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157

Collapse

References

PDF

Open Access

More filters

Proceedings Article•

Equivalence and synthesis of causal models

[...]

Thomas Verma, Judea Pearl

27 Jul 1990

TL;DR: The canonical representation presented here yields an efficient algorithm for determining when two embedded causal models reflect the same dependency information, which leads to a model theoretic definition of causation in terms of statistical dependencies.

...read moreread less

Abstract: Scientists often use directed acyclic graphs ( dags) to model the qualitative structure of causal theories, al lowing the parameters to be estimated from observa tional data. Two causal models are equivalent if there is no expirement which could distinguish one from the other. A canonical representation for causal models is presented which yields an efficient graphical crite rion for deciding equivalence, and provides a theoret ical basis for extracting causal structures from em pirical data. This representation is then extended to the more general case of an embedded causal model, that is, a dag in which only a subset of the vari[Verma and Pearl 90]. One problem that has arisen in the course of these studies is that of non uniqueness; it is quite common for two different causal models to be experimentally indistinguishable, hence, equally predictive. Formally, let a causal the ory be a pair T =< D, e >, where D is a dag, called the causal model ofT, and e a set of parameters com patible with D (i.e., sufficient for forming a probabil ity distribution for which D is a Bayesian network). We say that two causal models D1 and D2 are equiv alent if for every theory T1 =< D1, 81 > there is a theory T2 =< D2, 82 > such that T1 and T2 describe the same probability distribution, and vice versa. ables are observable. The canonical representation / b "-.. presented here yields an efficient algorithm for detera "-.. a � /c mining when two embedded causal models are equiv· "' alent, and leads to a model theoretic definition of b "-.. / b causation in terms of statistical dependencies. "' c a"' /

...read moreread less

1,286 citations

"An Algorithm for Fast Recovery of S..." refers background in this paper

...&dquo; Figure i ALARM network 67 Verma and Pearl (1990) have suggested an improvement on the sGs algorithm....
[...]
...Verma and Pearl (1990) subsequently proved the correctness of the algorithm and offered a variant that outputs a pattern rather than a collection of graphs....
[...]

Book•

Introduction to structural equation models

[...]

Otis Dudley Duncan

01 Jan 1975

1,235 citations

"An Algorithm for Fast Recovery of S..." refers background in this paper

...Methodology texts routinely recommend generating the set of admissible causal structures from &dquo;substantive theory&dquo; (see Joreskog & Sorbom, 1984; Duncan, 1975)....
[...]

Journal Article•DOI•

Introduction to Structural Equation Models.

[...]

Stephen E. Fienberg, Otis Dudley Duncan

01 Jun 1977-Journal of the American Statistical Association

814 citations

Journal Article•DOI•

An introduction to structural equation models

[...]

David J. Francis¹•Institutions (1)

University of Houston¹

01 Oct 1988-Journal of Clinical and Experimental Neuropsychology

TL;DR: An overview of structural equation models, and their potential for advancing neuropsychological theory and practice is provided to help researchers assess the relevance of these advanced statistical techniques to their own research, and begin the process of successful application.

...read moreread less

Abstract: This paper provides an overview of structural equation models, and their potential for advancing neuropsychological theory and practice. Four topics are covered: (1) an overview of the various classes of models, and an introduction to the terminology and diagrams used to describe them, (2) an outline of the steps involved in applying structural equation modeling to any research problem, (3) an overview of the information used in assessing model fit, and a discussion of the role of significance tests in structural models, and (4) an outline of the advantages and disadvantages of structural equation models, and their potential contribution to neuropsychology. The paper is intended to help researchers (1) assess the relevance of these advanced statistical techniques to their own research, and (2) begin the process of successful application.

...read moreread less

337 citations

Proceedings Article•

an entropy-driven system for construction of probabilistic expert systems from databases

[...]

Edward H. Herskovits, Gregory F. Cooper

27 Jul 1990

TL;DR: Kutato as mentioned in this paper is a system that takes as input a database of cases and produces a belief network that captures many of the dependence relations represented by those data, incorporating a module for determining the entropy of belief networks and a module based on entropy calculations.

...read moreread less

Abstract: Kutato is a system that takes as input a database of cases and produces a belief network that captures many of the dependence relations represented by those data. This system incorporates a module for determining the entropy of a belief network and a module for constructing belief networks based on entropy calculations. Kutato constructs an initial belief network in which all variables in the database are assumed to be marginally independent. The entropy of this belief network is calculated, and that arc is added that minimizes the entropy of the resulting belief network. Conditional probabilities for an arc are obtained directly from the database. This process continues until an entropy-based threshold is reached. We have tested the system by generating databases from networks using the probabilistic logic-sampling method, and then using those databases as input to Kutato. The system consistently reproduces the original belief networks with high fidelity.

...read moreread less

149 citations