Home
/
Authors
/
Anirban DasGupta

Author

Anirban DasGupta

Other affiliations: University of California, San Diego

Bio: Anirban DasGupta is an academic researcher from Purdue University. The author has contributed to research in topics: Central limit theorem & Random variable. The author has an hindex of 19, co-authored 98 publications receiving 4973 citations. Previous affiliations of Anirban DasGupta include University of California, San Diego.

Papers published on a yearly basis

2018
2014
2012
2011
2010
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1997
1995
1994
1993
1992
1991
1989
1988
1986

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Interval Estimation for a Binomial Proportion

[...]

Lawrence D. Brown, T. Tony Cai, Anirban DasGupta

01 May 2001-Statistical Science

TL;DR: In this paper, the problem of interval estimation of a binomial proportion is revisited, and a number of natural alternatives are presented, each with its motivation and con- text, each interval is examined for its coverage probability and its length.

...read moreread less

Abstract: We revisit the problem of interval estimation of a binomial proportion. The erratic behavior of the coverage probability of the stan- d ardWaldconfid ence interval has previously been remarkedon in the literature (Blyth andStill, Agresti andCoull, Santner andothers). We begin by showing that the chaotic coverage properties of the Waldinter- val are far more persistent than is appreciated. Furthermore, common textbook prescriptions regarding its safety are misleading and defective in several respects andcannot be trusted . This leads us to consideration of alternative intervals. A number of natural alternatives are presented, each with its motivation and con- text. Each interval is examinedfor its coverage probability andits length. Basedon this analysis, we recommendthe Wilson interval or the equal- tailedJeffreys prior interval for small n andthe interval suggestedin Agresti andCoull for larger n. We also provide an additional frequentist justification for use of the Jeffreys interval.

...read moreread less

2,893 citations

Book•

Asymptotic Theory of Statistics and Probability

[...]

Anirban DasGupta

12 Aug 2008

TL;DR: In this paper, a collection of Inequalities in Probability, Linear Algebra, and Analysis is presented. But they focus mainly on two-sample problems: Chi-square Tests for Goodness of Fit and Goodness-of-Fit with estimated parameters.

...read moreread less

Abstract: Basic Convergence Concepts and Theorems.- Metrics, Information Theory, Convergence, and Poisson Approximations.- More General Weak and Strong Laws and the Delta Theorem.- Transformations.- More General Central Limit Theorems.- Moment Convergence and Uniform Integrability.- Sample Percentiles and Order Statistics.- Sample Extremes.- Central Limit Theorems for Dependent Sequences.- Central Limit Theorem for Markov Chains.- Accuracy of Central Limit Theorems.- Invariance Principles.- Edgeworth Expansions and Cumulants.- Saddlepoint Approximations.- U-statistics.- Maximum Likelihood Estimates.- M Estimates.- The Trimmed Mean.- Multivariate Location Parameter and Multivariate Medians.- Bayes Procedures and Posterior Distributions.- Testing Problems.- Asymptotic Efficiency in Testing.- Some General Large-Deviation Results.- Classical Nonparametrics.- Two-Sample Problems.- Goodness of Fit.- Chi-square Tests for Goodness of Fit.- Goodness of Fit with Estimated Parameters.- The Bootstrap.- Jackknife.- Permutation Tests.- Density Estimation.- Mixture Models and Nonparametric Deconvolution.- High-Dimensional Inference and False Discovery.- A Collection of Inequalities in Probability, Linear Algebra, and Analysis.

...read moreread less

738 citations

Journal Article•DOI•

An overview of robust Bayesian analysis

[...]

James O. Berger¹, Elías Moreno², Luis R. Pericchi³, M. Jesús Bayarri⁴, José M. Bernardo⁴, Juan Antonio Cano⁵, Julián de la Horra⁶, Jacinto Martín⁷, David Rios-Insua⁷, Bruno Betrò, Anirban DasGupta¹, Paul Gustafson, Larry Wasserman, Joseph B. Kadane, Cid Srinivasan, Michael Lavine, Anthony O'Hagan⁸, Wolfgang Polasek⁹, Christian P. Robert¹⁰, Constantinos Goutis¹¹, Fabrizio Ruggeri, G. Salinetti¹², Siva Sivaganesan¹³ - Show less +19 more•Institutions (13)

Purdue University¹, University of Granada², Simón Bolívar University³, University of Valencia⁴, University of Murcia⁵, Autonomous University of Madrid⁶, Technical University of Madrid⁷, University of Nottingham⁸, University of Basel⁹, University of Rouen¹⁰, University College London¹¹, Sapienza University of Rome¹², University of Cincinnati¹³

01 Jun 1994-Test

TL;DR: An overview of the subject of robust Bayesian analysis is provided, one that is accessible to statisticians outside the field, and recent developments in the area are reviewed.

...read moreread less

Abstract: Robust Bayesian analysis is the study of the sensitivity of Bayesian answers to uncertain inputs. This paper seeks to provide an overview of the subject, one that is accessible to statisticians outside the field. Recent developments in the area are also reviewed, though with very uneven emphasis.

...read moreread less

587 citations

Journal Article•DOI•

Confidence Intervals for a binomial proportion and asymptotic expansions

[...]

Lawrence D. Brown, T. Tony Cai, Anirban DasGupta

01 Feb 2002-Annals of Statistics

TL;DR: Brown, Cai and DasGupta as mentioned in this paper compared the coverage properties of the standard Wald interval and four alternative interval methods by asymptotic expansions of their coverage probabilities and expected lengths.

...read moreread less

Abstract: We address the classic problem of interval estimation of a binomial proportion. The Wald interval $\hat{p}\pm z_{\alpha/2} n^{-1/2} (\hat{p} (1 - \hat{p}))^{1/2}$ is currently in near universal use. We first show that the coverage properties of the Wald interval are persistently poor and defy virtually all conventional wisdom. We then proceed to a theoretical comparison of the standard interval and four additional alternative intervals by asymptotic expansions of their coverage probabilities and expected lengths. The four additional interval methods we study in detail are the score-test interval (Wilson), the likelihood-ratio-test interval, a Jeffreys prior Bayesian interval and an interval suggested by Agresti and Coull. The asymptotic expansions for coverage show that the first three of these alternative methods have coverages that fluctuate about the nominal value, while the Agresti–Coull interval has a somewhat larger and more nearly conservative coverage function. For the five interval methods we also investigate asymptotically their average coverage relative to distributions for $p$ supported within (0 1) . In terms of expected length, asymptotic expansions show that the Agresti–Coull interval is always the longest of these. The remaining three are rather comparable and are shorter than the Wald interval except for $p$ near 0 or 1. These analytical calculations support and complement the findings and the recommendations in Brown, Cai and DasGupta (Statist. Sci. (2001) 16 101–133).

...read moreread less

299 citations

Journal Article•DOI•

The matching, birthday and the strong birthday problem: a contemporary review

[...]

Anirban DasGupta¹•Institutions (1)

Purdue University¹

01 Mar 2005-Journal of Statistical Planning and Inference

TL;DR: In this article, a contemporary exposition at a moderately quantitative level of the distribution theory associated with the matching and the birthday problems is provided to help a reader have a feeling for these questions at an intuitive level.

...read moreread less

75 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Collapse

Cited by

PDF

Open Access

More filters

Book•

Machine Learning : A Probabilistic Perspective

[...]

Kevin P. Murphy

24 Aug 2012

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Abstract: Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package--PMTK (probabilistic modeling toolkit)--that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

8,059 citations

Journal Article•DOI•

Convergence of Probability Measures

[...]

J. F. C. Kingman¹•Institutions (1)

University of Sussex¹

01 Nov 1969-Journal of The Royal Statistical Society Series C-applied Statistics

TL;DR: Convergence of Probability Measures as mentioned in this paper is a well-known convergence of probability measures. But it does not consider the relationship between probability measures and the probability distribution of probabilities.

...read moreread less

Abstract: Convergence of Probability Measures. By P. Billingsley. Chichester, Sussex, Wiley, 1968. xii, 253 p. 9 1/4“. 117s.

...read moreread less

5,689 citations

Journal Article•DOI•

Interval Estimation for a Binomial Proportion

[...]

Lawrence D. Brown, T. Tony Cai, Anirban DasGupta

01 May 2001-Statistical Science

...read moreread less

2,893 citations

Journal Article•DOI•

The Danish National Patient Registry: a review of content, data quality, and research potential.

[...]

Morten Schmidt¹, Sigrún Alba Jóhannesdóttir Schmidt¹, Jakob Lynge Sandegaard, Vera Ehrenstein¹, Lars Pedersen¹, Henrik Toft Sørensen¹ - Show less +2 more•Institutions (1)

Aarhus University Hospital¹

17 Nov 2015-Clinical Epidemiology

TL;DR: The Danish National Patient Registry is a valuable tool for epidemiological research, however, both its strengths and limitations must be considered when interpreting research results, and continuous validation of its clinical data is essential.

...read moreread less

Abstract: Background The Danish National Patient Registry (DNPR) is one of the world’s oldest nationwide hospital registries and is used extensively for research. Many studies have validated algorithms for identifying health events in the DNPR, but the reports are fragmented and no overview exists.

...read moreread less

2,818 citations

The handbook of research synthesis and meta-analysis, 2nd ed.

[...]

Harris Cooper¹, Larry V. Hedges, Jeffrey C. Valentine²•Institutions (2)

Duke University¹, University of Louisville²

01 Dec 2009

2,243 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse