Home
/
Authors
/
Pranab Kumar Sen

Author

Pranab Kumar Sen

University of North Carolina at Chapel Hill

Other affiliations: Indian Statistical Institute, Academia Sinica, University of Calcutta ...read more

Bio: Pranab Kumar Sen is an academic researcher from University of North Carolina at Chapel Hill. The author has contributed to research in topics: Estimator & Nonparametric statistics. The author has an hindex of 51, co-authored 570 publications receiving 19997 citations. Previous affiliations of Pranab Kumar Sen include Indian Statistical Institute & Academia Sinica.

Papers published on a yearly basis

2022
2021
2020
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969
1968
1967

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Some aspects of the statistical analysis of the 'mixed model'.

[...]

Gary G. Koch, Pranab Kumar Sen

01 Mar 1968-Biometrics

TL;DR: This paper deals with the statistical analysis (both parametric and non-parametric) of 'mixed model' experiments and illustrated in detail in two numerical examples the relative performances of the different test criteria for a situation in which the null hypothesis is essentially true.

...read moreread less

Abstract: This paper deals with the statistical analysis (both parametric and non-parametric) of 'mixed model' experiments. The general structure of such experiments involves n randomly chosen subjects who respond once to each of p distinct treatments. Thus the subject or block effects are random and treatment effects are fixed. The hypothesis of no treatment effects is considered under several different combinations of assumptions concerning the joint distribution of the observations corresponding to each of the particular subjects. For each situation, an appropriate test procedure is discussed and its properties studied. The different methods considered in the paper are illustrated in detail in two numerical examples. These examples have been chosen to illustrate the relative performances of the different test criteria for a situation in which the null hypothesis is essentially true (Example 1) and for a situation in which the null hypothesis is essentially false (Example 2). The reader may wish to begin by studying these examples for a better understanding of the theory. Finally, the section on examples contains algorithms for the efficient computation of the various test criteria. A computer program based on these algorithms has been written and can be made available to any interested persons.

...read moreread less

47 citations

Journal Article•DOI•

Weak Convergence of Multidimensional Empirical Processes for Stationary $\phi$-Mixing Processes

[...]

Pranab Kumar Sen

01 Feb 1974-Annals of Probability

TL;DR: In this paper, weak convergence of the empirical process (in the topology on $D^p\lbrack 0, 1 \rbrack) to an appropriate Gaussian process is established under a simple condition on the mixing constants.

...read moreread less

Abstract: For a stationary $\phi$-mixing sequence of stochastic $p(\geqq 1)$-vectors, weak convergence of the empirical process (in the $J_1$-topology on $D^p\lbrack 0, 1 \rbrack)$ to an appropriate Gaussian process is established under a simple condition on the mixing constants $\{\phi_n\}$. Weak convergence for random number of stochastic vectors is also studied. Tail probability inequalities for Kolmogorov Smirnov statistics are provided.

...read moreread less

47 citations

Journal Article•DOI•

Estimating correlation by using a general linear mixed model: evaluation of the relationship between the concentration of HIV-1 RNA in blood and semen.

[...]

Hrishikesh Chakraborty¹, Ronald W. Helms², Pranab Kumar Sen², Myron S. Cohen²•Institutions (2)

Research Triangle Park¹, University of North Carolina at Chapel Hill²

15 May 2003-Statistics in Medicine

TL;DR: The findings confirm and extend the idea that the concentrations of HIV‐1 in semen often differ from the HIV-1 concentration in blood, and confirm the need for antiretroviral therapy to be administered to subjects with low CD4 counts so as to improve the correlation between these compartments.

...read moreread less

Abstract: Estimating the correlation coefficient between two outcome variables is one of the most important aspects of epidemiological and clinical research. A simple Pearson's correlation coefficient method is usually employed when there are complete independent data points for both outcome variables. However, researchers often deal with correlated observations in a longitudinal setting with missing values where a simple Pearson's correlation coefficient method cannot be used. General linear mixed models (GLMM) techniques were used to estimate correlation coefficients in a longitudinal data set with missing values. A random regression mixed model with unstructured covariance matrix was employed to estimate correlation coefficients between concentrations of HIV-1 RNA in blood and seminal plasma. The effects of CD4 count and antiretroviral therapy were also examined. We used data sets from three different centres (650 samples from 238 patients) where blood and seminal plasma HIV-1 RNA concentrations were collected from patients; 137 samples from 90 different patients without antiviral therapy and 513 samples from 148 patients receiving therapy were considered for analysis. We found no significant correlation between blood and semen HIV-1 RNA concentration in the absence of antiviral therapy. However, a moderate correlation between blood and semen HIV-1 RNA was observed among subjects with lower CD4 counts receiving therapy. Our findings confirm and extend the idea that the concentrations of HIV-1 in semen often differ from the HIV-1 concentration in blood. Antiretroviral therapy administered to subjects with low CD4 counts result in sufficient concomitant reduction of HIV-1 in blood and semen so as to improve the correlation between these compartments. These results have important implications for studies related to the sexual transmission of HIV, and development of HIV prevention strategies.

...read moreread less

47 citations

Book•

From Finite Sample to Asymptotic Methods in Statistics

[...]

Pranab Kumar Sen¹, Julio M. Singer², Antonio Carlos Pedroso de Lima²•Institutions (2)

University of North Carolina at Chapel Hill¹, University of São Paulo²

30 Oct 2009

TL;DR: This chapter discusses Stochastic processes: an overview, asymptotic distributions, categorical data models, and Regression models, which describe weak convergence and Gaussian processes.

...read moreread less

Abstract: Exact statistical inference may be employed in diverse fields of science and technology. As problems become more complex and sample sizes become larger, mathematical and computational difficulties can arise that require the use of approximate statistical methods. Such methods are justified by asymptotic arguments but are still based on the concepts and principles that underlie exact statistical inference. With this in perspective, this book presents a broad view of exact statistical inference and the development of asymptotic statistical inference, providing a justification for the use of asymptotic methods for large samples. Methodological results are developed on a concrete and yet rigorous mathematical level and are applied to a variety of problems that include categorical data, regression, and survival analyses. This book is designed as a textbook for advanced undergraduate or beginning graduate students in statistics, biostatistics, or applied statistics but may also be used as a reference for academic researchers.

...read moreread less

47 citations

Journal Article•DOI•

Comparison of genomic sequences using the Hamming distance

[...]

Hildete Prisco Pinheiro¹, Aluísio Pinheiro¹, Pranab Kumar Sen²•Institutions (2)

State University of Campinas¹, University of North Carolina at Chapel Hill²

01 Mar 2005-Journal of Statistical Planning and Inference

TL;DR: In this paper, the problem of homogeneity among groups by comparison of genomic sequences is considered, and a one-sided hypothesis test is considered and the classical ANOVA decomposition can be directly adapted to sample measures based on the Hamming distance, without necessarily going through their second moments.

...read moreread less

45 citations

1
2
3
4
5
6
7
8
9
…
10
11
12
13
14
15
16
…
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.

[...]

Elizabeth R. DeLong¹, David M. DeLong¹, Daniel L. Clarke-Pearson•Institutions (1)

Quintiles¹

01 Sep 1988-Biometrics

TL;DR: A nonparametric approach to the analysis of areas under correlated ROC curves is presented, by using the theory on generalized U-statistics to generate an estimated covariance matrix.

...read moreread less

Abstract: Methods of evaluating and comparing the performance of diagnostic tests are of increasing importance as new tests are developed and marketed. When a test is based on an observed variable that lies on a continuous or graded scale, an assessment of the overall value of the test can be made through the use of a receiver operating characteristic (ROC) curve. The curve is constructed by varying the cutpoint used to determine which values of the observed variable will be considered abnormal and then plotting the resulting sensitivities against the corresponding false positive rates. When two or more empirical curves are constructed based on tests performed on the same individuals, statistical analysis on differences between curves must take into account the correlated nature of the data. This paper presents a nonparametric approach to the analysis of areas under correlated ROC curves, by using the theory on generalized U-statistics to generate an estimated covariance matrix.

...read moreread less

16,496 citations

Journal Article•

The Design and Analysis of Experiments

[...]

Margaret J. Robertson

01 Jun 1953-Yale Journal of Biology and Medicine

TL;DR: This book by a teacher of statistics (as well as a consultant for "experimenters") is a comprehensive study of the philosophical background for the statistical design of experiment.

...read moreread less

Abstract: THE DESIGN AND ANALYSIS OF EXPERIMENTS. By Oscar Kempthorne. New York, John Wiley and Sons, Inc., 1952. 631 pp. $8.50. This book by a teacher of statistics (as well as a consultant for \"experimenters\") is a comprehensive study of the philosophical background for the statistical design of experiment. It is necessary to have some facility with algebraic notation and manipulation to be able to use the volume intelligently. The problems are presented from the theoretical point of view, without such practical examples as would be helpful for those not acquainted with mathematics. The mathematical justification for the techniques is given. As a somewhat advanced treatment of the design and analysis of experiments, this volume will be interesting and helpful for many who approach statistics theoretically as well as practically. With emphasis on the \"why,\" and with description given broadly, the author relates the subject matter to the general theory of statistics and to the general problem of experimental inference. MARGARET J. ROBERTSON

...read moreread less

13,333 citations

Book•

Experimental Design and Data Analysis for Biologists

[...]

Gerry P. Quinn¹, Michael J. Keough²•Institutions (2)

Monash University¹, University of Melbourne²

21 Mar 2002

TL;DR: An essential textbook for any student or researcher in biology needing to design experiments, sample programs or analyse the resulting data is as discussed by the authors, covering both classical and Bayesian philosophies, before advancing to the analysis of linear and generalized linear models Topics covered include linear and logistic regression, simple and complex ANOVA models (for factorial, nested, block, split-plot and repeated measures and covariance designs), and log-linear models Multivariate techniques, including classification and ordination, are then introduced.

...read moreread less

Abstract: An essential textbook for any student or researcher in biology needing to design experiments, sample programs or analyse the resulting data The text begins with a revision of estimation and hypothesis testing methods, covering both classical and Bayesian philosophies, before advancing to the analysis of linear and generalized linear models Topics covered include linear and logistic regression, simple and complex ANOVA models (for factorial, nested, block, split-plot and repeated measures and covariance designs), and log-linear models Multivariate techniques, including classification and ordination, are then introduced Special emphasis is placed on checking assumptions, exploratory data analysis and presentation of results The main analyses are illustrated with many examples from published papers and there is an extensive reference list to both the statistical and biological literature The book is supported by a website that provides all data sets, questions for each chapter and links to software

...read moreread less

9,509 citations

Journal Article•DOI•

The control of the false discovery rate in multiple testing under dependency

[...]

Yoav Benjamini, Daniel Yekutieli

01 Aug 2001-Annals of Statistics

TL;DR: In this paper, it was shown that a simple FDR controlling procedure for independent test statistics can also control the false discovery rate when test statistics have positive regression dependency on each of the test statistics corresponding to the true null hypotheses.

...read moreread less

Abstract: Benjamini and Hochberg suggest that the false discovery rate may be the appropriate error rate to control in many applied multiple testing problems. A simple procedure was given there as an FDR controlling procedure for independent test statistics and was shown to be much more powerful than comparable procedures which control the traditional familywise error rate. We prove that this same procedure also controls the false discovery rate when the test statistics have positive regression dependency on each of the test statistics corresponding to the true null hypotheses. This condition for positive dependency is general enough to cover many problems of practical interest, including the comparisons of many treatments with a single control, multivariate normal test statistics with positive correlation matrix and multivariate $t$. Furthermore, the test statistics may be discrete, and the tested hypotheses composite without posing special difficulties. For all other forms of dependency, a simple conservative modification of the procedure controls the false discovery rate. Thus the range of problems for which a procedure with proven FDR control can be offered is greatly increased.

...read moreread less

9,335 citations

Journal Article•DOI•

Estimates of the Regression Coefficient Based on Kendall's Tau

[...]

Pranab Kumar Sen¹•Institutions (1)

University of North Carolina at Chapel Hill¹

01 Dec 1968-Journal of the American Statistical Association

TL;DR: In this article, a simple and robust estimator of regression coefficient β based on Kendall's rank correlation tau is studied, where the point estimator is the median of the set of slopes (Yj - Yi )/(tj-ti ) joining pairs of points with ti ≠ ti.

...read moreread less

Abstract: The least squares estimator of a regression coefficient β is vulnerable to gross errors and the associated confidence interval is, in addition, sensitive to non-normality of the parent distribution. In this paper, a simple and robust (point as well as interval) estimator of β based on Kendall's [6] rank correlation tau is studied. The point estimator is the median of the set of slopes (Yj - Yi )/(tj-ti ) joining pairs of points with ti ≠ ti , and is unbiased. The confidence interval is also determined by two order statistics of this set of slopes. Various properties of these estimators are studied and compared with those of the least squares and some other nonparametric estimators.

...read moreread less

8,409 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse