CoViD--19: An Automatic, Semiparametric Estimation Method for the Population Infected in Italy
Summary (2 min read)
1. Introduction
- Cases of COVID-19 break out in Italy where it is first attested a capillary spread of this disease in the European continent after the Asian one: the scenario that is developing in these days is creating an example that unfortunately will certainly be repeated in other states all over the world.
- This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.
- The presented procedure is designed to overcome these problems.
2. The proposed method
- While the former does not pose problems in terms of DOF, the latter clearly does.
- DOF–saving strategy is also the driving force of the choice not to consider as an exogenous parameter the georeferencing of Regions or to include the regional population in a regression–like scheme but to implicitly assumed these variable embedded in the dynamic of the time series in question.
3. Data and contageon indicator
- The paper makes use of official data published by Italian Authorities, on the following two variables of interest 1. number of deaths from CoViD–19 (denoted by the Latin letter M) 2. number of currently positive cases recorded after the administration of the test (denoted by the Latin letter C).
- The total number of Italian regions considered is 20.
- Two different subsets are built from Ω i.e.
- Ω, w is the ratio between current positive cases (C) and number of deaths (M) (2), τ the average doubling time for the CoViD–19 (i.e. the average span of time needed for the virus to double the cases) and δ the average time for an infected person to die.
- These two constant terms have been kept fixed as estimated according the data so far available worldwide (see Pueyo (2020)).
4. The Resampling Method
- The bootstrap scheme adopted proved to be a real asset for the problem at hand.
- Given the pivotal role played it will be briefly presented.
- In essence, the choice of the most appropriate resampling method is far from being an easy task, especially when the identical and independent distribution iid assumption (Efron’s initial bootstrap method) is violated.
- This is especially true under the actual conditions (small sample sizes).
- Technically, MEB algorithm can be broken down, following Koutris et al. (2008), in 8 steps.
5. The application of the maximum entropy bootstrap
- Once the data become available, one has just to divide them according to the subsets Ω i.e. Ω Page 5 of 14 L. Fenga COVID-19 Estimation and the code will process the new data in an automatic way.
- The procedure is also very fast as the computing time needed for the generation of the bootstrap samples requires less than 2 minutes.
- Both code and data used for this Paper are freely made available for any researcher who would consider using it.
6. Empiricical evidences
- Note that sudden variations are due to the little number of test administrated (denominator of the variable CT (2)).
- That said, the main result of the paper is summarized by Table 2, where three estimates of the number of infected people are reported by region.
- The regions belonging to the set Ω◦ (i.e. no deaths) are in Italics (all the others belong to the set Ω).
- In the column “Mean” and Lower Bounds the bootstrap estimates computed according to Eqn 5 and 6 and the Lower Bounds the lower bootstrap CIs are respectively reported.
- The column denominated “Official Cases” accounts for the number of official cases released by the Italian Authorities whereas the column “Morbidity” expresses the percentage ratio between µ (5) or µ◦ (6) and the actual population of each region.
7. Conclusions
- It is widespread opinion in the scientific community that current official data on the diffusion of SARS-CoV-2, responsible of the correlated disease, COIVD-19,among population, are likely to suffer from a strong downward bias.
- The aim of this instant paper is twofold: fist, it can compute realistic figures on the effective number of people infected with SARS-CoV-2 in Italy; Page 6 of 14 L. Fenga COVID-19 Estimation second, it can provide a methodology, which improves current state of art and can be used to compute similar figures in other countries.
- The entire procedure has been written in the programming language R and uses official data as published by the Italian Government.
- To overcome the crisis, international solidarity together wit, strong and coordinated actions among countries will be crucial.
- Stay at home, and, if you can, do research on this topic, every contribution could be crucial.
Did you find this useful? Give us your feedback
Citations
42 citations
37 citations
19 citations
16 citations
15 citations
Additional excerpts
...Finally we ave dI dt = r 0 I(1 − I K ) + A t = March 23 , (5)...
[...]
References
733 citations
"CoViD--19: An Automatic, Semiparame..." refers methods in this paper
...Under dependence structures embedded in the data, simple sampling with replacement has been proved -see, for example Carlstein et al....
[...]
321 citations
"CoViD--19: An Automatic, Semiparame..." refers methods in this paper
...For each time series y • t and y • t the bootstrap procedure is applied so that B= 100 "bona fide" replications are available, i.e.ỹ The explanation of the T-percentile method goes beyond the scope of this paper, therefore the interested reader is referred to the excellent paper by Berkowitz and Kilian (2000) ....
[...]
...The explanation of the T–percentile method goes beyond the scope of this paper, therefore the interested reader is referred to the excellent paper by Berkowitz and Kilian (2000)....
[...]
306 citations
"CoViD--19: An Automatic, Semiparame..." refers methods in this paper
...Briefly, the series of interest are considered a realization of an ARMA (Autoregressive Moving Average) model (see, e.g. Makridakis and Hibon (1997)) so that, each of them can be expressed as an autoregressive model of infinite order, i.e. AR(∞) whose infinite sequence of AR parameters is α1, α2, .…...
[...]
...Briefly, the series of interest are considered a realization of an ARMA (Autoregressive Moving Average) model (see, e.g. Makridakis and Hibon (1997) ) so that, each of them can be expressed as an autoregressive model of infinite order, i.e. AR(∞) whose infinite sequence of AR parameters is α 1 , α 2 , ....
[...]
269 citations
"CoViD--19: An Automatic, Semiparame..." refers methods in this paper
...In addition to that, a distance measure -based on the theory of stochastic processes and proposed by Piccolo (1990) -has been employed to guarantee statistical coherence among all the Italian regions....
[...]
...In addition to that, a distance measure – based on the theory of stochastic processes and proposed by Piccolo (1990) – has been employed to guarantee statistical coherence among all the Italian regions....
[...]
172 citations
"CoViD--19: An Automatic, Semiparame..." refers methods in this paper
...As it will be detailed in the sequel, in order to reduce the impact of biasing components on the parameter estimations, a recent bootstrap scheme, called Maximum Entropy Bootstrap and proposed by Vinod et al....
[...]