Abstract
The official data for the time evolution of active cases of COVID-19 pandemics around the world are available online. For all countries, a peak has been either observed (China and South Korea) or is expected in the near future. The approximate dates and heights of those peaks have important epidemiological implications. Inspired by similar complex behavior of volumes of transactions of stocks at the NYSE and NASDAQ, we propose a q-statistical functional form that appears to describe satisfactorily the available data for all countries. Consistently, predictions of the dates and heights of those peaks in severely affected countries become possible unless efficient treatments or vaccines, or sensible modifications of the adopted epidemiological strategies, emerge.
It is possible to predict the thermostatistical properties of uncountable physical systems at thermal equilibrium through the one-body distribution p(ϵ) ∝ ω(ϵ)e−βϵ, where ω(ϵ) is the density of states as a function of the energy ϵ, multiplied by the celebrated Boltzmann factor e−βϵ, β being the inverse temperature. The function ω(ϵ) comes from mechanical considerations (classical, quantum, relativistic) related with the number of degrees of freedom and does not depend on the temperature; the exponential weight comes instead from standard statistical-mechanical considerations. In many cases it is, either exactly or approximatively, w(ϵ) ∝ ϵα (α ∈ R). For the thermostatistical properties of the stationary- or quasi-stationary-state of wide classes of complex systems, the Boltzmann factor is to be generalized into the q-exponential factor [1–3]. This procedure yielded quite satisfactory results for high-frequency stock-markets, such as the NYSE, NASDAQ, and others [4–6].
Let us focus now on the data available for the COVID-19 pandemics. Soon after the beginning of the pandemics, several studies analyzing the available data and employing different models and candidate functions started to appear in the literature [7–12]. Most of them are interested in the behavior of total cases and fatality curves. We will concentrate here on the analysis of the active cases and deaths per day. Inspection of the public data1 (updated on a daily basis) and, in particular, of the time evolution of the number N of active cases (surely a lower bound of the unknown actual numbers) showed a rather intriguing similarity with the distributions of volumes of stocks. Along this line, we adopt the following functional form for each country or region:
with C > 0, α > 0, β > 0, γ > 0, q > 1, and t0 ≥ 0. The constant t0 indicates the first day of appearance of the epidemic in that particular region; it is conventionally chosen to be zero for China; for the other countries, it is the number of days elapsed between the appearance of the first case in China and the first case in that country. The normalizing constant C reflects the total population of that particular country. For α = 0, if γ = 1, we recover the standard q-exponential expression; if γ = 2, it is currently referred to in the literature as q-Gaussian; for other values of γ, it is referred to as stretched q-exponential. Through the inspection of the roles played by the four non-trivial parameters, namely (α, β, γ, q), it became clear that (α, β) depend strongly on the epidemiological strategy implemented in that region in addition to the biological behavior of the coronavirus in that geographical climate. In contrast, the parameters (γ, q) appear to be more universal, mainly depending on the coronavirus. Therefore, we investigate several countries that have not reached their peaks yet, with the basic assumption that these two parameters would not change much from one country to another, and we fixed these values at the values that we determine for China, since this country has already had nearly the full evolution of the pandemic. This assumption seems to be working. For other countries whose peaks have already been reached, we use the same functional form (1) but adjusting all parameters for a better fit. The results for China and South Korea are given in Figure 1. It is evident that, although the functional form (1) does yield satisfactory results for both China and South Korea, the (γ, q) parameter values differ somewhat for each of these countries. On the other hand, as can easily be seen from Figure 2, our assumption is corroborated by several countries that we have numerically analyzed. In Table 1, we present the forecasted dates and heights of the peaks, as well as the values of the fitting parameters using the data accumulated until April 28, 2020.
Figure 1
Figure 2
Table 1
| γandqparameters are fixed at China's values if peak has not yet been achieved | ||||||||
|---|---|---|---|---|---|---|---|---|
| Country | t0 | C | α | β | γ | q | Prediction interval | Prediction interval |
| of peak height | of peak day | |||||||
| Brazil | 34 | 1.55 × 10−5 | 5.58 | 4.00 × 10−6 | 3 | 1.26 | [145112, 159580] | [132, 134] |
| 3.30 × 10−6 | 3.037 | 1.28 | [0.00069,0.00076] | [1 June 1, June 3] | ||||
| Turkey | 48 | 8.0 × 10−3 | 4.625 | 1.52 × 10−5 | 3.025 | 1.02 | Achieved at | Achieved on day 93 |
| 80808 | April 23 | |||||||
| Italy | 24 | 4.0 × 10−3 | 4.68 | 1.10 × 10−5 | 3.083 | 1.43 | Achieved at | Achieved on day 96 |
| 106100 | April 26 | |||||||
| USA | 24 | 3.4 × 10−9 | 8.60 | 1.65 × 10−5 | 3 | 1.26 | [1061270, 1178556] | [119, 126] |
| 1.46 × 10−5 | 3.037 | 1.28 | [0.0032,0.0036] | [May 19, May 26] | ||||
| France | 24 | 1.6 × 10−6 | 6.56 | 9.20 × 10−6 | 3 | 1.26 | Achieved at | Achieved on day 98 |
| 1.19 × 10−5 | 3.037 | 1.28 | 95365 | April 28 | ||||
| UK | 24 | 1.6 × 10−8 | 7.50 | 8.50 × 10−6 | 3 | 1.26 | [182659, 197775] | [118, 121] |
| 8.00 × 10−6 | 3.037 | 1.28 | [0.0027,0.0030] | [May 18, May 21] | ||||
The values of the parameters and predictions for the maximum number of active cases and for the day at which the maximum will be achieved.
The second line in the column for the prediction interval of the peak height of active cases shows the same values divided by the total population of the country.
We also test our formula (1) and assumption for the evolution of deaths per day. Again, we fixed the (γ, q) parameter values at China's results, which can be seen in Figure 3, and tried to fit the data of the same six countries. It is quite surprising to see that the results given in Figure 4 seem to suggest that it is also possible to fit the evolution of deaths per day without changing these two parameters.
Figure 3
Figure 4
We may summarize as follows. The death curve of South Korea is atypical in the sense that it sensibly differs from all of those that we analyzed in the present work. Because of that, we have not included it here. We remind the reader that the values for (γ, q) used for the South Korea evolution curve of active cases also differ from those used for all the other countries, which reinforces the fact that some sort of exceptionality exists there for reasons that are unknown to us. For all countries that have not reached their peak values yet, we have adopted the values of (γ, q) obtained from the inspection of the entire curve of China. In all cases, we have dismissed the form of the short initial transient after the appearance of the first active case. The extrapolation procedure is tested in various countries, as indicated in Figure 5. We have indicated, for four typical countries, how the predicted day and height of the peak evolves with time while gradually incorporating the newly available data (which not only add recent information but also modify old public information, even the day of first appearance of a Covid-19 case in a particular country). As we can see, the peak date is more robustly predicted than the peak height. Indeed, 1 and even 2 months before, it has been possible to correctly predict the date within the span of 1 week. The prediction for the height is more sensitive to the new information and can easily fluctuate between simple and double depending on the country and its pandemic health strategy (or lack of it), in particular as concerns population mobility.
Figure 5
We verify that the present work appears to belong to the realm of complex systems, which includes not only, as mentioned above, high-frequency financial transactions (with α > 0) [4, 5] but also anthropological issues, such as medieval trading networks and biotech intercorporate networks (with α < 0) [13], and relaxation in spin-glasses [14], as well as q-Weibull distribution-like systems [15–18], which correspond to the particular case α = γ − 1. It remains open as a highly desirable goal to formulate a model which, along lines somewhat similar to epidemiological models, such as the SIR one, would predict an evolution curve, such as the present Equation (1). To be more precise, a great variety of SIR-like epidemiological models have been proposed in the literature. They typically yield an increase before the peak quite that is similar to the decrease after the peak. The behavior in our present Equation (1) is at variance with such characteristics, since it provides a power-law increase (with a typically large positive exponent α as illustrated in the table) and a quite different power-law decrease (with negative exponent given by [α − γ/(q − 1)], the absolute value of which can independently be either larger or smaller than α) after the peak. We emphasize that such increase-decrease quantitative behaviors appear to satisfactorily conform to reality, in contrast with the logistic-like growth behavior typical of most SIR-like models. An important issue remains to be clarified, namely the conditions under which the values of (γ, q) could indeed be (strictly or nearly) universal and essentially determined by the biology of the infecting agent, such as the present COVID-19. Let us also mention that, quite obviously, there is nearly everywhere a severe under-notification of the publicly available data for active cases (and even deaths). The real number could easily be 10 times larger, depending on the particular region. However, the consequences of this lack of important information onto the real number of deaths are somewhat mitigated by making use of the case fatality rate, which is relatively stable throughout recent weeks for a given country and can be found at websites, such as2,3. Finally, the present prediction algorithm could, in principle, be included within an internet app, which could access the data publicly available at a given website and automatically update the predicted dates and heights of the disease peaks of epidemics, such as the present Covid-19 one. Any initiative along these lines would be highly welcome.
Statements
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: https://www.worldometers.info/coronavirus/#countries.
Author contributions
All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.
Acknowledgments
We have benefitted from useful remarks by E. P. Borges, R. Dickman and P. M. C. Oliveira as well as from partial support by CNPq and Faperj (Brazilian Agencies).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
1.^Available online at: https://www.worldometers.info/coronavirus/#countries (accessed May 08, 2020).
2.^Available online at: https://www.cebm.net/covid-19/global-covid-19-case-fatality-rates/ (accessed May 08, 2020).
3.^Available online at: https://coronavirus.jhu.edu/data/mortality (accessed May 08, 2020).
References
1.
TsallisC. Possible generalization of Boltzmann-Gibbs statistics. J Stat Phys. (1988) 52:479. 10.1007/BF01016429
2.
Gell-MannMTsallisC. Nonextensive Entropy–Interdisciplinary Applications. New York, NY: Oxford University Press (2004).
3.
TsallisC. Introduction to Nonextensive Statistical Mechanics–Approaching a Complex World. New York, NY: Springer (2009).
4.
TsallisCAnteneodoCBorlandLOsorioR. Nonextensive statistical mechanics and economics. Phys A. (2003) 324:89. 10.1016/S0378-4371(03)00042-6
5.
OsorioRBorlandLTsallisC. Distributions of high-frequency stock-market observables. In: Gell-MannMTsallisC, editors. Nonextensive Entropy–Interdisciplinary Applications. New York, NY: Oxford University Press (2004). p. 321.
6.
RuizGde MarcosAF. Evidence for criticality in financial data. Eur Phys J B. (2018) 91:1. 10.1140/epjb/e2017-80535-3
7.
ZlaticVBarjašićIKadovićAŠtefančićHGabrielliA. Bi-stability of SUDR+K model of epidemics and test kits applied to COVID-19. (2020) arXiv [Preprint]. arXiv:2003.08479.
8.
VasconcelosGLMacêdoAMSOspinaRAlmeidaFAGDuarte-FilhoGCBrumAAet al. Modelling fatality curves of COVID-19 and the effectiveness of intervention strategies. medRxiv. (2020). 10.1101/2020.04.02.20051557
9.
PiccolominiELZamaF. Preliminary analysis of COVID-19 spread in Italy with an adaptive SEIRD model. (2020) arXiv [Preprint]. arXiv:2003.09909.
10.
PercMGorišekMiksić NSlavinecMStožerA. Forecasting COVID-19. Front Phys. (2020) 8:127. 10.3389/fphy.2020.00127
11.
PluchinoABiondoAEGiuffridaNInturriGLatoraVLe MoliRet al. A novel methodology for epidemic risk assessment: the case of COVID-19 outbreak in Italy. (2020) arXiv [Preprint]. arXiv:2004.02739.
12.
BastosSBCajueiroDO. Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil. (2020) arXiv [Preprint]. arXiv:2003.14288v2.
13.
WhiteDRKejzarNTsallisCFarmerDWhiteS. A generative model for feedback network. Phys Rev E. (2006) 73:016119. 10.1103/PhysRevE.73.016119
14.
PickupRMCywinskiRPappasCFaragoBFouquetP. Generalized spin glass relaxation. Phys Rev Lett. (2009) 102:097202. 10.1103/PhysRevLett.102.097202
15.
PicoliSJrMendesRSMalacarneLC. q-exponential, Weibull, and q-Weibull distributions: an empirical analysis. Phys A. (2003) 324:678. 10.1016/S0378-4371(03)00071-2
16.
AssisEMBorgesEPVieira de MeloSAB. Generalized q-Weibull model and the bathtub curve. Int J Qual Reliab Manage. (2013) 30:720. 10.1108/IJQRM-Oct-2011-0143
17.
PodestaTSVRosembachTVDos SantosAAMartinsML. Anomalous diffusion and q-Weibull velocity distributions in epithelial cell migration. PLoS ONE. (2017) 12:e0180777. 10.1371/journal.pone.0180777
18.
XuM. Using the q-Weibull Distribution for Reliability Engineering Modeling and Applications. College Park, MD: University of Maryland (2019).
Summary
Keywords
COVID-19, pandemics, complex systems, non-extensive statistical mechanics, epidemiology
Citation
Tsallis C and Tirnakli U (2020) Predicting COVID-19 Peaks Around the World. Front. Phys. 8:217. doi: 10.3389/fphy.2020.00217
Received
24 April 2020
Accepted
20 May 2020
Published
29 May 2020
Volume
8 - 2020
Edited by
Matjaž Perc, University of Maribor, Slovenia
Reviewed by
Vinko Zlatic, Rudjer Boskovic Institute, Croatia; Luiz G. A. Alves, Northwestern University, United States
Updates
Copyright
© 2020 Tsallis and Tirnakli.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ugur Tirnakli ugur.tirnakli@ege.edu.tr
This article was submitted to Social Physics, a section of the journal Frontiers in Physics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.