A comparative study on the three calculation methods for reproduction numbers of COVID-19

Abudunaibi, Buasiyamu; Liu, Weikang; Guo, Zhinan; Zhao, Zeyu; Rui, Jia; Song, Wentao; Wang, Yao; Chen, Qiuping; Frutos, Roger; Su, Chenghao; Chen, Tianmu

doi:10.3389/fmed.2022.1079842

ORIGINAL RESEARCH article

Front. Med. , 05 January 2023

Sec. Infectious Diseases: Pathogenesis and Therapy

Volume 9 - 2022 | https://doi.org/10.3389/fmed.2022.1079842

This article is part of the Research Topic COVID-19: Integrating Artificial Intelligence, Data Science, Mathematics, Medicine and Public Health, Epidemiology, Neuroscience, Neurorobotics, and Biomedical Science in Pandemic Management, volume II View all 92 articles

A comparative study on the three calculation methods for reproduction numbers of COVID-19

$\nBuasiyamu Abudunaibi&#x;$ Buasiyamu Abudunaibi¹^†

Weikang Liu¹^†

Zhinan Guo²^†

Zeyu Zhao^1,3

Jia Rui^1,3

Wentao Song¹

Yao Wang¹

Qiuping Chen³

Roger Frutos³

Chenghao Su⁴^*

Tianmu Chen¹^*

¹State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics, School of Public Health, Xiamen University, Xiamen, Fujian, China
²Xiamen Center for Disease Control and Prevention, Xiamen, Fujian, China
³Cirad, UMR 17, Intertryp, Université de Montpellier, Montpellier, France
⁴Zhongshan Hospital, Fudan University (Xiamen Branch), Xiamen, Fujian, China

Objective: This study uses four COVID-19 outbreaks as examples to calculate and compare merits and demerits, as well as applicational scenarios, of three methods for calculating reproduction numbers.

Method: The epidemiological characteristics of the COVID-19 outbreaks are described. Through the definition method, the next-generation matrix-based method, and the epidemic curve and serial interval (SI)-based method, corresponding reproduction numbers were obtained and compared.

Results: Reproduction numbers (R_eff), obtained by the definition method of the four regions, are 1.20, 1.14, 1.66, and 1.12. Through the next generation matrix method, in region H R_eff = 4.30, 0.44; region P R_eff = 6.5, 1.39, 0; region X R_eff = 6.82, 1.39, 0; and region Z R_eff = 2.99, 0.65. Time-varying reproduction numbers (R_t), which are attained by SI of onset dates, are decreasing with time. Region H reached its highest R_t = 2.8 on July 29 and decreased to R_t < 1 after August 4; region P reached its highest R_t = 5.8 on September 9 and dropped to R_t < 1 by September 14; region X had a fluctuation in the R_t and R_t < 1 after September 22; R_t in region Z reached a maximum of 1.8 on September 15 and decreased continuously to R_t < 1 on September 19.

Conclusion: The reproduction number obtained by the definition method is optimal in the early stage of epidemics with a small number of cases that have clear transmission chains to predict the trend of epidemics accurately. The effective reproduction number R_eff, calculated by the next generation matrix, could assess the scale of the epidemic and be used to evaluate the effectiveness of prevention and control measures used in epidemics with a large number of cases. Time-varying reproduction number R_t, obtained via epidemic curve and SI, can give a clear picture of the change in transmissibility over time, but the conditions of use are more rigorous, requiring a greater sample size and clear transmission chains to perform the calculation. The rational use of the three methods for reproduction numbers plays a role in the further study of the transmissibility of COVID-19.

Introduction

Ever since the first confirmed case of COVID-19 was reported in December 2019, there have been more than 500 million infections around the world, with a total of over 6 million deaths (1, 2). This pandemic has been a great challenge for not only people's health and the global health-care system but also for the socio-economy. In the last 2 years, multiple mutant variants of SARS-CoV-2 have emerged, which affects the transmissibility and severity of the virus greatly (3, 4). In China, due to the implementation of non-pharmaceutical (NPIs, such as quarantine, nucleic-acid testing, and social distancing) and pharmaceutical interventions (PIs, such as medication as well as vaccination) during the prevention and control of COVID-19, there were some satisfying results (1, 5–7). At the current stage of the pandemic, applying the appropriate quantitative index to describe the transmission dynamics of the disease and evaluating its transmissibility to propose corresponding controlling strategies have been areas of great interest for researchers.

In general, researchers use the secondary attack rate (SAR), the number of susceptible contacts who develop the disease as a percentage of the total number of susceptible contacts between the minimum incubation period and the maximum incubation period of certain infectious diseases, or reproduction numbers (R), the number of cases of second-generation infection caused by an infected individual with an infectious disease in a fully susceptible population without any intervention, to illustrate the transmissibility of the disease. As the modeling studies for infectious diseases are more sophisticated, the application of reproduction numbers has become more compelling, among which, basic reproduction number (R₀), effective reproduction number (R_eff), and time-varying reproduction number (R_t) are commonly used (8). At the beginning of 2020, researchers calculated R₀ of COVID-19 in Wuhan, China with the definition method, to demonstrate the transmissibility of this emerging infectious disease (EID) for the first time (9). Recently, there have been some studies published about obtaining R_eff and R_t by applying the next-generation matrix and serial interval (SI) (4, 10, 11). Simultaneously, there are studies about R_eff of COVID-19 by establishing transmission dynamics models, such as SEIAR models (1, 12, 13). Now that reproduction numbers are playing an important role in predicting and preventing infectious diseases, especially in COVID-19, it is of significance that we use the optimum reproduction number in an outbreak scenario with appropriate calculation methods.

In most cases, the reproduction numbers are obtained from definition methods, next-generation matrices, or using SI and epidemic curves (14–17). However, there are not enough studies about the optimum scenarios for the above calculation methods or they are misused in some circumstances. Therefore, in this study, we used data from four outbreaks occurring in China during 2021–2022, which were denoted as regions H, P, X, and Z, for assessing the transmissibility of COVID-19, and used reproduction numbers, which are obtained by the definition method, the next-generation matrix-based method, and SI and epidemic curve-based method, to be the indicators for COVID-19 transmissibility (1, 18–20). The epidemiological significance and application considerations of the reproduction numbers calculated using these three ways were also compared to provide a reference for public health departments to use more accurate reproduction numbers to formulate prevention and control measures and quantitatively evaluate the effects of various interventions.

Methods

Study design

There are three major steps involved in this study (Figure 1). First, we collected data and methods for calculating reproduction numbers. Then, we described the epidemiological characteristics of the outbreaks in four regions, H, P, X, and Z. Finally, we obtained reproduction numbers of COVID-19 in those regions through three different methods, which are the definition method, the method based on the next-generation matrix, and method based on SI and epidemic curve, to evaluate transmissibility of the diseases under various circumstances.

FIGURE 1

Figure 1. The framework of this study.

Data collection

In this study, data were collected on four COVID-19 outbreaks in China from corresponding studies and open accesses, which comprised of transmission caused by the Delta variant in region H (22 July−15 August 2021) (19), region P (4–23 September 2021) and region X (8–30 September 2021) (1, 20), and the Omicron variant in region Z (13–21 January 2022) (18). The data collected included basic information about the patients (age, sex, occupation, address, whether vaccinated, and doses), clinical typing (asymptomatic infection, minor, ordinary, heavy, and severe), key time points of transmission (date of exposure, date of onset, date of the positive test, and date of confirmation), and transmission chains between cases. Population information for each site was obtained from the 7th National Census data in 2020.

Calculation of COVID-19 reproduction number

The definition method

Reproduction number, R, is defined as the average number of infections acquired by an infected person during an infected period in a susceptible population. Thus, the definition method means that the transmission chain of an outbreak is obtained and directly calculates the reproduction number, which is denoted as the basic reproduction number, R₀. Specifically, first, we collect the number of second-generation cases caused by the first-generation cases and divide the number of second-generation cases by the number of first-generation cases to calculate the R₀.

\begin{array}{l} R_{0} = \frac{S u m o f t h e n u m b e r o f s e c o n d - g e n e r a t i o n c a s e s t r a n s m i t t e d p e r f i r s t - g e n e r a t i o n c a s e}{S u m o f t h e n u m b e r o f c a s e s i n f i r s t g e n e r a t i o n} & (1) \end{array}

The next-generation matrix-based method

Model development

According to the natural history of COVID-19 and the epidemiological characteristics of the four regions, we grouped the total population N into Susceptible, S; Exposed, E; Infected, I; Asymptomatic, A; and Recovered/Removed, R. Thereby, a SEIAR (Susceptible—Exposed—Infected—Asymptomatic—Recovered/ Removed) transmission dynamics model was constructed.

The SEIAR model was built under these assumptions:

1) We set the infection coefficient after effective contact between a susceptible person S and an infection with symptoms I as β, also assuming that the transmissibility of an asymptomatic infection A is κ times that of a symptomatic infection I (0 < κ < 1), then the number of new infections at time t is βS(I + κA).

2) At time t, there would be two results for exposed population E, they either become symptomatically infected I or asymptomatically infected A. Assume that a proportion p of E is converted to A, and the proportion of E to I is (1–p). It is generally understood that after a person is exposed to pathogens, there is a time interval between when he/she becomes invaded by the pathogen and when it is emitted, known as the latent period. The rate of transformation from E to A is proportional to the amount of E with a scale factor of pω′E, and ω′ is the latent period coefficient.

3) At time t, the number of transfers to the recovering population R is γI if the time interval between onset and diagnosis from a symptomatic infection I is γ; the number of transfers to R from A, identified as an asymptomatic infection, is γ′I.

4) Since there is a possibility of death in symptomatic infections, the mortality rate is taken as f .

The functions of the SEIAR transmission dynamics model are as follows:

\begin{array}{l} {\begin{matrix} \frac{d S}{d t} = - β S (I + κ A) / N \\ \frac{d E}{d t} = - β S (I + κ A) / N - p ω^{'} E - (1 - p) ω E \\ \frac{d I}{d t} = p ω^{'} E - γ I - f I \\ \frac{d A}{d t} = (1 - p) ω E - γ^{'} A \\ \frac{d R}{d t} = γ I + γ^{'} A \\ N = S + E + I + A + R \end{matrix} & (2) \end{array}

Parameter estimation

In this SEIAR model, there are various of parameters need to be estimated before modeling. The infection coefficient β is obtained by fitting the actual data in the model. The incubation period of the Delta variant is 3–7 days (4, 21) and that of the Omicron variant is 2 days (18, 22); therefore, we set the incubation period coefficient for the Delta variant as 1/ω = 0.33 and the Omicron variant as 1/ω = 0.4. As there were not any asymptomatic infections during the outbreaks in regions P, X, and Z, we set the proportion of asymptomatic infections as p = 0. While for region H, where asymptomatic infections were reported, we set its proportion of asymptomatic infections as p = 0.15 (23, 24), and we set the latent period coefficient in region H as 1/ω′ = 0.2. Simultaneously, according to the previous studies which illustrate that the infection coefficient of asymptomatic infections compared to symptomatic infections is 0.7 (21), we set κ = 0.7. It is widely accepted that the disease duration of the Delta variant is ~5 days, while it is 4–5 days for the Omicron variant (22), so we set the recovery rate of symptomatic infections as 1/γ = 0.2 and 1/γ = 0.22, respectively. As for the recovery rate for asymptomatic infections, we set 1/γ′ = 0.1 (24, 25). No region reported a fatal case, so the fatality of the disease is f = 0 (Table 1).

TABLE 1

Table 1. The definition and values of parameters in the SEIAR model of COVID-19.

This method is an indirect way to calculate the reproduction numbers that use a transmission dynamics model, denoted as effective reproduction number R_eff. This refers to the expected number of second-generation cases that can be infected by the first-generation cases with certain effective interventions implemented (1, 11).

The formula for calculating the effective reproduction number R_eff is as follows: Refer to Supplementary material for detailed calculations (11).

\begin{array}{l} R_{e f f} = \frac{β S / N}{p ω^{'} + (1 - p) ω} \times [\frac{κ p ω^{'}}{γ^{'}} + \frac{(1 - p) ω}{γ + f}] & (3) \end{array}

SI and epidemic curve-based method

Through the transmission chains, which are obtained from the collected COVID-19 data, we count the standard deviations and means of serial intervals (SI) of onset dates of outbreaks. Then, we calculate the reproduction number, denoted as R_t, by the EpiEstim package in R software. R_t represents the average number of second-generation infections from first-generation cases per unit time in a susceptible population. Refer to Supplementary material for a detailed calculation.

\begin{array}{l} S I = o n s e t t i m e o f s e c o n d - g e n e r a t i o n c a s e s \\ - o n s e t t i m e o f f i r s t - g e n e r a t i o n c a s e s & (4) \end{array}

Statistical analysis

Data entry and organization related to this study were performed in Excel 2019. Continuous quantitative variables were described by median ± interquartile range (IQR) and categorical qualitative variables by percentages. Statistical analysis was performed using SPSS version 22.0, and differences were statistically significant at p ≤ 0.05. Graphs were plotted using Graph Prism 7.0; transmission chains were plotted using OrignPro version 2022; models were fitted using Berkeley Madonna 8.3.18; differential equations were solved using the fourth-order Runge Kutta method; and model convergence was based on the root least mean square (LRMS), further using the coefficient of determination (R²) to determine the goodness of fit. The R_t was calculated using the EpiEstim package in R software.

Results

Epidemiological characteristics of COVID-19 in the four regions

From 22 July to 15 August 2021, there were 129 COVID-19 cases caused by the Delta variant reported in region H, with the highest number of new cases in a single day being 15 on 1 August. In 2021, from 4 to 23 September, in region P, a total of 209 COVID-19 cases caused by the Delta variant were reported. On 10 September, the highest number of new cases in a single day, 42 cases, was reached. At the same time, there was another outbreak in region X from 8 to 30 September, which was an outflow of region P's outbreak. The COVID-19 outbreak in region Z in 2022 was a consequence of the Omicron variant, which had basically replaced the Delta variant and become the dominant variant throughout the world. This outbreak in region Z started on 13 January and lasted for 8 days, with a total of 38 cases.

The spatial distributions of cases in each of those four outbreaks give clues about how the outbreaks occurred. The outbreak in region H was scattered in seven districts in the region, with 58.9% of cases reported in area A of region H. During the outbreak in region P, 96.2% of total cases were from area A of region P, and in region X's outbreak, 88.1% of cases were found in a factory in area A of region X. In the outbreak in region Z, 97.4% of cases were in area A of region Z (Figure 2).

FIGURE 2

Figure 2. The temporal distribution for COVID-19 outbreaks in four regions studied in this research. The horizontal axis is the dates when the outbreaks happened, and we used different shades of color for the bars in each figure to show the number of cases in different areas in these regions. (A) Region H, (B) Region P, (C) Region X, and (D) Region Z.

As for the population distribution, it shows that there are no significant differences among gender groups during the outbreaks in all four regions which was 58:71 (χ² = 1.310, p = 0.252) in region H, 88:121 (χ² = 3.211, p = 0.22) in region P, 116:120(χ² = 0.068, p = 0.795) in region X, and 12:25 (χ² = 3.789, p = 0.052) in region Z. But there are some differences among age groups. The median ± interquartile range (IQR) of regions H, P, X, and Z are 34 ± (15–48.5), 32 ± (9–46), 39 ± (31–47), and 21.5 ± (4–35), respectively. The differences in the age distribution of disease occurrence in the four regions were found to be statistically significant through rank-sum tests of multiple groups of samples (H = 40.9, p < 0.05). Vaccination also varied by region (χ² = 28.907, p < 0.05). The proportions of the unvaccinated group were 46.5, 45.9, 12.7, and 13.2% in regions H, P, X, and Z, respectively. Disease severity varied in regions as well (χ² = 10.907, p < 0.05). Nevertheless, it is worth noting that most cases reported in regions comprised of individuals with minor or ordinary symptoms, with fewer asymptomatic, heavy, or critical symptoms occurring (Table 2).

TABLE 2

Table 2. Population distribution of COVID-19 outbreaks in four regions.

Calculating COVID-19 reproduction number

The definition method

After clarifying the transmission chains in the four outbreaks (Figure 3), we directly calculated the reproduction numbers of COVID-19 in those four regions. However, there would have been interventions implemented as soon as the authorities discovered a COVID-19 case, so the effective reproduction number R_eff has been used in this case rather than the basic reproduction number R₀. Consequently, the effective reproduction number R_eff in regions H, P, X, and Z is R_eff = 1.20, R_eff = 1.14, R_eff = 1.66, and R_eff = 1.12, respectively. Notably, the reproduction numbers were far less than previous studies on the Delta and Omicron variants, which showed that the reproduction number is R = 5 for the Delta variant and R = 5–7 for the Omicron variant. Reproduction numbers of COVID-19 obtained from the definition method in the four regions were over 1, and although this means there were possibilities that the variants would cause outbreaks or epidemics, the speed of transmission was far lower than a super-spreading scenario. This is significant because the government and other departments would have taken prevention and control measures after detecting the cases for the first time, which ultimately presents a gamma distribution in the transmission chain (Figure 4).

FIGURE 3

Figure 3. The transmission chains for COVID-19 outbreaks in four regions studied in this research. The dots in each figure represent the COVID-19 cases during each outbreak, while the bars between dots mean that these cases were on the same transmission chain. (A) Region H, (B) Region P, (C) Region X, and (D) Region Z.

FIGURE 4

Figure 4. The gamma distribution for the transmission chains for COVID-19 outbreaks in four regions studied in this research. The horizontal axis in each figure shows the number of transmissions caused by a single case, while the vertical axis is the frequency of the transmission number. The bars present the actual frequency in each outbreak, and the red curves are the curves of fitting, which follow a gamma distribution. (A) Region H, (B) Region P, (C) Region X, and (D) Region Z.

The next-generation matrix-based method

We built a SEIAR transmission dynamics model and used the next-generation matrix to calculate the R_eff of COVID-19 in each of the four regions (Figure 5). The model fitted well for all four COVID-19 outbreaks in region H (R² = 0.782, p < 0.001), P (R² = 0.712, p < 0.001), X (R² = 0.837, p < 0.001), and Z (R² = 0.634, p < 0.001).

FIGURE 5

Figure 5. The simulation results of epidemic trends based on the different values of R_eff in each region. (A) Region H, (B) Region P, (C) Region X, and (D) Region Z.

For region H, the COVID-19 outbreak by the Delta variant in July 2021 could be divided into the natural transmission stage (July 15–August 1) whose R_eff = 4.30 and the effective control stage (August 2–August 15) was R_eff = 0.44. We have predicted that without any proper interventions, there would have been a total of 2,226 cases (95% CI: 1,871–2,586), while in fact, region H reported 129 cases, which means that taking effective prevention and control measures helped decrease 94.2% of the potential infections (Figure 5A).

The epidemic curves of outbreaks in regions P and X, which were caused by the Delta variant, could be divided into three stages. The first stage is denoted as the natural transmission stage. In region P, it was from 4 to 10 September, with R_eff = 6.5; in region X, it was from 8 to 13 September, with R_eff = 6.82. The second stage is the effective containment stage. In region P, it was from 11 to 15 September, with R_eff = 1.39; in region X, it was from 14 to 20 September, with R_eff = 1.51. The last stage is the effective control stage. In region P, it was from 16 to 23 September; in region X, it was from 21 to 30 September, both with R_eff = 0. Our prediction illustrated the significance of effective intervention measures via different effective reproduction numbers of three stages in both regions P and X. In the case of region P, if the public health departments had failed to implement interventions by 10 September and the outbreak had continued to develop with R_eff = 6.5, there might have been 14,076 (95% CI: 13,345–14,809) cases by 23 September, or if they had not strengthened the intervention measures at the effective containment stage, there would have been 400 (95% CI: 236–588) cases by 23 September. The actual number of COVID-19 cases in region P was 208, thus, after strict interventions on the disease transmission, 98.51 and 49.02% of potential infections were prevented (Figure 5B). Similarly, in the case of region X, if the government had not taken any measures for prevention and control, there might have been 71,930 (95%CI: 70,301–73,562) cases by 30 September, with R_eff = 6.82 for the virus, or if they had not maintained the strict intervention measures, by 30 September, the total number of cases would have been 518 (95%CI: 308–730). However, in fact, there were 236 cases in this COVID-19 outbreak in region X, thus, the effective reproduction numbers show that the prevention and control measures prevented 99.67 and 54.44% of potential cases in the first and second stages of this outbreak, respectively (Figure 5C).

For the COVID-19 outbreak in region Z, which was caused by the Omicron variant, similar to that of region H, we have divided its epidemic curve into two stages, with the first stage being the natural transmission stage from 13 to 15 January, where R_eff = 3.99, and the second stage denoted as the effective control stage from 16 to 21 January, where R_eff = 0.65. If no cases had been detected in region Z before 15 January and no measures had been taken, there would have been a total of 138 (95% CI: 66–213) cases by 21 January. The actual number of cases reported was 38, which shows that the timely measures taken reduced potential cases by 72.5% (Figure 5D).

The epidemic curve and SI-based method

According to the statistical analysis, we have found that for the COVID-19 outbreaks by Delta variant in regions H, P, and X, the average SI of onset date is 2.3, and the standard deviation is 3.4. For the COVID-19 outbreak by the Omicron variant in region Z, the average SI and its standard deviation are 2.9 and 2.4, respectively. Then, with the combination of the epidemic curve and SI, we obtained R_t for the COVID-19 outbreaks in regions H, P, X, and Z (Figure 6).

FIGURE 6

Figure 6. The simulation results of epidemic trends based on the different values of R_t in each region. (A) Region H, (B) Region P, (C) Region X, and (D) Region Z.

In region H, at the beginning of the epidemic (before 29 July), when there were no interventions, and R_t varies from 1.5 to 3, showing a rapidly increasing trend. After 1 August, when prevention and control measures were taken, R_twas declining, it was below 1 after 3 August, and continued until the end of the outbreak. Though the curve approached R_t = 1 on 8 and 11 August, it did not exceed it, due to the strict policies in place (Figure 6A).

In region P, according to the curve, R_t was between 5 and 6 at the beginning of the epidemic. Interventions had been implemented after detecting the first case on 10 September, which contributed to the rapid decline of the transmissibility of the variant, and it is consistent with the effective containment stage in the epidemic curve. After 14 September, R_t < 1, and it lasted until the end of this outbreak (Figure 6B).

In region X, which experienced the first factory cluster outbreak by the Delta variant of COVID-19 in China, the trend of its R_t curve fluctuates, yet it did not exceed 1.5. R_tand appeared to be declining after the public health department took prevention and control measures. R_t was <1 after 23 September, and it was consistent with the effective control stage (Figure 6C).

R_t of COVID-19 caused by the Omicron variant in January 2022 in region Z was over 1.5 before the administrations took effective measures on 15 January, which is a sign of a potential epidemic. After strict interventions, R_t decreased gradually, and it was below 1 on 19 January, and this outbreak was contained (Figure 6D).

Discussion

Evaluating the transmissibility of COVID-19 is of significance in the prevention and control of the pandemic. In this study, we obtained the reproduction numbers of COVID-19 through the definition method, the next-generation matrix-based method, and the SI and epidemic curve-based method, with outbreak data collected from four different regions of Delta and Omicron variants of SARS-CoV-2. Then, we made a comparison of the reproduction numbers obtained from different methods and looked into the merits and demerits of those calculation methods and their reproduction numbers in actual situations, so as to provide public health departments with a valid and reliable index for evaluating the transmissibility of COVID-19 and a theoretical basis for implementing intervention measures.

In selecting the outbreak data, we have chosen the outbreaks in region H in July 2022, in region P and X in September 2021, and in region Z in January 2022. For the past 2 years, SARS-CoV-2 has undergone mutations, where transmissibility varies with the various strains. Before November 2021, most of the outbreaks in China were caused by the Delta variant. After this, it was the Omicron variant that became the dominant variant worldwide (2, 26). In this study, outbreaks in regions H, P, and X were caused by the Delta variant, while the outbreak in region Z was caused by the Omicron variant. On the scale of the epidemic, outbreaks in regions H and P were observed to have a similar scale as the previous epidemics in China. As for the outbreak in region X in September 2021, this was the first factory cluster outbreak caused by the Delta variant in China. However, as for the outbreak in region Z in January 2022, though it was on a small scale, it is informative in relation to small outbreaks that may occur.

For the time distribution, it is concluded that outbreaks in all four regions showed a multi-phased characteristic, which is similar to most of the epidemics previously. As for the population and spatial distribution of the outbreaks in the four regions, it is confirmed that the population is susceptible to all COVID-19 variants, with no differences between Delta and Omicron variants. There was no difference in gender, although the age of the infected patients was mainly in the range of 20–39 years old. As for the disease severity, it is noteworthy that during the outbreaks caused by the Delta variant, there were cases of heavy symptoms, while in the outbreak of the Omicron variant, there were no heavy symptoms or serious cases; however, this may not be significant considering that there was only one data set of the Omicron variant in this study.

By obtaining the reproduction numbers through three different approaches, namely, the definition method, the next generation matrix-based method, and the SI and epidemic curve-based method, which is the main purpose of this study, we made comparisons between the different reproduction numbers (Table 3).

TABLE 3

Table 3. Reproduction numbers in four regions.

For the effective reproduction number, R_eff, obtained from the definition method, it is calculated that R_eff = 1.20, 1.14, 1.66, and 1.12 in regions H, P, X, and Z, respectively. The first three R_effs were caused by the Delta variant, and via the R_eff, which is relatively low, it showed that one infection could infect at least one susceptible individual. In previous studies, it was indicated that R₀ for the Delta variant was usually 5, and for the Omicron variant it was 5–7 (22, 27), which is significantly different from that in this study. Although we obtained R_eff that is consistent with the condition of transmission of the virus through the definition method, it is much slower than the real scenario. According to our analysis, it was found that the transmission chain in the cases was incomplete; in addition, as soon as the first case was reported, the government would have taken strict interventions against the epidemic, which may have resulted in bias in the results. Thus, it is concluded that the reproduction number obtained through the definition method cannot perform well in illustrating the transmissibility of the disease after interventions have been implemented; that is, it is considered that the definition method for calculating the reproduction number should be conducted in the early stages of the epidemic when there are clear transmission chains to predict the trend of the infectious disease (28).

Driven by the Chinese policy of preventing and controlling the transmission of SARS-CoV-2, appropriate interventions would have been implemented by the local government soon as a case was reported. Therefore, calculating the reproduction number, which is denoted as R_eff, through the next generation matrix method by using the transmission dynamics models is effective, because it can provide us with references for the trend of the epidemic and evaluate the effects of the conducted measures (8). The reproduction numbers in the outbreak of region P are taken as an example. According to the fitting curve, the outbreak could be divided into three phases. The first phase was the natural transmission phase, where R_eff = 6.5, and it explains the rapid increase of cases in the early stage of the outbreaks. When relevant interventions were taken, this was the second phase, where R_eff = 1.39, and although it shows that the transmissibility of the virus is slower than phase I, the virus was still able to transmit in region P and the outbreak continued. With the implementation of intervention measures were strengthen, this was the third phase, where R_eff = 0, and it showed that the outbreak was under control and there was no way that the disease would still spread in the city. In conclusion, it was found that the reproduction number from the next-generation matrix method was able to forecast and simulate the trend of the epidemic to evaluate the prevention and control measures more validly and offer significant references for future epidemics.

The time-varying reproduction number, obtained via epidemic curve and SI, presents the dynamic change of the epidemic and is now used more frequently (27, 29). However, the condition required for calculating R_t is more complicated: we have to recognize a clear transmission chain and obtain the SI between the first-generation cases and second-generation cases. In this study, we found that the SI of onset date for the Delta variant of the outbreaks in regions H, P, and X was 2.4 ± 3.4, which is smaller than that in Guangdong Province. The reason for this bias may be due to the lack of transmission chain in this study. While for the Omicron variant, whose onset date SI in region Z is 2.9 ± 2.4, in this outbreak, the transmissibility of the variant is less strong. However, because the total number of cases in region Z was much smaller, it is unrealistic to generalize the result in this case. As R_t is a real-time index, it is more instructive to use it as a reference when assessing the trend of epidemics to focus on prevention and controls. However, lacking transmission chains or losing the partial case data would lead to biases in calculating R_t, and, worse, could mislead decision-makers in connection to implementing intervention measures.

According to the above discussion, we have summarized with a decision tree to assist researchers to choose an optimal method for calculating reproduction numbers when there is a COVID-19 epidemic and to provide references for intervention implementation (Figure 7).

FIGURE 7

Figure 7. The decision tree for choosing the optimal reproduction number as an index for the transmissibility of SARS-CoV-2 during a COVID-19 epidemic.

Limitations

Due to technical conditions and data limitations, this study has the following limitations. First, the data used in this study are one provincial Delta variant outbreak, two municipal Delta variant outbreaks, and one municipal Omicron variant outbreak. As a comparative study, data information on the corresponding scale of the two types of variants should be guaranteed. Second, there are several ways to calculate the reproduction number, but in this study, only three methods were used for the calculation of the reproduction number, which can be further calculated by different methods and further compared to draw more reliable conclusions.

Conclusion

In this study, we calculated the reproduction numbers of four Chinese COVID-19 outbreaks by using the definition method, the next-generation matrix-based method, and the epidemic curve and SI-based method, then analyzed and compared the best applicable scenarios for each of these methods. The definition method is generally used in the early stages of an epidemic, when the number of cases is small, to be able to assess the virus transmissibility and the effectiveness of initial prevention and control measures and to be able to predict the trend of the epidemic, but the results may be biased when the transmission chain is unclear and the number of cases is too large. The method based on the next-generation matrix is mostly used in situations where there are more cases, and the transmission chains are unclear. The index incorporates a variety of factors to control transmission, which can well evaluate the measures taken for epidemic prevention and control, to make a judgment on the scale of the epidemic, but for larger-scale epidemic outbreaks, this method has some limitations. For the calculation of the reproduction number obtained using the method based on the epidemic curve and SI, R_t can give the real-time spread of COVID-19, making it easier and more efficient for decision-makers to take intervention measures, but the method is more complex and requires harsh conditions, and it needs enough clear generational relationships to calculate the SI so that the reproduction number can be calculated. Through the rational use of different reproduction number methods, the benefits of prevention and control measures can be maximized, and a good basis for further clarifying the epidemiological significance of disease transmission can be created.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Author contributions

TC, BA, WL, ZG, and CS designed research. BA, WL, and ZZ analyzed data. TC, CS, ZG, WL, BA, ZZ, JR, WS, YW, QC, and RF conducted the research and analyzed the results. TC, CS, WL, ZG, and BA wrote the manuscript. All authors read and approved the final manuscript.

Funding

This work was partly supported by the Bill & Melinda Gates Foundation (INV-005834), the National Key Research and Development Program of China (2021YFC2301604), and Guiding projects of the Science and Technology Program in Fujian Province (2019D014).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.1079842/full#supplementary-material

References

1. Liu W, Guo Z, Abudunaibi B, Ouyang X, Wang D, Yang T, et al. Model-Based evaluation of transmissibility and intervention measures for a COVID-19 outbreak in Xiamen City, China. Front Public Health. (2022) 10:887146. doi: 10.3389/fpubh.2022.887146

PubMed Abstract | CrossRef Full Text | Google Scholar

2. World Health Organization. WHO Coronavirus Disease (COVID-19) Dashboard. Available online at: https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed September 26, 2022).

Google Scholar

3. Sharun K, Tiwari R, Dhama K, Emran TB, Rabaan AA, Al Mutair A. Emerging SARS-CoV-2 variants: impact on vaccine efficacy and neutralizing antibodies. Hum Vaccin Immunother. (2021) 17:3491–4. doi: 10.1080/21645515.2021.1923350

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Deng B, Liu W, Guo Z, Luo L, Yang T, Huang J, et al. Natural history and cycle threshold values analysis of COVID-19 in Xiamen City, China. Infect Dis Model. (2022) 7:486–97. doi: 10.1016/j.idm.2022.07.007

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Pan A, Liu L, Wang C, Guo H, Hao X, Wang Q, et al. Association of public health interventions with the epidemiology of the COVID-19 outbreak in Wuhan, China. JAMA. (2020) 323:1915–23. doi: 10.1001/jama.2020.6130

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. (2020) 584:257–61. doi: 10.1038/s41586-020-2405-7

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Hens N, Vranck P, Molenberghs G. The COVID-19 epidemic, its mortality, and the role of non-pharmaceutical interventions. Eur Heart J Acute Cardiovasc Care. (2020) 9:204–8. doi: 10.1177/2048872620924922

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Linka K, Peirlinck M, Kuhl E. The reproduction number of COVID-19 and its correlation with public health interventions. Comput Mech. (2020) 66:1035–1050. doi: 10.1007/s00466-020-01880-8

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. (2020) 382:1199–207. doi: 10.1056/NEJMoa2001316

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Shen M, Peng Z, Xiao Y, Zhang L. Modeling the epidemic trend of the 2019 novel coronavirus outbreak in China. Innovation. (2020) 1:100048. doi: 10.1016/j.xinn.2020.100048

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Guo X, Guo Y, Zhao Z, Yang S, Su Y, Zhao B, et al. Computing R 0 of dynamic models by a definition-based method. Infect Dis Model. (2022) 7:196–210. doi: 10.1016/j.idm.2022.05.004

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Cazelles B, Champagne C, Nguyen-Van-Yen B, Comiskey C, Vergu E, Roche B. A mechanistic and data-driven reconstruction of the time-varying reproduction number: application to the COVID-19 epidemic. PLoS Comput Biol. (2021) 17:e1009211. doi: 10.1371/journal.pcbi.1009211

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Gunzler D, Sehgal AR. Time-Varying COVID-19 reproduction number in the United States. medRxiv. [Preprint]. (2020). doi: 10.1101/2020.04.10.20060863

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Andreasen V. The final size of an epidemic and its relation to the basic reproduction number. Bull Math Biol. (2011) 73:2305–21. doi: 10.1007/s11538-010-9623-3

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Viceconte G, Petrosillo N. COVID-19 R0: magic number or conundrum? Infect Dis Rep. (2020) 12:8516. doi: 10.4081/idr.2020.8516

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Xu C, Dong Y, Yu X, Wang H, Tsamlag L, Zhang S, et al. Estimation of reproduction numbers of COVID-19 in typical countries and epidemic trends under different prevention and control scenarios. Front Med. (2020) 14:613–22. doi: 10.1007/s11684-020-0787-4

PubMed Abstract | CrossRef Full Text | Google Scholar

17. O'Driscoll M, Harry C, Donnelly CA, Cori A, Dorigatti I. A comparative analysis of statistical methods to estimate the reproduction number in emerging epidemics, with implications for the current coronavirus disease 2019 (COVID-19) pandemic. Clin Infect Dis. (2021) 73:e215–23. doi: 10.1093/cid/ciaa1599

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Li K, Ruan F, Zhao Z, Guo Z, Yang Z, Yu S, et al. Comparative analysis of transmission and vaccine effectiveness in omicron and delta variant outbreaks in China. J Infect. (2022). doi: 10.1016/j.jinf.2022.08.018

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Sun K, Wang W, Gao L, Wang Y, Luo K, Ren L, et al. Transmission heterogeneities, kinetics, and controllability of SARS-CoV-2. Science. (2021) 371:eabe2424. doi: 10.1126/science.abe2424

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Chen T, Zeng Y, Yang D, Ye W, Zhang J, Lin C, et al. Nomogram model for prediction of SARS-CoV-2 breakthrough infection in fujian: a case-control real-world study. Front Cell Infect Microbiol. (2022) 12:932204. doi: 10.3389/fcimb.2022.932204

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Chen TM, Rui J, Wang QP, Zhao ZY, Cui JA, Yin L. A mathematical model for simulating the phase-based transmissibility of a novel coronavirus. Infect Dis Poverty. (2020) 9:24. doi: 10.1186/s40249-020-00640-3

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Nishiura H, Ito K, Anzai A, Kobayashi T, Piantham C, Rodriguez-Morales AJ. Relative reproduction number of SARS-CoV-2 omicron (B.1.1.529) compared with delta variant in South Africa. J Clin Med. (2021) 11:30. doi: 10.3390/jcm11010030

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Zhao Q, Yang M, Wang Y, Yao L, Qiao J, Cheng Z, et al. Effectiveness of interventions to control transmission of reemergent cases of COVID-19—Jilin Province, China, 2020. China CDC Wkly. (2020) 2:651–4. doi: 10.46234/ccdcw2020.181

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Liu W, Ye W, Zhao Z, Liu C, Deng B, Luo L, et al. Modelling the emerging COVID-19 epidemic and estimating intervention effectiveness - Taiwan, China, 2021. China CDC Wkly. (2021) 3:716–9. doi: 10.46234/ccdcw2021.177

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Cheng X, Hu J, Luo L, Zhao Z, Zhang N, Hannah MN, et al. Impact of interventions on the incidence of natural focal diseases during the outbreak of COVID-19 in Jiangsu Province, China. Parasit Vect. (2021) 14:483. doi: 10.1186/s13071-021-04986-x

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Niu Y, Rui J, Wang Q, Zhang W, Chen Z, Xie F, et al. Containing the transmission of COVID-19: a modeling study in 160 countries. Front Med. (2021) 8:701836. doi: 10.3389/fmed.2021.701836

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Alene M, Yismaw L, Assemie MA, Ketema DB, Gietaneh W, Birhan TY. Serial interval and incubation period of COVID-19: a systematic review and meta-analysis. BMC Infect Dis. (2021) 21:257. doi: 10.1186/s12879-021-05950-x

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Shaw CL, Kennedy DA. What the reproductive number R0 can and cannot tell us about COVID-19 dynamics. Theor Popul Biol. (2021) 137:2–9. doi: 10.1016/j.tpb.2020.12.003

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Gautret P, Million M, Jarrot PA, Camoin-Jau L, Colson P, Fenollar F, et al. Natural history of COVID-19 and therapeutic options. Expert Rev Clin Immunol. (2020) 16:1159–84. doi: 10.1080/1744666X.2021.1847640

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: COVID-19, reproduction number (R), definition methods, next generation matrix, serial interval (SI)

Citation: Abudunaibi B, Liu W, Guo Z, Zhao Z, Rui J, Song W, Wang Y, Chen Q, Frutos R, Su C and Chen T (2023) A comparative study on the three calculation methods for reproduction numbers of COVID-19. Front. Med. 9:1079842. doi: 10.3389/fmed.2022.1079842

Received: 25 October 2022; Accepted: 05 December 2022;
Published: 05 January 2023.

Edited by:

Reza Lashgari, Shahid Beheshti University, Iran

Reviewed by:

Jin-ren Pan, Zhejiang Center for Disease Control and Prevention (Zhejiang CDC), China
Ugo Avila-Ponce De León, National Autonomous University of Mexico, Mexico

Copyright © 2023 Abudunaibi, Liu, Guo, Zhao, Rui, Song, Wang, Chen, Frutos, Su and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tianmu Chen, yes MTM2OTg2NjVAcXEuY29t; Chenghao Su, yes MTI3MjIwODM3MkBxcS5jb20=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A comparative study on the three calculation methods for reproduction numbers of COVID-19

Introduction

Methods

Study design

Data collection

Calculation of COVID-19 reproduction number

The definition method

The next-generation matrix-based method

Model development

Parameter estimation

SI and epidemic curve-based method

Statistical analysis

Results

Epidemiological characteristics of COVID-19 in the four regions

Calculating COVID-19 reproduction number

The definition method

The next-generation matrix-based method

The epidemic curve and SI-based method

Discussion

Limitations

Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher's note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good