Skip to main content

COMMUNITY CASE STUDY article

Front. Big Data, 27 April 2022
Sec. Medicine and Public Health
This article is part of the Research Topic COVID-19: Integrating Artificial Intelligence, Data Science, Mathematics, Medicine and Public Health, Epidemiology, Neuroscience, and Biomedical Science in Pandemic Management View all 95 articles

Research on Spatial-temporal Spread and Risk Profile of the COVID-19 Epidemic Based on Mobile Phone Trajectory Data

\nQi Zuo
Qi Zuo1*Jiaman DuJiaman Du2Baofeng DiBaofeng Di1Junrong ZhouJunrong Zhou3Lixia ZhangLixia Zhang4Hongxia LiuHongxia Liu5Xiaoyu HouXiaoyu Hou6
  • 1Institute for Disaster Management and Reconstruction, Sichuan University, Chengdu, China
  • 2The School of International Studies, Sichuan University, Chengdu, China
  • 3Chengdu Fangwei Technology Co., Ltd., Chengdu, China
  • 4Sichuan Wisesoft System Integration Co., Ltd., Chengdu, China
  • 5West China Hospital, Sichuan University, Chengdu, China
  • 6SinoMaps Press Co., Ltd., Beijing, China

The COVID-19 epidemic poses a significant challenge to the operation of society and the resumption of work and production. How to quickly track the resident location and activity trajectory of the population, and identify the spread risk of the COVID-19 in geospatial space has important theoretical and practical significance for controlling the spread of the virus on a large scale. In this study, we take the geographical community as the research object, and use the mobile phone trajectory data to construct the spatiotemporal profile of the potential high-risk population. First, by using the spatiotemporal data collision method, identify, and recover the trajectories of the people who were in the same area with the confirmed patients during the same time. Then, based on the range of activities of both cohorts (the confirmed cases and the potentially infected groups), we analyze the risk level of the relevant places and evaluate the scale of potential spread. Finally, we calculate the probability of infection for different communities and construct the spatiotemporal profile for the transmission to help guide the distribution of preventive materials and human resources. The proposed method is verified using survey data of 10 confirmed cases and statistical data of 96 high-risk neighborhoods in Chengdu, China, between 15 January 2020 and 15 February 2020. The analysis finds that the method accurately simulates the spatiotemporal spread of the epidemic in Chengdu and measures the risk level in specific areas, which provides an objective basis for the government and relevant parties to plan and manage the prevention and control of the epidemic.

Introduction

Since the first reported case of COVID-19 in the world in December 2019, and until 30 September 2020, 208 countries and regions on six continents have successively reported confirmed cases. The mortality rate of the European countries is peaking at more than 10% at the end of March 2020 (Niu et al., 2020). Studies have found that the spread of the COVID-19 virus is affected by population density and population mobility, and has obvious temporal and spatial aggregation (Huang and He, 2010; Ahmadi et al., 2020; Laborde et al., 2020; Li et al., 2020). Therefore, tracking and isolation of close contacts and suspected cases is the key to epidemic prevention and control.

Mobile phone trajectory data is a kind of spatiotemporal trajectory big data. Based on the position of the cellular base stations, the data can be extrapolated into the spatiotemporal distribution and flow matrix of the entire population in a highly anonymized manner. The rapid development and popularization of information technology make mobile phone trajectory data analysis become one of the most effective ways to quickly track the movement of the population. However, mobile phone trajectory data is complex and diverse, and the analysis workload is heavy. Fortunately, the rapid collision method based on time–space trajectory greatly improves the analysis efficiency. In the scene of large data volume and sizeable crowd, this method can still quickly analyze the data with high performance and obtain information about the number of encounters or contacts between individuals and the location of the contacts (Tang et al., 2017; Wang, 2017). Therefore, based on the characteristics of mobile phone trajectory data, by combining the communication signaling and geographic information, we simulate the full trajectory data of more than 16 million people in Chengdu to form the trajectory basis for spatiotemporal collisions. Process continuous trajectory data, find out potential high-risk populations and their distribution, classify and evaluate the activity area and risk size formed by the re-diffusion of potential high-risk populations, and analyze the spatial activity and spatial relationship of potential high-risk populations. Finally, through the risk assessment of these regions, the spatiotemporal spread and risk profiles of the epidemic are constructed, in order to provide a methodological basis for guiding the allocation of anti-epidemic resources and manpower in specific geographic regions and the formulation of epidemic prevention and control policies.

The potential risk population referred to in this article is the group of people who have had “time and space” intersections with the confirmed patients within 14 days.

Literature Review

Studies have found that the spread of the COVID-19 virus, affected by population density and population mobility, displays obvious Space-time aggregation (Ahmadi et al., 2020; Laborde et al., 2020; Li et al., 2020; Huang and He, 2010). At present, the research on the prevention and control measures of COVID-19 mainly focuses on: rapidly tracing the movement of contacts within 4 days of the onset of the COVID-19 symptoms is critical for epidemic prevention and control (Willem et al., 2020), but the ultimate success of epidemic prevention and control depends on the speed and effectiveness of controlling suspected infected persons. Only when the time from primary case infection to contact tracing is shorter than the incubation period can the spread of the outbreak be prevented to the greatest extent possible (Hsiang et al., 2020; Keeling et al., 2020). Besides appropriate social isolation can reduce the COVID-19 infection rate, mortality rate, and the need for medical resources (Aaa, 2021). For example, in socially intensive environments such as schools or workplaces (Ahmed et al., 2018). However, with the advancement of current vaccine research and development technology, contact tracing and follow-up control measures have become particularly important in the early stages of epidemic control (Müller et al., 2000; Kwok et al., 2019; Read et al., 2020; Walker et al., 2020). Rapid and effective contact tracing can reduce the initial number of cases, which would make the outbreak easier to control overall (Jin et al., 2020). Effective contact tracing and isolation could contribute to reducing the overall size of an outbreak or bringing it under control over a longer period (Hellewell et al., 2020; Xu et al., 2020).

They used mobile-phone-data to develop a spatio-temporal “risk source” model that leverages population flow data to forecast confirmed cases, and identify high-transmission-risk locales at an early stage (Jia et al., 2020). Or they used the location data of the subscriber identity module (SIM) card from the largest mobile phone company in Haiti, to predict the scale and trend of population migration in Haiti following the 2010 earthquake and cholera outbreak. His studies have shown that it is potentially efficient to use this method to estimate population mobility in areas with high mobile phone usage (Bengtsson et al., 2011). Since the outbreak of the COVID-19 epidemic, many scholars have also done relevant theoretical and applied research on the protection of the epidemic and the resumption of work and production based on mobile phone signaling and spatiotemporal data. For example, based on epidemiological survey data of COVID-19 infection cases and mobile phone trajectory data of 15 million users during the epidemic, the spatiotemporal characteristics of COVID-19 and changes in personnel mobility were analyzed (Li et al., 2020; Liu et al., 2021). By aggregating and analyzing the big data and the epidemic data, it suggests cutting off the connection between different human mobility network communities and blocking the local transmission inside the community (Dong et al., 2020). The Johns Hopkins University in the United States has developed a real-time updated global spatiotemporal database of COVID-19 for epidemic prevention and research. The Chongqing has also formed a COVID-19 epidemic query system based on natural resources, a spatiotemporal geographic database, and a spatiotemporal big data service platform, and developed a COVID-19 epidemics map to report the progress of the epidemic in real-time. In addition, the research also builds a population mobile network model based on mobile phone location data to assess the possibility of infection in different regions. And with the support of spatiotemporal data, a comprehensive analysis of the spatiotemporal distribution of patient behavior trajectories was carried out. In addition, combining official data onto city, district, and county governments. Based on official case data, explore the temporal and spatial evolution of the COVID-19 epidemic and assess the epidemic risk level of key cities (Liu et al., 2020; Wang et al., 2020).

Previous research results have effectively supported COVID-19 epidemic protection and control of the epidemic at the national, social, and urban levels, but it needs to be in-depth, which has played an important role in blocking the spread of the epidemic, ensuring the needs of residents, and maintaining social stability community's research. In addition, most of the discussions on COVID-19 epidemic protection and control measures focus on the isolation and control of two types of people, confirmed patients and contacts, and how to quickly identify and locate the activity trajectories of these two groups of people, and take accurate and timely follow-up prevention and control measures. There are some shortcomings, and these two points are the main content that this article wants to explore.

Methods

The Research Data

The data used in this article come from the epidemic community data released by the Sichuan Provincial Health and Health Commission from 15 January to 15 February 2020, and questionnaire data of 10 confirmed cases in a hospital in Chengdu. Interviews include age, gender, residence, working place, time of diagnosis, time of isolation, travel, transportation, and relationship with other existing confirmed cases, etc. It also includes statistics on the population of more than 16 million in Chengdu.

The Research Methods

Method to Generate the Population Activities

1) Signaling location data integration: According to the time and space distribution characteristics of the signaling trajectory data of a single user entity, integrate the adjacent signaling data in time and space to correct the error data and eliminate invalid data. Determine the characteristics of the signaling trace data of the user in a behavior scenario;

2) Space block entity feature trajectory calculation: According to the base station cell engineering parameters, including network standard, coverage type, coverage scene, direction angle, latitude and longitude coordinate position, etc., combined with the base station coverage gravity model and cluster analysis in the actual scene, the trajectory data is processed. Calculate the user's actual geographic location; use the actual location and coverage coordinate distribution of the space block provided by China Map Publishing Group to calculate the user's characteristic trajectory on the space block entity;

3) User space behavior recognition: According to the user's behavior characteristic trajectory on the spatial entity, combined with the attribute characteristics of the context space block entity in the trajectory, a user behavior trajectory chain is formed;

4) User base location tags and behavior feature tags: According to the user behavior tag trajectory chain, the multi-dimensional statistical results of user behavior characteristics are formed, such as users, blocks, behavior frequency within a period, average residence time, number of days of stay in working days, non-work analysis indicators such as the number of days of stay in a day, the number of similar behavior blocks; At the same time, a variety of model algorithms including Monte Carlo tree search are introduced to label users with basic location—workplace, residence, etc.; combined with analysis indicators to characterize Various feature tags form a clear user portrait.

Model Structure

We suppose the operator's population trajectory data: Pi, {lxli}t, where “Pi” is a person, “li” is a location, “t” is time and time span. Finally, the spatial-temporal collision forms the population trajectory data.

Step 1: Conduct basic evaluation of location-based on population spatial activity behavior.

1) Based on the trajectory of human activities, the stationary points of people is deduced, where “pi” is a person, “pi,{lxli}t” is the trajectory of “pi”. The effective stationary position of “pi” is “pi{L1Li}”, where “Li” represents the effective lingering position obtained based on the statistical cycle of “{lxli}t”, and the lingering time is more than 30 min.

2) According to “pisb, {LliveLi }”, “pi” is deduced, where “ai” is the activity of “pi” and “di” is the dispersion of lingering position of “pi” (“ai” is the locations that people have visited and lingered in the target cycle; “di” is the calculation of the degree of location dispersion involved in “pi,{LliveLi}”, i.e., the degree of lingering points dispersion of “pi”).

3) Table of spatial behavior data of a new person —TAB1;

4) Taking “Li” as the statistical objective, “Li, P, Plive, A, D, FTAB2” is derived, where “P” is the population size: the number of people effectively linger at “Pall” is the de-duplication count of “pi”, and “Plive” is the resident population: when the number of people effectively resident in “Llive” is the de-duplication count “pi” etc.; “A” is population activity: a¯=1ni=1nai, “D” is the diffusance: what other locations have been visited by people who has stayed at this location, and then set this location as the center to calculate the dispersion of the two locations, “F” is fluidity: “PlivePall * 100%”.

Step 2: Assuming that the specific patient is known, the risk factor is determined at the increased position.

Define “S” as the identified risk, “si” as the specific patient, and “TAB1=pi=pi, {lxli}t”, based on “si”.

1) In TAB1, based on “S(pi)”, get “pi,{LliveLi },ai,di—TAB3.”

2) “{LliveLi}” in TAB3, use “Li” as the statistical target to calculate “Li,P,Plive,A,D–TAB4”, where “P′” is the total number of patients living in this location, “A′” is the patient's spatial activity, and “D′” is the patient's spatial spread.

Step 3: By the rapid collision method based on the trajectory of patients passing by at the same time, increase the potential risk factors at the location.

1) Based on “t” and “li” in “pi,{lxli}t” of “si”, map “pi,{lxli}t” to get “pi” (excluding “pi”).

2) In TAB1, get “pi,{LliveLi},ai,di, —TAB5” based on “B(pi)

3) In TAB5, “{LliveLi}” takes “Li” as the statistical target to get “Li,P,A,D–TAB6”, where “{LliveLi}” (excluding “{LliveLi}” in TAB3), “P″” are the number of people who have been at the same time and space, “Plive” are the number of people who have been at the same time and space living here, “A″” is the spatial activity of people who have been at the same time and space, and “D″” is the spatial diffusion of people who have been the same time and space.

Step4: Combine TAB2, TAB4, TAB6 to get TAB7, and comprehensively evaluate the risk category of the specific location based on TAB7.

The location risk is preliminarily classified by using the numerical value, as shown in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Risk categories for a specific location.

Step 5: Based on Moran's I Statistics, conduct an overall evaluation of the risk distribution in the target area.

Moran's I was used to measure the degree of Spatial autocorrelation (similar dimensional attributes), it means to evaluate whether the distribution of elements was aggregated, discrete or random. Moran's I index closer to “1” indicates aggregation, and closer to “-1” indicates dispersion. As shown in Table 2.

TABLE 2
www.frontiersin.org

Table 2. Spatial autocorrelation instruction.

Step 6: Based on “Pi{L1…Li}”, get the specific spatial field correlation.

Information entropy is used to measure the correlation of spatial fields. The conditions for defining the correlation index “r” between two discrete space fields are as follows: when the two space fields are independent of each other, there is “rmin=0”. When the correlation degree of the two spatial fields increases gradually, “r” also increases gradually. There is “rmax=1” when the two space fields are completely correlated. According to the nature of information entropy, the expression of “r” can be established.

When the discrete state sets “A” and “B” are completely related, each type area of the two layers must have a one-to-one mapping relationship in space. According to the layer properties of the index “A” space, it is divided into “m” types. The proportion of the area of the type “ai” to the total area of the area is P(ai), and the layer properties of the index “B” space are divided into “n” types, and the proportion of the area of the type “bj” to the total area of the area is “P(bj).” “aibj” represents the category after the type “ai” in the “A” layer and the space “bj” in the “B” layer are superimposed (up to “m*n” categories appear after the two layers are superimposed), and the area ratio is “P (aibj),” and the information entropy is H(A), H(B) and H(A,B). Finally, the correlation index between thematic layers “A” and “B” is calculated. The higher the entropy is, the more information can be transmitted; the lower the entropy is, the less information can be transmitted. Thereby, the linkage cascading relationships among indicators of different dimensions can be classified. As shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Information entropy relationship.

Results

Conduct Basic Location Evaluation Based on Population Spatial Activity Behavior

Analysis of the Number of Population Resident Places

By using the trajectory activity data of 10 confirmed patients collected, the random distribution and activity spaces of more than 16 million Chengdu are simulated based on the model, and the number of residents places in Chengdu within 14 days is obtained. Whether it is a transient population or temporary population, the more effective residences there are, the higher the transmission of infection will be. As shown in Table 3.

TABLE 3
www.frontiersin.org

Table 3. The number of effective population resident places.

To classify the temporal and spatial activity of the population and set the risk coefficient. As shown in Table 4.

TABLE 4
www.frontiersin.org

Table 4. Classify the temporal and spatial activity of the population and the risk factor.

Analysis of the Scope of Population Activities

The analysis of the scope of activity is mainly aimed at the spatial distance of the “location,” that is, the description and analysis of the dispersion of the stationary points of people, which represents the direction of spatial diffusion. Combined with the number of resident positions, the spatial dispersion can be measured as whether it is concentrated or dispersed. The higher the value is, the higher its spatial propagation is. The setting of each propagation coefficient is shown in Table 5.

TABLE 5
www.frontiersin.org

Table 5. The degree of spatial transmission of people.

Distribution of Track Resident Point of People-Based on Resident Places

Based on the simulation data of population activity trajectory, the distribution of resident heat, residential heat, activity, and spatial diffusion of the population can be obtained. As shown in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2. Distribution of track resident point of people (A) population resident heat distribution map (B) population residential heat distribution map (C) distribution of population spatial activity (D) distribution of population spatial. The darker color presents more people in the location, and the lighter color presents fewer people stay in the location, and correspondingly the heat is lower.

It can be seen from the Population Resident Heat Distribution Map that the darker color presents more people stay in the place, and the lighter color presents less people stay in the place, and the resident heat are lower. Residential heat distribution is different from resident heat distribution, once a confirmed case is found in the place of residence, the size of the population at that location can be quickly known and the epidemic situation in the place can be protected in time. In addition, combining with “P,” location “P/Plive” with high-population mobility can be quickly obtained. With high mobility (the greater the ratio), the higher the risk coefficient of transmission or infection in this location will be. In the Population Activity Distribution Map, the darker the color, the greater the number of related locations between this location and other locations. From the Distribution of Population Spatial Diffusion Map, the spatial diffusion of a single person is mainly spread out in the way of one point and more lines.

Analysis of Spatial Activity of Confirmed Patients

Distribution of Residence and Track Resident Point of Confirmed Patients

According to the specific locations of the 94 closed communities with confirmed cases as of 14 February 2020 issued by the Health Commission of Sichuan Province, the specific location distribution map of the confirmed patients can be drawn, as shown in Figure 3.

FIGURE 3
www.frontiersin.org

Figure 3. Distribution of residence and track resident point of confirmed patients (A) stationary place distribution (B) trajectory distribution.

As can be seen from the figure, the residents of the confirmed patients are mainly distributed downtown. Resident location distribution is related to regional population density and socio-economic vitality. The downtown or places with convenient transportation (subway, bus) lines, large shopping malls, and shopping centers are places where confirmed patients stay frequently.

Analysis of Activity and Spatial Diffusion of Confirmed Patients

The spatial information such as the patient's address and activity location is converted into spatial and temporal data to form the spatial transmission trajectory of the confirmed cases. According to the second step of the analysis method, the activity and spatial diffusion of the confirmed patients can be analyzed. However, in order to protect personal privacy, the trajectory of confirmed patients will not be analyzed graphically.

Distribution of Track Resident Point of Trajectories of People, Who Have Been at the Same Time and Space With the Confirmed Patient

According to Figure 4 the trajectories of people who have been at the same time and space with the confirmed patient, it can conclude that the trajectories of such people are almost all over the districts and counties of Chengdu. Among them, the densest track resident points are the downtown.

FIGURE 4
www.frontiersin.org

Figure 4. Distribution of track resident point.

Risk Profiling of COVID-19 Epidemic for Specific Location

As shown in Table 6.

TABLE 6
www.frontiersin.org

Table 6. Calculation methods for different risk categories.

Calculation of Risk Factors for a Specific Location

Assuming that the risk value of each category is the same, the risk value of epidemic transmission is divided into segments, and the level is set. As shown in Table 5.

Analysis of Risk Categories at Specific Locations

From 0:00 to 6:00 on 8 December 2020, 3 new cases of new coronary pneumonia were confirmed in Chengdu. The students of the Chengdu University of Technology have had contact with this confirmed case. Therefore, this article taking the Chengdu University of Technology as an example analyzes the comprehensive information of its location according to the aforementioned method as shown in Figure 5.

FIGURE 5
www.frontiersin.org

Figure 5. Comprehensive location evaluation of the Chengdu University of Technology (A) The Chengdu University of Technology geographical distribution. Based on the spatial activity of the population to realize the time and space traceability and time and space tracking: Be able to trace and track the source of time and space for residents (including residence only, work only, job, and residence at the same address/same street/same district), visits (non-resident population in the district); It can analyze the overall situation in the administrative area, each street, and the specific community. (B) Analysis of diagnosed patients and intermediate communicators of the Chengdu University of Technology (C) Several elements of risk assessment: Risk category: There are 3 categories (A, B, B). Risk factor: It is mainly obtained based on the comprehensive analysis of population number and regional population density. The larger the value, the higher the risk; Resident population's out-of-home activity coefficient: A factor of the risk coefficient, which describes the outdoor activities of the permanent population in the location. The larger the number, the more active the residents in the location are out-going activities; Inbound visitor activity coefficient: a factor of the risk coefficient, indicating the liquidity of the location, the higher the coefficient, the stronger the liquidity.

Combining calculations with specific models can be obtained, the Chengdu University of Technology, epidemic risk category: A, epidemic risk coefficient: 5, of which, the resident population's active coefficient of going out: 10, and the active coefficient of foreign visitors: 8.

Risk Profiling of COVID-19 Epidemic in Chengdu

Distribution of Regional Risks in Chengdu

According to the standardized statistics of the Local Moran's I index test, the spatial correlation values of different risk categories can be calculated, and the overall spatial distribution type can be determined based on this correlation, as shown in Table 7.

TABLE 7
www.frontiersin.org

Table 7. Description of the spatial correlation distribution of different risk categories.

According to the aforementioned formula, the spatial correlation distribution value of the identified risk area can be calculated. For areas with positive spatial autocorrelation, centralized protection deployment can be adopted. For areas with negative spatial autocorrelation, decentralized protection methods can be implemented. In areas with weak spatial correlation, the distribution among communities is generally random and the COVID-19 epidemic protection is complex. Therefore, the actual situation of the epidemic in the community should be investigated and corresponding protection measures can be taken.

Portrait of Epidemic Risk in Chengdu

The risk value is segmented and the risk level is set. Assuming that the risk value of each category is the same, based on the comprehensive distribution, a geographic profile of the risk of the epidemic community in Chengdu can be obtained, as shown in Figure 6.

FIGURE 6
www.frontiersin.org

Figure 6. The COVID-19 epidemic risk map of Chengdu.

According to the different colors shown in the figure aforementioned, the risk of the COVID-19 epidemics in Chengdu can be clearly identified, and the sequence of nucleic acid testing for COVID-19 can also be accurately guided in each region. For example, in areas with high-spatial population activity, high diffusion, and high-transmission coefficient, priority will be given to arranging local professional epidemic prevention forces to the station in the community, and residents in the whole region should conduct nucleic acid testing. In the potential risk areas, nucleic acid testing and identification of suspected cases should be strengthened, and the movement of suspected cases should be closely tracked, which can help to cut off the transmission chain of the epidemic in the shortest time. In areas where the risk assessment shows normal, nucleic acid testing can be done voluntarily by residents, in order to promptly guide and relieve residents' nervousness.

Analysis of COVID-19 Epidemic Spread in Chengdu

The analysis shows that 36% of the affected communities in Chengdu are within 1 km next to each other, and 9% of the communities are within 500 m. There are four main distribution characteristics: “A-B-C-A” triangular shape, “A-B-C” broken line, “A-B” segment type and scatter type, as shown in Table 8.

TABLE 8
www.frontiersin.org

Table 8. Specific distribution characteristics of the affected communities.

The distribution characteristics of the affected communities aforementioned in Chengdu: “triangle,” “broken line” and “line segment.” The causes and rules of the spread of the COVID-19 epidemic can be preliminarily summarized, so that different protection measures can be taken according to the distribution shape among communities.

Spatial Distribution of Epidemic Communities in Chengdu

Based on the confirmed cases published by the Health Commission of Sichuan Province, the spatial distribution of the outbreak communities in Chengdu is described, as shown in Figure 7.

FIGURE 7
www.frontiersin.org

Figure 7. The spatial distribution of epidemic communities.

The aforementioned picture depicts the spatial distribution of the epidemic communities in Chengdu, which can guide relevant government departments to formulate appropriate plans to block the spread of the COVID-19 epidemics. In areas where epidemic communities are densely distributed, protection departments must implement strict epidemic surveillance to ensure early detection, quarantine, and treatment of infected people to reduce the spread of the COVID-19 epidemic. For non-infected communities that are far away from the affected communities, the protection strategy should be adjusted to prevent the importation of outbreaks from other places, so that the resumption of work and production can be carried out to the maximum extent.

Discussion

Based on the mobile phone trajectory data, using the research methods and steps in this article, combined with the relevant data of the epidemic in Chengdu, the verification found that: according to the established model, based on basic population data and spatiotemporal collision methods, it accurately described people who have close contacts. The epidemic risk coefficient and risk level of the Chengdu University of Technology, and also the epidemic risk map of the entire area of Chengdu and the overall distribution characteristics of the epidemic community. This distribution feature can effectively guide different urban areas and risk communities to carry out nucleic acid inspections, putting anti-epidemic supplies, and formulating plans for resuming work and production.

In addition, this method can also be used for the research on spatial-temporal diffusion and profiling of various risks in the future. There are also aspects that need to be improved and optimized in the research: first, the activity trajectory of confirmed cases, medical information, and other data are relatively fragmented and not detailed enough. The overall profiling of the development of the COVID-19 epidemic in Chengdu may be biased. Future analysis of similar disaster events still needs to rely on more detailed spatial-temporal trajectory data and further study the micro-scale population activities and risk communication. Second, considering personal privacy and the difficulty of obtaining certain data, the analysis of the spatial diffusion of risk factors and the overall diffusion situation of the region is relatively rough. Therefore, the analysis method should be improved in the future, which requires a higher requirement for desensitization of spatial-temporal data.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

QZ, BD, and JZ substantial contributions to the conception or design of the work and the acquisition and analysis. JZ, XH, HL, and LZ contributions to the acquisition and analysis of data for the work. JZ, LZ, and JD drafting the work or revising it critically for important intellectual content. QZ and BD final approval of the version to be published. LZ and JD agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All authors contributed to the article and approved the submitted version.

Funding

National Key Research and Development Program of China, No. 2020YFD1100701.

Conflict of Interest

JZ was employed by Chengdu Fangwei Technology Co., Ltd. LZ was employed by Sichuan Chuanda Zhisheng System Integration Co., Ltd. XH was employed by SinoMaps Press Co., Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We gratefully appreciate the Foundation item: National Key Research and Development Program of China, No.2020YFD1100701 and Novel Coronavirus (2019-NCoV) Emergency Project of Sichuan University.

References

Aaa, B.. (2021). Role of social distancing in tackling covid-19 during the first wave of pandemic in nordic region: evidence from daily deaths, infections and needed hospital resources. Int. J. Nurs. Sci. 8, 145–151. doi: 10.1016/j.ijnss.2021.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Ahmadi, M., Sharifi, A., Dorosti, S., Ghoushchi, S. J., and Ghanbari, N. (2020). Investigation of effective climatology parameters on covid-19 outbreak in Iran. Sci. Total Environ. 729, 138705. doi: 10.1016/j.scitotenv.2020.138705

PubMed Abstract | CrossRef Full Text | Google Scholar

Ahmed, F., Zviedrite, N., and Uzicanin, A. (2018). Effectiveness of workplace social distancing measures in reducing influenza transmission: a systematic review. BMC Public Health 18, 518. doi: 10.1186/s12889-018-5446-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Bengtsson, L., Lu, X., Thorson, A., Garfield, R., Schreeb, J. V., and Gething, P. W. (2011). Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: a post-earthquake geospatial study in haiti. PLoS Med. 8, e1001083. doi: 10.1371/journal.pmed.1001083

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, E., Du, H., and Gardner, L. M. (2020). An interactive web-based dashboard to track covid-19 in real time. Lancet Infect. Dis. 20, 533–534. doi: 10.1016/S1473-3099(20)30120-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Hellewell, J., Abbott, S., Gimma, A., Bosse, N. I., and Zandvoort, K. V. (2020). Feasibility of controlling covid-19 outbreaks by isolation of cases and contacts. Lancet Global Health 8, 4. doi: 10.1016/S2214-109X(20)30074-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsiang, S., Allen, D., Annan-Phan, S., Bell, K., and Wu, T. (2020). The effect of large-scale anti-contagion policies on the covid-19 pandemic. Nature 584, 262–267. doi: 10.1038/s41586-020-2404-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, K. S., and He, D. (2010). Research on population flow trajectory based on big data. Population Health Sci. 2, 14–17.

Google Scholar

Jia, J. S., Lu, X., Yuan, Y., Xu, G., and Christakis, N. A. (2020). Population flow drives spatio-temporal distribution of covid-19 in china. Nature 582, 1–11. doi: 10.1038/s41586-020-2284-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, A. N., Li, G., and Wang, J. B. (2020). Spatio-temporal evolution and control strategies of COVID-19 epidemic in Shenzhen China. J. Shaanxi Normal University 48, 18–32. doi: 10.15983/j.cnki.jsnu.2020.03.017

CrossRef Full Text | Google Scholar

Keeling, M. J., Hollingsworth, T. D., and Read, J. M. (2020). The efficacy of contact tracing for the containment of the 2019 novel coronavirus (COVID-19). J. Epidemiol. Community Health. 74, 861–866. doi: 10.1101/2020.02.14.20023036

PubMed Abstract | CrossRef Full Text | Google Scholar

Kwok, K. O., Tang, A., Wei, V., Park, W. H., Yeoh, E. K., and Riley, S. (2019). Epidemic models of contact tracing: systematic review of transmission studies of severe acute respiratory syndrome and middle east respiratory syndrome. Comput. Struct. Biotechnol. J. 17, 186–194. doi: 10.1016/j.csbj.2019.01.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Laborde, D., Martin, W., Swinnen, J., and Vos, R. (2020). Covid-19 risks to global food security. Science 369, 500–502. doi: 10.1126/science.abc4765

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, D. R., Shao, Z. F., and Yu, W. B. (2020). Public epidemic prevention and control services based on big data of spatiotemporal location make cities more smart. Geomatics Inform. Sci. Wuhan University 45, 475–487. doi: 10.13203/j.whugis20200145

CrossRef Full Text | Google Scholar

Liu, Y. X., Song, C., Liu, Q. Y., Zhang, Z. X., Wang, X., Ma, J., et al. (2021). Spatial-temporal characteristics of COVID-19 in Chongqing and its relationship with human mobility. J. Geo-inform. Sci. 2, 222–235. doi: 10.13284/j.cnki.rddl.003228

CrossRef Full Text | Google Scholar

Liu, Z. Q., Ye, Y. Y., Zhang, H. O., Guo, H. X., Yang, J., and Wang, C. J. (2020). Spatio-temporal characteristics and transmission path of COVID-19 cluster cases in Zhuhai. Trop. Geography 40, 422–431.

Google Scholar

Müller, J., Kretzschmar, M., and Dietz, K. (2000). Contact tracing in stochastic and deterministic epidemic models. Health Care Manag. Sci. 10, 341–355. doi: 10.1016/S0025-5564(99)00061-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Niu, B., Liang, R., Zhang, S., Zhang, H., and Chen, Q. (2020). Epidemic analysis of covid in italy based on spatiotemporal geographic information and google trends. Transboundary Emerg. Dis. 68, 2384–2400. doi: 10.1111/tbed.13902

PubMed Abstract | CrossRef Full Text | Google Scholar

Read, J. M., Bridgen, J. R., Cummings, D. A., Ho, A., and Jewell, C. P. (2020). Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions. Philo. Transact. R. Soc. B: Biol. Sci. 2020, 18549. doi: 10.1101/2020.01.23.20018549

CrossRef Full Text | Google Scholar

Tang, X. Y., Zhou, T., and Lu, B. C. (2017). Analysis of wide range population flow analysis based on mobile phone signaling. J, Chongqing Jiaotong University 36, 87.

Google Scholar

Walker, P., Whittaker, C., Watson, O. J., Baguelin, M., Winskill, P., Hamlet, A., et al. (2020). The impact of covid-19 and strategies for mitigation and suppression in low- and middle-income countries. Science 369, eabc0035. doi: 10.1126/science.abc0035

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J. B., Li, G., Wang, J. P., Qiang, J. Q., and Zhu, D. D. (2020). Spatiotemporal evolution and risk profiling of the COVID-19 epidemic in Shaanxi Province. Trop. Geography 40, 432–445.

Google Scholar

Wang, R.. (2017). Big Data Investigation. Tsinghua: Tsinghua University Press.

Google Scholar

Willem, L., Abrams, S., Petrof, O., Coletti, P., and Hens, N. (2020). The impact of contact tracing and household bubbles on deconfinement strategies for COVID-19: an individual-based modelling study. Nat. Commun. 12, 1524. doi: 10.1101/2020.07.01.20144444

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, X. K., Wen, C., Zhang, G. Y., Sun, H. C., Liu, B., and Wang, X. W. (2020). The geographical destination distribution and effect of outflow population of wuhan when the outbreak of COVID-19. J. University Electr. Sci. Technol. China 49, 324–329.

Google Scholar

Keywords: COVID-19, community, risk profiling, mobile phone trajectory data, the spatiotemporal spread

Citation: Zuo Q, Du J, Di B, Zhou J, Zhang L, Liu H and Hou X (2022) Research on Spatial-temporal Spread and Risk Profile of the COVID-19 Epidemic Based on Mobile Phone Trajectory Data. Front. Big Data 5:705698. doi: 10.3389/fdata.2022.705698

Received: 06 May 2021; Accepted: 23 March 2022;
Published: 27 April 2022.

Edited by:

Reza Lashgari, Shahid Beheshti University, Iran

Reviewed by:

Ariful Islam, EcoHealth Alliance, United States
Gregory R. Hart, Yale University, United States

Copyright © 2022 Zuo, Du, Di, Zhou, Zhang, Liu and Hou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qi Zuo, 1818688@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.