Skip to main content

REVIEW article

Front. Digit. Health, 06 August 2021
Sec. Connected Health
This article is part of the Research Topic Health Technologies and Innovations to Effectively Respond to the COVID-19 Pandemic View all 21 articles

Data and Digital Solutions to Support Surveillance Strategies in the Context of the COVID-19 Pandemic

\nPatty KostkovaPatty Kostkova1Francesc Saigí-Rubi,Francesc Saigí-Rubió2,3Hans Eguia,Hans Eguia2,4Damian BorbollaDamian Borbolla5Marieke VerschuurenMarieke Verschuuren6Clayton HamiltonClayton Hamilton6Natasha Azzopardi-MuscatNatasha Azzopardi-Muscat6David Novillo-Ortiz
David Novillo-Ortiz6*
  • 1UCL Centre for Digital Public Health in Emergencies (dPHE), Institute for Risk and Disaster Reduction, University College London, London, United Kingdom
  • 2Faculty of Health Sciences, Universitat Oberta de Catalunya, Barcelona, Spain
  • 3Interdisciplinary Research Group on ICTs, Barcelona, Spain
  • 4SEMERGEN New Technologies Working Group, Madrid, Spain
  • 5Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, United States
  • 6Division of Country Health Policies and Systems, Regional Office for Europe, World Health Organization, Copenhagen, Denmark

Background: In order to prevent spread and improve control of infectious diseases, public health experts need to closely monitor human and animal populations. Infectious disease surveillance is an established, routine data collection process essential for early warning, rapid response, and disease control. The quantity of data potentially useful for early warning and surveillance has increased exponentially due to social media and other big data streams. Digital epidemiology is a novel discipline that includes harvesting, analysing, and interpreting data that were not initially collected for healthcare needs to enhance traditional surveillance. During the current COVID-19 pandemic, the importance of digital epidemiology complementing traditional public health approaches has been highlighted.

Objective: The aim of this paper is to provide a comprehensive overview for the application of data and digital solutions to support surveillance strategies and draw implications for surveillance in the context of the COVID-19 pandemic and beyond.

Methods: A search was conducted in PubMed databases. Articles published between January 2005 and May 2020 on the use of digital solutions to support surveillance strategies in pandemic settings and health emergencies were evaluated.

Results: In this paper, we provide a comprehensive overview of digital epidemiology, available data sources, and components of 21st-century digital surveillance, early warning and response, outbreak management and control, and digital interventions.

Conclusions: Our main purpose was to highlight the plausible use of new surveillance strategies, with implications for the COVID-19 pandemic strategies and then to identify opportunities and challenges for the successful development and implementation of digital solutions during non-emergency times of routine surveillance, with readiness for early-warning and response for future pandemics. The enhancement of traditional surveillance systems with novel digital surveillance methods opens a direction for the most effective framework for preparedness and response to future pandemics.

Introduction

In London's Soho district in 1854, the father of public health John Snow removed the handle of the local community water pump to stop the spread of the famous cholera outbreak (1). Snow proved cholera is a water-borne disease rather than an airborne “bad air” infection by manually mapping data of citizens infected and deaths onto a map (2). This began what we today call epidemiology and surveillance.

Public health surveillance is a regular routine process of collecting data on diseases, cases and public health interventions (such as vaccination) to inform public health authorities about the situation in order to respond with appropriate public health measures. Public health surveillance also includes early warning systems alerting about upcoming outbreaks and emergencies. In order to enable rapid response, inform public health policy and strategies (3). The recent increase of big data and digital and mobile technology has enabled the rapid growth of “digital epidemiology.”

Digital epidemiology provides numerous opportunities and challenges and now is an indispensable part of infectious disease surveillance systems. Epidemic intelligence is understood as the systematic collection and analysis of traditional (indicator-based epidemiological) and new data sources (event-based surveillance), which are used to identify new infection threats to provide early warning and rapid assessment of risk (4). Once a threat or an outbreak is detected and an event verified and risk assessed, a rapid response must be implemented to control the outbreak, including diagnosis, testing, contact tracing, and risk communication with the public.

These novel data sources arise from new digital solutions such as tracking devices, mobile applications (apps), and social media interventions; they can also contribute to infectious disease outbreak management. Digital epidemiology uses these devices and the data gathered to find new solutions to minimise disease spread as well as determine the population's behaviour and insights, e.g., behavioural reactions to public health interventions, contact tracing, and others (5). However, considerable computational and technical challenges arise from the rapid increase in relevant data from digital data sources: “Extracting meaningful information from this data deluge is challenging, but holds the unparalleled potential for epidemiology” (6). The general term for analysing and disseminating real-time health information from news and social media is also referred to as “infodemiology” (7, 8).

With the amount of data that can be collected through digital epidemiology, making sense of the data and determining whether it will adequately support epidemiological surveillance can pose difficulty. Applications and digital tools are being developed to process large volumes of unstructured data (big data) to help uncover useful information for problem solving. The term “big data” refers to the use and analysis of verified information that has been collected. This includes complex data that is rapidly collected in massive amounts, as long as the data is real and verifiable (9, 10). Digital sources of big data in healthcare include electronic medical records, genomics, imaging data, data from social networks, and sensor data (11).

Big data can be extracted from diverse real-time or static information sources that are often underutilised or not accessible, which could potentially increase the acquisition of new knowledge contributing to a better understanding of disease epidemiology. Algorithmic analysis to “train” data for classification or predictions for decision-making is a rapidly growing computer science domain called machine learning.

Digital epidemiological surveillance involves sources that are not typically used in traditional epidemiology, generating larger amounts of information that should be incorporated into public health systems as part of the response to traditional diseases, new emerging pathogens such as the COVID-19 virus we are currently fighting.

The main aim of this paper is to provide a comprehensive overview for the literature digital solutions and big data to support surveillance strategies in the context of the COVID-19 pandemic and beyond.

Methods

This research is a review of original research on digital surveillance from January 2015 until May 2020 with implications for opportunities and strategies for new outbreaks such as the COVID-19 pandemics and future emergencies.

A review of literature going back 15 years was conducted as 2005 was chosen as the year of the dawn of wide spread mobile technology and big data. We focused on syntesis of aproaches in order to draw implications for surveillance opportunities for COVID-19 and beyond (we are aware this study is not presenting a review of COVID-19 strategies as it is too soon to conduct such an excercise).

The search was conducted in the electronic database MEDLINE (accessed by PubMed) for articles published between January 2005 and May 2020 using combinations of the following free terms and Boolean operators (AND and OR):

COVID-19”[Title/Abstract] OR “COVID-19 diagnostic testing”[Supplementary Concept] OR “surveillance”[Title/Abstract] OR “Pandemics”[MeSH Terms] OR “epidemic control”[Title/Abstract] OR “self-diagnosis”[Title/Abstract] OR “self-evaluation”[Title/Abstract] OR “contact tracing”[Title/Abstract] AND (“digital health”[Title/Abstract] OR “information system*[Title/Abstract] OR “apps”[Title/Abstract] OR “eHealth”[Title/Abstract] OR “e-Health”[Title/Abstract] OR “electronic health record*[Title/Abstract] OR “big data”[Title/Abstract] OR “machine learning”[Title/Abstract] OR “data science”[Title/Abstract] OR “artificial intelligence”[Title/Abstract] OR “mHealth”[Title/Abstract] OR “m-Health”[Title/Abstract] OR “social media”[Title/Abstract] OR “IoT”[Title/Abstract] OR “smartphone”[Title/Abstract] OR “Internet of things”[Title/Abstract]).

The search was limited to English-, Portuguese-, and Spanish-language publications and was complemented using the snowballing technique to identify relevant articles in the reference lists of articles returned by our search (12). Additional search for grey literature was conducted regarding digital surveillance. Expanded grey literature searching included internet search engine, targeted websites and social media. The search is subject to a selection bias as publications were limited to the three major languages, however, as the majority of scientific literature is published in the three major languages this bios is minimal.

Diverse studies covering the use of digital data sources for surveillance during a health emergency were included. Initial screening was based on titles and abstracts, and articles were independently evaluated. Abstracts lacking sufficient information to identify their inclusion or exclusion, were retrieved for full-text evaluation. Subsequently, two investigators independently evaluated the full-text articles and determined eligibility for inclusion or exclusion. Authorship, journal, or years were not blinded.

Study Selection

The initial research included complete publications and abstracts that were reviewed to determine whether they met the inclusion or exclusion criteria. Abstracts lacking information were retrieved for full-text evaluation. The inclusion criteria were (1) original research articles, (2) studies conducted during outbreaks or pandemic situations that measured the use of digital tools for contact tracing, (3) studies on the application of data and digital solutions to support surveillance strategies, and (4) studies covering the use of digital data sources for surveillance during a health emergency. The exclusion criteria were studies that described the use of technology outside an epidemic, big data studies that were not focused on epidemiological problems, and other surveillance interventions that were not related to the use of digital health solutions.

After the first review of the titles and abstracts, 280 studies were selected. For the grey literature, 11 electronic notes were reviewed. After reading the full texts of these studies, 130 were deemed to have met the search selection criteria, one of them was grey literature. Two authors (DB and FSR) screened all articles individually. Discrepancies were resolved through discussion with a third author (DN) when necessary. All the data were analysed qualitatively and quantitatively. The search and selection processes are summarised in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Flow diagram of studies through the review.

Based on the surveillance and early-warning & response processes established at WHO, ECDC and member states, four core components of digital epidemiology were identified in previous research and expanded1:

• digital surveillance, supporting public health routine surveillance;

• early warning and epidemic intelligence, striving to improve early warning tools that alert public health professionals to upcoming threats. After a threat has been verified, big data analysis might support public health experts in:

• rapid response, outbreak control, and increasing the impact of public health measures through digital interventions; and

• risk communication and public communication being advanced through mobile apps and social media, while the improvement in epidemic modelling that leverages real-time heterogeneous data improves public health policy through better assessment of how control measures impact healthcare and society.

While our systematic review illustrates how digital technology has been contributing to these four subdisciplines for over a decade. In Table 1, we map them to the objectives of COVID-19 public health surveillance defined by World Health Organization (WHO) (13)—requiring a predominantly public health and healthcare sector response—complemented by opportunities provided by digital epidemiology leveraging novel big data streams.

TABLE 1
www.frontiersin.org

Table 1. Mapping of four digital epidemiology disciplines to WHO objectives of COVID-19 surveillance.

While reviewing the articles using an iterative process (with regular meetings), the digital data and innovation systems were described in line with the four digital epidemiology categories.

Results

Our results provide an overview of the literature identified throughout our search process and highlight the existing opportunities and applications for the found disciplines in digital epidemiology.

Digital Surveillance

Surveillance is a core component of public health and preventive medicine and can be categorised as “active” (where health authorities make direct contact with the population or care providers to measure the actual conditions) or “passive” (when the health authorities get pre-designed reports about specific conditions, typically with the care providers reporting) (14).

With the use of new technology—for example social networks (15)—the traditional surveillance process can be not only enhanced but also reach a wider population (16). Passive surveillance, drawing from a wide range of novel data sources, is less accurate and subject to elevated noise in the data, either by proactively searching related information and or by gathering it passively, making it less resource intensive (17).

Before an infectious disease is confirmed by a laboratory, infected persons may exhibit symptoms, signs, behavioural patterns, or laboratory findings that can be tracked through a variety of data sources (18). This relates to what is called “syndromic surveillance,” which may be boosted by the use of digital technologies.

Integrated or enhanced surveillance combines both systems (19): traditional and digital. The strength of syndromic surveillance systems is in early warning systems for emerging disease threats, such as dengue (20), while the strength of indicator-based (active) surveillance is the production of regular, robust, reliable surveillance reports (21). Syndromic surveillance threats could also capture undiagnosed infections (22, 23). A good example is metagenomics, as infected (and infectious) people who are asymptomatic may unwittingly spread the infection to others (24).

The use of online, mobile, and social media data streams for routine surveillance enhances traditional methods (3, 25). Also called “participatory surveillance,” (26)—disease and symptoms reported directly by citizens themselves rather than by health authorities—the new generation of health trackers and sensors brings new opportunities (5). In this type of surveillance, the structured data collection across all the participating countries enable study and research of subgroups, e.g., the health status of the population outside the health care system (27, 28) or vaccine effectiveness in vaccinated groups and attitudes towards vaccination (such as influenza vaccination) (29, 30). The desirable benefits for participants, improving the long-term engagement in standard reporting, include real-time information on disease rates at local or neighbourhood levels (31).

Participatory systems have also been successfully implemented for mapping tropical diseases such as Zika and dengue and during mass gatherings like the 2016 Olympics in Rio de Janeiro, Brazil (32). Such a “hybrid” system (3, 33) (for example, using online surveys, communicating events, or using websites and applications) was successfully explored by combining Zika-related Google searches, Twitter microblogs, and the HealthMap digital surveillance system (34, 35) and cross-validating them with traditional disease surveillance data (36). Combining diverse data sources, search queries, social media data, digital data from internet-based sources (25), and website visits have proven effective for digital surveillance systems (37). With a new generation of trackers and sensors, this kind of individual surveillance will soon increase in scope, intensity, and significance (5), especially for emerging diseases and epidemic outbreaks, helping to monitor populations and potential security threats (5). Online surveillance-mapping tools have the potential to improve the early detection of infectious diseases compared to traditional epidemiological tools (38).

Online surveillance-mapping tools used in tandem with traditional epidemiological tools can improve the early detection of infectious diseases as was accomplished with Ebola (38) and Zika with critical results. Mobile phone data was used in Kenya to help identify the dynamics of human carriers that drive malaria parasite importation between regions (39); Google Trends was used for epidemiologic searching for Mayaro virus (40). All of these technologies are being employed for the COVID-19 epidemic (41). Mobile phone data and apps where users can report cases are also valuable tools for surveillance purposes (42). Due to the universal ownership of mobile devices with Bluetooth, these types of applications can be used all over the world, including low-connectivity and low-resource environments like rural Sierra Leone (43).

Many other instances of the digital surveillance of emerging infectious diseases have successfully used digital media for public health management (44, 45), including dengue (46), chikungunya (47), Ebola (48), monkeypox (49), and influenza (3, 50). The most common disease surveillance that utilises social media analytics for early detection and surveillance is influenza (51). For example, digital surveillance in influenza was used in Canada to demonstrate the correlation between influenza incidence and Google Ads click rates (52). It was also used to find the correlation between influenza incidence and Yahoo search trends (53). It is not only used at a national level but also in regional and local areas (54, 55).

Many examples show digital surveillance and epidemic intelligence noticeably using social networks, indicating the essential role social media plays, even according to the World Health Organization (WHO), for which more than 60% of initial disease epidemic reports derive from unofficial sources (56). Twitter, for instance, is used for surveillance because it can be used in outbreaks or emergencies, in monitoring diseases, in prediction, in gauging public reactions, in lifestyle analysis, in geolocation, and other general applications (57). Twitter was also used for Middle East respiratory syndrome surveillance in Korea (58), to analyse the H1N1 pandemic in 2009 (59), and to broadcast information about Ebola (60). In the literature, infectious disease surveillance, predicting disease spread, dissemination of public health information, and assessment of the public's views on public health outbreaks are some of the roles suggested for Twitter (34, 45, 51, 5760).

Studies on digital information communication on social media sites are on the rise, as they have played an important role in real-time analysis and have been used for faster trend prediction (61); however, these studies are often small pilots that need more methodological rigour and scalability (62). More studies should be conducted using appropriate technologies such as web-based systems to help support data quality improvements and future reporting (63).

Early Warning and Epidemic Intelligence

Digital surveillance expands traditional epidemiology by adding information that previously did not exist. Important new sources of data include social networks, geographical location measured by GPS, wish lists, and consumption of mobile data, etc. These new types of information are not medically related (the classic epidemiological domain) but allow the expansion of early warning and prevention systems to prevent avoidable exposure to public health threats (64). Systematic review of digital data source their implication for health could be found in Li et al. (65)—some examples of novel data sources for the news of surveillance and epidemic intelligence are:

Online Media: Screening online news for mentions of specific diseases or conditions could be helpful to identify, for example, local food poisoning outbreaks. Special systems such as GPHIN (66), WHO EOIS (67), and HealthMap (68) screen all global media in multiple languages and were developed specifically to monitor epidemic events.

Online Searches: Search terms provide an invaluable geo-located monitoring tool for public information that could reveal public sentiment, shopping panics, or disease outbreaks, as demonstrated by Google Flu Trends in 2008 (40). While the opportunities are vast, commercial ownership of the search engines by Google and other tech giants prevents researchers and public health experts from exploring this resource for public health purposes. Online searches were also analysed on public medical websites such as the National Electronic Library of Infection and National Resource for Infection Control (69) to identify spikes in information needs resulting from the publication of major government guidelines (70).

Sensors, Digital Traces, and Internet of Things (IoT) Devices: Monitoring of population movements through citizens' digital traces via GPS-enabled phones, sensor networks, and credit/store cards seamlessly collects information about our moves, physical locations, purchases, online preferences, and payments and could provide early warnings of upcoming outbreaks. However, like Google, these datasets are mostly owned by commercial companies (supermarkets, pharmacies, etc.) and are not available for research or public health benefits.

Internet of things (IoT)-enabled devices and sensors allow real-time data streams from readings and measurements of environmental or smart home devices and could create opportunities for digital epidemiology such as mapping the spread of infection (71).

Mobile data and Mobility GPS data: Medical apps and games and health condition tracking devices support citizens in activities such as managing long-term conditions, increasing their physical activities, or losing weight (72) but create ethical challenges (64). GPS location and mobility data also play a part in locations and directions (navigation and mapping apps), or contact tracing (e.g., COVID-19 contacts) using Bluetooth, GPS, cellular location tracking and QR codes (73). For example, geo-coded electronic health records (EHR) were successfully mapped to better visualise the spread of methicillin-resistant Staphylococcus aureus (CA-MRSA), identifying risks for CA-MRSA in children that would not have been uncovered using traditional EHR analyses (74). For accuracy and privacy, Bluetooth technology with a decentralised server architecture is recommended (75). Bluetooth, however, is not precise enough to avoid false positive contacts (10).

Social Media Streams: Unlike digital traces, collected seamlessly, the increase of Web 2.0, user-generated content actively shared via social networking tools, has seen an unprecedented explosion. The privacy settings of Facebook, Instagram, and other social networks allow users to restrict their profile content and activity. Consequently, the most important social media channel for research has undoubtedly become Twitter due to a relatively open data policy that allows researchers and IT developers access to tweets through an open, free API, returning a 1% random sample of raw tweets free of charge.

The use of social network data can also provide early warning because analysis can detect a peak in an outbreak up to 2 weeks before the official public health authorities (as occurred in 2009 for swine flu) (76). In 2019, earlier than the official reports, Twitter reported almost one-third of the total notifications related to avian influenza outbreak (77). This corroborates that social networks can serve as a method to obtain valuable information on a disease's behaviour or spread, even a week in advance of what the general practitioners in a particular locality could report (78). The evaluation of commonly used drugs for seasonal influenza on Twitter also provides surveillance ahead of the flu season (79).

Social media data allow disease tracking (80, 81) and can help make predictions that could prevent danger to the population (3). Social media analytics in correlation with traditional laboratory data can predict an outbreak; examples of this include the cases of influenza and cholera (82) or Ebola and Marburg filoviruses (83). Some researchers also conducted a content analysis to identify key trends during the 2009 H1N1outbreak that could also correlate with outbreak incidence data (84). Even concerning the coronavirus, Kogan et al. found that Twitter could be used as a kind of barometer, showing potential growth 2–3 weeks before the growth of coronavirus cases (by region) (85) and in other cases even 8–12 days before the outbreak (86). This is not new. An analysis of more than 500 million tweets worldwide found a significant association between the geographic locations of HIV-related tweets and HIV prevalence (87).

Twitter data in 2010 accurately tracked the spread of cholera, but researchers advised that this type of information is not always reliable and must not replace traditional epidemiological methods, as information and guidance is missed (88). Moreover, social network analyses need to be challenged and scrutinised by the “ground truth.” In 2016, Mowery et al. conducted a study describing how epidemiological surveillance of influenza using Twitter incorrectly predicted the 2011–2012 flu season 3 months early (89).

Improvements in services and cost reductions in the health sector coupled with the need for early warnings for the onset of adverse health conditions are the main drivers of these developments and new sources of data (90). These novel data sources collect a massive amount of new information, and it can be difficult for big data tools to review the data obtained and present it as new useful knowledge for the prevention of a pandemic outbreak. However, the use of big data is proving crucial for the COVID-19 pandemic (91).

Rapid Response, Outbreak Control, and Digital Interventions

Once a threat or an outbreak is detected, an event verified, and risk assessed, a rapid response must be implemented to control the outbreak, which ranges from diagnosis, testing, and contact tracing to risk communication with citizens.

This iterative process includes discovering patterns and generating new information (92) that can be used to control the outbreak. Digital information and new technologies are providing a quick response and coordinated control and management of possible outbreaks. Through monitoring cases using mobile technology, contact tracing infected citizens, following up with patients, and providing medical advice, digital and mobile technology can successfully complement the efforts of medical and public health experts (93).

With social networks, big data enables the early epidemiological storey of an outbreak to be reconstructed (94). Data streams and non-classical datasets in the early stages of the outbreak can inform the design and implementation of effective public health measures (95).

Big data could effectively support the rapid response to and better control of an outbreak but could do so more quickly and generate more accurate predictions. The algorithms that accomplish this are known as artificial intelligence (AI). For example, Toronto's surveillance system was first to detect the COVID-19 epidemic outbreak in the first reported epicentre of Wuhan (96). Also, scientists from the John Hopkins University visualised the spread of the coronavirus in real-time (95). AI is changing the landscape in public health and clinical management with very promising results (97). For example, a project seeking information related to the COVID-19 called Evidence Navigator provides computer-generated evidence maps of scientific publications on the pandemic, which is updated daily in PubMed (98).

Machine learning (ML) is a dramatically growing computer science AI discipline investigating algorithms to find new results or predictions without looking for specific solutions. ML was used to analyse several projects using the internet to enhance epidemiological surveillance and disease prediction [e.g., malaria (99), dengue (100), and influenza (101)]. The research demonstrated a positive predictive value of the incidence of infectious diseases (101) or little predictive value (102). ML was successfully used to create an evidence-based guideline from the information gathered from the Ebola virus epidemic and turn it into an application called the Ebola Care Guidelines app (103) to inform the general population and healthcare providers of updated, evidence-based guidelines in real time during a global pandemic (41).

In Africa, the rapid recognition of localised areas of higher transmission of Ebola and the resulting quantitative assessment could support the optimal deployment of public health resources (104). In Latin America and the Caribbean, the rapid integration of Zika virus prevention recommendations into sexual and reproductive health services made it possible to reduce the incidence of the Zika virus (105). In 2012, digital pens were used by the New Hampshire Department of Health and Human Services to rapidly acquire epidemiologic data during a gastrointestinal illness outbreak (106), providing rapid assessment, response, and control measures before the problem multiplied.

The novel digital information gathered about the coronavirus is being used in Taiwan in conjunction with their immigration database to classify travellers according to different risk types and to issue alerts in real time to prevent infections (107). This is often called a “digital fence.” Russia, China, and Poland have used facial recognition software to monitor population compliance with government policies (42). Applications exist for self-testing, quarantine monitoring, and contact tracing that are being already used in many parts of the world such as India to support the rapid response to COVID-19 (108).

Online news surveillance and mapping tools have successfully provided early warnings, but their potential to improve response and risk assessment as the outbreak progresses over time has led to the adoption by WHO operations of the most robust systems: e.g., GPHIN, Medisys, SORMAS (Surveillance and Outbreak Response Management and Analysis System) (109) and HealthMap (68). News surveillance also means that real-time feedback and effective responses should function as an intervention. Locally appropriate technologies, such as web-based systems and mobile phones, can help support data quality improvements and reporting timeliness (63).

In response to the COVID-19 outbreak, the rapid development and deployment of point-of-care (POC) diagnostics for screening have shown to help slow the spread of the disease (110), demonstrating that telemedicine can be used as a means of surveillance. For example, Botswana is using telemedicine to control patients remotely (111), and obstetric departments in the US are monitoring coronavirus patients (112). Rapid deployment of an in-patient telemedicine response is feasible across many settings in response to the COVID-19 pandemic (113).

Public Communication and Evaluation of the Impact on Healthcare and Society

An essential piece at the centre of outbreak response success is public communication: a clear and concise message communicating the risk, measures, and policies taken by the government. Digital technologies complement traditional mass media and play an increasingly important role for sharing reliable, evidence-based information with the public and gaining citizens' buy-in (114).

While traditional media channels—newspapers and television—are still actively used for mass communication, the role of social media has grown dramatically. In particular, Twitter has demonstrated great potential to be used not only for tracking epidemics but also to inform citizens about the risks of pandemics in real time (57). Social media can be also used to study the public's risk perceptions (115). Mobile apps and wearable devices have further been used to monitor patient behaviour to provide personalised service and advice (116, 117). Twitter was analysed for risk communication potential during the epidemics of the Middle East respiratory syndrome (58), SARS (45), Ebola virus (60), Zika virus (34), H1N1 (“swine flu”) (59), and H7N9 (“avian flu”) (62). A study in Vietnam includes social media and science journalist for COVID-19 public policy2. The studies involving Twitter and other social networks summarise how they could be useful tools for disseminating information to the population about how to avoid the spread of the outbreak (118).

Specifically, role-play social media, a channel for gauging public attitudes, could effectively be used to disseminate risk communication in real time (78) but could be a double-edged sword in that also it is prone to misuse, misinformation, and the spread of fake news. How online channels inform healthcare professionals and the public and their genuine information needs to be part of any comprehensive government communication plan (70).

Discussion

The results have shown many opportunities ranging from the use of social networks to the use of AI and big data for digital surveillance and reference early warning and epidemic intelligence, rapid response, outbreak control, risk communication, and public communication. The results present solutions that have been implemented in many countries with differing results because of their diverse socio-cultural realities. In August 2020, WHO updated the objectives for COVID-19 surveillance: (i) enable rapid detection, isolation, testing, and management of cases; (ii) monitor trends in COVID-19 deaths; (iii) identify, follow-up, and quarantine of contacts; (iv) detect and contain clusters and outbreaks, especially among vulnerable populations; (v) guide the implementation and adjustment of targeted control measures while enabling safe resumption of economic and social activities; (vi) evaluate the impact of the pandemic on healthcare systems and society; (vii) monitor longer term epidemiologic trends and evolution of SARS-CoV-2 virus; and (viii) contribute to the understanding of the co-circulation of SARS-CoV-2 virus, influenza and other respiratory viruses, and other pathogens (119).

With digital epidemiology's subdisciplines already assessed and ready for use, the population must be made aware of the measures to be followed to control the outbreak using (ix) public communication and evaluation of the impact on healthcare and society as culture might shape how people think about privacy and surveillance Risk communication and public communication are now taken to a new level through mobile apps and social media, while the advances in epidemic modelling that leverage real-time heterogeneous data improve public health policy through better assessment of the impact of control measures on healthcare and society.

Mobile applications and mHealth approaches could be used for public health surveillance due to their multiple benefits as an efficient, almost universal presence (120), a contributor to high digital literacy, and a source that creates wide availability of data. With the number of mobile phones around 14 billion units worldwide and expecting an increase to almost 17 billion by 2023, the options available with this type of device must be considered. The use of mobile phones and Bluetooth to assess the follow-up of potential patients has always been a strong alternative (121). Apps with tracing functionalities using Bluetooth technology can support health authorities in the contact tracing process, identifying the possible contacts (known or unknown) of a confirmed/positive case, creating a network that will function as epidemic control if it is used by enough people (122). Some examples exist of mobile apps and mHealth being used already. For example, the Chinese government required citizens in 200 cities to use the Alipay app, assigning a risk code (green, yellow, red) to each person indicating to what extent they were allowed to move around the community. An algorithm incorporated information about the time they spent in risky locations and the frequency of contact with other people (42).

Although these technologies only use monitoring data temporarily, many people are reluctant to use them because they think that their data will be sold to private companies or they will continue to be monitored once the pandemic is over (123). Therefore, notification is necessary that the data obtained will be used only for monitoring purposes, that it will not be transferred to any public or private company, and that it will only be used in periods of a pandemic or used as anonymised data (124).

Mobile devices (including tablets, wearables, etc.) also can connect to social networks, which allows their users to be informed of the latest news about the pandemic. At the same time, they can be used to provide measures to avoid contagion. Social media's power of influence is vast, becoming an important tool public health during a pandemic, which is precisely a difference between digital and traditional epidemiology. Social networks can provide a different perspective than the traditional approach that relies on health reports by providing important correlations with abnormal disease trends that could indicate a potential outbreak.

The use of social media also offers significant opportunities to encourage citizen engagement in crisis management (125). However, not all the most-read information on social networks is true. Many celebrity “influencers” (people with many followers, such as actors, athletes, etc.) publish controversial posts such as false cures or prophylactics for the coronavirus or indicate that it is simply an invention (126). This is due to voluntary submission and a lack of gatekeeping (127). The advances in AI could help in this facet (128). This kind of digital technology, correctly applied, could benefit the healthcare landscape in public health and clinical management with promising results (97). It has already been used for some time to handle data related to the coronavirus (96).

Big data use for surveillance seems to be a beneficial tool for public health but it must follow a rigorous statistical analysis. Qin et al. used social media search indexes (SMSI) to predict the new number of COVID-19 cases that could be detected 6–9 days in advance (129), but the prediction is generated after an algorithm has evaluated the information. With all this generation of digital information, there is a risk of losing what is impactful to the epidemiological study. To accomplish an “understanding of all data” created by social networks, the use of big data has also been implemented.

While useful in digital epidemiology, big data can also serve as a standard to control possible adverse effects and can support traditional epidemiology by discovering additional facts from social media behaviour that may complement other data from the population. Furthermore, it can help overcome some challenges, such as geographic heterogeneity, insufficient representation in developing countries, and spatial/temporal uncertainty in the information obtained. On the other hand, a disadvantage of using internet search data or data from social networks for surveillance purposes is its—eventual—lack of representativeness, as well as possible fake results. In order to create big data, information must be collected and processed before it can be used; therefore, the WHO has called for requiring the detection and management of suspicious cases at entry points worldwide to generate this information to avoid the spread of COVID-19 (130). Information from symptoms checkers and laboratory data could help to estimate the predictive value of the respiratory symptoms on the community as well as present facts on the level of virus circulation (93).

The use of this technology would allow epidemiologists to evaluate millions of digital trails of people who constantly use their digital equipment such as mobile phones for social networking. However, an ideal prediction using machine learning and AI is not yet realised; the tracking and prediction of how COVID-19 will spread are not yet completely reliable. This could be due to two reasons: (1) the availability of COVID-19-related clinical data is a key barrier; and (2) AI requires data on COVID-19 to train itself (131). Therefore, the WHO recommends being cautious with the implementation of digital solutions until the utility of public health policies are better understood. Planned solutions should cover and allow (i) quality monitoring, transparency, and accountability, (ii) resource allocation optimisation, (iii) citizen participation and inclusion, and (iv) resilience and adaptation to exogenous events.

Several promising initiatives have been launched to gather and share both existing and new data using new AI models, including WHO's Global Research on Coronavirus Disease Database, along with the GISAID Initiative (Global Initiative on Sharing All Influenza Data); the COVID-19 Open Research Dataset Challenge of Kaggle data science platform; the around 20,000 related articles in ScienceDirect in its Novel Coronavirus Information Centre early-stage and peer-reviewed research on COVID-19; etc. Finally, the initiative formed by Microsoft, Facebook, Semantic Scholar, the Allen Institute for AI, and five other collaborators to make the COVID-19 Open Research Dataset (CORD-19) openly available, which contains about 44,000 scholarly articles for data mining, is worth mentioning.

Finally, there are a number of related subjects that are beyond the scope of this paper. Firstly, cost issues and culture aspects of surveillance were not part of our initial search strategies, however, it is important to note the cost to setting up and conducting digital surveillance, especially in low and middle income countries4,5. Culture attitudes (positive or negative) towards digital surveillance are also essential to consider for a successful deployment of a digital solution6,7 as demonstrated in fight against COVID-19 in Vietnam8. Final limitation of this study includes our search strategy focusing on PubMed and therefore not covering studies published in computer science outlets such as conferences published in ACM and IEEE libraries, such as pioneering Twitter research for epidemic intelligence and eastly warning (see text footnote 1)—this is a subject for a future research.

Conclusions

We have highlighted the opportunities and challenges for digital epidemiology as a growing discipline that have contributed to surveillance of the COVID-19 pandemic and highlighted how digital epidemiology will become indispensable for fighting future public health and natural disasters and pandemics.

Prevention and control of the COVID-19 pandemic requires public health and epidemiology measures. The use of digital technology enhances traditional epidemiological means to contain outbreak and supports prevention, early warning, rapid response, and digital interventions such as remote care for patients or providing reliable information to the public. In addition, it is crucial that the technology is inclusive and user friendly (for example, social networks and specially designed apps). However, additional support strategies are required for vulnerable groups who are not active technology users.

The key opportunities and challenges for effective digital epidemiology systems for the 21st century lie in front of us. To improve capacity and preparedness for later epidemics, the repurposed and emerging systems for COVID-19 need to be fully developed and evaluated after the crisis with the goal of creating fully integrated, interoperable digital epidemiology solutions at national and international levels.

Author Contributions

All authors contributed sufficiently and meaningfully to the conception, design, drafting, editing, and revising the manuscript.

Author Disclaimer

DN-O, NA-M, and CH are staff members of the WHO. The authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the WHO.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

References

1. Fine P, Victora CG, Rothman KJ, Moore PS, Chang Y, Curtis V, et al. John Snow's legacy: epidemiology without borders. Lancet. (2013) 381:1302–11. doi: 10.1016/S0140-6736(13)60771-0

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Mapping A London Epidemic | National Geographic Society. (n.d.). Available online at: https://www.nationalgeographic.org/activity/mapping-london-epidemic/ (accessed July 9, 2021).

3. Aiello AE, Renson A, Zivich PN. Social media– and internet-based disease surveillance for public health. Annu Rev Public Health. (2020) 41:101–18. doi: 10.1146/annurev-publhealth-040119-094402

PubMed Abstract | CrossRef Full Text | Google Scholar

4. St Louis C, Zorlu G. Can Twitter predict disease outbreaks? BMJ. (2012) 344:e2353. doi: 10.1136/bmj.e2353

CrossRef Full Text | Google Scholar

5. Samerski S. Individuals on alert: digital epidemiology and the individualization of surveillance. Life Sci Soc Pol. (2018) 14:13. doi: 10.1186/s40504-018-0076-z

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Salathé M, Bengtsson L, Bodnar TJ, Brewer DD, Brownstein JS. Digital epidemiology. PLoS Comput Biol. (2012) 8:e1002616. doi: 10.1371/journal.pcbi.1002616

CrossRef Full Text | Google Scholar

7. Barros JM, Duggan J, Rebholz-Schuhmann D. The application of internet-based sources for public health surveillance (infoveillance): systematic review. J Med Internet Res. (2020) 22:e13680. doi: 10.2196/13680

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Park HW, Park S, Chong M. Conversations and medical news frames on twitter: infodemiological study on COVID-19 in South Korea. J Med Internet Res. (2020) 22:e18897. doi: 10.2196/18897

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Wyber R, Vaillancourt S, Perry W, Mannava P, Folaranmi T, Celi LA. Big data in global health: improving health in low- and middle-income countries. Bull World Health Organ. (2015) 93:203–8. doi: 10.2471/BLT.14.139022

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Ward JS, Barker A. Undefined By Data: A Survey of Big Data Definitions. (2013). https://arxiv.org/abs/1309.5821v1 (accessed July 9, 2021).

Google Scholar

11. Gaitanou P, Garoufallou E, Balatsoukas P. The effectiveness of big data in health care: a systematic review. Commun Comput Inform Sci. (2014) 478:141–53. doi: 10.1007/978-3-319-13674-5_14

CrossRef Full Text | Google Scholar

12. Greenhalgh T, Peacock R. Effectiveness and efficiency of search methods in systematic reviews of complex evidence: audit of primary sources. BMJ. (2005) 331:1064–5. doi: 10.1136/bmj.38636.593461.68

PubMed Abstract | CrossRef Full Text | Google Scholar

13. World Health Organization (WHO). Management of Ill Travellers at Points of Entry - International Airports, Seaports and Ground Crossings - in the Context of COVID-19 Outbreak (2020). Available online at: https://www.who.int/publications/i/item/10665-331512 (accessed July 9, 2021).

Google Scholar

14. Bergquist T, Pejaver V, Hammarlund N, Mooney SD, Mooney SJ. Evaluation of the secondary use of electronic health records to detect seasonal, holiday-related, and rare events related to traumatic injury and poisoning. BMC Public Health. (2020) 20:46. doi: 10.1186/s12889-020-8153-7

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Hartley DM. Using social media and internet data for public health surveillance: the importance of talking. Milbank Q. (2014) 92:34–9. doi: 10.1111/1468-0009.12039

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Salathé M. Digital epidemiology: what is it, and where is it going? Life Sci Soc Policy. (2018) 14:1. doi: 10.1186/s40504-017-0065-7

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Yan SJ, Chughtai AA, Macintyre CR. Utility potential of rapid epidemic intelligence from internet-based sources. Int J Infect Dis. (2017) 63:77–87. doi: 10.1016/j.ijid.2017.07.020

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Mandl KD, Overhage JM, Wagner MM, Lober WB, Sebastiani P, Mostashari F, et al. Implementing syndromic surveillance: a practical guide informed by the early experience. J Am Med Inform Assoc. (2004) 11:141–50. doi: 10.1197/jamia.M1356

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Nsubuga P, White ME, Thacker SB, Anderson MA, Blount SB, Broome CV, et al. Public Health Surveillance: A Tool for Targeting and Monitoring Interventions. (2006). Disease Control Priorities in Developing Countries. The International Bank for Reconstruction and Development/The World Bank.

PubMed Abstract | Google Scholar

20. Bringay S, Flamand C, Quenel P, Ardillon V, Carvalho L, Teisseire M. The Epidemiologic Surveillance of Dengue-Fever in French Guiana: When Achievements Trigger Higher Goals Influenza Surveillance View Project Air Pollution View Project Claude Flamand, Institut Pasteur International Network (2016).

Google Scholar

21. Brownstein JS, Freifeld CC, Reis BY, Mandl KD. Surveillance sans frontières: internet-based emerging infectious disease intelligence and the healthmap project. PLoS Med. (2008) 5:e151. doi: 10.1371/journal.pmed.0050151

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Henning KJ. Overview of syndromic surveillance what is syndromic surveillance? September. (2004) 24:2004. doi: 10.1037/e307182005-001

CrossRef Full Text | Google Scholar

23. Muscatello DJ, Churches T, Kaldor J, Zheng W, Chiu C, Correll P, et al. An automated, broad-Based, near real-Time public health surveillance system using presentations to hospital emergency departments in new south wales, australia. BMC Public Health. (2005) 5:141. doi: 10.1186/1471-2458-5-141

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gilbert GL, Degeling C, Johnson J. Communicable disease surveillance ethics in the age of big data and new technology. Asian Bioethics Rev. (2019) 11:173–87. doi: 10.1007/s41649-019-00087-1

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Milinovich GJ, Avril SMR, Clements ACA, Brownstein JS, Tong S, Hu W. Using internet search queries for infectious disease surveillance: screening diseases for suitability. BMC Infect Dis. (2014) 14:690. doi: 10.1186/s12879-014-0690-1

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Fernandez-Luque L, Karlsen R, Melton GB. HealthTrust: a social network approach for retrieving online health videos. J Med Internet Res. (2012) 14:e22. doi: 10.2196/jmir.1985

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Guerrisi C, Turbelin C, Blanchon T, Hanslik T, Bonmarin I, Levy-Bruhl D, et al. Participatory syndromic surveillance of influenza in Europe. J Infect Dis. (2016) 214: S386–92. doi: 10.1093/infdis/jiw280

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Debin M, Turbelin C, Blanchon T, Bonmarin I, Falchi A, Hanslik T. Evaluating the feasibility participants' representativeness of an online nationwide surveillance system for influenza in france. PLoS ONE. (2013) 8:e73675. doi: 10.1371/journal.pone.0073675

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Debin M, Colizza V, Blanchon T, Hanslik T, Turbelin C, Falchi A. Effectiveness of 2012-2013 influenza vaccine against influenza-like illness in general population: estimation in a french web-Based cohort. Hum Vaccines Immunother. (2014) 10:536–43. doi: 10.4161/hv.27439

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Boiron K, Sarazin M, Debin M, Raude J, Rossignol L, Guerrisi C, et al. Opinion about seasonal influenza vaccination among the general population 3 years after the a(H1N1)Pdm2009 influenza pandemic. Vaccine. (2015) 33:6849–54. doi: 10.1016/j.vaccine.2015.08.067

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Koppeschaar CE, Colizza V, Guerrisi C, Turbelin C, Duggan J, Edmunds WJ, et al. Influenzanet: citizens among 10 countries collaborating to monitor influenza in europe. JMIR Public Health Surveill. (2017) 3:e66. doi: 10.2196/publichealth.7429

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Leal Neto O, Cruz O, Albuquerque J, Nacarato de Sousa M, Smolinski M, Pessoa Cesse EÂ, et al. Participatory surveillance based on crowdsourcing during the rio 2016 olympic games using the guardians of health platform: descriptive study. JMIR Public Health Surveill. (2020) 6:e16119. doi: 10.2196/16119

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Bansal S, Chowell G, Simonsen L, Vespignani A, Viboud C. Big data for infectious disease surveillance and modeling. J Infect Dis. (2016) 214:S375–9. doi: 10.1093/infdis/jiw400

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Masri S, Jia J, Li C, Zhou G, Lee MC, Yan G, et al. Use of Twitter data to improve zika virus surveillance in the United States during the 2016 epidemic. BMC Public Health. (2019) 19:761. doi: 10.1186/s12889-019-7103-8

PubMed Abstract | CrossRef Full Text | Google Scholar

35. McGough SF, Brownstein JS, Hawkins JB, Santillana M. Forecasting zika incidence in the 2016 Latin America outbreak combining traditional disease surveillance with search, social media, and news report data. PLoS Neglect Trop Dis. (2017) 11:e0005295. doi: 10.1371/journal.pntd.0005295

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Li J, Xu Q, Cuomo R, Purushothaman V, Mackey T. Data mining and content analysis of the chinese social media platform weibo during the early cOVID-19 outbreak: retrospective observational infoveillance study. JMIR Public Health Surveill. (2020) 6:e18700. doi: 10.2196/18700

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Seo DW, Shin SY. Methods using social media and search queries to predict infectious disease outbreaks. Healthc Inform Res. (2017) 23:343–8. doi: 10.4258/hir.2017.23.4.343

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Bempong NE, De Castañeda RR, Schütte S, Bolon I, Keiser O, Escher G, et al. Precision global health - the case of ebola: a scoping review. J Glob Health. (2019) 9:010404. doi: 10.7189/jogh.09.010404

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Wesolowski A, Eagle N, Tatem AJ, Smith DL, Noor AM, Snow RW, et al. Quantifying the impact of human mobility on malaria. Science. (2012) 338:267–70. doi: 10.1126/science.1223467

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Adawi M, Bragazzi NL, Watad A, Sharif K, Amital H, Mahroum N. Discrepancies between classic and digital epidemiology in searching for the mayaro virus: preliminary qualitative and quantitative analysis of google trends. JMIR Public Health Surveill. (2017) 3:e93. doi: 10.2196/publichealth.9136

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Alwashmi MF. The use of digital health in the detection and management of COVID-19. Int J Environ Res Public Health. (2020) 17:2906. doi: 10.3390/ijerph17082906

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Mello MM, Wang CJ. Ethics and governance for digital disease surveillance. Science. (2020) 368:951–4. doi: 10.1126/science.abb9045

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Danquah LO, Hasham N, MacFarlane M, Conteh FE, Momoh F, Tedesco AA, et al. Use of a mobile application for Ebola contact tracing and monitoring in northern Sierra Leone: a proof-of-concept study. BMC Infect Dis. (2019) 19:810. doi: 10.1186/s12879-019-4354-z

PubMed Abstract | CrossRef Full Text

44. Geneviève LD, Martani A, Wangmo T, Paolotti D, Koppeschaar C, Kjelsø C, et al. Participatory disease surveillance systems: ethical framework. J Med Internet Res. (2019) 21:e12273. doi: 10.2196/12273

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Carrion M, Madoff LC. ProMED-mail: 22 years of digital surveillance of emerging infectious diseases. Int Health. (2017) 9:177–83. doi: 10.1093/inthealth/ihx014

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Buczak AL, Baugher B, Moniz LJ, Bagley T, Babin SM, Guven E. Ensemble method for dengue prediction. PLoS ONE. (2018) 13:e0189988. doi: 10.1371/journal.pone.0189988

CrossRef Full Text | Google Scholar

47. Del Valle SY, McMahon BH, Asher J, Hatchett R, Lega JC, Brown HE, et al. Summary results of the 2014-2015 DARPA Chikungunya challenge. BMC Infect Dis. (2018) 18:245. doi: 10.1186/s12879-018-3124-7

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Viboud C, Sun K, Gaffey R, Ajelli M, Fumanelli L, Merler S, et al. The rAPIDD ebola forecasting challenge: synthesis and lessons learnt. Epidemics. (2018) 22:13–21. doi: 10.1016/j.epidem.2017.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Silenou BC, Tom-Aba D, Adeoye O, Arinze CC, Oyiri F, Suleman AK, et al. Use of surveillance outbreak response management and analysis system for human monkeypox outbreak, nigeria, 2017-2019. Emerg Infect Dis. (2020) 26:345–9. doi: 10.3201/eid2602.191139

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Santillana M, Nguyen AT, Dredze M, Paul MJ, Nsoesie EO, Brownstein JS. (2015). Combining search, social media, and traditional data sources to improve influenza surveillance. PLoS Comput Biol. 11:E1004513. doi: 10.1371/journal.pcbi.1004513

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Bernardo TM, Rajic A, Young I, Robiadek K, Pham MT, Funk JA. Scoping review on search queries and social media for disease surveillance: a Chronology of innovation. J Med Internet Res. (2013) 15:e147. doi: 10.2196/jmir.2740

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Eysenbach G. Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet. J Med Internet Res. (2009) 11:e11. doi: 10.2196/jmir.1157

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Porta M. Incidence (Syn: Incidence Number). In: Porta M, editor. A Dictionary of Epidemiology. New York, NY: Oxford University Press (2015). p. 144.

54. Carneiro HA, Mylonakis E. Google trends: a web-based tool for real-time surveillance of disease outbreaks. Clin Infect Dis. (2009) 49:1557–64. doi: 10.1086/630200

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Chunara R, Aman S, Smolinski M, Brownstein JS. Flu near you: an online self-reported influenza surveillance system in the USA. Online J Public health Inform. (2013) 5:4456. doi: 10.5210/ojphi.v5i1.4456

CrossRef Full Text | Google Scholar

56. WHO | Epidemic Intelligence - Systematic Event Detection. WHO (2020).

57. Aramburu MJ, Berlanga R, Lanza I. Social Media Multidimensional Analysis for Intelligent Health Surveillance. Int J Environ Res Public Health. (2020) 17:2289. doi: 10.3390/ijerph17072289

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Shin SY, Seo DW, An J, Kwak H, Kim SH, Gwack J, et al. High Correlation of middle east respiratory syndrome spread with google search and twitter trends in Korea. Scie Rep. (2016) 6:32920. doi: 10.1038/srep32920

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Ahmed W, Bath PA, Sbaffi L, Demartini G. Novel insights into views towards h1N1 during the 2009 pandemic: a Thematic analysis of twitter data. Health Info Libr J. (2019) 36:60–72. doi: 10.1111/hir.12247

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Liang H, Fung ICH, Tse ZTH, Yin J, Chan CH, Pechta LE, et al. How did Ebola information spread on Twitter: broadcasting or viral spreading? BMC Public Health. (2019) 19:438. doi: 10.1186/s12889-019-6747-8

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Alessa A, Faezipour M. A review of influenza detection and prediction through social networking sites. Theor Biol Med Model. (2018) 15:2. doi: 10.1186/s12976-017-0074-5

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Tang L, Bie B, Park SE, Zhi D. Social media and outbreaks of emerging infectious diseases: a systematic review of literature. Am J Infect Control. (2018) 46:962–72. doi: 10.1016/j.ajic.2018.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Ohrt C, Roberts KW, Sturrock HJW, Wegbreit J, Lee BY, Gosling RD. Information systems to support surveillance for malaria elimination. Am J Trop Med Hygiene. (2015) 93:145–52. doi: 10.4269/ajtmh.14-0257

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Kostkova P. Disease surveillance data sharing for public health: the next ethical frontiers. Life Sci Soc Policy. (2018) 14:16. doi: 10.1186/s40504-018-0078-x

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Li L, Novillo-Ortiz D, Azzopardi-Muscat N, Kostkova P. Digital data sources and their impact on people's health: a systematic review of systematic reviews. Front Public Health. (2021) 9:645260. doi: 10.3389/fpubh.2021.645260

PubMed Abstract | CrossRef Full Text | Google Scholar

66. About - GPHIN - Canada.Ca. (n.d.). Available online at: https://gphin.canada.ca/cepr/aboutgphin-rmispenbref.jsp?language=en_CA (accessed June 18, 2021).

67. World Health Organization (WHO). (n.d.). Early Detection, Verification, Assessment and Communication. Epidemic Intelligence from Open Sources (EIOS). Available online at: https://www.who.int/initiatives/eios (accessed July 9, 2021).

68. Flu and Ebola Map | Virus and Contagious Disease Surveillance. (n.d.). Available online at: https://healthmap.org/pt/ (accessed July 9, 2021).

69. Welcome to National Resource for Infection Control (NRIC) | National Resource for Infection Control (NRIC). London: University College London (n.d.).

70. Kostkova P, Fowler D, Wiseman S, Weinberg JR. Major infection events over 5 years: how is media coverage influencing online information needs of health care professionals and the public? J Med Int Res. (2013) 15:e107. doi: 10.2196/jmir.2146

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Peeri NC, Shrestha N, Rahman MS, Zaki R, Tan Z, Bibi S, et al. The sARS, mERS and novel coronavirus (COVID-19) epidemics, the newest and biggest global health threats: what lessons have we learned? Int J Epidemiol. (2020) 49:717–26. doi: 10.1093/ije/dyaa033

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Kostkova P. Grand challenges in digital health. Front Public Health. (2015) 3:1. doi: 10.3389/fpubh.2015.00134

CrossRef Full Text

73. Soontornpipit P, Viwatwongkasem C, Taratep C, Teerawat W, Vanitchatchavan P. Development of the electronic surveillance monitoring system on web applications. Proc Comput Sci. (2016) 86:244–7. doi: 10.1016/j.procs.2016.05.110

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Immergluck LC, Leong T, Matthews K, Malhotra K, Parker TC, Ali F, et al. Geographic surveillance of community associated MRSA infections in children using electronic health record data. BMC Infect Dis. (2019) 19170. doi: 10.1186/s12879-019-3972-9

PubMed Abstract | CrossRef Full Text | Google Scholar

75. How Apple Google Are Enabling Covid-19 Bluetooth Contact-Tracing | WIRED (2020). Available online at: https://www.wired.com/story/apple-google-bluetooth-contact-tracing-covid-19/ (accessed July 9, 2021).

76. De Quincey E, Kostkova P. Early warning and outbreak detection using social networking websites: the potential of twitter. In: Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, 27 LNICST:21–24. Berlin, Heidelberg: Springer (2010). doi: 10.1007/978-3-642-11745-9_4

CrossRef Full Text | Google Scholar

77. Yousefinaghani S, Dara R, Poljak Z, Bernardo TM, Sharif S. The assessment of twitter's potential for outbreak detection: avian influenza case study. Sci Rep. (2019) 9:18147. doi: 10.1038/s41598-019-54388-4

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Szomszor M, Kostkova P, De Quincey. E. #Swineflu: twitter predicts swine flu outbreak in 2009. In: Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, 69 LNICST:18–26. Berlin, Heidelberg: Springer (2011). doi: 10.1007/978-3-642-23635-8_3

CrossRef Full Text

79. Kagashe I, Yan Z, Suheryani I. Enhancing seasonal influenza surveillance: topic analysis of widely used medicinal drugs using twitter data. J Med Internet Res. (2017) 19:e315. doi: 10.2196/jmir.7393

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Lampos V, De Bie T, Cristianini N. Flu detector-tracking epidemics on twitter. In: Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6323 LNAI:599–602. Berlin, Heidelberg: Springer (2010). doi: 10.1007/978-3-642-15939-8_42

CrossRef Full Text | Google Scholar

81. Lampos V, Cristianini N. Nowcasting events from the social web with statistical learning. ACM Trans Intell Syst Technol. (2012) 3:1–22. doi: 10.1145/2337542.2337557

CrossRef Full Text | Google Scholar

82. Hossain L, Kam D, Kong F, Wigand RT, Bossomaier T. Social media in ebola outbreak. Epidemiol Infect. (2016) 144:2136–43. doi: 10.1017/S095026881600039X

CrossRef Full Text | Google Scholar

83. Natesan M, Wu SW, Chen CI, Jensen SMR, Karlovac N, Dyas BK, et al. A smartphone-Based rapid telemonitoring system for ebola and marburg disease surveillance. ACS Sensors. (2019) 4:61–8. doi: 10.1021/acssensors.8b00842

PubMed Abstract | CrossRef Full Text | Google Scholar

84. Chew C, Eysenbach G. Pandemics in the age of twitter: content analysis of tweets during the 2009 h1N1 outbreak. PLoS ONE. (2010) 5:e14118. doi: 10.1371/journal.pone.0014118

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Kogan NE, Clemente L, Liautaud P, Kaashoek J, Link NB, Nguyen AT, et al. An early warning approach to monitor COVID-19 activity with multiple digital traces in near real time. Sci Adv. (2021) 7:eabd6989. doi: 10.1126/sciadv.abd6989

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Li C, Chen LJ, Chen X, Zhang M, Pang CP, Chen H. Retrospective analysis of the possibility of predicting the COVID-19 outbreak from Internet searches and social media data, China, 2020. Euro Surveill. (2020) 25:2000199. doi: 10.2807/1560-7917.ES.2020.25.10.2000199

PubMed Abstract | CrossRef Full Text | Google Scholar

87. Young SD, Rivers C, Lewis B. Methods of using real-Time social media technologies for detection and remote monitoring of hIV outcomes. Prev Med. (2014) 63:112–5. doi: 10.1016/j.ypmed.2014.01.024

PubMed Abstract | CrossRef Full Text | Google Scholar

88. Bates M. Tracking disease: digital epidemiology offers new promise in predicting outbreaks. IEEE Pulse. (2017) 8:18–22. doi: 10.1109/MPUL.2016.2627238

PubMed Abstract | CrossRef Full Text | Google Scholar

89. Mowery J. Twitter influenza surveillance: quantifying seasonal misdiagnosis patterns. Online J Public Health Inform. (2016) 8:7011. doi: 10.5210/ojphi.v8i3.7011

PubMed Abstract | CrossRef Full Text | Google Scholar

90. Page T. A forecast of the adoption of wearable technology. Int J Technol Diff. (2015) 6:12–29. doi: 10.4018/IJTD.2015040102

CrossRef Full Text | Google Scholar

91. Qiu HJ LX, Yuan XK, Huang YQ, Zhou QW, Wu R, et al. Using the big data of internet to understand the characteristics of coronavirus disease 2019: a Big data study. Zhonghua Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. (2020) 55:569–75. doi: 10.1016/j.wjorl.2020.05.003

PubMed Abstract | CrossRef Full Text | Google Scholar

92. Garattini C, Raffle J, Aisyah DN, Sartain F, Kozlakidis Z. Big data analytics, infectious diseases and associated ethical impacts. Philos Technol. (2019) 32:69–85. doi: 10.1007/s13347-017-0278-y

PubMed Abstract | CrossRef Full Text | Google Scholar

93. European Commission (2020). Mobile Applications to Support Contact Tracing in the EU's Fight against COVID-19 - Common EU Toolbox for Member States. Brussels. Available online at: https://ec.europa.eu/health/sites/default/files/ehealth/docs/covid-19_apps_en.pdf (accessed July 9, 2021).

94. Sun K, Chen J, Viboud C. Early epidemiological analysis of the coronavirus disease 2019 outbreak based on crowdsourced data: a population-level observational study. Lancet Digital Health. (2020) 2:e201–8. doi: 10.1016/S2589-7500(20)30026-1

PubMed Abstract | CrossRef Full Text | Google Scholar

95. Bragazzi NL, Dai H, Damiani G, Behzadifar M, Martini M, Wu J. (2020). How big data and artificial intelligence can help better manage the covid-19 pandemic. Int J Environ Res Public Health. 17:3176. doi: 10.3390/ijerph17093176

PubMed Abstract | CrossRef Full Text | Google Scholar

96. McCall B. COVID-19 artificial intelligence: protecting health-care workers and curbing the spread. Lancet Digital Health. (2020) 2:e166–7. doi: 10.1016/S2589-7500(20)30054-6

PubMed Abstract | CrossRef Full Text | Google Scholar

97. Shaban-Nejad A, Michalowski M, Buckeridge DL. Health intelligence: how artificial intelligence transforms population and personalized health. Npj Digit Med. (2018) 1:53. doi: 10.1038/s41746-018-0058-9

PubMed Abstract | CrossRef Full Text | Google Scholar

98. Antons D, Grünwald E, Cichy P, Salge TO. The application of text mining methods in innovation research: current state, evolution patterns, development priorities. RandD Manage. (2020) 50:329–51. doi: 10.1111/radm.12408

CrossRef Full Text | Google Scholar

99. Ocampo AJ, Chunara R, Brownstein JS. Using search queries for malaria surveillance, thailand. Malar J. (2013) 12:390. doi: 10.1186/1475-2875-12-390

PubMed Abstract | CrossRef Full Text | Google Scholar

100. Chan EH, Sahai V, Conrad C, Brownstein JS. Using web search query data to monitor dengue epidemics: a New model for neglected tropical disease surveillance. PLoS Neglect Trop Dis. (2011) 5:e1206. doi: 10.1371/journal.pntd.0001206

PubMed Abstract | CrossRef Full Text | Google Scholar

101. Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. (2009) 457:1012–4. doi: 10.1038/nature07634

PubMed Abstract | CrossRef Full Text | Google Scholar

102. Daughton AR, Chunara R, Paul MJ. Comparison of social media, syndromic surveillance, and microbiologic acute respiratory infection data: observational study. JMIR Public Health Surveill. (2020) 6:e14986. doi: 10.2196/14986

PubMed Abstract | CrossRef Full Text | Google Scholar

103. Colubri A, Hartley MA, Siakor M, Wolfman V, Felix A, Sesay T, et al. Machine-learning prognostic models from the 2014–16 ebola outbreak: data-harmonization challenges, validation strategies, mHealth applications. EClinicalMed. (2019) 11:54–64. doi: 10.1016/j.eclinm.2019.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

104. Santermans E, Robesyn E, Ganyani T, Sudre B, Faes C, Quinten C, et al. Spatiotemporal evolution of ebola virus disease at sub-national level during the 2014 West Africa epidemic: model scrutiny and data meagreness. PLoS ONE. (2016) 11:147172. doi: 10.1371/journal.pone.0147172

PubMed Abstract | CrossRef Full Text | Google Scholar

105. Beare S, Simpson E, Gray K, Andjelic D. Rapid integration of zika virus prevention within sexual and reproductive health services and beyond: programmatic lessons from latin america and the caribbean. Global Health Sci Pract. (2019) 7:116–27. doi: 10.9745/GHSP-D-18-00356

PubMed Abstract | CrossRef Full Text | Google Scholar

106. Mathewson AA, Daly ER, Cavallo SJ, Alic A. Use of digital pens for rapid epidemiologic data collection during a foodborne outbreak investigation. Disaster Med Public Health Prep. (2015) 9:349–53. doi: 10.1017/dmp.2015.43

PubMed Abstract | CrossRef Full Text | Google Scholar

107. Wang CJ, Ng CY, Brook RH. Response to COVID-19 in Taiwan: big data analytics, new technology, and proactive testing. JAMA. (2020) 323:1341–2. doi: 10.1001/jama.2020.3151

PubMed Abstract | CrossRef Full Text | Google Scholar

108. Bassi A, Arfin S, John O, Jha V. An Overview of Mobile Applications (Apps) to Support the Coronavirus Disease 2019 Response in India. Mumbai: Indian Journal of Medical Research (2020). Wolters Kluwer Medknow Publications.

PubMed Abstract | Google Scholar

109. Official Website of SORMAS. (n.d.). Available online at: https://sormas.org/ (accessed July 9, 2021).

110. Pang J, Wang MX, Ang IYH, Tan SHX, Lewis RF, Chen I-P, et al. Potential rapid diagnostics, vaccine and therapeutics for 2019 novel coronavirus (2019-NCoV): a systematic review. J Clin Med. (2020) 9:623. doi: 10.3390/jcm9030623

PubMed Abstract | CrossRef Full Text | Google Scholar

111. Ncube B, Mars M, Scott RE. The need for a telemedicine strategy for botswana? A scoping review and situational assessment. BMC Health Serv Res. (2020) 20:794. doi: 10.1186/s12913-020-05653-0

PubMed Abstract | CrossRef Full Text | Google Scholar

112. Reforma LG, Duffy C, Collier AY, Wylie BJ, Shainker SA, Golen TH, et al. A multidisciplinary telemedicine model for management of COVID-19 in obstetric patients. Am J Obstetr Gynecol. (2020) 2:100180. doi: 10.1016/j.ajogmf.2020.100180

CrossRef Full Text | Google Scholar

113. Vilendrer S, Patel B, Chadwick W, Hwa M, Asch S, Pageler N, et al. Rapid deployment of inpatient telemedicine in response to cOVID-19 across three health systems. JAMIA. (2020) 27:1102–9. doi: 10.1093/jamia/ocaa077

PubMed Abstract | CrossRef Full Text | Google Scholar

114. Mahmood S, Hasan K, Colder Carras M, Labrique A. Global preparedness against COVID-19: we must leverage the power of digital health. JMIR Public Health Surveill. (2020) 6:e18980. doi: 10.2196/preprints.18980

PubMed Abstract | CrossRef Full Text | Google Scholar

115. Lohiniva AL, Sane J, Sibenberg K, Puumalainen T, Salminen M. Understanding coronavirus disease (COVID-19) risk perceptions among the public to enhance risk communication efforts: a practical approach for outbreaks, Finland, February 2020. Euro Surveill. (2020) 25:2000317. doi: 10.2807/1560-7917.ES.2020.25.13.2000317

PubMed Abstract | CrossRef Full Text | Google Scholar

116. Bayham J, Kuminoff NV, Gunn Q, Fenichel EP. Measured voluntary avoidance behaviour during the 2009 A/H1N1 epidemic. Proc Royal Soc B Biol Sci. (2015) 282:1818. doi: 10.1098/rspb.2015.0814

PubMed Abstract | CrossRef Full Text | Google Scholar

117. Chapman GB, Coups EJ. Emotions and preventive health behavior: worry, regret, influenza vaccination. Health Psychol. (2006) 25:82–90. doi: 10.1037/0278-6133.25.1.82

PubMed Abstract | CrossRef Full Text | Google Scholar

118. Rufai SR, Bunce C. World leaders' usage of Twitter in response to the COVID-19 pandemic: a content analysis. J Public Health. (2020) 42:510–6. doi: 10.1093/pubmed/fdaa049

PubMed Abstract | CrossRef Full Text | Google Scholar

119. World Health Organization (WHO). Management of Ill Travellers at Points of Entry - International Airports, Seaports and Ground Crossings - in the Context of COVID-19 Outbreak (2020). Available online at: https://www.who.int/publications/i/item/10665-331512 (accessed July 9, 2021).

Google Scholar

120. Budd J, Miller BS, Manning EM, Lampos V, Zhuang M, Edelstein M, et al. Digital technologies in the public-health response to COVID-19. Nat Med. (2020) 26:1183–92. doi: 10.1038/s41591-020-1011-4

PubMed Abstract | CrossRef Full Text | Google Scholar

121. Can a Coronavirus Tracking App Be Both Effective and Privacy-Centric? | VentureBeat. (2020). Available online at: https://venturebeat.com/2020/04/09/can-a-coronavirus-tracking-app-be-both-effective-and-privacy-centric/ (accessed July 9, 2021).

122. Ferretti L, Wymant C, Kendall M, Zhao L, Nurtay A, Abeler-Dörner L, et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science. (2020) 368:6491. doi: 10.1126/science.abb6936

PubMed Abstract | CrossRef Full Text | Google Scholar

123. Vaughan A. The problems with contact-tracing apps. New Sci. (2020) 246:9. doi: 10.1016/S0262-4079(20)30787-9

CrossRef Full Text | Google Scholar

124. Coronavirus: An EU Approach for Efficient Contact Tracing Apps to Support Gradual Lifting of Confinement Measures | Shaping Europe's Digital Future (2020). Available online at: https://ec.europa.eu/digital-single-market/en/news/coronavirus-eu-approach-efficient-contact-tracing-apps-support-gradual-lifting-confinement (accessed July 9, 2021).

125. Chen Q, Min C, Zhang W, Wang G, Ma X, Evans R. Unpacking the black box: how to promote citizen engagement through government social media during the COVID-19 crisis. Comput Human Behav. (2020) 110:106380. doi: 10.1016/j.chb.2020.106380

PubMed Abstract | CrossRef Full Text | Google Scholar

126. Bridgman A, Merkley E, Loewen PJ, Owen T, Ruths D, Teichmann L, et al. The causes and consequences of COVID-19 misperceptions: understanding the role of news and social media. Harvard Kennedy Sch Misinform Rev. (2020) 1. doi: 10.37016/mr-2020-028

CrossRef Full Text | Google Scholar

127. Hasan K, Carras MC, Labrique A. Global preparedness against cOVID-19: we must leverage the power of digital health. JMIR Public Health Surveill. (2020) 6:18980. doi: 10.2196/18980

PubMed Abstract | CrossRef Full Text | Google Scholar

128. Arora VS, McKee M, Stuckler D. Google trends: opportunities and limitations in health and health policy research. Health Policy. (2019) 123:338–41. doi: 10.1016/j.healthpol.2019.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

129. Qin L, Sun Q, Wang Y, Wu KF, Chen M, Shia BC, et al. Prediction of number of cases of 2019 novel coronavirus (COVID-19) using social media search index. Int J Environ Res Public Health. (2020) 17:2365. doi: 10.3390/ijerph17072365

PubMed Abstract | CrossRef Full Text | Google Scholar

130. Management of Ill Travellers at Points of Entry – International Airports Seaports and Ground Crossings – in the Context of COVID-19 Outbreak. (n.d.)

Google Scholar

131. Akhtar M, Kraemer MUG, Gardner LM. A dynamic neural network model for predicting risk of zika in real time. BMC Med. (2019) 17:1–16. doi: 10.1186/s12916-019-1389-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: digital surveillance, digital epidemiology, data sources, outbreak, COVID-19

Citation: Kostkova P, Saigí-Rubió F, Eguia H, Borbolla D, Verschuuren M, Hamilton C, Azzopardi-Muscat N and Novillo-Ortiz D (2021) Data and Digital Solutions to Support Surveillance Strategies in the Context of the COVID-19 Pandemic. Front. Digit. Health 3:707902. doi: 10.3389/fdgth.2021.707902

Received: 10 May 2021; Accepted: 30 June 2021;
Published: 06 August 2021.

Edited by:

Pradeep Nair, Central University of Himachal Pradesh, India

Reviewed by:

Manh-Toan Ho, Phenikaa University, Vietnam
Mohamed Elhakim, World Health Organization, Djibouti

Copyright © 2021 Kostkova, Saigí-Rubió, Eguia, Borbolla, Verschuuren, Hamilton, Azzopardi-Muscat and Novillo-Ortiz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: David Novillo-Ortiz, ZG5vdmlsbG8mI3gwMDA0MDt3aG8uaW50

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.