DATA REPORT article

Front. Public Health

Sec. Infectious Diseases: Epidemiology and Prevention

Volume 13 - 2025 | doi: 10.3389/fpubh.2025.1535315

Data report: Epidemiology, Immunology and Clinical Characteristics of COVID-19 (EPIC 3 ) -Database of A Prospective Longitudinal Observational Study within the Veterans Health Administration

Provisionally accepted
Liuye  HuangLiuye Huang1,2Xumin  LiXumin Li1,2Samin  I KamalSamin I Kamal1,2Jonathan  SugimotoJonathan Sugimoto1,3,4Cindy  H LiuCindy H Liu2Tracy  WangTracy Wang2Daniel  K MorelliDaniel K Morelli2Jordanna  B MidthunJordanna B Midthun2Vivek  R PakanatiVivek R Pakanati2Katrina  V DeardorffKatrina V Deardorff2Jennifer  L SporlederJennifer L Sporleder2Jude  LopezJude Lopez5Mark  HolodniyMark Holodniy5,6,7Nicholas  L SmithNicholas L Smith1,2*Jennifer  S LeeJennifer S Lee6,7Javeed  Ali ShahJaveed Ali Shah2,8,9Jennifer  RossJennifer Ross2,8
  • 1Department of Epidemiology, University of Washington, Seattle, United States
  • 2VA Puget Sound Health Care System, Veterans Health Administration, United States Department of Veterans Affairs, Seattle, Washington, United States
  • 3International Vaccine Institute, Seoul, Republic of Korea
  • 4Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States
  • 5Veterans Health Administration, United States Department of Veterans Affairs, Washington D.C., District of Columbia, United States
  • 6VA Palo Alto Health Care System, Veterans Health Administration, United States Department of Veterans Affairs, Palo Alto, California, United States
  • 7School of Medicine, Stanford University, Stanford, California, United States
  • 8Division of Allergy and Infectious Diseases, Department of Medicine, School of Medicine, University of Washington, Seattle, Washington, United States
  • 9Department of Global Health, School of Public Health, University of Washington, Seattle, Washington, United States

The final, formatted version of the article will be published soon.

The health burden caused by SARS-CoV-2 infection has been devastating nationally and internationally (1,2). Over the past 4 years, much research has been conducted to characterize the clinical trajectory of acute SARS-CoV-2 infection and to identify risk/prognostic factors for infection and disease severity, respectively (3)(4)(5)(6). Despite the end of the COVID-19 public health emergency declaration in the U.S. (7) and globally (8) in May 2023, research into COVID-19 continues. Priorities have shifted from understanding acute manifestations to long-term sequelae, clinical phenotypes, persistence of vaccine immunity, and identification/validation of clinically relevant biomarkers, among others (9)(10)(11)(12)(13)(14)(15)(16)(17)(18)(19)(20). Such shifts underscore the need for prospective and longitudinal data, collected using externally validated and widely adopted instruments across participants with heterogenous sociodemographic and clinical characteristics to allow for research across various domains of interest.Therefore, the Cooperative Studies Program (CSP) of the Veterans Health Administration (VHA), U.S. Department of Veterans Affairs (VA) authorized the Epidemiology, Immunology, and Clinical Characteristics of COVID-19 (EPIC 3 ) study -a prospective, multi-site, cohort study conducted within the nation's largest integrated healthcare system. The design of the EPIC 3 study is based on that of the Department of Defense (DoD)-sponsored Epidemiology, Immunology, and Clinical Characteristics of Emerging Infectious Diseases with Pandemic Potential (EPICC-EID) study (21). The EPIC 3 study collects information regarding patient reported outcome measures (PROMs) and biospecimens across pre-defined visit time windows, over a follow-up period of 24 months. These data are augmented with real-world data (RWD) sourced from participants' VA electronic health records (EHRs) to address the following aims:(1) To identify sociodemographic, clinical, virologic, and immunologic factors which influence the trajectory of SARS-CoV-2 infection, with a particular emphasis on understanding medium and long-term sequelae, and prognostic factors associated with adverse outcomes.(2) To characterize factors associated with and protective against initial infection, reinfection, and breakthrough infection after vaccination.(3) To identify both individual and facility-level risk factors for SARS-CoV-2 infection, among Veterans living in community living centers (CLCs) -long-term skilled nursing facilities managed by the VA.The EPIC 3 The study enrolled U.S. military Veterans with and without SARS-CoV-2 infection who utilized services at VHA facilities. For the inpatient and outpatient sub-cohorts, infection status was confirmed by the record of a positive reverse transcriptase polymerase chain reaction (RT-PCR), antibody, or antigen test for SARS-CoV-2 in the VHA EHR. Self-reported RT-PCR tests conducted outside the VA were permitted as well, but this contingency was not often used. In the inpatient and outpatient sub-cohort, 430 participants (58.4%) and 1,280 participants (66.9%) had confirmed SARS-CoV-2 positive status, respectively. For each positive participant enrolled in the inpatient and outpatient sub-cohorts, the study attempted to enroll a Veteran who sought the same type of care (inpatient versus outpatient) at the same site but without the record of a positive SARS-CoV-2 test within the 21-days preceding the date of seeking care. All residents of participating CLCs were eligible to enroll regardless of their SARS-CoV-2 infection status at enrollment.Recruitment Recruitment procedures differed among the inpatient, outpatient, and CLC sub-cohorts. There was no predetermined sample size for any sub-cohort; the recruitment goal was to consent as many participants as possible given the site staffing capabilities and the prevalence of SARS CoV-2 infection.Inpatient sub-cohort: Every 24 hours, the research team used VHA EHRs to find newly eligible inpatient participants. Research staff were also able to identify eligible participants. Clinical staff approached inpatients with confirmed or suspected SARS-CoV-2 infection to discuss the study's goals and to determine if the patient was capable of providing informed consent. Clinical staff referred interested inpatients to the site's research team, who then scheduled a recruitment and consent discussion.Outpatient sub-cohort: At VHA SARS-CoV-2 testing locations, study flyers were posted or distributed by VHA staff. Every 24 hours, the research team used the nightly-updated VHA EHR to find newly eligible outpatient participants who sought care or had a test ordered at participating locations. The research team then reached out to potential outpatient participants by phone and mailed a recruitment package that included an invitation letter and a study description.Interested outpatients were also invited to contact the research team directly using the information on the recruitment flyers posted at public locations at VHA facilities.Lists of eligible participants were then loaded into a study database that all study sites used for recruitment and enrollment. The date on which an eligible participant was identified from the VHA EHR was defined as their index date. In both the inpatient and outpatient sub-cohorts, study staff were allotted 72 hours from index date to make first contact with eligible SARS-CoV-2 positive participants, with eligible participants then being allowed an indefinite amount of time to make their decision regarding participation.For each sub-cohort except for CLC, the study sought to recruit a 1:1 ratio of SARS-CoV-2 positive and negative participants. To achieve the 1:1 ratio, study sites were instructed to consult the study database for eligible SARS-CoV-2 negative patients for each enrolled SARS-CoV-2 positive counterpart, matching on study site and sub-cohort. Study sites were also instructed to prioritize enrolling SARS-CoV-2 negative matches who had the closest duration of elapsed time from the time of seeking care to the time of recruitment to their positive counterparts. For enrolled SARS-CoV-2 positive participants, the time since seeking care was defined as the number of hours that have elapsed between the start of their eligibility period and their recruitment into the study. While for SARS-CoV-2 negative participants, it indicated the number of hours that have elapsed between the start of their eligibility period and when the study database was consulted to find a matched SARS-CoV-2 negative participant.CLC sub-cohort: For residents at CLC sites, the local research team, in collaboration with CLC staff, informed all residents and their family members about the study through letters. Interested Veterans or their legally authorized representative were approached for informed consent.Definition for SARS-CoV-2 status in CLC was same for other two cohorts, although it was not part of the recruitment criteria.Written informed consent was required and obtained prior to recruitment for each participant.Paper and secure digital documentation of written informed consent were permitted. As part of the informed consent process, the site principal investigators or their designee provided a plainlanguage description of the study's aims, procedures, likely risks, and benefits, as well as describing rights of the participant and which study components are optional. Participation in the study's data and biospecimen repositories were required; participation in the participant registry was optional. In cases where a Veteran had impaired capacity to make informed decisions, the outlined process remained the same, incorporating a legal authorized representative (LAR) or personal representative (PR) to provide consent and a PR to provide HIPAA authorization.This study was approved by the VHA Central IRB (VHA Central IRB Study .Given the observational nature of the study, participants were determined to be at minimal risk for any study-related adverse outcomes. All the electronic data, written information, and materials are stored securely behind the VHA security firewall.After enrollment, baseline data were collected via questionnaires and biological sampling.Inpatient and outpatient participants were asked to complete follow-up questionnaires and provide biologic samples for testing during follow-up visits on days 3, 7, 14, 21, and 28, as well as months 3, 6, 12, 18, and 24 after enrollment. CLC participants were asked to complete questionnaires and provide biospecimens every 3 months until 24 months after enrollment. CLC participants who were discharged from a CLC were offered the option of continuing to participate in the study by continuing with the inpatient/outpatient data collection schedule, starting with the next scheduled visit relative to their enrollment date. EHR data were abstracted for each participant in both a retrospective and prospective manner (Table 1).A modified Million Veteran Program (MVP) COVID-19 survey was administered at baseline (22). This survey assessed SARS-CoV-2 and COVID-19 risk factors related to sociodemographic characteristics, substance use and lifestyle behaviors, family history, sources of exposures to infection, and medical and mental health comorbidities. The survey also assessed the impact of COVID-19 on health-related quality of life. A wide array of information can be captured from EHRs at the VHA Corporate Data Warehouse (CDW), including medical history of conditions relevant to SARS-CoV-2 and COVID-19 risk, SARS-CoV-2 vaccinations received, vital signs, diagnosis codes for medical comorbidities, hospitalization records, and other data relevant to clinical care and treatment. Clinical indices calculated using EHR data include the Charlson Comorbidity Index (CCI) (32) and the Veterans Affairs Severity Index for COVID-19 (VASIC) (33). The VASIC is a 4-category (mild, moderate, severe, or death) measure of the maximum illness severity within 30 days of enrollment that was calculated for SARS-CoV-2-positive inpatients.Biospecimens were collected at in-person visits and, if in-person was not feasible, through participant self-collection. At in-person visits, the study collected peripheral blood in whole blood ethylenediaminetetraacetic acid (EDTA) tubes or sodium citrate tubes, serum separator tubes, RNA-stabilizing PAXGene ® tubes, and CPT â„¢ mononuclear cell preparation tubes. Saliva and the eluant from nasal or nasopharyngeal swabs was collected at in-person visits for inpatient and outpatient participants. Residual clinical samples from inpatients were also collected as available. Specimens were then shipped to the biorepository and laboratory at PHRC within 24hours at 3-4 degrees Celsius. Most study sites were also able to have an additional set of blood specimens assayed by on-site Pathology and Laboratory Medicine Service (PLMS) laboratories.Where an in-person visit was not feasible, devices for self-collecting saliva (Spectrum Solutions SDNA-1000) and TASSO-SST ® or Tasso ® plus kits for collecting a small amount of capillary blood were shipped to the participant, which they were asked to self-administer and mail free-ofcharge back to PHRC. Study staff were available to provide real-time, telephone-based coaching to help interested participants through the self-collection process. Specimens collected by these self-administered kits were stable at ambient temperature for at least a week. ✔ - - - - - - - - - - - - - Symptoms questionnaire (Flu-related) ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Vaccination questionnaire ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Long-Term Symptoms questionnaire - ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Nasopharyngeal and oropharyngeal swabs a ✔ ✔ ✔ ✔ ✔ ✔ - - - - - - - - Blood b and saliva c ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Outpatient sub-cohort Baseline questionnaire (modified MVP) ✔ - - - - - - - - - - - - - Symptoms questionnaire (Flu-related) ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Vaccination questionnaire ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Long-Term Symptoms questionnaire - ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ Nasopharyngeal and oropharyngeal swabs a ✔ ✔ ✔ ✔ ✔ ✔ - - - - - - - - Blood b and saliva c ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ - ✔ - ✔ - ✔ CLC sub-cohort Baseline questionnaire (modified MVP) ✔ - - - - - - - - - - - - - Symptoms questionnaire (Flu-related) ✔ - - - - - ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Vaccination questionnaire ✔ - - - - - ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Long-Term Symptoms questionnaire - - - - - - ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Blood b ✔ - - - - - ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔Around 88.5% of participants have completed the study, 7.8% participants deceased and 3.7% withdrawn during follow-up (as of March 17 th , 2025). Participants in the outpatient sub-cohort were younger than members of the inpatient and CLC sub-cohorts. The percentage of participants identifying as Black or African American in the inpatient, outpatient, and CLC subcohorts were approximately 39%, 18%, and 36%, respectively. Less than 10% of participants in the inpatient and CLC sub-cohorts identified as being Hispanic or Latino. A higher proportion of participants in the SARS-CoV-2 positive group were unvaccinated (< 1 dose) prior to their index date starting one month after a SARS-CoV-2 vaccine was first available to VHA users, compared with their negative counterparts (Table 2). Among the 131 CLC participants without SARS-CoV-2 infection at enrollment, 80 (61.1%) of them tested positive at least once during follow-up.Regarding the completion of specimen collections and questionnaires, the proportions of participants completing baseline questionnaires were 71.3%, 94.6%, and 84.9% in the inpatient, outpatient, and CLC sub-cohorts, respectively. The completion rate of vaccination questionnaire and symptoms questionnaire (flu-related) at baseline was similar among the three sub-cohorts.However, the completion rate of questionnaires in the outpatient sub-cohort became much higher than the other two sub-cohorts during follow-up. For biospecimens, the completion rate of blood collection was 67.8%, 15%, and 57.5% at baseline in the inpatient, outpatient, and CLC subcohorts, respectively. But the completion rate of blood in the outpatient sub-cohort exceeded the other sub-cohorts during follow-up. The trends remained similar for saliva and swabs. For either inpatient or outpatient sub-cohort, the completion rates of specimen collection and questionnaire were a little bit higher in the SARS-CoV-2 negative group compared with the positive group during follow-up (Supplementary Table 1). More information regarding the completion rates of biospecimen collection and questionnaire and their availability can be found on the official website with the page indicating research progress (https://www.seattle.eric.research.va.gov/research/CSP-2028-EPIC3/Study-progress.asp). There are several limitations we would like to note for utilizing EPIC 3 resources. First, there are potential issues with loss-to-follow-up (LTFU) or data missingness at certain time periods.However, LTFU is likely not a serious threat to internal validity since the rate of withdrawal from the study was minimal (Table 2). Issues with data missingness at various time periods may be dealt with by design. For example, missing questionnaire data may be supplemented with comparable EHR records. In addition, the indefinite extraction of EHR data allows participants to be longitudinally followed-up well past the initial 24-month study period. Similarly, specimens banked at the biorepository may be assayed for markers not initially specified at the time of enrollment, rate limited by collected sample quantity, allowing for flexibility to respond to emerging research questions. We would also like to note that Veterans may receive care at other institutions and if they were over 65 years old, some of their data may be available through Centers for Medicare & Medicaid Services instead of EHRs at VHA.Second, SARS-CoV-2 positive participants were not required to be symptomatic with COVIDlike illness for recruitment purposes, resulting in heterogeneity in their health conditions and the associated indication for testing. But given the rich diverse data repositories in this study, investigators are welcome to manipulate the dataset and generate indicators (i.e., new definitions of ascertaining SARS-CoV-2 status) according to their research goals.Third, the allotted recruitment timeframe differed between SARS-CoV-2 positive and negative participants. Positives were required to be recruited within 72 hours of identification from CDW records, while only an approximately similar timeframe was recommended for negatives. The amount of time between initial recruitment and final enrollment varied among participants as well.Lastly, issues of potential selection bias and generalizability should be considered. In EPIC 3 , there is a much higher proportion of biological men than women, making it challenging to study the effect of sex. In addition, the patterns of receiving services at VHA medical centers differ in various phases of the pandemic/recruitment period, due to transitions in hospital access restrictions, quarantine mandates, conversion of in-person appointments to telehealth visits, and availability of COVID-19 testing (34). This may indicate systematic differences between enrolled participants and the corresponding population to whom we desire to generalize results.However, given the relatively long recruitment period and that many participants were recruited during later phases of the pandemic, potential selection bias issues should be minimal.

Keywords: SARS-CoV-2, Veterans, Epidemiology, Longitudinal cohort, database

Received: 27 Nov 2024; Accepted: 10 Apr 2025.

Copyright: © 2025 Huang, Li, Kamal, Sugimoto, Liu, Wang, Morelli, Midthun, Pakanati, Deardorff, Sporleder, Lopez, Holodniy, Smith, Lee, Shah and Ross. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Nicholas L Smith, Department of Epidemiology, University of Washington, Seattle, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more