Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol.
Sec. Food Microbiology
Volume 16 - 2025 | doi: 10.3389/fmicb.2025.1519189

Source attribution of human Campylobacter infection: A multi-country model in the European Union

Provisionally accepted
  • 1 Technical University of Denmark, Kongens Lyngby, Denmark
  • 2 National Institute for Public Health and the Environment (Netherlands), Bilthoven, Utrecht, Netherlands
  • 3 Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Utrecht, Netherlands
  • 4 National Veterinary Research Institute (NVRI), Pulawy, Poland
  • 5 Department of Agriculture Food and the Marine (Ireland), Celbridge, County Kildare, Ireland
  • 6 Teagasc Environment Research Centre, Wexford, Ireland
  • 7 Teagasc Food Research Centre (Ireland), Carlow, County Carlow, Ireland
  • 8 VISAVET Health Surveillance Centre (UCM), Madrid, Madrid, Spain
  • 9 Department of Animal Health, Faculty of Veterinary Medicine, Complutense University of Madrid, Madrid, Madrid, Spain
  • 10 Swedish Veterinary Agency, Uppsala, Uppsala, Sweden
  • 11 National Institute for Agricultural and Veterinary Research (INIAV), Oeiras, Lisboa, Portugal
  • 12 Division of Infectious Diseases and Immunology, Department Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands, Netherlands
  • 13 State Serum Institute (SSI), Copenhagen, Hovedstaden, Denmark
  • 14 Department of Food Safety, Nutrition and Veterinary Public Health, National Institute of Health (ISS), Rome, Lazio, Italy
  • 15 National Health Institute Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • 16 National Veterinary Institute (Sweden), Uppsala, Uppsala, Sweden
  • 17 French Agency for Food Environmental and Occupational Health & Safety (ANSES), Ploufragan, France

The final, formatted version of the article will be published soon.

    Infections caused by Campylobacter spp. represent a severe threat to public health worldwide. National action plans have included source attribution studies as a way to quantify the contribution of specific sources and understand the dynamic of transmission of foodborne pathogens like Salmonella and Campylobacter. Such information is crucial for implementing targeted intervention. The aim of this study was to predict the sources of human campylobacteriosis cases across multiple countries using available whole-genome sequencing (WGS) data and explore the impact of data availability and sample size distribution in a multi-country source attribution model. We constructed a machine-learning model using k-mer frequency patterns as input data to predict human campylobacteriosis cases per source. We then constructed a multi-country model based on data from all countries. Results using different sampling strategies were compared to assess the impact of unbalanced datasets on the prediction of the cases.The results showed that the variety of sources sampled and the quantity of samples from each source impacted the performance of the model. Most cases were attributed to broilers or cattle for the individual and multi-country models. The proportion of cases that could be attributed with 70% probability to a source decreased when using the down-sampled data set (535 vs. 273 of 2627 cases). The baseline model showed a higher sensitivity compared to the down-sampled model, where samples per source were more evenly distributed. The proportion of cases attributed to non-domestic source was higher but varied depending on the sampling strategy. Both models showed that most cases could be attributed to domestic sources in each country (baseline: 248/273 cases, 91%; down-sampled: 361/535 cases, 67%;). The sample sizes per source and the variety of sources included in the model influence the accuracy of the model and consequently the uncertainty of the predicted estimates. The attribution estimates for sources with a high number of samples available tend to be overestimated, whereas the estimates for source with only a few samples tend to be underestimated. Reccomendations for future sampling strategies include to aim for a more balanced sample distribution to improve the overall accuracy and utility of source attribution efforts.

    Keywords: source attribution, Foodborne disease, Campylobacteriosis, machine learning, European union Only k-mers with a standard deviation greater than

    Received: 29 Oct 2024; Accepted: 20 Jan 2025.

    Copyright: © 2025 Thystrup, Lykke Brinch, Henri, Mughini-Gras, Franz, Wieczorek, Gutierrez, Prendergast, Duffy, Burgess, Bolton, Alvarez, Lopez-Chavarrias, Rosendal, Clemente, Amaro, Zomer, Grimstrup Joensen, Nielsen, Scavia, Skarżyńska, Pinto, Oleastro, Cha, Thépault, Rivoal, Denis, Chemaly and Hald. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Cecilie Thystrup, Technical University of Denmark, Kongens Lyngby, Denmark

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.