AUTHOR=Gilbert Sappho Z. , Morrison Conor L. , Chen Qiuyu J. , Punian Jesman , Bernstein Jodi T. , Jessri Mahsa TITLE=Algorithm-based mapping of products in a branded Canadian food and beverage database to their equivalents in Health Canada’s Canadian Nutrient File JOURNAL=Frontiers in Nutrition VOLUME=9 YEAR=2023 URL=https://www.frontiersin.org/journals/nutrition/articles/10.3389/fnut.2022.1013516 DOI=10.3389/fnut.2022.1013516 ISSN=2296-861X ABSTRACT=Introduction

There is increasing recognition of the value of linking food sales databases to national food composition tables for population nutrition research.

Objectives

Expanding upon automated and manual database mapping approaches in the literature, our aim was to match 1,179 food products in the Canadian data subset of Euromonitor International’s Passport Nutrition to their closest respective equivalents in Health Canada’s Canadian Nutrient File (CNF).

Methods

Matching took place in two major steps. First, an algorithm based on thresholds of maximal nutrient difference (between Euromonitor and CNF foods) and fuzzy matching was executed to offer match options. If a nutritionally appropriate match was available among the algorithm suggestions, it was selected. When the suggested set contained no nutritionally sound matches, the Euromonitor product was instead manually matched to a CNF food or deemed unmatchable, with the unique addition of expert validation to maximize meticulousness in matching. Both steps were independently performed by at least two team members with dietetics expertise.

Results

Of 1,111 Euromonitor products run through the algorithm, an accurate CNF match was offered for 65% of them; missing or zero-calorie data precluded 68 products from being run in the algorithm. Products with 2 or more algorithm-suggested CNF matches had higher match accuracy than those with one (71 vs. 50%, respectively). Overall, inter-rater agreement (reliability) rates were robust for matches chosen among algorithm options (51%) and even higher regarding whether manual selection would be required (71%); among manually selected CNF matches, reliability was 33%. Ultimately, 1,152 (98%) Euromonitor products were matched to a CNF equivalent.

Conclusion

Our reported matching process successfully bridged a food sales database’s products to their respective CNF matches for use in future nutritional epidemiological studies of branded foods sold in Canada. Our team’s novel utilization of dietetics expertise aided in match validation at both steps, ensuring rigor and quality of resulting match selections.