- 1Clinical and Experimental Endocrinology, Department of Chronic Diseases and Metabolism, KU Leuven, Leuven, Belgium
- 2Department of Endocrinology, University Hospitals Leuven, Leuven, Belgium
Background: Assessing dietary intake is valuable both in clinical practice and in research. In research and in clinical practice, long-term habitual dietary intake is most often of interest. Therefore, a web-based semi-quantitative Food Frequency Questionnaire (FFQ) was developed to measure habitual intake of nutrients and foods.
Aim: This study aimed to assess content validity, convergent validity, and reliability of a 32-item semi-quantitative FFQ for adults.
Methods: A total of three different cohorts of Flemish adults were recruited in the past 10 years. The first cross-sectional validation study took place in 2013, consequently in 2019 and 2021. Content validity was assessed in 2019 through a semi-structured cognitive interview. Convergent validity was assessed by examining mean differences, Wilcoxon signed-rank test, Spearman's correlation coefficients (SCC), and Bland–Altman analysis for energy, nutrient, and food group intake compared with a 3-day food record (FR). Additionally, consumers-only analysis was performed together with cross-classification analysis by assessing the ranking capabilities of the FFQ into quartiles and weighted kappa. Reliability was assessed through the evaluation of SCC and intra-class correlation (ICC) of test–retest assessment of the FFQ.
Results: Spearman's correlation coefficient (SCC) for energy and absolute nutrient intake between the FFQ and the FR ranged from 0.02 to 0.54. Compared with absolute macronutrients, higher SCC was found for the majority of the relative macronutrient intake and most food groups. Bland–Altman plots showed improved agreement and decreasing bias between the FFQ and the FR over time. Misclassification of the FFQ for nutrients was acceptable and decreased over time (7.4, 7.5, and 6.8% in 2013, 2019, and 2021, respectively), but weighted kappa remained mostly fair (κ ≤ 0.20). The reliability of the FFQ was good and improved over time (mean SCC of 0.65 and 0.66 p <0.001 in 2013 and 2019).
Conclusion: The short web-based FFQ is an easy, low-cost, and feasible tool with good reliability, low misclassification, and acceptable validity to compare nutrient densities and food group intake at the population level. The measurement of absolute intake remains debatable.
1. Introduction
Assessing dietary intake is valuable both in clinical practice and in research. However, measuring an individual's dietary intake accurately is challenging. The dietary intake of an individual is highly variable due to day-to-day variation, seasonal variation, and the context an individual is in at the time of eating (1, 2). To assure high-quality data, measuring dietary intake quantitatively can only be performed for short period of time. However, in research and in clinical practice, the long-term habitual dietary intake is most often of interest. Additionally, dietary intake is multidimensional and can be described by macro- and micronutrient intake, food group consumption, diet quality, or dietary pattern, as a whole. Consequently, diet is a complex exposure variable existing of different components which are all interrelated (2). Nevertheless, accurate measurements of dietary intake are crucial to research the role of diet in relation to diseases (3). Moreover, in clinical practice, evaluating dietary intake allows to assess nutritional status and provide dietary advice as part of the prevention or treatment of disease. Currently, no method is available to measure true dietary intake, and research is relying on methods approximating true dietary intake. For this reason, dietary assessment methods are known to have inherent measurement error (4).
Food Frequency Questionnaires (FFQs) are amongst the most used methods to measure dietary intake in large epidemiological studies (5). While classic food diaries or 24-h recalls measure short-term dietary intake, FFQs assess habitual intake over a longer period of time (5). The length and extensiveness of an FFQ can differ between a couple of questions on specific food items, such as a “screener,” or can cover an extensive list of food items to assess the overall dietary pattern. Typically, an FFQ consists of questions both on the amount and the frequency of consumption of specific foods over a predefined period of time. However, FFQs are often characterized by an extensive enumeration of questions on more than 100 food items resulting in long questionnaires (6). Although widely used in large-scale epidemiological studies and clinical practice, the length of the questionnaire still holds a substantial burden and limits feasibility. Therefore, the aim of this study was to develop a short FFQ, intended to be concise yet accurate.
The self-administered semi-quantitative web-based FFQ was developed in Dutch and allows to measure dietary habits, dietary quality, and habitual intake of foods and nutrients during the past month in Belgian adults in both the clinical setting and nutritional research. This study describes the validation of this specific FFQ during the past decade in different cohorts and the improvement of the FFQ accordingly.
2. Materials and methods
This study applied the STROBE-NUT guidelines (7).
2.1. Study design and study population
This research is a sequential cross-sectional validation study of a web-based semi-quantitative Food Frequency Questionnaire (FFQ) in which data were collected in 2013, 2019, and 2021 including different cohorts of the Belgian population. Content validity, reliability, and convergent validity compared with a 3-day food record were assessed at the three different time periods. Convergent validity was assessed in all three studies by comparison with a 3-day food record (2013, 2019, and 2021), reliability was assessed by test–retest assessment in the cohorts of 2013 and 2019, and content validity was solely assessed in 2019 in a sample of 15 participants through a semi-cognitive interview.
The studies were approved by the Medical Ethical Committee UZ Leuven (2013) or the Social and Societal Ethics Committee (SMEC) review board (2019, 2021) and received registration numbers S54908, G-2017 12, and G-2021-2956, respectively. All participants provided their written informed consent prior to the start of the study. The study protocol of the three cohorts in the different time periods during the past decade is shown in Figure 1. Participants in all three cohorts filled out a generic sociodemographic questionnaire at the beginning of the study as well. Participants were allocated an identification number so that the analysis of all questionnaires and food records was anonymised. The different sub-studies took place in Flanders, the Northern Dutch-speaking part of Belgium.
Figure 1. Study design of the validation studies in the years 2013, 2019, and 2021. (A) 2013. (B) 2019. (C) 2021.
To assess the validity and reliability of the FFQ, a sample size of each time of 100 participants was aimed for. The sample size was based on the recommendations by Cade et al. and Willet et al. stating that a sample size of minimum of 50 participants, preferably at least 100, is sufficient to assess the convergent validity of Food Frequency Questionnaires (5, 8). The cohorts of the validation studies were imbedded in different cross-sectional studies. Therefore, different inclusion criteria were applied to the different cohorts. For the cohort of 2013, participants were recruited by students through word of mouth and by contacting communities through e-mail. Inclusion criteria for the 2013 sample were having an age of 40–70 years old, having no cardiovascular diseases, not following a specific diet for medical reasons, and not having other family members enrolled. The 2019 sample was recruited through calls on social media together with the snowball sampling method. Inclusion criteria for the 2019 sample were being of reproductive age (i.e., 18–42 years old), speaking Dutch, and not following a specific diet for medical reasons. Recruitment of the 2021 cohort took place through word of mouth or calls on social media together with the snowball sampling method. Inclusion criteria were having knowledge of Dutch, not being pregnant, or lactating. Exclusion criteria were individuals living in a country other than Belgium and individuals following a specific diet because of a medical indication including but not limited to persons with Coeliac disease, type 2 diabetes mellitus, food intolerances or allergies and persons who underwent bariatric surgery previously.
2.2. Development and design of FFQ
The Food Frequency Questionnaire was developed to acquire standardized data on habitual dietary intake in adults during the past month. It is aimed to use a simple, quick, and low-cost method to assess dietary intake in epidemiological and nutritional studies within the general population in Flanders as well as in a clinical setting, hence its concise format. The FFQ was developed in Dutch (Belgium). The first version of the FFQ consisted of questions on the consumption of 27 food items, where frequency and amount consumed are administered in close-ended questions, based on a previous study (9–14). Table 1 provides an overview of the different food groups that are part of the initial FFQ. The FFQ development was in line with the recommendations of Cade et al. (5). Food items included in the questionnaire were selected based on the Belgian dietary habits and assessed during the 2004 Belgian Food Consumption Survey and the food categories as described in the Flemish Food-Based Dietary Guidelines (15, 16). The frequency of consumption of each food item can be indicated on the FFQ as never, 1 to 3 days per month, 1 day per week, 2 to 4 days per week, 5 to 6 days per week, or every day. The amount of consumption of each food item can be indicated in the multiple-choice answers, where different amounts of the food item are given in grams, milliliters, or pieces, in case of eggs, to choose from. Questions on the amount of consumption usually provide examples of typical amounts of the questioned food item expressed in grams or milliliters, such as “1 teaspoon = 10 grams” or “1 glass = 150 ml,” to reduce the risk of incorrect estimated amounts by the participants. Additional open-ended or close-ended questions on the specific type or brand were included for some of the food items. The average individual daily intake of energy and nutrients was calculated by multiplying the consumed amount and frequency of consumption indicated in the FFQ and the nutrient content per food item. The nutrient content of the different food items was based on the weighted average of the foods included in the food category. The weighted average is based on the consumption data of the 2004 Belgian Food Consumption Survey. The FFQ was first developed in 2012. In 2019, the FFQ was adapted to the newly introduced Flemish Food-Based Dietary Guidelines of 2019. This included the addition of questions on new food group items such as “nuts and seeds,” “legumes,” “oats and wholegrain cereals,” “coffee,” “soup,” “sweetened dairy products,” and “savory snacks” and the elimination of the question on salt intake. This resulted in an FFQ of 33 food items (Table 1).
2.3. Semi-cognitive interview
To assess the content validity of the FFQ, a semi-structured cognitive interview was performed in 2019, as described by Patrick et al. (17, 18) (Figure 1). During the interview, the interviewer went over every question of the questionnaire with the participant while the participants filled in the questions and explained how they interpret the questions and its components in order to evaluate the comprehensibility and understandability of the questions included in the FFQ. To evaluate the content validity of the questions regarding the amount of the consumed food items, participants were requested to point out the amount of the food item that corresponds best with their amount of consumption in a booklet with typical portion sizes of different foods visualized (19). Additionally, participants were asked to select the glass, cup, and/or mug that corresponded with the described quantities given as an example in the questions of the FFQ. Food groups of which portion sizes were evaluated in comparison with pictures of standard portion sizes included “meat,” “fish,” “sauce,” “cooking fat,” “pasta,” “potatoes,” “vegetables,” “legumes,” “beans,” “chick peas,” “meat spread,” “fish spread,” “sweet spreads,” and “nuts and seeds.”
2.4. Food record
In all three cohorts, participants were asked to keep a food record for 3 non-consecutive days including 1 day during the weekend (Figure 1). In the cohort in 2013, paper-based food diaries were used. Participants received instructions on the correct registration of all food consumed in the food diaries prior to the study. In the cohorts of 2019 and 2021, participants were requested to keep a food record through the mobile application or web-based version of “MyFitnessPal.” Participants were requested to register the type of food or recipe and amount consumed in the mobile application of all consumed food items over 3 days. Before registration of the dietary record, an instruction booklet was provided to the participants through e-mail, with guidelines on how to use and register their daily dietary intake correctly in “MyFitnessPal.” Daily consumption of energy and nutrients was calculated as the average of the daily nutrient intake for the 3 days registered in the dietary record. Nutrients of the consumed food items were calculated using the Belgian Food Composition Table (NUBEL, fifth edition, 2009) (20). Participants with an average daily energy intake of <500 kcal per day for women or <800 kcal per day for men and >3,500 kcal per day for women or >4,000 kcal per day for men were excluded from the study (21).
2.5. Statistical analysis
Only complete FFQs and food diaries consisting of 3 days with at least two main meals (breakfast, lunch, or dinner) registered daily were considered complete and included for analysis. Convergent validity was assessed by comparing mean differences and Spearman's correlation coefficients for energy, macronutrients, micronutrients, and food groups obtained from the FFQ with the mean intake obtained from the food records (FRs). Protein, total fatty acids, saturated fatty acids (SFA), monounsaturated fatty acids (MUFAs), polyunsaturated fatty acids (PUFAs), total carbohydrates, monosaccharides/disaccharides, and polysaccharides were expressed as energy densities (i.e., percentage of total energy intake that the nutrient is accountable for), next to the absolute values, to adjust for energy intake. Similarly, micronutrients were expressed as relative values (i.e., intake per 100 kcal), next to the absolute values. Mean and standard error (SE) for intake of each nutrient and food group measured by the FFQ and the food record were calculated (Supplementary Tables S1, S2). Since correlation coefficients only assess relatedness between the two methods, the Bland–Altman (BA) method was applied, as well, to assess agreement between the two methods (5). Limits of agreement were calculated as mean difference ± 1.96 * SD. Good agreement was interpreted as at least 95% of the data points lying within the limits of agreement in the BA plot. Reliability was examined by test–retest assessment of two administrations of the FFQ. Spearman's correlation coefficients and intra-class correlation coefficients (ICC) were calculated to assess the reliability of the FFQ, next to a Wilcoxon signed-rank test to examine the statistically significant difference between the mean of the two administrations of the FFQ. ICC was assessed in a two-way mixed-effects model considering absolute agreement for a single measurement, as recommended by Koo et al. for test–retest reliability assessment (22). Next, consumers-only analysis (i.e., taking only consumers for each specific food group into account) was performed similarly to assess convergent validity for each food group (Supplementary Table S3). Finally, cross-classification analysis was performed to assess the percentage of misclassification by the FFQ compared with the 3-day food record. Spearman's correlation coefficients were interpreted according to Cohen's cutoff values, where r = ±0.5 was considered to be strong, r = ±0.30 moderate, and r = ±0.10 weak (23). ICCs were interpreted as ICC ≤ 0.40 is “poor,” 0.41 < ICC < 0.59 is “fair,” 0.60 < ICC < 0.74 is “good,” and 0.75 < ICC < 1.00 is “excellent” (24). A P-value of < 0.05 indicated statistical significance. The statistical program IBM SPSS Statistics 27 was used for all analyses.
3. Results
3.1. Population characteristics
Initially, 111, 114, and 44 participants were recruited in the cohorts of 2013, 2019, and 2021, respectively. Participants with an incomplete or missing FFQ or 3-day food record were excluded from the analysis [n = 11 (2013), n = 8 (2019), and n = 2 (2021)]. As previously described, participants with a self-reported energy intake lower than 500 or 800 kcal per day for women or men, respectively, or higher than 3,500 or 4,000 kcal per day for women and men, respectively, were excluded as well [n = 0 (2013 and 2021), n = 6 (2019)]. The final samples for analysis included 100, 100, and 42 participants in the years 2013, 2019, and 2021, respectively. Only 37 out of 42 participants of the cohort of 2021 responded to the sociodemographic questionnaire. Characteristics of the study population are shown in Table 2. All three cohorts included more women than men (64.4, 58.0, and 81.1% for 2013, 2019, and 2021, respectively). The sample of the year 2019 included younger participants (mean age 25.8 ± 0.52 (SE) years old), including most students, compared with the participants in the samples of the years 2013 and 2021 (mean age 53.5 ±0.65 (SE) years old and 40.9 ±2.18 (SE), respectively). Mean body mass index (BMI), based on self-reported weight and height, was 25.2 ±0.38 (SD) kg/m2 for the study population in 2013 and 23.7 ± 0.46 (SD) kg/m2 for the study population in 2021. Length and weight were not reported in the study population of 2019. All three cohorts included proportionally more participants with higher education of at least college or university level degree (54.0, 58.0, and 100.0% for 2013, 2019, and 2021, respectively).
3.2. Content validity
Content validity was assessed in 2019 in a subsample of 15 participants with a mean age of 29 ±1.45 (SE) years old including eight women and seven men. According to this evaluation, correctly estimated portion sizes mentioned in the FFQ by participants ranged from 53.3 to 92.3%. The portion sizes of the food groups “vegetables” and “potatoes” were least often estimated correctly by the participants (estimated correctly in 53.3% of participants for both the food groups). The portion sizes of the food groups “sweet spreads” and “sauce” were most often estimated correctly by 92.3 and 86.7% of the participants, respectively. Overestimation of the portion sizes mentioned in the FFQ was lowest for the food group “sweet spreads” (by 0% of participants) and highest for the food groups “meat” and “vegetables” (both by 33.3% of participants). The portion sizes of the food groups “sauce” and “meat spread” were never underestimated by participants, while the portion sizes of “chick peas,” “fish spread,” and “potatoes” were most often underestimated (by 36.4, 33.3, and 33.3% of participants, respectively). The content validity study disclosed further that specific words used in the phrasing of questions in the FFQ are open to multiple interpretations. Afterwards, the phrasing of questions and usage of words were changed according to the feedback received during the interviews in the content validity study.
3.3. Convergent validity
3.3.1. Macronutrients and micronutrients
In general, the validation studies in 2013, 2019, and 2021 show that the FFQ underestimates energy and absolute macronutrient intake. Table 3 presents the convergent validity parameters for absolute macronutrients and macronutrient densities and Table 4 for micronutrients and micronutrient densities for the validation studies. A mean difference in energy intake shows an underestimation of the FFQ compared with the food record (FR) of 468.02 kcal/day (24.0%), 599.81 kcal/day (30.8%), and 350.87 kcal/day (20.6%) in 2013, 2019, and 2021, respectively (Table 3). As for energy intake, the mean difference for absolute protein, total fat, saturated fat, total carbohydrate, and fiber intake decreases over time (2013 vs. 2021) (Table 3). Spearman's correlations between the FFQ and FR for energy and absolute macronutrient intake ranged from 0.05 to 0.35 in 2013, from 0.16 to 0.44 in 2019, and from 0.02 to 0.54 in 2021, indicating moderate correlation (Table 3). Bland–Altman plots indicate good agreement between the FFQ and the FR for absolute macronutrient intake except for energy intake (Figures 2, 3). The Bland–Altman plots confirm an underestimation of the FFQ compared with the FR and show a downward regression trend in the data points indicating bias increases with increasing intake (Figure 3). The results of 2019 and 2021 show an overestimation of all macronutrient density intakes except for saturated fatty acid (SFA) intake (Table 3). Mean differences in density intakes are highest for MUFA and polysaccharide densities in 2013 and 2019 (10.13 and 20.41% in 2013 and 10.76 and 20.09% in 2019, respectively). Spearman's correlations between the FFQ and the FR for macronutrient density intake indicate moderate correlation and ranged from 0.02 to 0.47 in 2013, from 0.05 to 0.42 in 2019, and from 0.17 to 0.49 in 2021 (Table 3). Bland–Altman plots show good agreement for all macronutrient density intakes of the FFQ compared with the FR (figures not shown). Mean differences in absolute micronutrient intake between the FFQ and the FR show an underestimation of the FFQ compared with the FR (Table 4). Low-to-moderate correlations were found for absolute micronutrient intake with the highest correlations found for calcium intake (r2 = 0.44 p = 0.001, r2 = 0.53 p = 0.000, r2 = 0.53 p = 0.001 in 2013, 2019, and 2021, respectively) (Table 4).
Table 3. Parameters of convergent validity for absolute intake of macronutrients according to the FFQ and food record.
Table 4. Parameters of convergent validity for absolute intake of micronutrients according to the FFQ and food record.
Figure 2. Bland–Altman plots for absolute protein intake measured by the FFQ compared with the 3-day food record in 2013 (A), 2019 (B), and 2021 (C).
Figure 3. Bland–Altman plots with regression lines for energy intake measured by the FFQ compared with the 3-day food record in 2013 (A), 2019 (B), and 2021 (C).
3.3.2. Food groups
The results of the validity analysis for food group intake are presented in Table 5. Mostly moderate-to-good correlations were found. The highest correlation was found for alcoholic beverages (r2 = 0.81 p < 0.001) in 2013, coffee (r2 = 0.84 p = < 0.001) in 2019, and juices and soda (r2 = 0.67 p = 0.001) in 2021. The FFQ both underestimates and overestimates the intake of certain food groups. The results show that meat intake was underestimated by the FFQ by 19.42 g/day in 2013, by 25.98 g/day in 2019, and by 6.20 g/day in 2021. Full-fat dairy product intake, in contrast, was overestimated by the FFQ by 1.29 g/day in 2013, by 18.20 g/day in 2019, and by 18.46 g/day in 2021. As represented in Figure 4, correlations for intake of some food groups were similar across the three validation studies in 2013, 2019, and 2021 which was the case for the food groups :“fruit,” “cheese,” “white bread,” and “sweet spreads.” For the food group “margarine,” a decreasing correlation was found across the 3 years. Correlations of other food groups are lower in the validation study of 2019 compared with 2013 and 2021, such as for “potatoes,” “wholegrain bread,” and “juices and soda” (Figure 4). The results for convergent validity analysis after the exclusion of non-consumers are shown in Table 5. Consumers-only analysis shows lower correlations for most food groups measured by the FFQ compared with the food record (Supplementary Table S3). Consumers-only analysis shows a decrease in mean difference for most food groups compared analyses including all participants. The food groups “vegetarian meat substitute” and “wholegrain pasta and rice” had the lowest number of consumers, while the food groups “vegetables,” “meat,” and “sweet snacks” had the highest number of consumers across the 3 years.
3.4. Ranking and misclassification
Cross-classification analysis by quartiles is presented in Tables 6, 7. Correct classification of absolute macronutrient intake by the FFQ into the same or adjacent quartile ranged from 60 to 78% in 2013, from 68 to 80% in 2019, and from 68.7 to 83.3% in 2021 (Table 6). Misclassification of absolute macronutrient intake was < 10%, except for monosaccharides and disaccharides (11.0%) in 2013, for PUFA (10.0%) in 2019, and for energy (11.9 %) in 2021. The validation study of 2013 showed acceptable agreement (κ ≥ 0.20) for absolute intake of total fat, SFA, MUFA, polysaccharides, and fiber. In 2019, an acceptable agreement was found for the absolute intake of protein and fiber, while in 2021, absolute intake of PUFA and all carbohydrates showed acceptable agreement between the FFQ and the food record (Table 6). Improvement in misclassification was seen between 2013 and 2021 for all macronutrient density intakes, except for SFA, MUFA, and PUFA densities. Agreement between ranking participants according to their intake between the methods increases for macronutrient densities compared with absolute macronutrient intake across the 3 years for total fat, SFA, and total carbohydrate. The highest weighted kappa coefficients for micronutrient intake were found for calcium in 2013 (κ = 0.24, p = 0.001) and 2019 (κ = 0.18, p = 0.000) and for calcium, phosphorus, and iron in 2021 (κ = 0.34, p = 0.002) for all three micronutrients (Table 7).
Table 6. Cross-classification analysis of Food Frequency Questionnaire compared with a 3-day food record for macronutrients.
Table 7. Cross-classification analysis of Food Frequency Questionnaire compared with a 3-day food record for micronutrients.
3.5. Reliability
The reliability of the FFQ was assessed in 2013 and 2019 by test–retest assessment of 84 participants in 2013 and 81 participants in 2019 (Tables 8, 9). Wilcoxon signed-rank tests show that there was no statistically significant difference between the two administrations of the FFQ in 2013 for macronutrient intake, macronutrient densities, and micronutrient intake, except for energy intake. Spearman's correlation coefficients between the two administrations show good reliability and range from 0.51 to 0.75 (p < 0.005) in 2013 and from 0.58 to 0.81 (p < 0.005) in 2019 (Table 7). Intra-class correlation coefficients show a similar trend. Both Spearman's correlation coefficients and intra-class correlations improve further between 2013 and 2019 for energy intake and intake of most absolute macronutrients. However, for all macronutrient density intakes, the correlation coefficients between the two administrations of the FFQ decreases in 2019 compared with 2013, except for protein, MUFA, and PUFA density (Table 7).
Table 8. Reliability of the Food Frequency Questionnaire assessed by test–retest in 2013 and 2019 for macronutrients.
Table 9. Reliability of the Food Frequency Questionnaire assessed by test–retest in 2013 and 2019 for micronutrients.
4. Discussion
4.1. Summary
In general, the FFQ underestimates energy intake and absolute macro- and micronutrient intake but shows good reliability and acceptable agreement compared with the food record. The estimation of macronutrient density by the FFQ resulted in a small overestimation compared with the food record. However, the overestimation is < 5% for most nutrient densities and is therefore considered acceptable. Low-to-moderate correlations were found for convergent validity of the FFQ compared with the food record for both macro- and micronutrients. However, the Bland–Altman plots provide better insight and showed good agreement between the FFQ and the food record, indicating acceptable accuracy of the FFQ taking the systematic bias into account. Interestingly, the Bland–Altman plots show a clear improvement in the FFQ over the years. In the first validation study in 2013, a strong downward regression trend was present revealing that underestimation increases with higher intake. In 2019, the FFQ was validated for its content and adapted accordingly, next to adding more specific food groups based on the new Flemish Food-Based Dietary Guidelines published in 2019. This could have contributed to the decreased downward regression between intake and underestimation seen on the 2019 Bland–Altman plots until the plot of data in 2021 shows a complete disappearance of downward regression trend and smaller limits of agreement. However, the cohort of 2021 had a considerably smaller sample size (n = 42) compared with the cohorts of 2013 and 2019 (n =1 00 and n =1 00, respectively), and therefore, the regression trend should be interpreted with caution. For assessing food group intake, the FFQ shows acceptable to good convergent validity but differs greatly between the food groups. Consumers-only analysis results in smaller correlation coefficients compared with validity analysis for all consumers. However, FFQs are not developed to assess absolute nutrient or food group intake but are rather used for ranking populations according to their intake, and misclassification is an important aspect to assess. Generally, misclassification of 10% or less is preferred, which has been achieved by the FFQ for most nutrients and food groups. However, Cohen's kappa values were generally low, which could be caused by chance agreement. Finally, the reliability of the FFQ was shown to be good indicating great precision of the FFQ.
4.2. Comparison with the literature
The results of the validation studies of our FFQ are in line with other similar validation studies of FFQs. However, comparison of validity results requires caution as these studies differ greatly depending on the questionnaire length, reference method used, number of administrations of the reference method, and study population, for example (6, 25). Moreover, validity is not always assessed together with reproducibility, cross-classification, or misclassification analysis. While some studies show that FFQs overestimate measurements, others show an underestimation, similar to our study (26–28). A meta-analysis by Cui et al. examined the validity of FFQs compared with both 24-h recalls and food records and concluded that the number of administrations of the reference method, gender, sample size, and reference period impacts the validity correlation (6). Steineman et al. assessed the convergent validity of a paper-based FFQ compared with a 4-day food record (26). The FFQ of Steineman et al. consisted of 127 items assessing the intake of 25 food groups in the past 4 weeks. This FFQ was considerably more extensive compared with our FFQ and differed by the use of three standard portion size options in their FFQ (small, medium, and large) and the use of nine answer options to indicate the frequency of consumption (26). However, correlation coefficients were similar to our results for nutrients and food intake and ranged from 0.27 to 0.55 and from 0.09 to 0.92, respectively (26). Bland–Altman plots presented by Steineman et al. showed a similar trend as presented in our Bland–Altman plots and the study of Mumu et al., where bias increases with increasing intake for both men and women (29). Moreover, it was shown that energy-dense foods are underestimated more and women underestimate more than men (26). The study population by Mumu et al. consisted of ~60% women which is similar to our study samples of 2013 and 2019 (65 and 58%, respectively) (26). Women were represented in the same ratio in the study population of the FFQ validation study of Eysteinsdottir et al., in which a short FFQ with 30 questions was validated for food group intake against a 3-day weighed food record in people of advanced age (mean age = 74 years) (28). Similar correlation coefficients were found for food group intake between the FFQ and the weighed food record and ranged from 0.05 to 0.71 for men and from 0.01 to 0.61 for women. Although generally lower correlation coefficients were found compared with our study, the highest correlation coefficients were found for a beverage, specifically tea, as well. Remarkably, beverages including coffee, juice, and alcoholic beverages showed the highest correlation coefficients between the FFQ and the food record in our validation study as well. This was shown in other studies as well and may be explained that portion sizes of beverages (i.e., a glass, a mug, and a bottle) are easier to estimate compared with solid foods such as potatoes or meat, indicating that the ability to estimate portion size impacts validation results significantly (30). However, correlation coefficients for water intake were high at first in 2013 but very low in 2019 and 2021 (Table 3). This is possibly due to the different registration methods of the food record. In 2013, a paper-based food record, including probing questions toward specific food categories, was used while in 2019 and 2021, the MyFitnessPal application was used as a dietary record. Although MyFitnessPal can be used as a food record to calculate nutrient intake, the MyFitnessPal app in our validation study was solely used to capture the type of foods consumed while nutrient calculations were based on the Belgian Food Composition Table (NUBEL 2009) (20, 31). However, the use of MyFitnessPal to register the type of consumed foods only yielded some shortcomings. Water intake and also cooking fat use were mostly forgotten to be registered in the mobile application used as food record, leading to unrealistically low consumptions of water intake in the food record and low correlation coefficients between the FFQ and the food record for those food groups. Additionally, MyFitnessPal has a default option to register complete meals (i.e., Caesar salad or pasta dishes) which lead to larger error in estimated portions sizes compared to registering the individual foods of which the meal consisted.
Mumu et al. investigated the validity of a 126-item FFQ compared with 24-h recalls and biomarkers (29). While correlation coefficients were higher for nutrient intake, misclassification was found to be 18% on average, which is significantly higher compared with our findings of 7.7, 7.8, and 5.9% misclassification on average in 2013, 2019, and 2021, respectively.
Cui et al. performed a meta-analysis on the reproducibility of Food Frequency Questionnaires and identified factors influencing reproducibility of Food Frequency Questionnaires (25). Pooled ICC of macronutrients ranged from 0.51 for starch to 0.78 for fiber, which are similar to our findings with ICCs in 2013 ranging from 0.53 for protein to 0.71 for monosaccharides and disaccharides. Remarkably, fiber was also the nutrient to have amongst the highest ICC (ICC = 0.70) in our study of 2013, and starch was the nutrient with lowest ICC of 0.64 in our study of 2019 as well.
4.3. Factors influencing the validity
However, most validation studies use 24-h recalls as a reference method and therefore cannot be compared directly. As 24-h recalls and FFQs are known to have correlated errors, this can explain higher correlation coefficients found in validation studies, using 24-h recalls as the reference method for the FFQ compared with food records. The meta-analysis of Cui et al. showed that correlation coefficients in validation studies using 24-h recalls are higher compared with studies using food records as a reference method. Moreover, Molag et al. showed that correlation coefficients increased when the reference method was used for longer periods, specifically 8 to 14 days (32). The food records in our validation studies consisted of 3-day records only. However, this choice is based on research showing that the quality and completeness of the food records decrease significantly after 4 days while the burden increases to register dietary intake for more days (1).
Another factor influencing the validity of the FFQ is the length of the questionnaire. Most validation studies include FFQs with more than 100 food items, while our FFQ is considerably shorter consisting of 33 food items. Molag et al. confirm that FFQs with more than 100 food items found generally higher correlation coefficients and perform better at ranking subjects, according to their intake (32). Block et al. highlight that reproducibility is higher too for shorter questionnaires (33). Moreover, the use of portion size questions (quantitative vs. qualitative) in FFQs seems to result in higher correlation coefficients as well compared with the use of predefined portions (32). Finally, categorization of mixed dishes in the food record into different food groups influences obtained energy and macronutrient intakes and may influence the outcome of convergent validity accordingly (34).
Marks et al. investigated how gender, age, and BMI influenced the convergent validity of the FFQ compared with a 12-day weighed food record (30). The bias of the FFQ compared with the food record was different for men and women for different foods. Moreover, the study by Marks et al. showed that age was associated with differences between the FFQ and the food record for specific foods including vegetables and cereals (product). Additionally, BMI was shown to be not significantly associated with such differences. However, difference between intakes measured by the FFQ compared with the food record increases with increasing BMI or age, which is a possible explanation for the higher difference between the FFQ and the food record seen in the Bland–Altman plots.
4.4. Changing validity over time
It is difficult to assess whether validation has improved over the years as the differences in sample size, study population characteristics (i.e., age), update of the FFQ, different registration method food record (paper-based FR vs. MyFitnessPal), and the changing dietary pattern during the past decades could explain different outcomes in validation as well. Although Spearman's correlation coefficients seem not to improve significantly for macronutrient intake, Bland–Altman plots do show less bias with increasing intake and smaller limits of agreement, indicating improved accuracy. On the contrary, convergent validity of the FFQ seemed to be improved for micronutrient intake, except for sodium, vitamin C, and vitamin D. Convergent validity of food group intake has decreased over the years, showing lower correlation coefficients and intake of most food groups being statistically differently measured by the FFQ compared with the food record. However, these results are probably influenced greatly by the changing dietary pattern and the different age groups, having different and more irregular dietary patterns with more inter-person and between-person variability in the young study population of 2019. Moreover, it can be argued that the food groups questioned in the FFQ are better resembling the “traditional” dietary pattern of the older study population in 2013 and less suitable for study populations of 2019 and 2021 having different dietary patterns and habits including more diverse and “new” foods. The ranking capability of the FFQ seemed similar across the years with good misclassification rates but fair- to low-weighed kappa values. Additionally, comparison between the different cohorts is difficult due to differences in sample size as misclassification in smaller sample sizes can cause large differences in kappa values (35). Reproducibility, on the contrary, seemed to be improved between 2013 and 2019, showing good reproducibility of the FFQ. However, the period between the two administrations is rather short (1 and 3 weeks, respectively) and can have contributed to higher Spearman's correlations and ICCs.
4.5. Strengths and limitations
The strength of this research is the ability to assess the validation in different sample populations while both convergent validity, reliability together with consumers-only analysis, and cross-classification have been evaluated extensively in different time periods. As diet is culture-specific, diet changes also over time. In total, 10 years ago, potatoes and bread were food groups consumed at almost every meal in a typical Belgian diet. However, lately, more “new” foods have been introduced into the Belgian dietary pattern and gained popularity. This was reflected in the dietary records, as well, with food records of 2013 including mostly “traditional” meals with potatoes or bread as the major food group while the latest food records showed a shifted dietary pattern including considerably less bread but more “on-trend” foods, such as oatmeal, avocado, quinoa, and chia seeds. This trend can also be reinforced by the difference in age of the cohorts in the three different validation studies of 2013, 2019, and 2021. In 2013, the cohort was older while the sample population of 2019 consisted mainly of young students and the age of the sample population of 2021 was more generally representative but had a small sample size and was overall highly educated (n = 42). The food records of the students showed greater irregularities in the dietary pattern including skipping meals (i.e., breakfast), higher intake of alcoholic beverages, fast food, sweet snacks, and ready-to-eat meals. This is also represented as this study population has the largest mean difference for energy intake between the FFQ and the food record, which is mainly contributed to the discrepancies for mono- and disaccharides. Nevertheless, the data of the different cohorts should be compared with caution. The cohort of 2019 and 2021 used an updated version of the FFQ including 32 food items compared with the original FFQ used in 2013 with 27 items.
Additionally, a different method as food record was used in 2013 compared with 2019 and 2021. While a paper-based food record was used in 2013, the studies of 2019 and 2021 have used the MyFitnessPal application as a food record. The use of MyFitnessPal introduced serious pitfalls mainly because the application is developed originally for the United States (U.S.) while our study participants are Belgian. Although the MyFitnessPal application is used in Dutch, it is still embedded with US culture-specific features including US foods or recipes (i.e., Caesar Salad) and measuring units (i.e., cups). Moreover, study participants used US measuring units such as cups to indicate portion size in MyFitnessPal while participants are most likely unfamiliar with US units as metric units are the standard in Belgium. The main advantage of MyFitnessPal is being feasible and easy-to-use for participants, while the disadvantage includes the appeal to register meals as a whole (i.e., “one Caesar salad” for lunch) instead of registering the amount of the separate foods contained in one meal (i.e., 100g of chicken, 100g of lettuce, and 30 g of Caesar dressing). This introduces additional measurement errors resulting in a less accurate reference method. Lastly, a paper-based FFQ was used in 2013, while a web-based version was used in the 2019 and 2021 validation studies. However, the research by Lai et al. and Al-Shaar et al. shows that paper-based versus web-based FFQs show comparable results (36, 37).
4.6. Conclusion
In conclusion, the current FFQ shows low validity for estimating absolute nutrient intake but acceptable validity for estimating nutrient density intake and food group intake. Moreover, the reproducibility of the FFQ is good and misclassification is low. Therefore, it is recommended to use the FFQ only to compare nutrient density intakes or food group intake of populations and should not be used to measure actual or absolute intake. The length of the FFQ is rather short which comprises its validity. Nevertheless, this does not outweigh its great feasibility for the researcher and low burden for the participant while being a quick and low-cost dietary assessment tool that can be used on a larger scale.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by Social and Societal Ethics Committee and Medical Ethical Committee UZ Leuven. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
JV wrote the manuscript, analyzed and interpreted the data of 2013, 2019, and 2021, and set up the study of 2021. TB set up the study of 2019, interpreted the data of 2013, 2019, and 2021, and reviewed the manuscript. CM set up the study of 2013, 2019, and 2021, interpreted the data of 2013, 2019, and 2021, and reviewed the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This research was funded by a Ph.D. fellowship Strategic Basic research grant (1S96721N) of Research Foundation Flanders (FWO). The data of 2019 were collected as part of a study supported by an FWO-TBM grant (T005417N).
Acknowledgments
The authors want to thank all participants of the study for contributing to this research. The master students Astrid Vermeulen and Gaëlle Buysschaert are acknowledged for their dedicated contribution in aiding in the recruitment of participants and collecting and cleaning the data of this research.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2023.1073559/full#supplementary-material
References
1. Rutishauser H. Dietary intake measurements. Public Health Nutr. (2005) 8:1100–7. doi: 10.1079/PHN2005798
2. Willet W. Nutritional epidemiology: issues and challenges. Int J Epidemiol. (1987) 16:312–7. doi: 10.1093/ije/16.2.312
3. Paeratakul S, Popkin B, Kohlmeier L, Hertz-Picciotto I, Guo X, Edwards L. Measurement error in dietary data: implications for the epidemiologic study of the diet-disease relationship. Eur J Clin Nutr. (1998) 52:722–7. doi: 10.1038/sj.ejcn.1600633
4. M. N. The Validation of Dietary Assessment. Oxford: Oxford University Press (1997). Available online at: https://www.oxfordscholarship.com/view/10.1093/acprof:oso/9780192627391.001.0001/acprof-9780192627391-chapter-8 (accessed January 07, 2020).
5. Cade J, Thompson R, Burley V, Warm D. Development, validation and utilisation of food-frequency questionnaires - a review. Public Health Nutr. (2002) 5:567–87. doi: 10.1079/PHN2001318
6. Cui Q, Xia Y, Wu Q, Chang Q, Niu K, Zhao Y. Validity of the food frequency questionnaire for adults in nutritional epidemiological studies: a systematic review and meta-analysis. Crit Rev Food Sci Nutr. (2021) 14:1–19. doi: 10.1080/10408398.2021.1966737
7. Lachat C, Hawwash D, Ocké MC, Berg C, Forsum E, Hörnell A, et al. Strengthening the reporting of observational studies in epidemiology-nutritional epidemiology (STROBE-nut): an extension of the STROBE statement. PLoS Med. (2016) 13:e1002036. doi: 10.1371/journal.pmed.1002036
8. Willett WC, Sampson L, Browne ML, Stampfer MJ, Rosner B, Hennekens CH, et al. The use of a self-administered questionnaire to assess diet four years in the past. Am J Epidemiol. (1988) 127:188–99. doi: 10.1093/oxfordjournals.aje.a114780
9. Matthys C, Pynaert I, De Keyzer W, De Henauw S. Validity and reproducibility of an adolescent web-based food frequency questionnaire. J Am Diet Assoc. (2007) 107:605–10. doi: 10.1016/j.jada.2007.01.005
10. Huybrechts I, De Bacquer D, Matthys C, De Backer G, De Henauw S. Validity and reproducibility of a semi-quantitative food-frequency questionnaire for estimating calcium intake in Belgian preschool children. Br J Nutr. (2006) 95:802–16. doi: 10.1079/BJN20051671
11. Bilau M, Matthys C, Bellemans M, De Neve M, Willems JL, De Henauw S. Reproducibility and relative validity of a semi-quantitative food frequency questionnaire designed for assessing the intake of dioxin-like contaminants. Environ Res. (2008) 108:327–33. doi: 10.1016/j.envres.2008.06.003
12. Beck KL, Kruger R, Conlon CA, Heath AL, Coad J, Matthys C, et al. The relative validity and reproducibility of an iron food frequency questionnaire for identifying iron-related dietary patterns in young women. J Acad Nutr Diet. (2012) 112:1177–87. doi: 10.1016/j.jand.2012.05.012
13. Matthys C, Pynaert I, Roe M, Fairweather-Tait SJ, Heath AL, et al. Validity and reproducibility of a computerised tool for assessing the iron, calcium and vitamin C intake of Belgian women. Eur J Clin Nutr. (2004) 58:1297–305. doi: 10.1038/sj.ejcn.1601965
14. Vandelanotte C, Matthys C, De Bourdeaudhuij I. Reliability and validity of a computerized questionnaire to measure fat intake in Belgium. Nutr Res. (2004) 24:621–31. doi: 10.1016/j.nutres.2003.12.008
15. Vlaams Instituut Gezond Leven Onderbouwing inhoudelijke visie voeding en gezondheid. Achtergronddocument bij vernieuwde richtlijnen en visuele voorstelling van de voedingsdriehoek. Laken (Brussel): Online: gezondleven.be (2017).
17. Patrick DL, Burke LB, Gwaltney CJ, Leidy NK, Martin ML, Molsen E, et al. Content validity–establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO Good Research Practices Task Force report: part 2–assessing respondent understanding. Value Health. (2011) 14:978–88. doi: 10.1016/j.jval.2011.06.013
18. Gaëlle B. De betrouwbaarheid en validiteit van een voedselgeletterdheid en voedselfrequentie vragenlijst voor de Belgische populatie van reproductieve leeftijd. Leuven: KU Leuven (2019).
21. Willett WC. Chapter 13: issues in analysis and interpretation of dietary data. In:Willett WC, , editor. Nutritional Epidemiology. 3rd ed. New York, NY: Oxford University Press (2013).
22. Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. (2016) 15:155–63. doi: 10.1016/j.jcm.2016.02.012
23. Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. New York, NY: Routledge (1988).
24. Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess. (1994) 6:284–90. doi: 10.1037/1040-3590.6.4.284
25. Cui Q, Xia Y, Wu Q, Chang Q, Niu K, Zhao Y. A meta-analysis of the reproducibility of food frequency questionnaires in nutritional epidemiological studies. Int J Behav Nutr Phys Act. (2021) 18:12. doi: 10.1186/s12966-020-01078-4
26. Steinemann J, Grize L, Ziesemera K, Kaufe P, Probst-Hensch N, Brombacha C. Relative validation of a food frequency questionnaire to estimate food intake in an adult population. Food Nutr Res. (2017) 61:1305193. doi: 10.1080/16546628.2017.1305193
27. Paalanen L, Männistö S, Virtanen M, Knekt P, Räsänen L, Montonen J, et al. Validity of a food frequency questionnaire varied by age and body mass index. J Clin Epidemiol. (2006) 59:994–1001. doi: 10.1016/j.jclinepi.2006.01.002
28. Eysteinsdottir T, Thorsdottir I, Gunnarsdottir I, Steingrimsdottir L. Assessing validity of a short food frequency questionnaire on present dietary intake of elderly Icelanders. Nutr J. (2012) 11:12. doi: 10.1186/1475-2891-11-12
29. Mumu S, Merom D, Ali L, Fahey P, Hossain I, Fazlur Rahman A, et al. Validation of a food frequency questionnaire as a tool for assessing dietary intake in cardiovascular disease research and surveillance in Bangladesh. Nutr J. (2020) 19:42. doi: 10.1186/s12937-020-00563-7
30. Marks G, Hughes M, van der Pols J. Relative validity of food intake estimates using a food frequency questionnaire is associated with sex, age, and other personal characteristics. J Nutr. (2006) 136:459–65. doi: 10.1093/jn/136.2.459
31. Evenepoel C, Clevers E, Deroover L, Van Loo W, Matthys C, Verbeke K. Accuracy of nutrient calculations using the consumer-focused online app myfitnesspal: validation study. J Med Internet Res. (2020) 22:e18237. doi: 10.2196/18237
32. Molag M, de Vries J, Ocké M, Dagnelie P, van den Brandt P, Jansen M. Design Characteristics of food frequency questionnaires in relation to their validity. Am J Epidemiol. (2007) 166:1468–78. doi: 10.1093/aje/kwm236
33. Block G, Hartman A. Issues in reproducibility and validity of dietary studies. Am J Clin Nutr. (1989) 50:1133–8. doi: 10.1093/ajcn/50.5.1133
34. Krebs-Smith SM, Cronin FJ, Haytowitz DB, Cook DA. Contributions of food groups to intakes of energy, nutrients, cholesterol, and fiber in women's diets: effect of method of classifying food mixtures. J Am Diet Assoc. (1990) 90:1541–6. doi: 10.1016/S0002-8223(21)01837-X
35. Maclure M, Willett WC. Misinterpretation and misuse of the kappa statistic. Am J Epidemiol. (1987) 126:161–9. doi: 10.1093/aje/126.2.161
36. Al-Shaar L, Yuan C, Rosner B, Dean SB, Ivey KL, Clowry CM, et al. Reproducibility and validity of a semiquantitative food frequency questionnaire in men assessed by multiple methods. Am J Epidemiol. (2021) 190:1122–32. doi: 10.1093/aje/kwaa280
Keywords: nutrition assessment, dietary assessment, methodology, reliability, nutrition survey, food intake
Citation: Verbeke J, Boedt T and Matthys C (2023) Development and validity of a short web-based semi-quantitative Food Frequency Questionnaire applicable in both clinical and research setting: an evolution over time. Front. Nutr. 10:1073559. doi: 10.3389/fnut.2023.1073559
Received: 18 October 2022; Accepted: 18 April 2023;
Published: 17 May 2023.
Edited by:
Angela M. Zivkovic, University of California, Davis, United StatesReviewed by:
José Camolas, Centro Hospitalar Lisboa Norte (CHLN), PortugalNarendra Kumar Arora, INCLEN Trust, India
Copyright © 2023 Verbeke, Boedt and Matthys. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Christophe Matthys, christophe.matthys@uzleuven.be