- 1Departamento de Física y Matemáticas, Universidad Iberoamericana Ciudad de México, Ciudad de México, Mexico
- 2Departamento de Salud, Universidad Iberoamericana Ciudad de México, Ciudad de México, Mexico
- 3Laboratorio de Envejecimiento Saludable, Centro de Investigación Sobre Envejecimiento (CIE-CINVESTAV Sur), Instituto Nacional de Medicina Genómica, Ciudad de México, Mexico
Introduction: Food Exchange Lists (FELs) are a user-friendly tool developed to help individuals aid healthy eating habits and follow a specific diet plan. Given the rapidly increasing number of new products or access to new foods, one of the biggest challenges for FELs is being outdated. Supervised machine learning algorithms could be a tool that facilitates this process and allows for updated FELs—the present study aimed to generate an algorithm to predict food classification and calculate the equivalent portion.
Methods: Data mining techniques were used to generate the algorithm, which consists of processing and analyzing the information to find patterns, trends, or repetitive rules that explain the behavior of the data in a food database after performing this task. It was decided to approach the problem from a vector formulation (through 9 nutrient dimensions) that led to proposals for classifiers such as Spherical K-Means (SKM), and by developing this idea, it was possible to smooth the limits of the classifier with the help of a Multilayer Perceptron (MLP) which were compared with two other algorithms of machine learning, these being Random Forest and XGBoost.
Results: The algorithm proposed in this study could classify and calculate the equivalent portion of a single or a list of foods. The algorithm allows the categorization of more than one thousand foods with a confidence level of 97% at the first three places. Also, the algorithm indicates which foods exceed the limits established in sodium, sugar, and/or fat content and show their equivalents.
Discussion: Accurate and robust FELs could improve implementation and adherence to the recommended diet. Compared with manual categorization and calculation, machine learning approaches have several advantages. Machine learning reduces the time needed for manual food categorization and equivalent portion calculation of many food products. Since it is possible to access food composition databases of various populations, our algorithm could be adapted and applied in other databases, offering an even greater diversity of regional products and foods. In conclusion, machine learning is a promising method for automation in generating FELs. This study provides evidence of a large-scale, accurate real-time processing algorithm that can be useful for designing meal plans tailored to the foods consumed by the population. Our model allowed us not only to distinguish and classify foods within a group or subgroup but also to perform the calculation of an equivalent food. As a neural network, this model could be trained with other food bases and thus improve its predictive capacity. Although the performance of the SKM model was lower compared to other types of classifiers, our model allows selecting an equivalent food not from a group previously classified by machine learning but with a fully interpretable algorithm such as cosine similarity for comparing food.
Highlights
• ML is a method for automation to generate FELs.
• ML algorithm allows correct food categorization to FELs.
• ML algorithm reduces the time needed for food categorization.
• ML algorithm allows the calculation of an equivalent food.
1. Introduction
The Food Exchange List (FEL) arises from the need to offer a simple didactic tool to give food variety to the individual diet of patients with type 2 diabetes (T2D) (1). Because of their usefulness, practitioners in nutrition and dietetics have been using them in menu planning and nutritional education of patients, especially those with metabolic diseases such as diabetes, obesity, and cancer, among others (2). The first edition was published in 1950 and was developed by the American Dietetic Association, the American Diabetes Association, and the United States Public Health Service (3). In Mexico, it began to be used in the 1970s as a translation of the American list. In 1988, the FEL was adapted to include typical foods of the culinary and gastronomic customs of the Mexican population [Mexican System of Equivalent Foods (SMAE)] (4, 5).
The SMAE is based on the grouping of foods proposed in the Mexican Official Standard “NOM-043-SSA2-2012, Basic health services. Food safety promotion and education. Criteria for the provision of guidance” and the concept of “Equivalent Food” (6, 7). An “Equivalent Food” is a portion that approximately contributes the same macronutrient (energy, protein, carbohydrate, and fat) value to those of its same group of foods in quality and quantity, which allows them to be interchangeable without significant differences in dietary intakes of patients (8, 9). The SMAE is grouped into the usual exchange categories, but some subgroups are proposed based on the secondary macronutrient contents (e.g., different sugar, fat, and protein amounts) (5).
Given the changes in the population’s eating patterns, changes in the food marketplace, the permanent innovation of new products, and globalization, it is considered a latent need to analyze and update the FELs, verify the information, and add greater detail to the foods included (10, 11). Distinguishing and classifying food within a group or subgroup to finally offer the user the equivalent portion can be a challenging and resource-intensive task since it is, at this time, a manual process, albeit necessary for a better understanding of foods and diets.
Machine learning (ML) has become dominant, especially when text datasets are on large scales, such as in computer science, medical science, healthcare, and agriculture (12, 13). The typical description of these methods is that they exploit the amount of data available due to their ability to model non-linear relationships and high-level interactions (14, 15). Neural Networks are among the most powerful (and popular) algorithms used for classification. They take inputs as vectors, perform some computations, and then produce an output vector. The output vector is then compared with the ground truth labels, and the weights are tweaked (i.e., trained) to yield better results. To train the Neural Network, we feed our input data in feature vectors representing the data’s important gist. Neural networks (NNs) computing systems allow computers to learn from experience and understand the world through a hierarchy of concepts, each defined by its relation to more straightforward concepts (16). By gathering knowledge from experience, this approach avoids the need for scientists to specify all the knowledge and rules that the computers previously needed. The hierarchy of concepts enables the computer to learn complicated concepts by building them out of simpler ones (17).
Most studies on classifying foods using deep learning employ pictures, images, spectroscopic, hyperspectral signals, and mass spectrometry data (18–22). An ML approach has also been used to predict added sugar and fiber content using nutrient information packages (23, 24). Recently, Ma et al. achieved up to 99% accuracy for food classification and 0.93 ~ 0.97 for calories, protein, sodium, carbohydrate, and lipids estimation using a generic deep-learning-based technique (25).
The present study aimed to generate an algorithm using artificial neural networks and how we arrived at it to predict the classification of foods, calculate the equivalent portion for the most similar foods based on cosine similarity, and indicate foods with high sodium, sugar, or fat content.
2. Materials and methods
2.1. Dataset
The Mexican food exchange list had six phases:
1. Obtain the nutritional composition per 100 g from food composition databases (26, 27).
2. Calculate the portion of foods using household measurements (240 mL cup, 15 mL tablespoon, 5 mL teaspoon, 16 tablespoon cup, three teaspoon tablespoon, medium piece).
3. Classification of foods into groups and subgroups.
4. Definition of the key nutritional component and its quantity for each group (Table 1).
5. Calculate the food portions of each sub-group in weight (g) and household measures like cups and others.
The portion needs to be adequate to obtain the value of the key nutritional component (on average ± 2 standard deviations) and reasonable for use as a household measure: rounding the weight in grams to the nearest five or zero for high-moisture foods such as fruits, vegetables, animal foods, cooked cereals, and cooked legumes.
The weight is indicated in grams for dry or raw foods such as cereals and legumes without rounding (6). Classification of foods in other sub-groups according to the content of other nutritional components, for example, cereals, meats, and milk with different fat content; milk with high sugar content; oils and fats with different protein content and sugars with different gauze content in Table 2, nine variables are presented, including the macronutrients, which are contained in most food categories. Also, the units are included in this table, being all the variables positive scalars. Is important to mention that the total amount of data with which the study was carried out was 2,877 data. Using 2,201 data for training and 576 data for testing, which corresponds to 80 and 20%, respectively. In the case of the XGBoost algorithm 50% was used of the whole dataset for training, 25% for the testing, to find the best parameters and the rest, 25%, used for the validation.
An important consideration regarding the code is that variables such as sodium and cholesterol were not defined in the database for some foods because measurements of these two variables were irrelevant to the nutrition label of those foods: therefore, this equivalent to filling in the missing data with zero.
At the algorithm input, the carbohydrate and lipid variables were broken into more nutrients, respectively:
and
3. Dimensionless
The first step in considering the input data was to dimensionless it as follows:
Where is the number of the foods and is the nutrient of the food .
3.1. Similarity and equivalence factor
After dimensionless, normalization is one of the fundamental processes before running the code. It is already known that the nutrient of the database is referred to as 100 g, but this does not mean that foods should be immediately compared with each other, and a mean square error is not enough for that. It is necessary to build equivalent foods, starting from the following rule:
Where and are food vectors in the space of nutrients, where each of the nutrients considered is one dimension, is the optimal equivalence factor that multiplies the components of , bringing the food as close as possible to according to the definition of an equivalent food.
Given the vector origin of the food, the following equivalence function that satisfied the properties mentioned before is proposed by the standard household similarity measure.
Is given by,
The similitude between and is equal to the projection of the unit vector of on the unit vector . Thus, the formulation of the similarity between foods was based on the nutrient concentration and not on distance, as presented in other works (25). This formula has advantages since it does not depend on the amount of both foods but on the concentration; the amount needed to scale from food B to A in terms of units of B is defined as follows:
The most optimal scalar is calculated in the dimensionless space, and this does not imply that it is the same, nor that it leads to the closest point of food A in the nutrient space since having different units causes distributions in different ranges, as an example, a difference of 50 kcal has not the same impact to the consumer as a difference of 50 g of saturated fat for a 100-gram food.
3.2. Classification
Once the tool has been created to compare the similarity between foods, the representative centroids of each food group were placed using the Spherical K-means (SKM) algorithm which is the most common algorithm of aggrupation that is used in this kind of analysis (28). This algorithm has been studied for many decades, and the main objective of SKM is to minimize the differences between each group and maximize the differences between clusters. The following formula gives the classification:
Where is the centroid which represents the category.
The above is analog to a single layer neural network with no activation function and with the proviso that the weights will be normalized, so you can remove all these constraints and treat it as a multilayer perceptron (MLP) neural network, see Figure 1.
Figure 1. Map of the Spherical K-means algorithm represented in nodes and the map of the neural networks.
As a result of this process and the normalization of the input data, Figure 2 was obtained, where the distributions of the nourishment components are displayed given the segmentation proposed by SMAE. This result was also the input for the training of the MLP.
Figure 2. Color map of the distribution of nourishment components for color segmentation of the different food groups.
Figure 2 shows that each nutrition group has a well-defined and different distribution for each nutritional component, indicating a good segmentation. For example, in the sugar nutrient, the sugars groups with and without fat stand out for the high concentration of their distribution, the same for protein, where the foods of animal origin stand out, mainly Meat: Fish and Poultry have very low-fat content. Some information was lost during the dimensionless food vector, so its magnitude was introduced as a new variable in the input vector that could help the neural network classify; otherwise, it will be discarded during the learning process.
The results of MLP model have been compared with two algorithms of machine learning to evaluate which of multiple options can be used to obtain better results; these two algorithms are Random Forest and XGBoost. On the one hand, Random Forest, several numbers of estimators, called trees, were tried. Also, different maximum tree depth was tried, which is a limit to stop the splitting of nodes when the specified tree depth has been reached during the creation of the initial decision tree. On the other hand, For XGBoost, a search for the best parameters was performed using the test dataset.
3.3. Model
Once data treatment, the MLP model was selected according to the conditioning of the variables; the chosen approach was SKM and MLP for interpretability purposes (25) and several topologies were tried to build the MLP model (Figure 3).
Figure 3. Diagram of the MLP model, being layer 1 and 2 the hidden layers of the neural network. For the SKM layer 1 and 2 were eliminated and the transfer functions were modified between each layer.
3.4. Cut points
Additional to the prediction of the nutritional group of each food, a function was added to compute the cut points of the sodium, sugar and/or fat food content (Table 3), using the methodology according to the Mexican Official Standard, NOM-051-SCFI/SSA1-2010 (29).
4. Results
To evaluate the performance of different algorithms, three models were tested using stratified cross-validation with 5 splits. The algorithms under consideration were MLP, Random Forest, and XGBoost. By employing this validation technique, the models were assessed and compared based on their respective accuracy percentages (Figure 4B). The most important thing is accuracy that the three algorithms bring which one could be more useful for this kind of study. The mean accuracy for the algorithms is reported in Table 4.
Figure 4. (A) Accuracy distribution of the MLP model for training and validation sets for all 19 categories. (B) The comparison between each model.
Upon analyzing these results, it is evident that both Random Forest and XGBoost exhibit higher performances compared to MLP. The Random Forest algorithm achieves an accuracy of 86.03%, while XGBoost surpasses it slightly with an accuracy of 86.12%. These findings suggest that both Random Forest and XGBoost are more effective in addressing the problem at hand, outperforming MLP in terms of accuracy.
It is important to note that while XGBoost and Random Forest showcase similar accuracy levels, the slight advantage of XGBoost implies that it may be a preferable choice when accuracy is the primary evaluation metric. However, further analysis is required to determine if the difference in accuracy between these two models is statistically significant.
These updated results highlight the potential of ensemble-based methods, such as Random Forest and XGBoost, in achieving higher accuracies in this type of problem. It is worth noting that the specific characteristics and requirements of the dataset should also be considered when selecting the most suitable algorithm for practical applications.
For the MLP model was necessary to observe the model’s performance during the training phase (Figure 5B). And get the graph of the assigned place of the valid group (Figure 5A).
Figure 5. MLP Model (A) assigned place of the true group. (B) Loss of the training and validation datasets.
The prediction model (MLP) for the food category was trained with 80% of the database, which considers 2,877 registers of food of different categories. This model gets 97.83% accuracy during the training. Meanwhile, during the validation process, we get 97.05% accuracy (Table 5).
Regarding the food classification (Table 6), algorithm performance is shown for each category of foods and their size during the training and validation process.
In Figure 4A, the distribution precision of the algorithm for the training and validation sets is shown in Figure 4B, the accuracy of three algorithms is shown being Random Forest and XGBoost the best algorithms to deal with this case presented in this manuscript.
5. Discussion
According to statistics, 671 million and 439 million of the global population suffer from obesity and T2D, respectively (30). Who also estimated that 4.2 million deaths were related to these diseases, expecting the numbers to increase constantly. Many cross-sectional, prospective, and retrospective studies have found significant associations between nutrients, foods, and dietary patterns in preventing and managing T2D and obesity (31). Thus, innovative healthcare for them is rapidly drawing public attention. For instance, those sensitive to food intake and weight changes may want to keep track of the calories and amounts of carbohydrates, proteins, fats, and other nutrients (20). One approach used to provide that information in a simple and accessible way is by FELs (32). However, distinguishing and classifying food within a group and later calculating the equivalent portion can be challenging and resource-intensive since it is manual. Besides the growing number of food composition datasets and the changes in the eating patterns of the population and the permanent appearance of new foods, it is considered a latent need to analyze and update the FELs, adding more detail about the foods included (33). The preceding makes evident the need for advanced mathematical analysis in this field, which classifies food and offers us the equivalent portion (34). Thus, a methodology that is accurate for classification and interpretable can help create artificial intelligence models for designing healthy diets that meet specific nutritional requirements.
ML is how computers learn to do specific things without being specifically programmed. This happens through algorithms, sets of rules a computer follows to reach a goal: prediction, classification, or clustering (35). A subfield of artificial intelligence, supervised machine learning algorithms can learn from training data to develop a function that can model the relation between input variables (e.g., nutrients) and an outputs variable (e.g., classification into groups, equivalent portion) (36).
Therefore, we generate an algorithm using artificial neural networks to classify and calculate the equivalent portion. In the present study, once the tool has been created to compare the similarity between foods, we obtained the interpretability of each of the weights by using the flexibility of the SKM model to make a simple tool for weighting variables to predict the group category. Subsequently, using the vector formulation and the similarity equation, the foods with the most significant similarity with the target food were found within the database, and the equivalence factor of each one was calculated. This equivalence would be more exact in terms of nutrients if the factors of more than two foods could be optimized to reach a single target,
Where the foods , and belong to the same category with a previous classification.
The main reason for the idea of use of Spherical K-means, SKM, is the implementation of centroids and the cosine similarity. In this case the centroids were obtained as the means of the nutrients in each group from the initially defined set.
As observed, the performance of SKM model and even neural networks was not as high as the performance of XGBoost and Random Forest classifiers. However, the objective of these exchangeable lists is to provide an equivalent food, so it is proposed to omit the classification step since the nature of these algorithms allow little interpretability. This opens the alternative of selecting an equivalent food not from a group previously classified by machine learning, but with a completely interpretable algorithm such as cosine similarity for comparing foods.
On the other hand, multilayer perceptron (MLP) is a complement to feed forward the neural network. MLP consists of three layers: input, output (Food groups in our case), and hidden layer. Being the hidden layers, the ones that carry out the computational work of the multiple perceptrons. MLP is usually used for recognition, approximation, prediction, and pattern classification, which is used in our case. However, the information it has received to be trained is limited. In effect, it does not have the food’s origin so it can make an incorrect classification, but it was evident to humans.
FELs must be updated to introduce new foods or adapt them to specific countries or populations (4) since one of the challenges of the FELs is the scarcity of regional food and the need for up-to-updating. Using up to date FELs with local/regional foods will allow the designed diet to gain greater acceptance with a better chance of successful implementation and avoid adherence-related issues due to foods being limited to the food exchange list (37, 38). The model proposed in this study can classify and calculate the equivalent portion of a single or a list of foods. Due to the similarity formula, our model has enough flexibility to quickly compare a complete set of foods from a previously classified category. Therefore, it offers a helpful, agile, and versatile tool for the dietitian and the patient. Also, our model uses normalized input data, often used in neural networks, to speed up the training and improve the neural networks (39), thus allowing, even with a 3% error in the prediction of the food category, to give us a group of foods that offer a similar nutritional composition.
On the other hand, FELs should not only focus on macronutrients and the energy content of food; they should also consider the content of other nutrients such as sodium, added sugars, and cholesterol, among others (40). These nutrients are relevant because high intakes of sodium, cholesterol, and added sugar have been associated with an increased risk of developing hypertension, cardiovascular disease, obesity, insulin resistance, fatty liver, and type 2 diabetes (41). In addition to being a tool for designing eating plans, the SMAE indicates when the contribution of a nutrient in an equivalent portion is considered high in sodium, cholesterol, or sugar (29). It has been observed that a low-sodium diet prepared using an exchange list was more effective than the one designed using food composition tables (4).
Interestingly, the algorithm’s prediction had the worst performance during the validation with the legume group, and the second one with the worst performance during the training was the sugar with the fat groups. This could be because the characteristics of nutrients are very similar between each food analyzed, and this means that the neural network needs to have adequate training and that at the time of doing the test, it could perform better.
Finally, calculating the similarity coefficient, which is a much more flexible method compared with others (42), in the dimensionless space (43) results in the closeness in this space but not in the nutritional component space; working in this space has the advantage of eliminating the dependence on units, such as kcal, g, and mg, being able to calculate a very close approximation to what is expected and not the nearest position.
Ultimately, more is needed to have robust FELs generated with ML; it is also necessary that the information be transmitted in a didactic way to the patients. Bawadi et al. points out that using a human-friendly food exchange list can be implemented for people with low literacy, as it is based on visual techniques (32).
Is necessary to mention that this technique that is proposed in this manuscript has limitations. Being one of the most important that has to do with the database, which could have nutritional elements that are not known or have not been measured in new foods, this could cause a predisposition in the classification of the food. Although is a problem, it is not something common to see eating processed foods, which are regularly very well studied from the nutritional point of view. Something interesting that was developed in this work is the matter of dimensioning and normalizing the data, although it is additional process that takes a little more time, the advantage lies in comparing the nutritional concentration and not in distances for being a vector.
In conclusion, machine learning is a promising method for automation in generating FELs. This study provides evidence of a large-scale, accurate real-time processing algorithm that can be useful for designing meal plans tailored to the foods consumed by the population. Our model allowed us not only to distinguish and classify foods within a group or subgroup but also to perform the calculation of an equivalent food. As a neural network, this model could be trained with other food bases and thus improve its predictive capacity. Although the performance of the SKM model was lower compared to other types of classifiers, our model allows selecting an equivalent food not from a group previously classified by machine learning but with a fully interpretable algorithm such as cosine similarity for comparing food.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://github.com/jovanidhh/SMAE-NN/tree/main.
Author contributions
DH-H: formal analysis, data curation, and writing—original draft preparation. AP-L: investigation, methodology, and writing—review. BP-G and GM-L: conceptualization, methodology, and writing—review and editing. All authors contributed to the article and approved the submitted version.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Caso, EK, Calculation of Diabetic Diets . Report of the committee on diabetic diet calculations, American dietetic Association.1 prepared cooperatively with the committee on education, American Diabetes Association, and the diabetes branch, U.S. Public Health Service. J Am Diet Assoc. (1950) 26:575–83. doi: 10.1016/S0002-8223(21)30353-4
2. Sidahmed, E, Cornellier, ML, Ren, J, Askew, LM, Li, Y, Talaat, N, et al. Development of exchange lists for Mediterranean and healthy eating diets: implementation in an intervention trial. J Hum Nutr Diet. (2014) 27:413–25. doi: 10.1111/jhn.12158
3. Martínez-Sanz, JM, Menal-Puey, S, Sospedra, I, Russolillo, G, Norte, A, and Marques-Lopes, I. Development of a sport food exchange list for dietetic practice in sport nutrition. Nutrients. (2020) 12:2403. doi: 10.3390/nu12082403
4. Khan, MN, Kalsoom, S, and Khan, AA. Food exchange list and dietary management of non-Communicable Diseases in cultural perspective. Pak J Med Sci. (2017) 33:1273–8. doi: 10.12669/pjms.335.13330
5. Pérez-Lizaur, AB, and Palacios-González, B. Sistema Mexicano de Alimentos Equivalentes. 4th ed. Mexico: FNS (2014). 160 p.
6. Norma Oficial Mexicana DOdlF . Norma Oficial Mexicana, NOM-043-SSA2-2012. Servicios básicos de salud. Promoción y educación para la salud en materia alimentaria. Criterios para brindar orientación. Mexico: Diario Oficial de la Federación. (2012).
7. Wheeler, ML . Nutrient database for the 2003 exchange lists for meal planning. J Am Diet Assoc. (2003) 103:894–920. doi: 10.1016/S0002-8223(03)00376-6
8. Marques-Lopes, I, Menal-Puey, S, Martínez, JA, and Russolillo, G. Development of a Spanish food exchange list: application of statistical criteria to a rationale procedure. J Acad Nutr Diet. (2018) 118:1161–70. doi: 10.1016/j.jand.2017.04.010
9. Benezra, LM, Nieman, DC, Nieman, CM, Melby, C, Cureton, K, Schmidt, DAN, et al. Intakes of most nutrients remain at acceptable levels during a weight management program using the food exchange system. J Am Diet Assoc. (2001) 101:554–61. doi: 10.1016/S0002-8223(01)00138-9
10. Ma, P, Li, A, Yu, N, Li, Y, Bahadur, R, Wang, Q, et al. Application of machine learning for estimating label nutrients using USDA global branded food products database, (BFPD). J Food Compos Anal. (2021) 100:103857. doi: 10.1016/j.jfca.2021.103857
11. Cáceres, P, Lataste, C, Uribe, D, Herrera, J, and Basfi-Fer, K. Sistema de porciones de intercambio de alimentos en Chile y el mundo: Historia, usos y consideraciones. Revista chilena de nutrición. (2020) 47:484–92. doi: 10.4067/S0717-75182020000300484
12. Kiourt, C, Pavlidis, G, and Markantonatou, S. Deep learning approaches in food recognition. (2020). arXiv:2004.03357. doi: 10.48550/arXiv.2004.03357
13. Bhatia, M, Sharma, S, Hooda, M, and Debnath, NC. A machine learning approach toward meal classification and assessment of nutrients value based on weather conditions In: P Tanwar, V Jain, C-M Liu, and V Goyal, editors. Big data analytics and intelligence: a perspective for health care. Bingley: Emerald Publishing Limited (2020). i–xviii.
14. Sak, J, and Suchodolska, M. Artificial intelligence in nutrients science research: a review. Nutrients. (2021) 13:322. doi: 10.3390/nu13020322
15. Rajkomar, A, Dean, J, and Kohane, I. Machine learning in medicine. N Engl J Med. (2019) 380:1347–58. doi: 10.1056/NEJMra1814259
16. DeGregory, KW, Kuiper, P, DeSilvio, T, Pleuss, JD, Miller, R, Roginski, JW, et al. A review of machine learning in obesity. Obes Rev. (2018) 19:668–85. doi: 10.1111/obr.12667
17. Cui, XR, Abbod, MF, Liu, Q, Shieh, J-S, Chao, TY, Hsieh, CY, et al. Ensembled artificial neural networks to predict the fitness score for body composition analysis. J Nutr Health Aging. (2011) 15:341–8. doi: 10.1007/s12603-010-0260-1
18. Ma, P, Lau, CP, Yu, N, Li, A, and Sheng, J. Application of deep learning for image-based Chinese market food nutrients estimation. Food Chem. (2022) 373:130994. doi: 10.1016/j.foodchem.2021.130994
19. Sowah, RA, Bampoe-Addo, AA, Armoo, SK, Saalia, FK, Gatsi, F, and Sarkodie-Mensah, B. Design and development of diabetes management system using machine learning. Int J Telemed Appl. (2020) 2020:1–17. doi: 10.1155/2020/8870141
20. Ahn, D, Choi, J-Y, Kim, H-C, Cho, J-S, Moon, K-D, and Park, T. Estimating the composition of food nutrients from hyperspectral signals based on deep neural networks. Sensors. (2019) 19:1560. doi: 10.3390/s19071560
21. VijayaKumari, G, Vutkur, P, and Vishwanat, P. Food classification using transfer learning technique. Glob Transit Proc. (2022) 3:225–9. doi: 10.1016/j.gltp.2022.03.027
22. Lo, FPW, Sun, Y, Qiu, J, and Lo, B. Image-based food classification and volume estimation for dietary assessment: a review. IEEE J Biomed Health Inform. (2020) 24:1926–39. doi: 10.1109/JBHI.2020.2987943
23. Davies, T, Louie, JCY, Ndanuko, R, Barbieri, S, Perez-Concha, O, and Wu, JHY. A machine learning approach to predict the added-sugar content of packaged foods. J Nutr. (2022) 152:343–9. doi: 10.1093/jn/nxab341
24. Davies, T, Louie, JCY, Scapin, T, Pettigrew, S, Wu, JH, Marklund, M, et al. An innovative machine learning approach to predict the dietary Fiber content of packaged foods. Nutrients. (2021) 13:3195. doi: 10.3390/nu13093195
25. Ma, P, Zhang, Z, Li, Y, Yu, N, Sheng, J, Küçük McGinty, H, et al. Deep learning accurately predicts food categories and nutrients based on ingredient statements. Food Chem. (2022) 391:133243. doi: 10.1016/j.foodchem.2022.133243
26. Morales de León, JC, Bourges-Rodríguez, H, and Camacho-Parra, ME. Tables of composition of Mexican foods and food products (condensed version 2015). Mexico: National Institute of Medical Sciences and Nutrition Salvador Zubirán, Direction of Nutrition, Depatment of Food Science and Technology (2016).
27. US Department of Agriculture (USDA), Agricultural Research Service, Nutrient Data Laboratory . USDA National Nutrient Database for Standard Reference, Legacy [Internet]. (2022). Available from: http://www.ars.usda.gov/nutrientdata (Acessed May 3, 2022).
28. Schubert, E, Lang, A, and Feher, G. Accelerating spherical k-means. Dortmund, Germany: Springer International Publishing (2021). 217–231.
29. Norma Oficial Mexicana, NOM-051-SCFI/SSA1-2010 . Especificaciones generales de etiquetado para alimentos y bebidas no alcohólicas preenvasados-información comercial y sanitaria. Mexico: Diario Oficial de la Federación (2010).
30. Wang, W, Liu, Y, Li, Y, Luo, B, Lin, Z, Chen, K, et al. Dietary patterns and cardiometabolic health: clinical evidence and mechanism. MedComm. (2023) 4:e212. doi: 10.1002/mco2.212
31. Sami, W, Ansari, T, Butt, NS, and Hamid, MRA. Effect of diet on type 2 diabetes mellitus: a review. Int J Health Sci. (2017) 11:65–71.
32. Bawadi, H, Al-Jayyousi, GF, Shabana, H, Boutefnouchet, S, Eljazzar, S, and Ismail, S. Innovative nutrition education: a color-coded tool for individuals with low literacy level. Healthcare. (2022) 10:272. doi: 10.3390/healthcare10020272
33. Marconi, S, Durazzo, A, Camilli, E, Lisciani, S, Gabrielli, P, Aguzzi, A, et al. Food composition databases: considerations about complex food matrices. Foods. (2018) 7:2. doi: 10.3390/foods7010002
34. Banerjee, S, and Mondal, AC, editors. Nutrient food prediction through deep learning. 2021 Asian conference on innovation in technology (ASIANCON). (2021), 27–29
35. Awad, M, and Khanna, R. Machine learning in action: examples. Eff Learn Mach. (2015) 1–18. doi: 10.1007/978-1-4302-5990-9_1
36. Sarker, IH . Machine learning: algorithms, real-world applications and research directions. SN Comp Sci. (2021) 2. doi: 10.1007/s42979-021-00592-x
37. Centrone Stefani, M, and Humphries, DL. Exploring culture in the world of international nutrition and nutrition sciences. Adv Nutr. (2013) 4:536–8. doi: 10.3945/an.113.004218
38. Goldschmidt, J, Sankavaram, K, and Udahogora, M. Advancing cultural competencies: applying the dietary exchange list system to Jamaican foods. Health Sci J. (2018) 12:12. doi: 10.21767/1791-809X.1000563
39. Huang, L, Qin, J, Zhou, YI, Zhu, F, Liu, L, and Shao, L. Normalization Techiniques un training DNNs: methodology, analysis and application. IEEE Trans Pattern Anal Mach Intell. (2023) 45:10173–96. doi: 10.1109/TPAMI.2023.3250241
40. Russolillo-Femenías, G, Menal-Puey, S, Martínez, JA, and Marques-Lopes, I. A practical approach to the Management of Micronutrients and Other Nutrients of concern in food exchange lists for meal planning. J Acad Nutr Diet. (2018) 118:2029–41. doi: 10.1016/j.jand.2017.07.020
41. Li, M, and Shi, Z. Ultra-processed food consumption associated with incident hypertension among Chinese adults—results from China health and nutrition survey 1997–2015. Nutrients. (2022) 14:4783. doi: 10.3390/nu14224783
42. Yin, Y, and Yasuda, K. Similarity coefficient methods applied to the cell formation problem: a taxonomy and review. Int J Prod Econ. (2006) 101:329–52. doi: 10.1016/j.ijpe.2005.01.014
Keywords: artificial neural networks, food classification, exchangeable portion, nutrient composition, equivalent portion
Citation: Hernández-Hernández DJ, Perez-Lizaur AB, Palacios-González B and Morales-Luna G (2023) Machine learning accurately predicts food exchange list and the exchangeable portion. Front. Nutr. 10:1231873. doi: 10.3389/fnut.2023.1231873
Edited by:
Han Shi Jocelyn Chew, National University of Singapore, SingaporeReviewed by:
Azam Doustmohammadian, Iran University of Medical Sciences, IranM. Raihan, North Western University, Bangladesh
Copyright © 2023 Hernández-Hernández, Perez-Lizaur, Palacios-González and Morales-Luna. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Berenice Palacios-González, bpalacios@inmegen.gob.mx; Gesuri Morales-Luna, gesuri.morales@ibero.mx