Skip to main content

ORIGINAL RESEARCH article

Front. Nutr., 06 February 2024
Sec. Food Chemistry

Exploring the use of NIR and Raman spectroscopy for the prediction of quality traits in PDO cheeses

  • 1Department of Veterinary Science, University of Parma, Parma, Italy
  • 2Department of Food Chemistry and Technology, Teagasc Food Research Centre Moorepark, Fermoy, Ireland
  • 3Escola Universitària Salesiana de Sarrià, Barcelona, Spain
  • 4Consorzio del Formaggio Parmigiano Reggiano, Reggio Emilia, Italy
  • 5Department of Animal Medicine, Production and Health, University of Padova, Padova, Italy
  • 6Institute of Chemical Engineering, Bulgarian Academy of Sciences, Sofia, Bulgaria

The aims of this proof of principle study were to compare two different chemometric approaches using a Bayesian method, Partial Least Square (PLS) and PLS-discriminant analysis (DA), for the prediction of the chemical composition and texture properties of the Grana Padano (GP) and Parmigiano Reggiano (PR) PDO cheeses by using NIR and Raman spectra and quantify their ability to distinguish between the two PDO and among their ripening periods. For each dairy chain consortium, 9 cheese samples from 3 dairy industries were collected for a total of 18 cheese samples. Three seasoning times were chosen for each dairy industry: 12, 20, and 36 months for GP and 12, 24, and 36 months for PR. A portable NIR instrument (spectral range: 950–1,650 nm) was used on 3 selected spots on the paste of each cheese sample, for a total of 54 spectra collected. An Alpha300 R confocal Raman microscope was used to collect 10 individual spectra for each cheese sample in each spot for a total of 540 Raman spectra collected. After the detection of eventual outliers, the spectra were also concatenated together (NIR + Raman). All the cheese samples were assessed in terms of chemical composition and texture properties following the official reference methods. A Bayesian approach and PLS-DA were applied to the NIR, Raman, and fused spectra to predict the PDO type and seasoning time. The PLS-DA reached the best performances, with 100% correctly identified PDO type using Raman only. The fusion of the data improved the results in 60% of the cases with the Bayesian and of 40% with the PLS-DA approach. A Bayesian approach and a PLS procedure were applied to the NIR, Raman, and fused spectra to predict the chemical composition of the cheese samples and their texture properties. In this case, the best performance in validation was reached with the Bayesian method on Raman spectra for fat (R2VAL = 0.74). The fusion of the data was not always helpful in improving the prediction accuracy. Given the limitations associated with our sample set, future studies will expand the sample size and incorporate diverse PDO cheeses.

1 Introduction

European PDO (Protected Designation of Origin) cheeses are outstanding examples of historic inheritance, which is characterized by diversity and tradition in the world of dairy production. These cheeses, which include renowned types like Parmigiano Reggiano (PR), Grana Padano (GP), Roquefort, and Manchego, are praised for their flavors that are closely connected to their geographic origins. Behind the artisanal craftsmanship and centuries-old traditions of PDO cheese production lies a complex interplay of chemical and physical processes. Spectroscopic techniques have emerged as indispensable tools in the scientific field, providing invaluable insights into the composition, structure, and quality of these dairy products (1). Among these techniques, Near Infrared (NIR) and Raman spectroscopy are the most used in the food industry for the quantification of crucial cheese components such as moisture, fat, and protein content (24), facilitating precise quality control, and for the identification and quantification of specific compounds responsible for flavor and aroma (5). The top selling PDO cheeses in the world are the Italian PR and GP, with total export value exceeding $1.2 billion and $650 million USD, respectively (6). Despite their reputation in the global market, monitoring their quality is a complex task involving several challenges. Spectroscopic techniques, while valuable, face specific issues when applied to these cheeses, in part because obtaining representative samples from the cheese wheels is not always easy and because developing accurate calibration models for spectroscopic analysis requires extensive data collection and validation (7). Moreover, as both cheeses are vulnerable to counterfeiting and fraudulent replication, spectroscopic techniques used for authentication must be robust to detect subtle differences between authentic and imitation products.

When spectroscopy is used, the instrument is also an important factor, and its choice depends, apart from available resources, on the specific goals of the analysis, the cheese type, and the desired level of detail in terms of composition prediction. For example, NIR is well-suited for macronutrients, whereas Raman is more suitable for lipids, such as ester linkages in triglycerides and phosphodiester bonds in phospholipids. The presence of these functional groups results in unique Raman peaks, enabling the differentiation of the type of lipid (810). For these reasons, a combination of spectroscopic techniques has been proposed to improve the accuracy of composition trait predictions from cheese spectra (1113). The performance of the models can be evaluated in different ways: use of latent variables, cumulative variance, standard error of calibration, standard error of cross validation, coefficient of determination, similarity map and salience dimension of common space, limit of detection, limit of quantification, linearity, model fit and uncertainties. The spectroscopic instrument and the chemometric approach used (and related setup of parameters and features) for predicting cheese composition or other traits of interest can impact the prediction accuracy. For example, while chemometric methods like Partial Least Squares Regression (PLSR), Principal Component Analysis (PCA), or Linear Regression models are more commonly used for the prediction of food (including cheese) composition and other traits (14), Bayesian approaches are less common but have gained attention in recent years due to their ability to handle complex data structures and to provide deeper insights into uncertainty and variable selection (15). However, the choice of chemometric method often depends on several factors, including also the expertise of the researchers.

Therefore, the aims of this study were to compare two different chemometric approaches using a Bayesian method and PLSR and PLS- PLS-Discriminant Analysis (PLS-DA) (i) for the prediction of the chemical composition and texture properties of the GP and PR PDO cheeses by using NIR and Raman spectra, (ii) and for their ability to distinguish between the two PDO and among their ripening periods.

2 Materials and methods

2.1 Experimental design

A total of 18 cheese samples were collected from 6 dairy plants comprised within the consortia of GP and PR dairy chain. Three dairy plants belonged to GP and three to PR PDO chains. For each dairy, three seasoning times were selected, and were 12, 20, and 36 months for GP and 12, 24, and 36 months for PR.

2.2 Collection of the spectra

A portable NIR instrument (Alba GraiNit, Padova, Italy), working within the spectral range 950–1,650 nm, was used on 3 selected spots of each cheese samplealong the radius of the cheese wheel (as indicated in Figure 1), for a total of 54 spectra collected. The spectra (average of 27 spectra within PDO type) are plotted in Figure 2A for Grana Padano and (Figure 2B) for Parmigiano Reggiano.

Figure 1
www.frontiersin.org

Figure 1. Example of a cheese sample analyzed and related spots (1: paste near the crust; 2: middle sample paste; 3: core of the entirecheese wheel).

Figure 2
www.frontiersin.org

Figure 2. Absorbance spectra (the solid lines represent the average absorbance and the broken lines the mean ± 1 SD) of NIR portable instrument for Grana Padano (GP) (A); Parmigiano Reggiano (PR) (B).

Figure 3
www.frontiersin.org

Figure 3. Absorbance spectra (the solid lines represent the average absorbance and the broken lines the mean ± 1 SD) of the Raman instrument for Grana Padano (GP) (A); Parmigiano Reggiano (PR) (B).

An Alpha300 R confocal Raman microscope (WITec, Ulm, Germany) was used to acquire the Raman spectra using a 532 nm laser at 40 mW and a 10x/0.25 objective. The integration time was 1 s and the number of accumulations 20. In this case, 10 individual spectra were collected for each cheese sample in each location (3 spots) for a total of 540 Raman spectra collected. The spectra (average of 270 spectra within PDO type) are plotted in Figure 3A for Grana Padano and (Figure 3B) for Parmigiano Reggiano.

2.3 Composition analyses

The chemical composition (moisture, protein, lipids) was analyzed on all the cheese samples. Briefly, the cheese samples were grinded (Thermomix TM6, Vorwerk), and approximately 2 g of grinded cheese sample was weighed and used for moisture and fat analysis using the microwave moisture analyzer (Smart 5 Turbo) and Nuclear Magnetic Resonance (NMR) fat analyzer (CEM Corporation). The microwave moisture analyzer applies microwave radiation to the sample, causing the water molecules to heat up and evaporate. The moisture content is then determined by measuring the weight loss of the sample before and after the microwave treatment. For the determination of fat, the NMR can distinguish the signal produced by hydrogen protons present in fats from that produced by all other sources of protons in food matrices, such as carbohydrates and proteins. The instrument is validated according to the AOAC regulations [Peer-Verified Method 1:2004 according to (16)]. The protein content of cheese samples was determined by Kjeldahl method (17). The steps typically involve digestion of the sample with concentrated sulfuric acid, conversion of nitrogen to ammonium sulfate, and subsequent measurement of the ammonia produced. The protein content is then determined using the total nitrogen content.

2.4 Texture properties

Texture traits of all the cheese samples were determined using a Texture Analyzer (XT2i, Stable Micro Systems, Ltd., Godalming, Surrey, UK) with a Warner–Bratzler shear device [50 Newton (N) load cell; 2 mm/s crosshead speed]. For each cheese, 1 cylinder-shaped core sample was taken (1 cm2 cross sectional area; 3 cm long). Texture data were reported as hardness (defined as the maximum shear force, expressed in N), adhesiveness (describes the work needed to overcome attractive force between food and other surfaces, expressed in N/s), resilience (which referred to the degree to which the cheese regains its original shape during the biting process, expressed in %), cohesiveness (the tendency of cheese to remain together, and resist breaking into several pieces, during compression), springiness (a measure of ability that the deformed cheese returned to the initial position after the removal of the force, expressed in %), gumminess (the energy required to disintegrate a semi-solid food to a state ready for swallowing, expressed in N), chewiness (the work needed to masticate a solid food to a state ready for swallowing, expressed in N/s), respectively.

2.5 Chemometric analyses

2.5.1 Editing of the spectra, data fusion and Bayesian models

Before data fusion and spectra analysis, each instrument’s raw absorbance values of each wavelength of the spectra were centered and scaled to a null mean and a unit variance. Then, samples having a large spectral distance (i.e., Mahalanobis distance >3) were considered outliers and removed from the calibration dataset. After scaling, the NIR and Raman spectra were concatenated and stored as single matrix to be used for the spectra analysis. This is called low-level data fusion (18, 19). No other mathematical preprocessing was applied to the spectra. The matrix comprises m-rows (number of individual samples) and n columns (measurement variables from each source). In this study, fusion data comprised a total of 2,000 variables with the Raman and NIR instrument contributing for 1,600 and 400 wavelengths, respectively.

The calibration models built using the Bayesian approach (15) were developed by using the Bayesian Generalized Linear Regression [BGLR; (20)] package, available in R software (21). Each trait was regressed to a new pool of wavelengths using the following equation:

y = μ + j t x i j β j + e i
Where μ is the overall mean, xij are the NIR, Raman or fused spectra’s x axis, i is the sample (from 1 to 54), t is the number of wavelengths of the studied spectra (400 for NIR and 1,600 for Raman), and j the wavelengths (950 to 1,650 nm for NIR, and from −50 to 4,000 rel cm−1 for Raman), βj are the regression coefficients, and ei is the residual assumed to be independently and identically distributed with a normal distribution with mean equal to 0 and variance equal to s2e. The Bayes B model implemented in the package we used incorporates prior information about the model parameters and updates this information based on the observed data to estimate the effects of wavelengths on the phenotypes (15).

2.5.2 Editing of the spectra and PLSR models

PLSR models were performed with the software Unscrambler (Aspen Tech, MA, USA). For the classification of the seasoning and PDO type of each cheese sample, PLS-DA models have been performed by Solo, version 9.2.1 (Eigenvector Research, Inc., Wenatchee, WA, USA). The different parameters were obtained from the different reference methods and were used as reference values for the model.

For the PLSR and PLS-DA model construction, from the total 54 spectra, 36 were selected by the Kennard-Stone (21) algorithm and used as calibration samples and the following 18 were kept as validation set. By doing this, the final model will have the maximum spectral variability to reach the best prediction capability.

The spectral pretreatments used to construct the PLSR and PLS-DA models included standard normal variate (SNV) (22), used to correct baseline shifts and variations in intensity across spectra; Savitzky–Golay Derivatives with second-order polynomial fitting (1st D and 2nd D) (23), to reduce high-frequency noise in a signal due to its smoothing properties and to reduce low-frequency signal (e.g., due to offsets and slopes) using differentiation and the second-order derivatives to highlight spectral features; smoothing (moving average, MA) (24); linear baseline correction (BLC) (25) and orthogonal signal correction (OSC), used to remove unwanted variations in the X- data that are unrelated to interest response (Y) (26). The selected pretreatments allowed the reduction of multiplicative effects derived from the physical characteristics of the samples and allowed the enhancement of the differences between spectra that will allow the performance of the desired classifications and quantitative models. The MA and BLC were necessary only for Raman spectra, as MA was useful to reduce the noise, whereas BLC corrected the curvature caused by the fluorescence effect.

2.5.3 Cross-validation models

2.5.3.1 Bayesian approach

For the classification of the seasoning and PDO type of each cheese sample and the prediction of the chemical composition and texture traits, a random cross-validation was applied, in which 80% of the total records were randomly selected and used to build the equation (calibration set; CAL), and the remaining 20% of records were used to test the model (validation set; VAL). To account for sample variability, the procedure was repeated 10 times for the classification and 5 times for the quantification. The results were averaged over the replicates. The standard deviation (SD) across the replicates was also calculated. The coefficient of determination (R2VAL), the root mean squared error of validation (RMSEVAL) and relative error of prediction (RSEP%) were used to assess the models’ performances. As the Bayesian approach uses a linear regression model, it was necessary to establish a decision criterion for interpreting the predicted values. For the prediction of the PDO, the label were switched to numerical value (0 for GP and 1 for PR) and a critical threshold was set up at 0.5. All predicted value <0.5 are attributed to GP while values >0.5 are classified as PR. For the prediction of the seasoning time, the model used the numerical value in months and the predicted values were divided in 3 classes: “young” (<17 months), “mid” (17–28.5 months) and “old” (>28.5 months). The ranges were decided in order to distinguish the 3 seasoning times and allowing the comparison between PDO (as their seasoning time was not the same). The percentage of attribution of each class was also calculated to assess the model’s accuracy.

2.5.3.2 PLSR and PLS-DA approaches, and data fusion

Previously to any calibration or classification model an exploratory analysis was done to find the combination of pretreatments that provided better discrimination between PDO and ripening times classes.

For each instrument, prior to data fusion and spectra analysis, the raw absorbance values of each wavelength of the spectra were centered and scaled to a null mean and a unit variance. Then, samples having a large spectral distance (i.e., Mahalanobis distance >3) were considered outliers and removed from the calibration dataset. After scaling, the NIR and Raman spectra were concatenated and stored as a single spectra matrix that could be used for the spectra analysis. The matrix comprises m-rows (number of individual samples) and n columns (measurement variables from each source). Fusion data comprised a total of 2,000 variables, with the Raman and NIR instruments contributing for 1,600 and 400 wavelengths, respectively. Calibration models were constructed using the PLSR algorithm and internally validated by cross-validation (the leave-one-out method for PLSR and venetian blinds for PLS-DA). The coefficient of determination (R2VAL), the root mean squared error of validation (RMSEVAL) and relative error of prediction (RSEP%) were used to assess the models’ performances.

RSEP % = i = 1 n y i y ̂ i 2 i = 1 n y i 2 × 100
RMSEVAL = i = 1 n y i y ̂ i 2 n
Where yi is the reference value for validation set sample i, ŷi is the predicted value for validation set sample i, and n is the number of samples in the validation set.

The optimum number of latent variables was determined from a plot of the explained variance against the number of factors. Then, the initial model was refined by selecting those factors resulting in the lowest relative standard error for prediction (RSEP%), bias, and SD for the validation set.

For the classification, the percentage of attribution of each class was also calculated to assess the model’s accuracy.

Then, a permutation tests were conducted on the Bayesian and PLS models to assess their predictive performance and rule out potential overfitting to the spectral data’s inherent structure. The tests involved randomly shuffling the class labels (PDO and age) 500 times and refitting the models to each shuffled dataset. From this test we constructed a null distribution representing the range of outcomes expected under the assumption of no true relationships between the spectral data and the class labels. A key indicator of a model’s genuine predictive ability is its performance relative to this null distribution. If only a small percentage of the permuted datasets yields results comparable to or better than those of the original model, it suggests that the original model’s performance is unlikely to be attributable to chance associations or overfitting. Instead, it provides evidence that the model has successfully captured meaningful patterns in the spectral data that are genuinely predictive of the class labels (27). Since only 2% of the permutations yielded results comparable to or better than those of the original Bayesian models for all spectral matrices, and none of the permutations achieved results similar to the developed PLS model (data not shown), the results will not be further discussed, as the model actually predicted the class label (PDO or age) from the spectral data, without overfitting to a casual structure of the data.

3 Results and discussion

3.1 Prediction of seasoning and PDO type: Bayesian vs. PLS-DA across different type of spectra

Table 1 reports the descriptive statistics for composition traits and texture properties of the two types of PDO cheeses: PR and GP PDO cheeses. As regards to the chemical composition, the PR had a slightly higher moisture content (27.5%) compared to GP (26.7%), with similar CV (9 and 8%, respectively, for PR and GP). Moreover, PR contained more fat (33.7%) and less protein (34.3%) contents, compared to GP (30.1 and 32.4%, respectively, for fat and protein). Regarding texture properties, the hardness of both cheeses was similar, with PR having a mean hardness of 18.38 N and GP of 18.33 N. The CV values were relatively high, indicating some variability within each cheese type (15 and 19%, respectively) probably due to the wide range of seasoning of the analyzed samples. Adhesiveness refers to the ability of a cheese sample to stick or adhere to surfaces. Parmigiano Reggiano showed higher adhesiveness (−0.76 N/s) compared to GP (−0.47 N/s). Both had negative values, indicating that the cheese sample exhibited a low tendency to stick or adhere to surfaces. This can be a desirable characteristic for some types of cheese, like the hard ones. The CV was high (30 and 66%, respectively), which could be due to the different ripening times within each PDO cheese. Both cheeses exhibited similar resilience values (9.30 and 9.20%, respectively, for PR and GP) with higher CV for GP (9 and 14%, respectively). Cohesiveness quantifies the degree to which a cheese sample resists falling apart or fragmenting when it is bitten, chewed, or compressed. It provides insights into the cheese’s ability to maintain its integrity and internal structure during consumption, and from this point of view the two PDO had similar values (0.27 and 0.26, respectively, for PR and GP). As regards to springiness, this was higher in GP (54.8%) compared to PR (47.3%), with also higher CV (27 and 20%, respectively). Both cheeses had similar gumminess values (4.98 N and 4.87 N, respectively, for PR and GP), with higher CV for GP (23 and 28%, respectively). Grana Padano was also slightly chewier (2.82 N/s) than PR (2.45 N/s), with also higher CV (51 and 42%, respectively).

Table 1
www.frontiersin.org

Table 1. Descriptive statistics (Mean ± SD, and CV) of composition and texture properties of Parmigiano Reggiano and Grana Padano cheese samples.

3.2 Prediction of seasoning and PDO type: Bayesian vs. PLS-DA across different type of spectra

Tables 2, 3 show the numbers of correctly and wrongly identified samples for the seasoning and PDO type by using the Bayes B model (Table 2) and the PLS-DA model (Table 3) with NIR, Raman and fused (NIR + Raman) spectra, respectively. Regarding the seasoning and PDO identification (Table 2), the NIR technique performed relatively well with accuracy ranging from 75% (Young) to 90% (Mid) and with a correct % of identification of 69% for GP and 59% for PR. The Raman performed poorly on the seasoning time, with a correct identification of 12 and 17%, respectively, for young and old ripening times, but achieved better results for mid seasoning (83%). The exact reasons for the observed performances can be multifaceted. However, some potential factors contributing to the poor results for young and old ripening times, when using a Bayesian approach, could include the dataset size, as in a small dataset, the observed information may not be comprehensive enough to fully capture the complexity of the relationship between Raman spectra and the ripening times of cheese. For the PDO identification, Raman achieved 83% accuracy for GP and 78% for PR. Regarding the results from the data fusion, this is the first work the authors are aware of, presenting results about data fusion on the prediction of PDO type and seasoning time. The spectra fusion provided consistent results with accuracies from 64 to 77% for the seasoning time, and high accuracy for both GP and PR (89%). As regards to PLS-DA used for the seasoning identification (Table 3), NIR achieved relatively high accuracy, ranging from 66 to 88%. For the PDO identification, NIR achieved perfect accuracy (100%) for GP and high accuracy (88%) for PR. Raman exhibited moderate accuracy, ranging from 50 to 75% for the seasoning time, but it achieved 100% accuracy for both PDO cheeses. The fusion of the spectra provided good accuracy, from 66 to 83% for the identification of the seasoning time, and was excellent for GP (100%) and good (77%) for PR. Comparing the two chemometric approaches, the Bayes B Model achieved very good accuracy for some categories within seasoning and PDO type but overall, the PLS-DA models consistently outperformed the Bayes B in terms of accuracy across all the instruments (NIR, Raman, and fused spectra). In particular, PDO identification using the PLS-DA models showed very high accuracy, with GP being correctly identified at 100% in all three type of spectra. As reported by Silva et al. (28), the chemometric methods most commonly used for the differentiation between cheese samples by origin, were mainly linear discriminant analysis, principal component analysis and the PLS-DA methods. For example, Karoui et al. (29) reached a 91% classification rate to determine the geographic origin of Gruyère PDO and L’Etivaz PDO Swiss cheeses. Yet again, in the study by Ottavian et al. (30), the use of PLS-DA demonstrated its value in differentiating NIR spectra in relation to the production period of Asiago d’Allevo cheeses, yielding 100% correct classification. A result that is comparable to that attained with the direct assessment of chemical properties. On the contrary, the Bayesian methods have not yet been applied to the infrared spectra of dairy products for the authentication of cheeses or discrimination purposes. However, Li Vigni et al. (31) used a class-modeling applied to Raman spectra of Parmigiano Reggiano PDO cheese samples to discriminate it from other cheese types. Their preliminary results were promising, showing sensitivity and specificity of 100% for the test set. In the case of data fusion, comparing the results from the integration of the spectra from NIR and Raman and the individual techniques, it comes that, when using the Bayesian model, the fusion of the data increased the performances in the prediction of the early seasoning (<12 months) and the identification of the PDO type. For the mid and old seasoning, the data fusion did not improve (mid seasoning, 77% vs. 90 and 83%, respectively, for fused vs. NIR and Raman) or improved only partially (old seasoning, 17% for Raman) the correct % of identification (Table 2). When using PLS-DA, the data fusion had the best prediction accuracy for the mid and old seasoning, improved the prediction over Raman for the early seasoning, did not change for the GP, and reduced the correct % of identification for PR (Table 3). It is interesting to note that in some cases, the fusion was not helpful to the improvement of the prediction accuracy, and this could be due to the fact that NIR and Raman may capture similar information about the sample that can be redundant, which in turn does not provide additional information (32). Other studies applied NIR and Raman spectra fusion for food authentication in other matrices: Márquez et al. (33) tested two data fusion strategies (mid and high level) combined with a multivariate classification approach for the identification of adulteration of hazelnut paste with almond. Their sensitivity and specificity values were between 96–100% and 88–100% for the mid- and high-level data fusion strategies, respectively. Wang et al. (34) developed chemometric models using Vis–NIR and Raman spectral data fusion for the assessment of commercial infant formula storage temperature and time, with a RMSEVAL of 0.7 for the mid-level data fusion to predict storage time at both 20° and 37°C. Again, Bragolusi et al. (19) in their preliminary study on the differentiation of Italian and Greek extra virgin olive oil, they merged NIR and Raman spectra by low and mid-level fusion and submitted to PLS-DA analysis. In cross-validation the accuracy was 94% for the low-level and 97% for the mid-level fusion.

Table 2
www.frontiersin.org

Table 2. Identification of seasoning and PDO using the Bayes B Model with NIR, Raman, and Fused Spectra.

Table 3
www.frontiersin.org

Table 3. Identification of seasoning and PDO using the PLS-DA Models with NIR, Raman, and Fused Spectra.

3.3 Prediction of chemical composition and texture properties: Bayesian vs. PLSR for NIR

Tables 4, 5 report the prediction statistics of the composition and texture properties of cheese deriving from the Bayesian and PLSR procedures using NIR spectra from the cheese samples. As regards the Bayesian approach (Table 4), the R2CAL values for fat, protein, and moisture ranged from 0.11 (fat) to 0.50 (moisture), with quite high SD (from ±0.05 to ±0.13), suggesting substantial variability in the model’s performance. Indeed, the errors of calibration were also high (from 1.57 to 2.91% for RMSECAL and from 5.8 to 9.2% for RSEC%). Obviously, the results in the validation set were not accurate enough to be used at the dairy industry level, with R2VAL < 0.50 within composition traits and high RSEP% (from 8 to 9%). Results from the texture properties were even worse, especially if we consider that the R2CAL values were lower than the R2VAL values. This fact could be due to the randomness in the data split (35) and/or the small sample size and/or to overfitting due to the complexity of the model (36). It is worth mentioning that the effectiveness of a chosen approach (Bayesian of PLSR in this study) depends on the specific characteristics of the data and the nature of the trait to be predicted. In some cases, Bayesian modeling with informative priors can provide valuable insights even with small datasets, as well as PLSR may be a more practical and less computationally demanding option. Indeed, results from the PLSR procedure (Table 5) generally showed better results than the Bayesian approach. Overall, the R2CAL ranged from 0.37 (chewiness) to 0.89 (protein) with lower RSEC% (from 2.4 to 39%) compared to the Bayesian approach (RSEC% from 5.8 to 43%; Table 4). The R2VAL values were good for moisture (0.69) and fat (0.63), with generally higher values compared to the Bayesian model, although for some traits, the R2VAL was higher than the R2CAL (adhesiveness). Comparing the two chemometric approaches, the Bayes B model using NIR spectra appeared to have some limitations in predicting both the composition and texture properties of cheese. The model’s performance varied across different traits, but with generally low R2 values and relatively high errors. This suggests that the model may require further refinement or the inclusion of additional features to improve its predictive accuracy for these cheese properties. In contrast, the PLSR procedure using NIR spectra appeared to be more effective in predicting both the composition and texture traits of cheese. However, these results may, in part, be attributed to the inherent constraints associated with the small sample size, underscoring the importance of considering the potential impact of limited data in interpreting the predictive capabilities of the model. There is an extensive amount of literature on the application of PLSR and NIR spectroscopy to the prediction of cheese chemical composition. In diverse kinds of cheeses and curds, all predictions of fat and the majority of predictions of protein and moisture/dry matter can be deemed excellent [R2 > 0.90; (3739)]. NIR spectroscopy is, therefore highly suitable for predicting main chemical components like fat, protein, and moisture, and it has also been proved to be effective for monitoring the compliance of nutritional labels with EU tolerance limits of a wide varety of food products (40). However, it is not stable for predicting chemicals with small amounts in cheese products [e.g., minerals, fatty acids; (41)] since the amount of chemicals can alter the accuracy of prediction. NIR has a great degree of spectrum stability, which has made it quite popular at the industry level (42). However, literature related to NIR spectroscopy for cheese quality evaluation has become less in recent years (43) because applications for evaluating other attributes are inadequate, and innovative chemometric methods are not extensively used in research.

Table 4
www.frontiersin.org

Table 4. Prediction statistics (Mean ± SD of the 5 replicates) of composition and texture properties of cheese deriving from the Bayesian model Cross-Validation procedure using NIR spectra from paste cheese samples.

Table 5
www.frontiersin.org

Table 5. Prediction statistics of composition and texture properties of cheese deriving from the PLS regression procedure using NIR spectra from paste cheese samples.

3.4 Prediction of chemical composition and texture properties: Bayesian vs. PLSR for Raman

Tables 6, 7 report the prediction statistics of composition and texture properties of cheese deriving from the Bayesian and PLSR procedures, respectively, using Raman spectra from the cheese samples. As regards the Bayesian approach (Table 6), the R2CAL values for composition traits ranged from 0.28 (protein) to 0.72 (fat) and were higher compared to the R2CAL values obtained with the same procedure using NIR spectra (Table 4). The errors of calibration were still high (from 1.58 to 2.58% for RMSECAL and from 5.8 to 7.7% for RSEC%), but in the case of fat, the error was-3% lower than RSEC% of fat predicted with the same procedure applied to the NIR spectra. The results in the validation set were good, with R2VAL ranging from 0.46 (moisture) to 0.74 (fat) within composition traits, although RSEP% were still high (from 5 to 9%). Fat percentage was the trait predicted at best with the highest R2VAL (0.72) and the lowest RSEP% (5.3%). Indeed, it is acknowledged that fat molecules, which are primarily composed of carbon-hydrogen (C-H) bonds and carbon–carbon (C-C) bonds, exhibit characteristic vibrational modes that are readily detected by Raman spectroscopy. The vibrational frequencies of these bonds are unique to fats, providing a distinct Raman signature (44, 45). On the contrary, fitting statistics for the texture properties were poor, with R2CAL values ranging from 0.19 to 0.43 and high calibration errors. Consequently, results in validation were not satisfactory, with R2VAL values from 0.08 to 0.45. As aforementioned, this could be due to the randomness in the data split, the small sample size, and/or to overfitting due to the complexity of the model (35, 36). Moreover, compared to the same approach applied to the NIR spectra (Table 4), texture properties were worse predicted using the Raman spectra.

Table 6
www.frontiersin.org

Table 6. Prediction statistics (Mean ± SD of the 5 replicates) of composition and texture properties of cheese deriving from the Cross-Validation procedure using Raman spectra from paste cheese samples.

Table 7
www.frontiersin.org

Table 7. Prediction statistics of composition and texture properties of cheese deriving from the PLS regression procedure using Raman spectra from paste cheese samples.

Results from the PLSR procedure (Table 7) generally showed better results than the Bayesian approach. The R2CAL ranged from 0.59 (protein) to 0.93 (moisture) with lower RSEC% (from 1.1 to 4.8%) compared to the Bayesian approach, as well as for texture properties, whose R2CAL ranged from 0.64 to 0.99. Although the calibration statistics were satisfactory, the R2VAL values were > 0.50, only for half of the traits examined. Compared to the fitting statistics using the same approach applied to the NIR spectra, the use of Raman spectra showed better results, both in terms of composition and texture properties and in general, also compared to the use of NIR and Raman spectra with the Bayesian models. Raman spectroscopy is less used than NIR spectroscopy, but it has gradually become a new analytical method for the structure identification of substances (46). In the dairy cheese industry field, Raman spectra were used to discriminate adulterated spreadable cheese with 100% accuracy and 95% confidence and to predict starch content with R2VAL of 0.98 (12). Recently, Zhang et al. (47) combined Raman spectroscopy with a machine learning model for cheese product identification, with an average identification accuracy rate of 98%.

3.5 Prediction of chemical composition and texture properties: Bayesian vs. PLSR for fused spectra

There are no studies in the literature on integrating spectrum signals derived from various spectroscopic techniques or comparing different chemometric procedures at the dairy sector level. However, data fusion is a developing practice since collecting spectral data is becoming easier. The primary goal is to optimize the information gathered in order to take advantage of the synergies between the various types of information provided by diverse approaches (48). Data fusion can be done using low, mid, or high-level fusion techniques. Low-level data fusion involves the direct fusion (e.g., simple concatenation) of raw data from several sources. Tables 8, 9 report the prediction statistics of composition and texture properties of cheese deriving from the Bayesian and PLSR procedures, respectively, using the fused spectra (NIR + Raman) from the cheese samples. As also proved by the results above, different spectroscopic techniques can provide complementary information about a sample (e.g., chemical composition, structure). Therefore, integrating data sources from different spectroscopic techniques can provide a more comprehensive understanding of the cheese quality, and also increase the robustness of a prediction model (33, 49). As regards the Bayesian approach (Table 8), the R2CAL values for composition traits ranged from 0.59 (protein) to 0.93 (moisture) and were much higher compared to the R2CAL values obtained with the same procedure using NIR and Raman spectra (Tables 4, 6). The R2VAL of composition traits improved with respect to the NIR prediction (0.41, 0.30, and 0.53 of fused vs. 0.12, 0.25, and 0.47 of NIR, respectively, for fat, protein, and moisture) and with respect to the moisture content predicted by using Raman spectra (R2VAL = 0.46). Results in calibration for texture properties were also much more improved (R2CAL ranging from 0.18 to 0.49) with also lower errors (RSEC% from 10 to 41%) compared to NIR spectra, whereas if compared to the Raman, the R2CAL values were lower, with similar errors. In validation, some texture traits benefited from the fusion of the spectra from the two instruments (e.g., hardness and resilience), whereas others were better predicted with NIR (e.g., adhesiveness, cohesiveness, springiness) or Raman (e.g., chewiness, gumminess) only. So, the fusion of the two techniques (NIR and Raman) with the Bayesian model tended to have moderate calibration and prediction performance across most attributes, with some variation in the quality of results. This might be the result of the complementing information that NIR and Raman provide regarding the structure and composition of the cheese, as well as the possibility that the performance of the model is impacted by the variability in the cheese samples and the quality of the spectroscopic data. However, Raman spectra generally outperformed both fused and NIR spectra, with higher R2CAL and R2VAL values and lower RMSEVAL and RSEP% for several attributes. The NIR spectra exhibited the lowest calibration and prediction performance compared to the other two types of spectra.

Table 8
www.frontiersin.org

Table 8. Prediction statistics (Mean ± SD of the 5 replicates) of composition and texture properties of cheese deriving from the Cross-Validation procedure using fused NIR and Raman spectra from paste cheese samples.

Table 9
www.frontiersin.org

Table 9. Prediction statistics of composition and texture properties of cheese deriving from the PLS procedure using Fused Spectra from paste cheese samples.

Results from the PLSR procedure (Table 9) generally showed good results but worse results compared to the Bayesian approach using the fused spectra. The R2CAL values for composition traits ranged from 0.54 (moisture) to 0.86 (protein), although the RSEC% were better (from 3 to 5%) compared to the Bayesian approach. In validation, the accuracy of the PLSR was still worse than the Bayesian (R2VAL from 0.15 to 0.42). The model for texture properties was better in calibration (R2CAL from 0.19 to 0.77) than the Bayesian but with scanty predictive accuracy (R2VAL from 0.02 to 0.23). Comparing the three techniques using the PLSR method, the results from the Raman spectra generally exhibited higher R2CAL and R2VAL values compared to fused and NIR spectra, indicating better calibration and predictive ability for most attributes. Raman spectra also had lower RSEC% and RSEP% values, suggesting better precision and accuracy in calibration and prediction. The NIR spectra showed moderate performance, with R2CAL and R2VAL values falling between those of Raman and fused spectra. For some attributes (e.g., fat, protein, moisture), NIR spectra outperform fused spectra regarding R2 and R2VAL values. Adhesiveness, chewiness, and cohesiveness had relatively low R2CAL and R2VAL values in all three type of data, suggesting that they might be challenging to predict using these spectroscopic/chemometric techniques. In the scientific literature, the highest number of publications incorporating data fusion techniques is alcoholic beverages (27%) [(e.g., 18)] followed by fruit and vegetables (17%) [(e.g., 50)] and oils (13%) [(e.g., 19)]. Milk and dairy products cover the smallest percentage (only 5%) (49), therefore, it is difficult to compare our results with the current literature. However, it is important to mention that simply combining data from various instruments (in this case, NIR and Raman) does not automatically improve prediction accuracy. For example, data fusion may not be advantageous if the single instruments produce extremely comparable information or if one of them introduces excessive noise. It is critical to examine the uniqueness of the information provided by each instrument and carefully consider whether the benefits of fusion outweigh the obstacles and potential drawbacks, such as redundancy and noise amplification (51, 52). This consideration becomes particularly pertinent in the context of a small sample size, where the potential benefits of data fusion must be carefully interpreted.

So, the most effective method was the PLS-DA approach applied to NIR for classifying seasoning time, and to Raman spectra for classifying PDO type. The NIR, coupled with the PLS technique, was the best one for chemical traits. With the exception of cohesiveness (Raman) and resilience (NIR/Raman) with a RSEP% lower than 10%. On the other hand, no chemometric strategy nor spectroscopic technique for predicting texture achieved good prediction levels to be employed in the dairy industry.

4 Conclusion

This proof of principle study provides new insights into the application of chemometric approaches for predicting the characteristics of GP and PR PDO cheeses. Specifically, it focused on the use of NIR and Raman spectra and their integration to achieve these predictions, as well as the potential to distinguish between the two PDO and their various ripening stages. As regards the classification models, the PLS-DA achieved the greatest results, correctly identifying the PDO type at 100%. The findings were enhanced by the data fusion in 60% of the cases using the Bayesian approach and 40% using the PLS-DA approach. As regards the prediction of chemical composition and texture traits, the Bayesian technique using Raman spectra for fat provided the greatest performance in validation. It is important to highlight that the accuracy of predictions did not consistently improve with data fusion. This suggests that the effectiveness of data fusion may vary depending on the specific analysis and the methods employed. It may be necessary to carefully consider when and how to apply data fusion in such studies. Mathematical spectra treatment before and after fusion may enhance prediction accuracy, especially when dealing with inherently distinct techniques. Moreover, larger dataset of high quality data is needed to improve the statistical power of the analysis. This can also help to validate the models and reduce the risk of overfitting.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Author contributions

GS: Data curation, Investigation, Supervision, Writing – original draft, Writing – review & editing. LG-M: Formal analysis, Investigation, Resources, Writing – original draft. GD: Formal analysis, Writing – original draft. JS: Formal analysis, Funding acquisition, Writing – original draft. AM: Formal analysis, Investigation, Writing – original draft. VP: Resources, Writing – original draft. PB: Resources, Writing – original draft. GG: Formal analysis, Writing – original draft. CC-G: Conceptualization, Project administration, Writing – original draft.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was funded by E-COST-GRANT-CA19145-e90f5549 of the Short-Term Scientific Mission included within the WG2 of the CA19145 COST Action: Innovation related to the integration of several NDSS signals for critical issues in food integrity.

Acknowledgments

The study is part of the Short-Term Scientific Mission included within the WG2 of the CA19145 COST Action: Innovation related to the integration of several NDSS signals for critical issues in food integrity. The authors would like to thank the Parmigiano Reggiano and Grana Padano PDO Consortia for their provision of the cheese samples used in this study. For the Raman spectra acquisition, the research was conducted using resources of the National Food Imaging Centre (NFIC) at Teagasc.

Conflict of interest

PB co-author of this manuscript, is also administrator of GraiNit srl, which provided the portable NIR instrument.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Karoui, R, Mazerolles, G, and Dufour, E. Spectroscopic techniques coupled with chemometric tools for structure and texture determinations in dairy products. Int Dairy J. (2003) 13:607–20. doi: 10.1016/S0958-6946(03)00076-1

Crossref Full Text | Google Scholar

2. Bittante, G, Patel, N, Cecchinato, A, and Berzaghi, P. Invited review: a comprehensive review of visible and near-infrared spectroscopy for predicting the chemical composition of cheese. J Dairy Sci. (2022) 105:1817–36. doi: 10.3168/jds.2021-20640

PubMed Abstract | Crossref Full Text | Google Scholar

3. Čurda, L, and Kukačková, O. NIR spectroscopy: a useful tool for rapid monitoring of processed cheeses manufacture. J Food Eng. (2004) 61:557–60. doi: 10.1016/S0260-8774(03)00215-2

Crossref Full Text | Google Scholar

4. Silveira, L, Motta, E, Zângaro, R, Pacheco, M, Lima, C, and Moreira, L. Characterization of nutritional parameters in bovine milk by Raman spectroscopy with least squares modeling. Instrum Sci Technol. (2016) 44:85–97. doi: 10.1080/10739149.2015.1055578

Crossref Full Text | Google Scholar

5. González-Martín, M, Vivar-Quintana, A, Revilla, I, and Salvador-Esteban, J. The determination of fatty acids in cheeses of variable composition (cow, ewe's, and goat) by means of near infrared spectroscopy. Microchem J. (2020) 156:104854. doi: 10.1016/j.microc.2020.104854

Crossref Full Text | Google Scholar

6. CLAL. (2022). Available at: https://www.clal.it/en/?section=prod_export (Accessed October 19, 2023).

Google Scholar

7. Filho, P, and Volery, P. Broad-based versus specific NIRS calibration: determination of total solids in fresh cheese. Anal Chim Acta. (2005) 544:82–8. doi: 10.1016/J.ACA.2005.03.022

Crossref Full Text | Google Scholar

8. Baeten, V. Raman spectroscopy in lipid analysis. Lipid Technol. (2010) 22:36–8. doi: 10.1002/lite.200900082

Crossref Full Text | Google Scholar

9. Beć, KB, Grabska, J, and Huck, CW. Miniaturized NIR spectroscopy in food analysis and quality control: promises, challenges, and perspectives. Foods. (2022) 11:1465. doi: 10.3390/foods11101465

PubMed Abstract | Crossref Full Text | Google Scholar

10. Gómez-Mascaraque, LG, Kilcawley, K, Hennessy, D, Tobin, JT, and O'Callaghan, TF. Raman spectroscopy: a rapid method to assess the effects of pasture feeding on the nutritional quality of butter. J Dairy Sci. (2020) 103:8721–31. doi: 10.3168/jds.2020-18716

Crossref Full Text | Google Scholar

11. Boubellouta, T, Karoui, R, Lebecque, A, and Dufour, E. Utilisation of attenuated total reflectance MIR and front-face fluorescence spectroscopies for the identification of saint-Nectaire cheeses varying by manufacturing conditions. Eur Food Res Technol. (2010) 231:873–82. doi: 10.1007/S00217-010-1336-1

Crossref Full Text | Google Scholar

12. de Sá Oliveira, K, de Souza Callegaro, L, Stephani, R, Almeida, MR, and de Oliveira, LFC. Analysis of spreadable cheese by Raman spectroscopy and chemometric tools. Food Chem. (2016) 194:441–6. doi: 10.1016/j.foodchem.2015.08.039

PubMed Abstract | Crossref Full Text | Google Scholar

13. Rodriguez-Saona, L, Koca, N, Harper, W, and Alvárez, V. Rapid determination of Swiss cheese composition by Fourier transform infrared/attenuated total reflectance spectroscopy. J Dairy Sci. (2006) 89:1407–12. doi: 10.3168/JDS.S0022-0302(06)72209-3

PubMed Abstract | Crossref Full Text | Google Scholar

14. Wold, S, Sjöström, M, and Eriksson, L. PLS-regression: a basic tool of chemometrics. Chemom Intell Lab Syst. (2001) 58:109–30. doi: 10.1016/S0169-7439(01)00155-1

Crossref Full Text | Google Scholar

15. Ferragina, A, de Los Campos, G, Vazquez, AI, Cecchinato, A, and Bittante, G. Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data. J Dairy Sci. (2015) 98:8133–51. doi: 10.3168/jds.2014-9143

PubMed Abstract | Crossref Full Text | Google Scholar

16. Cartwright, G, McManus, BH, Leffler, TP, and Moser, CR. Rapid determination of moisture/solids and fat in dairy products by microwave and nuclear magnetic resonance analysis: PVM 1:2004. J AOAC Int. (2005) 88:107–20. doi: 10.1093/jaoac/88.1.107

PubMed Abstract | Crossref Full Text | Google Scholar

17. AOAC ed. Association of Official Analytical Chemists. 16th ed. Arlington, VA: AOAC INTERNATIONAL (1995).

Google Scholar

18. Biancolillo, A, Bucci, R, Magrì, AL, Magrì, AD, and Marini, F. Data-fusion for multiplatform characterization of an Italian craft beer aimed at its authentication. Anal Chim Acta. (2014) 820:23–31. doi: 10.1016/j.aca.2014.02.024

PubMed Abstract | Crossref Full Text | Google Scholar

19. Bragolusi, M, Massaro, A, Zacometti, C, Tata, A, and Piro, R. Geographical identification of Italian extra virgin olive oil by the combination of near infrared and Raman spectroscopy: a feasibility study. J Near Infrared Spectrosc. (2021) 29:359–65. doi: 10.1177/09670335211051575

Crossref Full Text | Google Scholar

20. Pérez, P, and de Los Campos, G. Genome-wide regression and prediction with the BGLR statistical package. Genetics. (2014) 198:483–95. doi: 10.1534/genetics.114.164442

PubMed Abstract | Crossref Full Text | Google Scholar

21. Kennard, RW, and Stone, LA. Computer aided design of experiments. Technometrics. (1969) 11:137–48. doi: 10.1080/00401706.1969.10490666

Crossref Full Text | Google Scholar

22. Barnes, RJ, Dhanoa, MS, and Lister, SJ. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Appl Spectrosc. (1989) 43:772–7. doi: 10.1366/0003702894202201

Crossref Full Text | Google Scholar

23. Savitzky, A, and Golay, MJ. Smoothing and differentiation of data by simplified least squares procedures. Anal Chem. (1964) 36:1627–39. doi: 10.1021/ac60214a047

Crossref Full Text | Google Scholar

24. Hibbert, DB. Vocabulary of concepts and terms in chemometrics (IUPAC recommendations 2016). Pure Appl Chem. (2016) 88:407–43. doi: 10.1515/pac-2015-0605

Crossref Full Text | Google Scholar

25. Burke, EA. Raman microspectrometry of fluid inclusions. Lithos. (2001) 55:139–58. doi: 10.1016/S0024-4937(00)00043-8

Crossref Full Text | Google Scholar

26. Fearn, T. On orthogonal signal correction. Chemom Intell Lab Syst. (2000) 50:47–52. doi: 10.1016/S0169-7439(99)00045-3

Crossref Full Text | Google Scholar

27. Alewijn, M, van der Voet, H, and van Ruth, S. Validation of multivariate classification methods using analytical fingerprints–concept and case study on organic feed for laying hens. J Food Compos Anal. (2016) 51:15–23. doi: 10.1016/j.jfca.2016.06.003

Crossref Full Text | Google Scholar

28. da Silva Medeiros, ML, Cruz-Tirado, JP, Lima, AF, de Souza Netto, JM, Ribeiro, APB, Bassegio, D, et al. Assessment oil composition and species discrimination of brassicas seeds based on hyperspectral imaging and portable near infrared (NIR) spectroscopy tools and chemometrics. J Food Compos Anal. (2022) 107:104403. doi: 10.1016/j.jfca.2022.104403

Crossref Full Text | Google Scholar

29. Karoui, R, Mazerolles, G, Bosset, JO, De Baerdemaeker, J, and Dufour, E. Utilisation of mid-infrared spectroscopy for determination of the geographic origin of Gruyère PDO and L’Etivaz PDO Swiss cheeses. Food Chem. (2007) 105:847–54. doi: 10.1016/j.foodchem.2007.01.051

Crossref Full Text | Google Scholar

30. Ottavian, M, Facco, P, Fasolato, L, and Barolo, M. Multispectral data classification using similarity factors. Chemom Intell Lab Syst. (2012) 118:13–23. doi: 10.1016/j.chemolab.2012.07.008

Crossref Full Text | Google Scholar

31. Li Vigni, M, Durante, C, Michelini, S, Nocetti, M, and Cocchi, M. Preliminary assessment of Parmigiano Reggiano authenticity by handheld Raman spectroscopy. Foods. (2020) 9:1563. doi: 10.3390/foods9111563

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wang, Z, Liu, J, Zeng, C, Bao, C, Li, Z, Zhang, D, et al. Rapid detection of protein content in rice based on Raman and near-infrared spectroscopy fusion strategy combined with characteristic wavelength selection. Infrared Phys Technol. (2023) 129:104563. doi: 10.1016/j.infrared.2023.104563

Crossref Full Text | Google Scholar

33. Márquez, C, López, MI, Ruisánchez, I, and Callao, MP. FT-Raman and NIR spectroscopy data fusion strategy for multivariate qualitative analysis of food fraud. Talanta. (2016) 161:80–6. doi: 10.1016/j.talanta.2016.08.003

PubMed Abstract | Crossref Full Text | Google Scholar

34. Wang, X, Esquerre, C, Downey, G, Henihan, L, O'Callaghan, D, and O'Donnell, C. Development of chemometric models using Vis-NIR and Raman spectral data fusion for assessment of infant formula storage temperature and time. Innovative Food Sci Emerg Technol. (2021) 67:102551. doi: 10.1016/j.ifset.2020.102551

Crossref Full Text | Google Scholar

35. Wang, Q, and Bovenhuis, H. Validation strategy can result in an overoptimistic view of the ability of milk infrared spectra to predict methane emission of dairy cattle. J Dairy Sci. (2019) 102:6288–95. doi: 10.3168/jds.2018-15684

PubMed Abstract | Crossref Full Text | Google Scholar

36. Grelet, C, Dardenne, P, Soyeurt, H, Fernandez, JA, Vanlierde, A, Stevens, F, et al. Large-scale phenotyping in dairy sector using milk MIR spectra: key factors affecting the quality of predictions. Methods. (2021) 186:97–111. doi: 10.1016/j.ymeth.2020.07.012

PubMed Abstract | Crossref Full Text | Google Scholar

37. González-Martín, I, González-Pérez, C, Hernández-Hierro, JM, and González-Cabrera, JM. Use of NIRS technology with a remote reflectance fibre-optic probe for predicting major components in cheese. Talanta. (2008) 75:351–5. doi: 10.1016/j.talanta.2007.11.028

Crossref Full Text | Google Scholar

38. Karoui, R, Mouazen, AM, Dufour, É, Pillonel, L, Schaller, E, De Baerdemaeker, J, et al. Chemical characterisation of European Emmental cheeses by near infrared spectroscopy using chemometric tools. Int Dairy J. (2006) 16:1211–7. doi: 10.1016/j.idairyj.2005.10.002

Crossref Full Text | Google Scholar

39. Oca, ML, Ortiz, MC, Sarabia, LA, Gredilla, AE, and Delgado, D. Prediction of Zamorano cheese quality by near-infrared spectroscopy assessing false non-compliance and false compliance at minimum permitted limits stated by designation of origin regulations. Talanta. (2012) 99:558–65. doi: 10.1016/j.talanta.2012.06.035

PubMed Abstract | Crossref Full Text | Google Scholar

40. Bragolusi, M, Tata, A, Massaro, A, Zacometti, C, and Piro, R. Nutritional labelling of food products purchased from online retail outlets: screening of compliance with European Union tolerance limits by near infrared spectroscopy. J Near Infrared Spectrosc. (2023) 31:89–99. doi: 10.1177/09670335231156470

Crossref Full Text | Google Scholar

41. Manuelian, CL, Ghetti, M, De Lorenzi, C, Pozza, M, Franzoi, M, and De Marchi, M. Feasibility of pocket-sized near-infrared spectrometer for the prediction of cheese quality traits. J Food Compos Anal. (2022) 105:104245. doi: 10.1016/j.jfca.2021.104245

Crossref Full Text | Google Scholar

42. Cen, H, and He, Y. Theory and application of near infrared reflectance spectroscopy in determination of food quality. Trends Food Sci Technol. (2007) 18:72–83. doi: 10.1016/j.tifs.2006.09.003

Crossref Full Text | Google Scholar

43. Lei, T, and Sun, DW. Developments of nondestructive techniques for evaluating quality attributes of cheeses: a review. Trends Food Sci Technol. (2019) 88:527–42. doi: 10.1016/j.tifs.2019.04.013

Crossref Full Text | Google Scholar

44. Czamara, K, Majzner, K, Pacia, MZ, Kochan, K, Kaczor, A, and Baranska, M. Raman spectroscopy of lipids: a review. J Raman Spectrosc. (2015) 46:4–20. doi: 10.1002/jrs.4607

Crossref Full Text | Google Scholar

45. Xu, FX, Rathbone, EG, and Fu, D. Simultaneous dual-band hyperspectral stimulated Raman scattering microscopy with femtosecond optical parametric oscillators. J Phys Chem B. (2023) 127:2187–97. doi: 10.1021/acs.jpcb.2c09105

PubMed Abstract | Crossref Full Text | Google Scholar

46. Xu, Y, Zhong, P, Jiang, A, Shen, X, Li, X, Xu, Z, et al. Raman spectroscopy coupled with chemometrics for food authentication: a review. TrAC Trends Anal Chem. (2020) 131:116017. doi: 10.1016/j.trac.2020.116017

Crossref Full Text | Google Scholar

47. Zhang, ZY, Jiang, MQ, and Xiong, HM. Optimized identification of cheese products based on Raman spectroscopy and an extreme learning machine. New J Chem. (2023) 47:6889–94. doi: 10.1039/D2NJ06350F

Crossref Full Text | Google Scholar

48. Borràs, E, Ferré, J, Boqué, R, Mestres, M, Aceña, L, and Busto, O. Data fusion methodologies for food and beverage authentication and quality assessment–a review. Anal Chim Acta. (2015) 891:1–14. doi: 10.1016/j.aca.2015.04.042

PubMed Abstract | Crossref Full Text | Google Scholar

49. Hayes, E, Greene, D, O’Donnell, C, O’Shea, N, and Fenelon, MA. Spectroscopic technologies and data fusion: applications for the dairy industry. Front Nutr. (2023) 9:1074688. doi: 10.3389/fnut.2022.1074688

PubMed Abstract | Crossref Full Text | Google Scholar

50. Galvan, D, Tanamati, AAC, Casanova, F, Danieli, E, Bona, E, and Killner, MHM. Compact low-field NMR spectroscopy and chemometrics applied to the analysis of edible oils. Food Chem. (2021) 365:130476. doi: 10.1016/j.foodchem.2021.130476

PubMed Abstract | Crossref Full Text | Google Scholar

51. Biancolillo, A. Chapter 10 – data fusion strategies in food analysis In: M Cocchi, editor. Data handling in science and technology. Amsterdam: Elsevier (2019). 271–310.

Google Scholar

52. Castanedo, F. A review of data fusion techniques. Sci World J. (2013) 2013:1–19. doi: 10.1155/2013/704504

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Parmigiano Reggiano, Grana Padano, NIR, Raman, Bayesian methods, Partial Least Squares, data fusion

Citation: Stocco G, Gómez-Mascaraque LG, Deshwal GK, Sanchez JC, Molle A, Pizzamiglio V, Berzaghi P, Gergov G and Cipolat-Gotet C (2024) Exploring the use of NIR and Raman spectroscopy for the prediction of quality traits in PDO cheeses. Front. Nutr. 11:1327301. doi: 10.3389/fnut.2024.1327301

Received: 24 October 2023; Accepted: 04 January 2024;
Published: 06 February 2024.

Edited by:

Francesco Siano, National Research Council (CNR), Italy

Reviewed by:

Alessandra Tata, Experimental Zooprophylactic Institute of the Venezie (IZSVe), Italy
Kusumiyati Kusumiyati, Padjadjaran University, Indonesia

Copyright © 2024 Stocco, Gómez-Mascaraque, Deshwal, Sanchez, Molle, Pizzamiglio, Berzaghi, Gergov and Cipolat-Gotet. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Giorgia Stocco, Z2lvcmdpYS5zdG9jY29AdW5pcHIuaXQ=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.