- 1Department of Agronomy, Food, Natural Resources, Animals and Environment, University of Padova, Padua, Italy
- 2Department of Genetics and Biostatistics, National Autonomous University of Mexico, Mexico City, Mexico
- 3Italian Brown Breeders Association, Bussolengo, Italy
- 4Department of Animal Science, North Carolina State University, Raleigh, NC, United States
The objective of this study was to evaluate the contribution of Fourier-transformed infrared spectroscopy (FTIR) data for dairy cattle breeding through two different approaches: (i) estimating the genetic parameters for 30 measured milk traits and their FTIR predictions and investigating the additive genetic correlation between them and (ii) evaluating the effectiveness of FTIR-derived phenotyping to replicate a candidate bull’s progeny testing or breeding value prediction at birth. Records were available from 1,123 cows phenotyped using gold standard laboratory methodologies (LAB data). This included phenotypes related to fine milk composition and milk technological characteristics, milk acidity, and milk protein fractions. The dataset used to generate FTIR predictions comprised 729,202 test-day records from 51,059 Brown Swiss cows (FIELD data). A first approach consisted of estimating genetic parameters for phenotypes available from LAB and FIELD datasets. To do so, a set of bivariate animal models were run, and genetic correlations between LAB and FIELD phenotypes were estimated using FIELD information obtained at the population level. Heritability estimates were generally higher for FIELD predictions than for the corresponding LAB measures. The additive genetic correlations (ra) between LAB and FIELD phenotypes had different magnitudes across traits but were generally strong. Overall, these results demonstrated the potential of using FIELD information as indicator traits for the indirect genetic improvement of LAB measures. In the second approach, we included genotype information for 1,011 cows from the LAB dataset, 1,493 cows from the FIELD dataset, 181 sires with daughters in both LAB and FIELD datasets, and 540 sires with daughters in the FIELD dataset only. Predictions were obtained using the single-step GBLUP method. A four fold cross-validation was used to assess the predictive ability of the different models, assessed as the ability to predict masked LAB records from daughters of progeny testing bulls. The correlation between observed and predicted LAB measures in validation was averaged over the four training-validation sets. Different sets of phenotypic information were used sequentially in cross-validation schemes: (i) LAB cows from the training set; (ii) FIELD cows from the training set; and (iii) FIELD cows from the validation set. Models that included FIELD records showed an improvement for the majority of traits. This study suggests that breeding programs for difficult-to-measure traits could be implemented using FTIR information. While these programs should use progeny testing, acceptable values of accuracy can be achieved also for bulls without phenotyped progeny. Robust calibration equations are, deemed as essential.
Introduction
In the omics- era, an emerging field of research is represented by phenomics, which is the study of phenotypes on a genome-wide scale (Bilder et al., 2009; Houle et al., 2010). In animal breeding, the advance in high-throughput genomics has increased the need for simple, fast, accurate, and high-throughput phenotyping technologies. Fourier-transformed infrared spectroscopy (FTIR), including part of near- and mid-infrared (NIR and MIR) electromagnetic radiations, is a versatile and cost-effective analytical tool to collect individual data for monitoring traditional and novel milk traits in dairy cattle (Boichard and Brochard, 2012). For many years, milk composition traits such as fat and protein content, as well as lactose, urea, and casein content, have been routinely estimated by FTIR spectroscopy (Barbano and Clark, 1989). More recently, infrared technology has also been proposed as an alternative method for the quantification of difficult- or expensive-to-measure milk phenotypes including protein fractions, fatty acids, and minerals as well as milk coagulation properties (MCP), cheese yield, and curd nutrient recoveries (Soyeurt et al., 2006a, b, 2011; Ferragina et al., 2013; Cecchinato et al., 2015; Sanchez et al., 2018). In addition, FTIR data has been shown to be a potentially valuable tool for predicting health and reproductive phenotypes (Belay et al., 2017; Toledo-Alvarado et al., 2018), as well as residual feed intake, dry matter intake (DMI), and methane emissions (Bittante and Cipolat-Gotet, 2018; Dórea et al., 2018).
Within the animal breeding context, studies have shown the potential for using FTIR predictions as indicator traits of novel phenotypes like MPC, fatty acid profiles, and other milk components (Cecchinato et al., 2009; Rutten et al., 2011; Bonfatti et al., 2017). Multi-trait prediction allows simultaneous use of information from relatives and from different traits (Henderson and Quaas, 1976). It has been demonstrated that, using different databases, breeds, and traits, the effectiveness of FTIR calibrations to provide novel phenotypes exploitable in indirect selective breeding relies on the magnitude of heritability of FTIR predictions, and the additive genetic correlation between predictions and the measured traits (i.e., gold standard/breeding targets). Although the predictive ability of FTIR data is moderate for some traits, the genetic response achievable using FTIR predictions as indicator traits may be equal to or slightly lower than the response achievable from direct measurements of traits are utilized (Cecchinato et al., 2009; Rutten et al., 2010).
Besides the infrared technology, genome-wide prediction using the single-step approach has also been recognized as an important tool to predict phenotypes (Lee et al., 2008; Aguilar et al., 2010). The key principle for all these applications is the simultaneous estimation of all genome-wide marker effects based on a reference population with known phenotypes. Within this framework, the accuracy of prediction might also benefit from the added value of including genomic information in multi-trait prediction models, which have been shown to have better performances compared to single-trait models (Calus and Veerkamp, 2011; Guo et al., 2014; Karaman et al., 2018). While dairy cattle genetic improvement has historically hinged on progeny testing, genomic selection has made available high-accuracy breeding value predictions for candidate bulls at birth. This could mean that the traditional progeny testing is no longer required, provided that these early breeding value predictions are reliable. In this context, we hypothesized that multi-trait genomic prediction models applied to indicator traits estimated from routinely collected FTIR data, and their corresponding gold-standard measured traits could represent a viable option to evaluate the contribution of FTIR data collection for dairy cattle breeding. Cross-validation could be used to test the predictive ability of models that include or do not include FTIR-predicted phenotypes, to replicate a candidate bulls’ progeny testing or prediction of breeding value at birth.
Therefore, the overall objective of this study was to test the value of FTIR predictions from field data (FIELD) for the genetic improvement of difficult-to-measure traits in dairy cattle. Steps to address this objective were (i) to infer (co)variance components, heritabilities, and additive genetic correlations between 30 LAB-measured and FIELD-predicted phenotypes, divergent in terms of biological meaning, variability, and heritability, related to fine milk composition and milk technological characteristics [traditional MCP, curd firming (CF) traits, cheese yields and recoveries of nutrients, milk acidity and milk protein fractions] and (ii) to use bivariate single-step GBLUP for evaluating the predictive ability of FTIR-derived phenotyping for these traits by using different phenotyping and genotyping strategies.
Materials and Methods
Ethics Statement
This study did not require any specific ethics permit. The cows sampled belonged to commercial private herds and were not experimentally manipulated. Milk samples were collected during routine milk recording coordinated by technicians from the Breeders Federations of Trento Province (FPA, Trento, Italy) and of Alto Adige/Südtirol (Associazione Provinciale delle Organizzazioni Zootecniche Altotesine/Vereinigung der Südtiroler Tierzuchtverbände, Bolzano/Bozen, Italy).
Data Structure
In this study, we used two sets of data collected on Brown Swiss cows: (i) a LAB dataset in which laboratory measurements and spectra data for phenotypes related to milk quality and cheese-making were available to develop calibration equations and (ii) a FIELD-FTIR dataset for testing field prediction at the population level. A subset of the FIELD-FTIR dataset including only first lactation records was used for estimating (co)variance components (FIELDlact1); the whole database was instead used for the genomic analyses (FIELD). The data structure is summarized in Table 1.
Table 1. Summary of data structure for laboratory (LAB) measures, after editing, and Fourier transform infrared predictions (FIELD).
LAB Dataset
The LAB data were part of the Cowability/Cowplus projects. Individual milk samples from 1,200 Brown Swiss cows from 85 herds located in the Alpine province of Trento (Italy) were collected. Details of the animals used in this study and characteristics of the area are reported in Mele et al. (2016). Data on the cows, herds, and single test-day milk yield were provided by the Superbrown Consortium of Bolzano and Trento (Italy), and pedigree information was supplied by the Italian Brown Swiss Cattle Breeders Association (Verona, Italy).
FTIR Spectra Data
Individual milk samples were analyzed using a MilkoScan FT6000 (Foss Electric, Hillerød, Denmark). The spectrum covers from the short-wavelength infrared (SWIR, also known as NIR or IR-B), through the medium-wavelength infrared (MWIR, also known as MIR), to the long-wavelength infrared (LWIR, or LIR) regions with 1,060 spectral points from wavenumber 5,010 to 925 cm–1, which correspond to wavelengths ranging from 1.99 to 10.81 μm and frequencies ranging from 150.19 to 27.73 THz. Spectra were expressed as absorbance calculated as log(1/transmittance). Two spectral acquisitions were carried out for each sample (collected during the evening milking), and the results were averaged before data analysis (Ferragina et al., 2015).
Phenotypes
Traditional Milk Coagulation Properties
Measures of MCP were obtained using two different instruments: a Formagraph (Foss Electric A/S) and an Optigraph (Ysebaert SA, Frépillon, France) according to Cecchinato et al. (2013). Briefly, milk samples (10 mL) were heated to 35°C and 200 μL of a rennet solution (Hansen Standard 160, with 80 ± 5% chymosin and 20 ± 5% pepsin; 160 international milk clotting units/mL; Pacovis Amrein AG, Bern, Switzerland), diluted to 1.6% (wt/vol) in distilled water, was added at the beginning of the analysis. The time of analysis was extended up to 90 min after rennet addition. Rennet coagulation time (RCT) was defined as the time (min) from the addition of enzyme to the beginning of coagulation, k20 (min) was defined as the interval from RCT to the time at which a curd firmness of 20 mm was obtained, and a30 and a45 (mm) were measurements of curd firmness at 30 and 45 min after rennet addition, respectively.
Modeling the Curd Firmness
A set of parameters of CF at time t (CFt) was estimated, and details are described in Bittante et al. (2015). Estimated parameters included rennet coagulation time (RCTeq, min), estimated from the CFt equation; potential asymptotical curd firmness (CFP, mm), representing the maximum potential curd firmness after infinite time in the absence of syneresis; curd-firming rate constant (kCF, %/min), which is a measure of the rate of CF; syneresis rate constant (kSR, %/min); maximum curd firmness (CFmax, mm); and time to CFmax (tmax, min).
Milk Acidity
The milk pH was measured using a Crison Basic 25 electrode (Crison, Barcelona, Spain).
Cheese Yields and Curd Nutrient Recoveries
To assess cheese-making properties, milk samples were processed according to all the steps of the cheese-making practice used in artisanal commercial dairies for producing a traditional whole milk cheese described by Bittante et al. (2014). Briefly, 1,500 mL of milk was heated to 35°C in a stainless steel microvat, supplemented with thermophilic starter culture, and mixed with rennet. The resulting curd from each vat was double-cut and heated for 30 min to 55°C, drained, shaped in wheels, pressed, salted, and weighed. The whey was drained from the curd, weighted, and analyzed for fat, protein, lactose, and total solid content using FT2 (Foss Electric A/S, Hillerød, Denmark). Three cheese yield (CY) traits were calculated expressing the weight (wt) of fresh curd (%CYCURD), of curd DM (%CYSOLIDS), and of water retained in the curd (%CYWATER) as a percentage of the weight of milk processed. Four recovery (REC) traits were calculated as proportions of nutrients and energy of the milk retained in the curd (RECSOLIDS, RECFAT, RECPROTEIN, and RECENERGY calculated as the % ratio between the nutrient in curd and the corresponding nutrient in processed milk). The energy within the curd was calculated as the difference between energy in the milk and in the whey (NRC, 2001).
Milk Proteins
Concentrations of the major casein (CN) fractions (αS1–CN, αS2–CN, β-CN, and κ-CN) and whey proteins, β-lactoglobulin (β-LG), and alpha-lactalbumin (α-LA) were determined using a validated reversed phase high-performance liquid chromatography (RP-HPLC) method (Bonfatti et al., 2008). Each protein fraction was expressed as a percentage of the milk total nitrogen (N) content.
Data Editing
Only records with spectra and measured phenotypes available were kept. After data editing and removing observations outside three standard deviations, the final number of records and phenotypes used in subsequent analyses varied from 770 to 1,120 depending on the trait as reported in Table 2.
Table 2. Descriptive statistics for laboratory (LAB) measures and Fourier transform infrared predictions (FIELD) for the investigated traits.
Field Data
Spectra Data
The FIELD data for our study were provided by the Breeders Federations of Trento Province (FPA, Trento, Italy) and of Alto Adige/Südtirol (Associazione Provinciale delle Organizzazioni Zootecniche Altotesine/Vereinigung der Südtiroler Tierzuchtverbände, Bolzano/Bozen, Italy) as part of the Cowability/Cowplus projects. All milk samples were analyzed using a MilkoScan FT6000 (Foss Electric, Hillerød, Denmark). Spectra characteristics, in terms of wavelengths and way of expression (absorbance), were the same as for the LAB data. Spectra were collected between 2010 and 2017.
Spectra Editing
A preliminary analysis was carried out to identify the outlier spectra based on the Mahalanobis distance from the first five principal component scores. The probability level for the chi-squared distribution of a sample’s Mahalanobis distance was calculated from the incomplete gamma function with five degrees of freedom (Toledo-Alvarado et al., 2018). Samples with P < 0.01 were removed from the dataset. To overcome spectral variations, the absorbance values for every wavelength were centered to a null mean and standardized to a unit sample variance within year periods. Records without parity number, date of calving, animal ID, or pedigree information were also removed. If a cow had predictions from both LAB and FIELD datasets, the FIELD prediction records of that specific cow were deleted. This latter editing step was performed for the following reasons: (i) to avoid an overestimation of additive genetic correlations; (ii) to force the connection between LAB and FIELD data only through the additive genetic effect; and (iii) to avoid EBV inflation for cows with both LAB and FIELD records.
To detect outliers among the predicted phenotypes, a mixed model was fitted for each trait including the fixed non-genetic effects of (i) the stage of lactation (12 monthly classes), (ii) the combined effect of herd (∼2,460 levels), year of the test day (2010–2017), and two seasons of calving (April to September and October to March) (∼22,200 levels). The permanent environmental effects (∼39,600) and animal additive genetic effects were included as random terms.
The residuals of phenotypes outside 3 standard deviations were considered as outliers.
Genotypes
The pool of genotyped individuals consisted of (i) 1,011 LAB cows genotyped with the Illumina BovineSNP50 v.2 BeadChip (Illumina, Inc., San Diego, CA, United States; 54,000 SNPs); (ii) 1,463 FIELD cows which were genotyped with the BovineLD v2.0 BeadChip (Illumina, Inc., San Diego, CA, United States; 7,931 SNPs); (iii) 181 sires with both LAB and FIELD daughters genotyped with the Illumina BovineSNP50 v.2 BeadChip or the Illumina Bovine HD BeadChip (Illumina, Inc., San Diego, CA, United States; 777,000 SNPs); and (iv) 540 sires with FIELD daughters only genotyped as (v). The software FImpute (Sargolzaei et al., 2014) was used for imputation and all individuals were imputed to the BovineSNP50 v.2 BeadChip panel. A total number of 3,195 genotyped individuals were available for this study.
Marker editing was performed using the preGSf90 software (Misztal et al., 2002). Markers were excluded where the call rate was below 95%, the minor allele frequency was below 5%, and/or there was significant deviation from the Hardy–Weinberg equilibrium (P < 10–6). SNPs mapped to the sex chromosomes or with unknown position on the genome were also removed. After editing, the number of markers available for analyses was 37,093.
Statistical Analyses
Calibration Equations
Separate models were fitted for each trait considered. We used a Bayesian model (BayesB model) implemented in BGLR (de los Campos and Perez-Rodriguez, 2014), as previously described by Ferragina et al. (2015). Phenotypes in LAB (i.e., the calibration dataset) were regressed on standardized spectra covariates using the linear model:
where β0 is an intercept, {xij} are standardized FTIR spectra-derived wavelength data(j = 1,⋯1,060), βj are the effects of each of the wavelengths, and εi are model residuals assumed to be independent and identically distributed, with normal distribution centered at zero and variance . Models were applied with 100,000 iterations and 20,000 chains discarded as burn in. The editing and analysis were conducted using R software (R Core Team, 2018). To evaluate predictive performance, the coefficient of correlation for the calibration model developed on LAB measures and used for FIELD predictions was calculated for each trait (rC_LAB). The calibration equations obtained with this procedure in LAB data were then applied to the spectral population data (FIELD) in order to obtain FIELD predictions for all traits of concerns.
Genetic Analyses: (Co)Variance Components Estimation Between Measured (LAB) and Predicted (FIELD) Phenotypes
The (co)variance components were estimated using REMLF90 and AIREMLF90 (Misztal et al., 2002), considering LAB measures and FIELD predictions as distinct traits. The connection between the two datasets was guaranteed by 266 sires, 1,044 dams, 94 sires of sires, and 372 sires of dams in common between LAB and FIELD datasets (Table 1). To save time and improve convergence of models, for this first approach, the FIELD dataset has been limited to cows belonging to first lactation (FIELDlact1).
The program was run until a convergence criterion of 1e−10 was reached.
The model for LAB-measures was
where y is the vector of observations for traits of concern; b, a, and e correspond to the vector of fixed non-genetic effects, random animal additive genetic effects, and random residual effects, respectively; X and Za are the incidence matrices relating each observation in y to b and a. The non-genetic fixed effects included in the model were (i) the DIM of each cow within parity (60 levels) and (ii) herd-test day (83 levels). The random terms were animal additive genetic effects and residual effects. The pedigree file included all phenotyped animals and their ancestors (∼6,500 animals).
where and are the additive genetic and residual variances, respectively.
The model for FIELDlact1 records was
where y is the vector of observations for the traits of concern; b, a, pe, and e correspond to the vector of fixed non-genetic effects, random animal additive genetic effects, random permanent environmental effects, and random residuals effects, respectively; X, Za, and Zpe are the incidence matrices relating each observation in y, a, b, and pe, respectively. The fixed non-genetic effects were (i) the stage of lactation (12 monthly classes) and (ii) the combined effect of herd (∼2,460 levels), year of the test-day (2010–2017), and season of calving (April to September and October to March) (∼22,200 levels). The random terms were permanent environmental effects (∼39,600), animal additive genetic effects, and residual effects. The pedigree file included all phenotyped animals and their ancestors (∼97,900 animals). Only first lactation records were utilized to avoid convergence problems. The residuals were considered uncorrelated across the two traits (LAB and FIELDlact1 predictions).
where , , and are the additive genetic, permanent environmental, and residual variances, respectively.
For each trait, the additive genetic correlation between LAB and FIELDlact1 was estimated from the variance–covariance matrix of the random additive genetic effect as where σa1,a2 is the additive genetic covariance between two traits, and σa1 and σa2 are the additive genetic standard deviations for traits 1 and 2, respectively.
Predictive Ability Estimated Using Genomic and Infrared Information
In order to assess the ability of FIELD data to predict LAB records as phenotypes of interest, a fourfold cross-validation was used. In this setting, the LAB phenotype is considered as the breeding goal while the FIELD record is its correlated trait. The models employed for both LAB and FIELD data were the same as for the variance component estimation, except for the stage of lactation effect in the FIELD dataset that was replaced with a stage of lactation by parity effect (5 parity classes, 72 levels in total), since the whole FIELD dataset was used (not restricting the records to first lactation).
Sires with both LAB and FIELD daughters were randomly assigned to four groups, evenly sized based on the number of LAB cows for each sire. Details on the number of records and cows in each set are reported in Supplementary Table S1. Cross-validation was performed using alternatively 3 groups for training (denoted with the suffix “t”) and one group for validation (denoted with the suffix “v”), for the scenarios that only included the training sets.
The cross-validation considered different training sets, either using LAB or FIELD data alone or combined: (i) model “LAB.t” included phenotypes available are from LAB cows that were daughters of the sires in the training set; (ii) model “FIELD.t” included phenotypes available from FIELD cows that were daughters of the sires in the training set; (iii) model “LAB.t + FIELD.t” included phenotypes from LAB and FIELD cows that were daughters of the sires in the training set; (iv) model “FIELD.t + FIELD.v” included phenotypes available are from all FIELD cows (no alternate masking of phenotypes was performed here); and (v) model “LAB.t + FIELD.t + FIELD.v” included phenotypes available from LAB cows that were daughters of the sires from the training set and all FIELD cows. Model “LAB.t” assesses the predictive ability when FIELD records are not collected, i.e., no use of FTIR data. Model “FIELD.t” evaluates the impact of FIELD phenotyping the daughters of the bulls in the reference population, with no LAB phenotypes included in the genetic evaluation. Model “LAB.t + FIELD.t” evaluates the impact of including FIELD phenotypes for the daughters of the bulls in the progeny test. In model “FIELD.t + FIELD.v,” progeny testing bulls have daughters phenotyped with FTIR data while in model “LAB.t + FIELD.t + FIELD.v,” the progeny testing bulls have daughters phenotyped with LAB data as well.
Accuracy of prediction was quantified as
where accx is the accuracy in the validation set x (x = 1,2,3,4), are the masked LAB records from cows in validation set x, is the estimated genomic breeding value for LAB cows in validation set x, and cor refers to the Pearson correlation. Estimated genomic breeding values were calculated using the program BLUPf90 (Misztal et al., 2002) that combines pedigree-derived and SNP-derived genomic relationship matrices (Legarra et al., 2009). All 3,195 genotypes available for this study were included in the analysis.
For this part of the study, we did not use the variance components estimated in the previous step, but we tested different genetic correlation (covariance) values in order to find those that better maximized prediction accuracy. Firstly, estimates of variance components were obtained from single-trait models (Supplementary Table S2). Single-trait models were used in order to remove the inflation in the estimates due to genetic covariances. Then, different values of genetic correlation between LAB and FIELD phenotypes ranging from 0.1 to 1, with 0.1 step increases, were generated. Each prediction run was then repeated for the values of covariance generated by each value of assumed genetic correlation. This allowed the exploration of the predictive ability over all the potential values of genetic correlation.
Results and Discussion
Descriptive Statistics
Descriptive statistics for all investigated LAB-measured and FIELD-predicted traits are summarized in Table 2. Extensive description and discussion of the results for the LAB dataset have been reported in previous works, including the differences between MCP measured with FRM and OPT (Cecchinato et al., 2013; Bittante et al., 2015; Pegolo et al., 2018). Generally, we observed consistency between the mean values from LAB, FIELDlact1, and FIELD datasets for all the investigated traits. We only observed a slight loss of variability for all the FTIR-predicted phenotypes as demonstrated by the lower standard deviations, compared to for the observed traits.
Variance Components and Heritability of Measured and Predicted Traits
Coagulation Traits and Milk Acidity
Variance components and heritability estimates from LAB and FIELDlact1 datasets for coagulation traits and milk acidity are reported in Table 3 and Figure 1. The coefficient of correlation for the calibration models developed on LAB measures (rC_LAB) and used for FIELD predictions, and the additive genetic correlation (ra) between LAB and FIELD traits are reported in Table 3 and Figure 2. As previously reported, in order to guarantee convergence of animal models, these analyses have been restricted to a dataset with only first lactation records of FIELD data. Generally, a decrease in both genetic and residual variances was observed in FIELDlact1 with respect to LAB data. A similar pattern was reported in other studies (Bonfatti et al., 2017). In particular, residual variances for LAB traits were 1.3–6 times greater than residual plus permanent environmental variances in FIELDlact1. In the case of LAB parameters obtained from mechanical lacto-dynamographs (traditional MCP and Curd firming model), this difference increases particularly for traits recorded at increasing time intervals from milk gelation. This is caused by the decreasing repeatability of the measures recorded by the instrument with the progress of the textural test. Therefore, heritability estimates were comparable between datasets for traits recorded earlier (e.g., RCT, kCF), whereas heritability estimates decreased for later measured LAB traits, but not for FTIR predicted FIELDlact1 traits (a30, a45, CFP, CFmax). Optigraph, yielding an optical prediction and not a mechanical measurement, does not show the same decrease in repeatability and heritability, for traits describing the later part of CF pattern. Generally, heritability estimates of measured and predicted traits were comparable to other studies (Cecchinato et al., 2009; Bonfatti et al., 2017).
Table 3. Estimates of variance components and heritability of measured LAB traits, coefficient of correlation of the calibration model used for infrared prediction (rC_LAB) developed on LAB measures and used for FIELD predictions, estimates of variance components and heritability of Fourier transform infrared FIELDlact1 predictions, and additive genetic correlation (ra) between LAB and FIELD traits for the coagulation and acidity traits.
Figure 1. Estimates of heritability for LAB measures and FTIR predictions of the investigated traits using LAB and FIELDlact1 data. MCP = milk coagulation properties; RCT = rennet coagulation time; k20 = curd firming (CF) rate as the time to a curd firmness of 20 mm; a30 (45) = curd firmness at 30 (45) min from rennet addition; RCTeq = rennet coagulation time estimated using the equation; CFP = asymptotic potential curd firmness; kCF = curd firming instant rate constant; kSR = syneresis instant rate constant; CFmax = maximum curd firmness achieved within 45 min; tmax = time at achievement of CFmax; OPT = Optigraph; CY = cheese yield; %CYCURD = weight of fresh curd as percentage of weight of milk processed; %CYSOLIDS = weight of curd solids as percentage of weight of milk processed; %CYWATER = weight of water curd as percentage of weight of milk processed; REC = recoveries; RECPROTEIN = protein of the curd as percentage of the protein of the milk processed; RECFAT = fat of the curd as percentage of the fat of the milk processed; RECSOLIDS = solids of the curd as percentage of the solids of the milk processed; RECENERGY = energy of the curd as percentage of energy of the milk processed. 1True protein nitrogen (N) and milk N fractions are expressed as percentage of total milk N; β-CN (β-casein), κ-CN (κ-casein), αs1-CN (αs1-casein), αs2-CN (αs2-casein); caseins (CN): Σ(β-CN+κ-CN+ αs1-CN+αs2-CN); β-LG (β-lactoglobulin), α-LA (α-lactalbumin), whey proteins (WP): Σ(β-LG+ α-LA).
Figure 2. Coefficient of correlation of the calibration models used for infrared prediction (rC_LAB) and additive genetic correlations (ra) between LAB measures and FTIR predictions of the investigated traits using LAB and FIELDlact1 data. MCP = milk coagulation properties; RCT = rennet coagulation time; k20 = curd firming (CF) rate as the time to a curd firmness of 20 mm; a30 (45) = curd firmness at 30 (45) min from rennet addition; RCTeq = rennet coagulation time estimated using the equation; CFP = asymptotic potential curd firmness; kCF = curd firming instant rate constant; kSR = syneresis instant rate constant; CFmax = maximum curd firmness achieved within 45 min; tmax = time at achievement of CFmax; OPT = Optigraph; CY = cheese yield; %CYCURD = weight of fresh curd as percentage of weight of milk processed; %CYSOLIDS = weight of curd solids as percentage of weight of milk processed; %CYWATER = weight of water curd as percentage of weight of milk processed; REC = recoveries; RECPROTEIN = protein of the curd as percentage of the protein of the milk processed; RECFAT = fat of the curd as percentage of the fat of the milk processed; RECSOLIDS = solids of the curd as percentage of the solids of the milk processed; RECENERGY = energy of the curd as percentage of energy of the milk processed. 1True protein nitrogen (N) and milk N fractions are expressed as percentage of total milk N; β-CN (β-casein), κ-CN (κ-casein), αs1-CN (αs1-casein), αs2- CN (αs2-casein); caseins (CN): Σ(β-CN+κ-CN+ αs1-CN+αs2-CN); β-LG (β-lactoglobulin), α-LA (α-lactalbumin), whey proteins (WP): Σ(β-LG+ α-LA).
Cheese-making traits. Variance components and heritability estimates of LAB and FIELDlact1 for cheese yield and recovery traits are reported in Table 4 and Figure 1. In addition, the rC_LAB developed on LAB data and the ra between LAB and FIELD traits are reported in Table 4 and Figure 2. For CY traits, genetic variances were larger in FIELDlact1 for CYCURD (∼1.2-fold) and CYSOLIDS (∼1.6-fold) and smaller for CYWATER (∼1.2-fold). For all these traits, a ∼1.5-fold reduction was observed in residual plus permanent environmental variances. For CYCURD and CYWATER, heritability estimates were comparable between the two datasets, while the heritability estimate of CYSOLIDS was almost double in FIELDlact1 (0.358), compared to LAB (0.192). Variance components for REC traits had different behaviors: RECPROTEIN and RECFAT had smaller genetic variances for FIELDlact1 compared to LAB traits (∼1.6- and 1.2-fold, respectively), while RECSOLIDS and RECENERGY had larger genetic variances in FIELDlact1, compared to LAB (∼1.3-fold). On the other hand, we observed a reduction in residual variance for all traits except RECPROTEIN which had a slightly higher value in FIELDlact1, compared to the LAB trait. Accordingly, heritability estimates for REC traits were comparable or higher in FIELDlact1, with the exception of RECPROTEIN which had a lower value in FIELDlact1 (0.247 vs. 0.563).
Table 4. Estimates (Est, with standard error reported as SE) of variance components and heritability of measured LAB traits, coefficient of correlation of the calibration model used for infrared prediction (rC_LAB) developed on LAB measures and used for predictions using FIELD spectra, estimates of variance components and heritability of Fourier transform infrared FIELDlact1 predictions, and additive genetic correlation (ra) between LAB and FIELD for the cheese yield, curd nutrient recovery traits, and protein fractions.
Milk Protein Fractions
Genetic parameters of milk protein fractions are also presented in Table 4 and Figure 1. In the case of milk proteins, heritability estimates of LAB traits were comparable to previous studies (Schopen et al., 2009). FTIR-predicted milk proteins showed a marked decrease in genetic variance, compared to LAB measures (especially αS1-CN, from 0.873 to 0.065), while residual variances were larger for total CN, κ-CN, total whey, β-LG, and α-LA, smaller for αS1-CN and αS2-CN, and comparable for β-CN. Heritability estimates were lower in FIELDlact1 for all protein traits, ranging from high-moderate (ranging from 0.182 to 0.678) to moderate-low (ranging from 0.051 to 0.264; Figure 1). These results were in accordance with previous studies, showing a decrease in the estimated genetic variance and heritabilities for predicted milk proteins compared to measured traits (Rutten et al., 2011; Bonfatti et al., 2017), and are essentially due to the low rC_LAB values obtained for these traits.
Reliability of Calibrations and Genetic Correlation Between Measured and Predicted Traits
To assess the reliability of calibration equations in the animal breeding context, we estimated, besides the rC_LAB, the ra between LAB and FIELDlact1 traits as an indicator of the potential of FTIR calibrations to provide novel phenotypes for indirect selective breeding. Results are displayed in Figure 2. The rC_LAB ranged from 0.571 (kSR) to 0.953 (total CN). In general, estimates of ra for milk coagulation and cheese-making traits were high (>0.75), except for kCF and kSR which displayed moderate-high estimates (>0.60). On the other hand, only two milk proteins showed ra smaller than 0.6: αS1-CN (0.396) and αS2-CN (0.535). These two traits were those with the lowest rC_LAB among all traits tested (Table 4: 0.598 and 0.623, respectively). This means that expected genetic gain using these two predicted milk proteins would be moderately lower, compared to that achievable for the other traits. These results were in accordance with Bonfatti et al. (2017) which found high ra for milk protein content but lower ra for percentage traits (0.58, on average). On the other hand, Rutten et al. (2011) reported greater ra values compared to those in our study (0.62–0.97), which might be ascribed to differences in population (breed, size) and/or in the analytical technique used for milk protein investigation (capillary electrophoresis vs. HPLC). The reliability of calibration rC_LAB was associated with the decrease in genetic variance between LAB and FIELDlact1 traits (R2 = 0.45). In particular, lower rC_LAB corresponded to larger decrease in genetic variance (Figure 3A). A weaker relationship was observed between rC_LAB and the decrease in residual variance (R2 = 0.09, data not reported in Figures). A positive relationship (R2 = 0.26) was observed between rC_LAB and ra: a higher reliability of the calibration model corresponded to a higher additive genetic correlation between the LAB-measures and FIELDlact1 predictions (Figure 3B). Beyond this correlation, it is worth noting from Figure 2 that 22 out of 30 investigated traits presented genetic correlations greater than the reliability of calibration (ra > rC_LAB). So, even calibrations with a relatively small predictive ability could be exploited in selective breeding, if their ra between measured and predicted traits is sufficiently high.
Figure 3. Relationships between the coefficient of correlation of calibration models (rC_LAB) used for infrared prediction and (A) the change in additive genetic variance () and (B) the additive genetic correlation (ra) between LAB measures and FTIR predictions.
Single-Step GBLUP Prediction Accuracy by Cross-Validation
Table 5 and Supplementary Figure S1 report the results for the cross-validation performed in order to compare different phenotyping strategies. The cross-validation mimicked the progeny testing scheme, where the LAB phenotypes of testing sires were predicted based on LAB phenotypes from daughters of proven bulls and FIELD information from daughters of proven and progeny testing bulls. Results are presented as the average of the prediction accuracy for the fourfold of sires. For the scenarios that included FIELD information, results in Table 5 show the largest predictive ability obtained within the range of genetic correlation values tested, which is also reported in Table 5. The change in prediction accuracy for each trait over the values of genetic correlation is reported in Supplementary Figure S1.
Table 5. Best-performing model in terms of accuracy (acc) for each of the scenarios studied. The values in bold indicate the best performing scenario (in terms of accuracy) for each trait.
Prediction With LAB Information
In the LAB.t scenario, daughters of progeny testing bulls are not phenotyped and FIELD information is not included and the genetic evaluation is based on a single-trait model. The predictive ability for the different traits followed, to a large extent, the trait heritability (Figure 1). Among MCP traits, RCT showed the highest accuracy (0.118), with the other traits showing almost null values. For CF traits, only RCTeq, kCF, and tmax showed values of accuracy above 0.10. CFp and Cmax showed null values, which is in agreement with the low (∼0.1) heritability estimates. Optigraph traits showed a different trend, with RCT reporting the strongest prediction accuracy (0.168) but not the largest heritability. Prediction accuracies were below 0.1 for all CY traits, for which heritability estimates barely reached the value of 0.2. Prediction accuracy was largest for RECPROTEIN among recovery traits, and its heritability estimate was the largest in the group (∼0.55). A similar scenario was found among casein fractions, where whey proteins, β-LG, and β-CN showed one of the strongest prediction accuracies (0.208, 0.187, and 0.153, respectively) and the largest heritability (above 0.50). Surprisingly, αS1-CN showed a strong predictive ability (0.165), yet not the largest heritability estimate (∼0.50). Prediction with only LAB information can be considered unreliable from a genetic evaluation standpoint, given the limited size of the dataset and the relative cost needed to acquire each single record. However, this scenario was included as a reference for the other, more reliable, models.
Prediction When LAB and FIELD Information Is Included
The LAB.t+FIELD.t scenario mimics a genetic evaluation where both LAB and FIELD information of proven bulls is included. The comparison with the LAB.t scenario proves the value of including FIELD.t information, which highlights the importance of the construction of FTIR calibration equations and the collection of data at the population level on daughters of proven bulls. Still, the genetic merit of bulls can be predicted at birth under this scenario. The genetic evaluation leverages a bivariate model. Overall, prediction accuracy showed a strong increase, compared to the previous scenario. The largest increases in prediction accuracy were observed for curd firming traits, with MCP k20, a30, and a45 having prediction accuracies of 0.208, 0.190, and 0.181, respectively (compared to null accuracies in LAB.t); Optigraph a30 prediction accuracy increased from 0.037 to 0.242; and CFp increased from 0.038 to 0.235. Among protein composition traits, κ-CN increased from 0.051 to 0.330, while αS1-CN did not increase (being the only trait not showing any increase). For all the abovementioned traits that showed an increase, the optimal value of genetic correlation appeared to be 0.9, which corroborates the relevance of FIELD information for accurately predicting LAB breeding values and phenotypes. The values of correlation for the calibration model used for infrared prediction (rC_LAB) and additive genetic correlations (ra) estimated from the data were large for these traits but not the largest found.
The LAB.t+FIELD.t+FIELD.v scenario mimicked a genetic evaluation based on a progeny testing scheme, which implies the collection of FTIR spectra information on the daughters of progeny testing bulls (FIELD.v). Here, genetic merit of bulls cannot be estimated at birth but collection of FIELD phenotypes is needed as in traditional progeny testing. As compared to a previously discussed scenario, the advantage of the FIELD.v data seemed marginal, showing a maximum of 1.2-fold increase and often a decrease in prediction accuracy. Traits that benefited the most were OPT k20 (from 0.250 to 0.302), CYWATER (from 0.203 to 0.242), and casein proportion (from 0.198 to 0.230). Traits that did not show an increase were αS2-CN, whey proteins, α-LA, and β-LG.
Prediction When LAB Information Is Not Included
Scenarios FIELD.t and FIELD.t+FIELD.v mimic a breeding scheme where LAB information is not included in the genetic evaluations. Still, LAB information is used to obtain FTIR phenotype prediction equations. The underlying model is a bivariate model that produces LAB breeding values, although LAB records are not included. Comparing the LAB.t+FIELD.t scenario to the FIELD.t scenario allows the evaluation of the contribution of LAB information when FIELD information is recorded on the daughters of proven bulls. Here, differences were negligible, except for αS1-CN that showed a dramatic decrease from 0.165 to 0.055 when LAB information was removed. This is due to the low quality of the calibration model used for FTIR predictions (rC_LAB = 0.60 and ra = 0.40). The comparison of the LAB.t+FIELD.t+FIELD.v scenario to the FIELD.t+FIELD.v allows us to further prove the value of LAB phenotypes when FIELD information is also recorded in daughters of progeny testing bulls. Again, changes were negligible for most traits, confirming the low relevance of LAB information when solid FTIR calibration equations are used. The relevance of FIELD.v information is assessed in the comparison of FIELD.t and FIELD.t+FIELD.v, when LAB information is absent. While the increase was negligible in the presence of LAB information (maximum of 1.2-fold increase in accuracy), in this case the fold increase reached 1.25 with several traits being about 1.1. This supports the hypothesis that progeny testing could be beneficial for most traits, but probably the gain in accuracy would not match the cost of delaying the candidate bull’s evaluation. It should be considered that generation interval can be dramatically reduced when FIELD.v information is omitted, compromising prediction accuracy, but probably not sufficiently to justify progeny testing cost.
Best Performing Scenario in Terms of Prediction Accuracy
With the exception of αS1-CN, which showed a poor FTIR calibration equation, all traits showed an advantage in the inclusion of FIELD information, supporting the need for well-constructed calibration equations that allow us to obtain predicted phenotypes at the population level. Furthermore, an increase in prediction accuracy was found with the inclusion of FIELD.v information rather than just FIELD.t, either with or without the presence of LAB.t phenotypes. Scenarios that included all FIELD information outperformed scenarios that just included FIELD.t, with the exception of αS2-CN. This would lead to further speculation on the need for conducting progeny testing for new bulls, but the gain in accuracy of prediction is probably not translated into gain in genetic progress due to increased generation interval. It should also be noted that for some traits, i.e., k20, kCF, kSR, βCN, pH, αS1-CN, and α-LA (not necessarily those with low FTIR prediction accuracy), the trend of prediction accuracy over the values of assumed genetic correlation was sometimes null if not negative.
Despite the large number of traits involved, this study does not allow us to declare a breakeven value for any genetic parameter that would serve to predict the value of FIELD vs. LAB information. The only trait that did not benefit from FIELD information happened to be αS1-CN, for which the quality of the FTIR calibration equation was particularly low, followed by the low FIELD heritability and low genetic correlation with LAB measures. All other traits benefited from the inclusion of FIELD information and the accuracy gained with FIELD information greatly depended on the genetic correlation between LAB and FIELD traits. Nonetheless, it was not possible to declare a breakeven value of genetic correlation which deemed the use of FIELD information advantageous. Breakeven values of genetic correlations for indicator traits were found to be 0.5 by Calus and Veerkamp (2011), who studied genomic selection performance under multi-trait scenarios, and 0.7–0.8 by Mulder and Bijma (2007) who studied the impact of genotype by environment interactions in breeding programs. Given that this study based on field data, there could be more factors affecting the value of indicator traits in genomic prediction. Further research is needed in order to explore all potential contributions.
Conclusion
The present study reported two approaches for assessing the contribution of FTIR calibration equations at the population level for dairy cattle breeding. With the first approach, results indicate that FIELD predictions can be used in breeding programs for the genetic improvement of difficult-to-measure traits and that indirect selection for FIELD predictions will provide satisfactory responses. With the second approach, for the first time we highlighted the utility of FTIR predictions for breeding purposes using real data to simulate different genetic evaluation scenarios, where FTIR-derived phenotypic information is dosed into (single-step) GBLUP to predict wet-lab measured performance of daughters of progeny testing bulls. Collection of FIELD measures for progeny testing bulls appears to be advantageous for increasing the predictive ability for most of the traits studied, but the increase in generation interval due to progeny testing does not justify the increase in prediction accuracy. LAB information from proven bulls’ daughters could be included in the genetic evaluations without a detrimental effect. As there is no evidence of a clear advantage to including FIELD information for progeny testing bulls, progeny testing schemes could be replaced by the construction of robust calibration equations together with more vast collection of FIELD measures on daughters of proven bulls.
In general, the increase in predictive ability observed with the inclusion of FIELD information is very favorable and reaches moderate values for 12 traits. While further research is needed in the modeling of FTIR-predicted data, results are promising. Once calibration equations are developed, the cost of collecting FIELD is virtually null, provided that routine spectra acquisition within milk recording schemes is performed and available for breeders. Yet, the cost of developing robust calibration equations should be factored into the total cost of implementing (genomic) selection that includes FIELD data. Thus, an economic analysis should be performed before progressing its use in breeding programs for difficult-to-measure traits.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author Contributions
AC conceived the study, contributed to set up the objectives of this study, drafted the first version of the manuscript, and supervised the project. HT-A performed the statistical analysis together with FT. SP contributed to the critical interpretation of the results and to manuscript drafting. GB and CM contributed to the critical revision of the manuscript. FT conceived the study with AC, contributed to the critical interpretation of the results, and to manuscript drafting. All authors have read and approved the final manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors wish to thank Italian Brown Swiss Cattle Breeders Association (ANARB, Verona, Italy), the Associazione Provinciale delle Organizzazioni Zootecniche Altoa-tesine/Vereinigung der Südtiroler Tierzuchtverbände, and the Federazione Latterie Alto Adige/Sennereiver-band Südtirol (Bolzano/Bozen, Italy) for providing the data.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.563393/full#supplementary-material
References
Aguilar, I., Misztal, I., Johnson, D. L., Legarra, A., Tsuruta, S., and Lawlor, T. J. (2010). Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. J. Dairy Sci. 93, 743–752. doi: 10.3168/jds.2009-2730
Barbano, D. M., and Clark, J. L. (1989). Infrared milk analysis — challenges for the future. J. Dairy Sci. 72, 1627–1636. doi: 10.3168/jds.S0022-0302(89)79275-4
Belay, T. K., Svendsen, M., Kowalski, Z. M., and Ådnøy, T. (2017). Genetic parameters of blood β-hydroxybutyrate predicted from milk infrared spectra and clinical ketosis, and their associations with milk production traits in Norwegian Red cows. J. Dairy Sci. 100, 6298–6311. doi: 10.3168/jds.2016-12458
Bilder, R. M., Sabb, F. W., Cannon, T. D., London, E. D., Jentsch, J. D., Parker, D. S., et al. (2009). Phenomics: the systematic study of phenotypes on a genome-wide scale. Neuroscience 164, 30–42. doi: 10.1016/j.neuroscience.2009.01.027
Bittante, G., and Cipolat-Gotet, C. (2018). Direct and indirect predictions of enteric methane daily production, yield, and intensity per unit of milk and cheese, from fatty acids and milk Fourier-transform infrared spectra. J. Dairy Sci. 101, 7219–7235. doi: 10.3168/jds.2017-14289
Bittante, G., Cipolat-Gotet, C., Malchiodi, F., Sturaro, E., Tagliapietra, F., Schiavon, S., et al. (2015). Effect of dairy farming system, herd, season, parity, and days in milk on modeling of the coagulation, curd firming, and syneresis of bovine milk. J. Dairy Sci. 98, 2759–2774. doi: 10.3168/jds.2014-8909
Bittante, G., Ferragina, A., Cipolat-Gotet, C., and Cecchinato, A. (2014). Comparison between genetic parameters of cheese yield and nutrient recovery or whey loss traits measured from individual model cheese-making methods or predicted from unprocessed bovine milk samples using Fourier-transform infrared spectroscopy. J. Dairy Sci. 97, 6560–6572.
Boichard, D., and Brochard, M. (2012). New phenotypes for new breeding goals in dairy cattle. Animal 6:18. doi: 10.1017/S1751731112000018
Bonfatti, V., Grigoletto, L., Cecchinato, A., Gallo, L., and Carnier, P. (2008). Validation of a new reversed-phase high-performance liquid chromatography method for separation and quantification of bovine milk protein genetic variants. J. Chromatogr. A 1195, 101–106. doi: 10.1016/j.chroma.2008.04.075
Bonfatti, V., Vicario, D., Degano, L., Lugo, A., and Carnier, P. (2017). Comparison between direct and indirect methods for exploiting Fourier transform spectral information in estimation of breeding values for fine composition and technological properties of milk. J. Dairy Sci. 100, 2057–2067. doi: 10.3168/JDS.2016-11951
Calus, M. P. L., and Veerkamp, R. F. (2011). Accuracy of multi-trait genomic selection using different methods. Genet. Sel. Evol. 43:26. doi: 10.1186/1297-9686-43-26
Cecchinato, A., Albera, A., Cipolat-Gotet, C., Ferragina, A., and Bittante, G. (2015). Genetic parameters of cheese yield and curd nutrient recovery or whey loss traits predicted using Fourier-transform infrared spectroscopy of samples collected during milk recording on Holstein, Brown Swiss, and Simmental dairy cows. J. Dairy Sci. 98, 4914–4927. doi: 10.3168/jds.2014-8599
Cecchinato, A., Cipolat-Gotet, C., Casellas, J., Penasa, M., Rossoni, A., and Bittante, G. (2013). Genetic analysis of rennet coagulation time, curd-firming rate, and curd firmness assessed over an extended testing period using mechanical and near-infrared instruments. J. Dairy Sci. 96, 50–62. doi: 10.3168/jds.2012-5784
Cecchinato, A., de Marchi, M., Gallo, L., Bittante, G., and Carnier, P. (2009). Mid-infrared spectroscopy predictions as indicator traits in breeding programs for enhanced coagulation properties of milk. J. Dairy Sci. 92, 5304–5313. doi: 10.3168/jds.2009-2246
de los Campos, G., and Perez-Rodriguez, P. (2014). Genome-wide regression and prediction with the BGLR statistical package. Genetics 198, 483–495. doi: 10.1534/genetics.114.164442
Dórea, J. R. R., Rosa, G. J. M., Weld, K. A., and Armentano, L. E. (2018). Mining data from milk infrared spectroscopy to improve feed intake predictions in lactating dairy cows. J. Dairy Sci. 101, 5878–5889. doi: 10.3168/jds.2017-13997
Ferragina, A., Cipolat-Gotet, C., Cecchinato, A., and Bittante, G. (2013). The use of Fourier-transform infrared spectroscopy to predict cheese yield and nutrient recovery or whey loss traits from unprocessed bovine milk samples. J. Dairy Sci. 96, 7980–7990. doi: 10.3168/jds.2013-7036
Ferragina, A., de los Campos, G., Vazquez, A. I, Cecchinato, A., and Bittante, G. (2015). Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data. J. Dairy Sci. 98, 8133–8151. doi: 10.3168/jds.2014-9143
Guo, G., Zhao, F., Wang, Y., Zhang, Y., Du, L., and Su, G. (2014). Comparison of single-trait and multiple-trait genomic prediction models. BMC Genet. 15:30. doi: 10.1186/1471-2156-15-30
Henderson, C. R., and Quaas, R. L. (1976). Multiple trait evaluation using relatives’ records. J. Anim. Sci. 43, 1188–1197. doi: 10.2527/jas1976.4361188x
Houle, D., Govindaraju, D. R., and Omholt, S. (2010). Phenomics: the next challenge. Nat. Rev. Genet. 11, 855–866. doi: 10.1038/nrg2897
Karaman, E., Lund, M. S., Anche, M. T., Janss, L., and Su, G. (2018). Genomic prediction using multi-trait weighted GBLUP accounting for heterogeneous variances and covariances across the genome. G3 Genes Genomes Genet. 8, 3549–3558. doi: 10.1534/g3.118.200673
Lee, S. H., van der Werf, J. H. J., Hayes, B. J., Goddard, M. E., and Visscher, P. M. (2008). Predicting unobserved phenotypes for complex traits from whole-genome SNP data. PLoS Genet. 4:e1000231. doi: 10.1371/journal.pgen.1000231
Legarra, A., Aguilar, I., and Misztal, I. (2009). A relationship matrix including full pedigree and genomic information. J. Dairy Sci. 92, 4656–4663. doi: 10.3168/jds.2009-2061
Mele, M., Macciotta, N. P. P., Cecchinato, A., Conte, G., Schiavon, S., and Bittante, G. (2016). Multivariate factor analysis of detailed milk fatty acid profile: effects of dairy system, feeding, herd, parity, and stage of lactation. J. Dairy Sci. 99, 9820–9833.
Misztal, I., Tsuruta, S., Strabel, T., Auvray, B., Druet, T., and Lee, D. H. (2002). “BLUPF90 and related programs (BGF90), Commun. No. 28-07,” in Proceedings of the 7th World Congress on Genetics Applied to Livestock Production, Montpellier.
Mulder, H. A., and Bijma, P. (2007). Effects of genotype x environment interaction on genetic gain in breeding programs. J. Anim. Sci. 83, 49–61. doi: 10.2527/2005.83149x
NRC (2001). Nutrient Requirements of Dairy Cattle Subcommittee on Dairy Cattle Nutrition. Committee on Animal Nutrition Board on Agriculture and Natural Resources National Research Council. Washington, DC: National Academy Press.
Pegolo, S., Mach, N., Ramayo-Caldas, Y., Schiavon, S., Bittante, G., and Cecchinato, A. (2018). Integration of GWAS, pathway and network analyses reveals novel mechanistic insights into the synthesis of milk proteins in dairy cows. Sci. Rep. 8:566. doi: 10.1038/s41598-017-18916-4
Rutten, M. J. M., Bovenhuis, H., Heck, J. M. L., and van Arendonk, J. A. M. (2011). Predicting bovine milk protein composition based on Fourier transform infrared spectra. J. Dairy Sci. 94, 5683–5690. doi: 10.3168/JDS.2011-4520
Rutten, M. J. M., Bovenhuis, H., and van Arendonk, J. A. M. (2010). The effect of the number of observations used for Fourier transform infrared model calibration for bovine milk fat composition on the estimated genetic parameters of the predicted data. J. Dairy Sci. 93, 4872–4882.
Sanchez, M. P., El Jabri, M., Minéry, S., Wolf, V., Beuvier, E., Laithier, C., et al. (2018). Genetic parameters for cheese-making properties and milk composition predicted from mid-infrared spectra in a large data set of Montbéliarde cows. J. Dairy Sci. 101, 10048–10061. doi: 10.3168/jds.2018-14878
Sargolzaei, M., Chesnais, J. P., and Schenkel, F. S. (2014). A new approach for efficient genotype imputation using information from relatives. BMC Genom. 15:478. doi: 10.1186/1471-2164-15-478
Schopen, G. C. B., Heck, J. M. L., Bovenhuis, H., Visker, M. H. P. W., van Valenberg, H. J. F., and van Arendonk, J. A. M. (2009). Genetic parameters for major milk proteins in Dutch Holstein-Friesians. J. Dairy Sci. 92, 1182–1191. doi: 10.3168/jds.2008-1281
Soyeurt, H., Dardenne, P., Dehareng, F., Lognay, G., Veselko, D., Marlier, M., et al. (2006a). Estimating fatty acid content in cow milk using mid-infrared spectrometry. J. Dairy Sci. 89, 3690–3695. doi: 10.3168/jds.S0022-0302(06)72409-2
Soyeurt, H., Dardenne, P., Gillon, A., Croquet, C., Vanderick, S., Mayeres, P., et al. (2006b). variation in fatty acid contents of milk and milk fat within and across breeds. J. Dairy Sci. 89, 4858–4865. doi: 10.3168/JDS.S0022-0302(06)72534-6
Soyeurt, H., Dehareng, F., Gengler, N., McParland, S., Wall, E., Berry, D. P., et al. (2011). Mid-infrared prediction of bovine milk fatty acids across multiple breeds, production systems, and countries. J. Dairy Sci. 94, 1657–1667. doi: 10.3168/jds.2010-3408
Keywords: high-throughput phenotyping, Fourier-transformed infrared spectroscopy, genetic parameters, genomic predictions, dairy cattle, single-step GBLUP
Citation: Cecchinato A, Toledo-Alvarado H, Pegolo S, Rossoni A, Santus E, Maltecca C, Bittante G and Tiezzi F (2020) Integration of Wet-Lab Measures, Milk Infrared Spectra, and Genomics to Improve Difficult-to-Measure Traits in Dairy Cattle Populations. Front. Genet. 11:563393. doi: 10.3389/fgene.2020.563393
Received: 18 May 2020; Accepted: 31 August 2020;
Published: 29 September 2020.
Edited by:
Fabyano Fonseca Silva, Universidade Federal de Viçosa, BrazilReviewed by:
Rodrigo Reis Mota, University of Liège, BelgiumKathryn Tiplady, Livestock Improvement Corporation, New Zealand
Tesfaye Kebede Belay, Norwegian University of Life Sciences, Norway
Copyright © 2020 Cecchinato, Toledo-Alvarado, Pegolo, Rossoni, Santus, Maltecca, Bittante and Tiezzi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Alessio Cecchinato, YWxlc3Npby5jZWNjaGluYXRvQHVuaXBkLml0