- 1Institut de Recerca i Tecnologia Agroalimentàries (IRTA), Mas Bové, Ctra. Reus-El Morell Km 3, Constantí Tarragona, Spain
- 2Institut de Recerca i Tecnologia Agroalimentàries (IRTA), Centre de Recerca en Agrigenòmica (CRAG), CSIC-IRTA-UAB-UB, Cerdanyola del Vallès (Bellaterra), Barcelona, Spain
- 3Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Cerdanyola del Vallès (Bellaterra), Barcelona, Spain
- 4Grupo de Investigación Calidad y Seguridad Alimentaria, Centro de Investigación e Innovación Agroalimentaria y Agroambiental (CIAGRO-UMH), Universidad Miguel Hernández, Carretera de Beniel, Alicante, Spain
- 5Institut de Recerca i Tecnologia Agroalimentàries (IRTA), Fruitcentre, PCiTAL, Lleida, Spain
Almond breeding is increasingly focusing on kernel quality. However, unlike other agronomic traits, the genetic basis of physical and chemical kernel quality traits has been poorly investigated. To address this gap, we conducted a QTL mapping of these traits to enhance our understanding of their genetic control. We phenotyped fruit samples from an F1 population derived from the cross between ‘Marcona’ and ‘Marinada’ for up to four years, using conventional and image analysis methods. Additionally, the 91 individuals of the population were genotyped with the almond Axiom™ 60K SNP array, and high-density linkage maps were constructed. These analyses identified several genomic regions of breeding interest. For example, two regions on chromosome one were found to contain QTLs for kernel shape and dimension, while another region at the end of the same chromosome contained QTLs for kernel fatty acid composition. Notably, QTLs for kernel symmetry and kernel shoulder, reported for the first time in this study, were also mapped on chromosome one. These QTLs will serve as a foundation for developing molecular markers linked to kernel physical and chemical quality traits in almonds, facilitating the integration of marker-assisted selection into breeding programs.
1 Introduction
Almond [Prunus dulcis (Miller) D.A. Webb, syn. P. amygdalus (L.) Batsch] is the most important tree nut species worldwide, with a production steadily increasing over the last 15 years. In 2022, global in-shell almond production reached nearly 4 million tons, yielding 1.6 M tons of kernel. The U.S. accounted for the majority of the almond production (79%), followed by Australia (8%) and Spain (6%) (FAOSTAT, 2024). The crop exhibits remarkable adaptability to diverse climates and irrigation regimes (Alonso et al., 2017; Martínez-Gómez et al., 2017), ranging from the fully irrigation practiced in California to the traditional dryland farming in some Mediterranean and Asian countries (Gradziel et al., 2017). This wide adaptability highlights almond as a promising crop for addressing the challenges of climate change in various regions, where less resilient crops may prove unsuitable for cultivation.
Almond have a wide range of uses, including raw consumption, snacks, desserts, marzipans, cookies, ice creams, etc. Each application has specific quality requirements, and different almond varieties are better suited to particular uses. Thus, several kernel traits like shape, dimension, fatty acids, lipid content, vitamins, phytosterol content, minerals, proteins, carbohydrates, or fiber were described at cultivar levels to assess the quality, and attractiveness, and study their best industrial performance (reviewed in Yada et al., 2011; Romero, 2014; Kodad, 2017; Flankin and Mitchell, 2019). Differences between almond in physical traits such as kernel dimensions, shape, surface color, or ease of skin removal are well-characterized and serve as unique features for usage and marketing. Similarly, variation in nutritional composition among cultivars highlights the influence of their genetic makeup and its interaction with factors such as geographical origin, climatic environment, and growing conditions (Beltrán et al., 2021; Rabadán et al., 2018; Lipan et al., 2019).
Despite extensive research on the physico-chemical traits of almonds, information on their genetic basis remains limited. The first marker-based genetic studies focused on agronomic traits, such as self-compatibility (Ballester et al., 1998), flowering time (Ballester et al., 2001), productivity and ripening date (Sánchez-Pérez et al., 2007), using low-density linkage maps that identified several major genes and quantitative trait loci (QTLs). Subsequent genetics studies on kernel quality traits initially targeted monogenic features, such as the amygdalin content related to kernel bitterness/sweetness (Sánchez-Pérez et al., 2007, 2010). Later, genomic regions associated with the content of chemical compounds like tocopherol homologues, fatty acids, protein, and oil were mapped in a ‘Vivot’ × ‘Blanquerna’ almond population (Font i Forcada et al., 2012; Fernández i Martí et al., 2013). Additionally, association analyses using a wide almond germplasm collection investigated the genetic control of kernel dimensions (width, thickness, and length) and other chemical traits (Font i Forcada et al., 2015a and 2015b). In recent years, new genomic tools have been developed, and three different almond genomes have been released (Sánchez-Pérez et al., 2019; Alioto et al., 2020; D’Amico-Willman et al., 2022; Castanera et al., 2024) along with a 60K SNP array for high-throughput genotyping (Duval et al., 2023). These advances have enabled the development of highly saturated linkage maps used to search for genetic associations with traits like shell hardness (Goonetilleke et al., 2018), the volatilome of roasted kernels (Di Guardo et al., 2021), and other phenological and nut quality traits (Paizila et al., 2021; Goonetilleke et al., 2023; Sideli et al., 2023; Pérez de los Cobos et al., 2023; Sideli et al., 2024). As a result of these efforts, four major genes: self-compatibility, late-blooming, sweet kernel, and shell hardness are currently being selected with molecular markers in different breeding programs (Goonetilleke et al., 2018; Sideli et al., 2023). However, despite the development of a marker for kernel bitterness (Sánchez-Pérez et al., 2010) along with KASP markers (Ricciardi et al., 2018; Lotti et al., 2023), no markers of breeding interest are available for other physicochemical almond kernel traits.
In this study, the genetic inheritance of kernel quality traits such as kernel weight, shape-related traits, color, and chemical composition was investigated in an F1 population from the ‘Marcona’ × ‘Marinada’ cross. This population was phenotyped for four years for physical traits and one year for kernel chemical composition. Highly saturated linkage maps were developed using the genotypes obtained with the almond 60K SNP array (Duval et al., 2023) and QTL mapping was carried out.
2 Materials and methods
2.1 Plant material
An F1 population of 91 individuals derived from the cross between ‘Marcona’ and ‘Marinada’ (MC × MI) was used in this study. ‘Marcona’ is a highly valued traditional Spanish cultivar, known for its characteristic rounded kernel shape and high level of fatty acids (Martín Carratalá et al., 1998; Calle et al., 2024). ‘Marinada’ (‘Lauranne’ × ‘Glorieta’) is a self-compatible breeding cultivar released by IRTA in 2008, noted for its very sweet kernel and high percentage of soluble sugars (Vargas et al., 2008; Romero et al., 2011). Seedlings were grafted onto ‘Garnem®’ rootstock and planted at 4 × 1.8 m in 2015. The MC × MI population and its parents are maintained at the IRTA Mas Bové experimental station (41.170723 N, 1.172942 E) under standard agricultural practices.
2.2 Phenotypic data collection
The MC × MI population and both parents were evaluated for several physical and chemical traits. The physical traits included kernel weight, crack-out percentage, kernel size (length, width, and thickness), kernel shape (roundness, globosity, shoulder, and symmetry), and tegument and kernel color (L*, a*, b*). The kernel chemical traits assessed were kernel protein, fiber and fat content, and fatty acids profile (myristic, palmitic, palmitoleic, margaric, cis-10-heptadecenoic, stearic, oleic, vaccenic, linoelaidic, linoleic, arachidic, cis-11-eicosenoic, and cis-11,14-eicosadienoic).
From 2018 to 2021, fifty mature fruits were randomly collected from each individual of the F1 MC × MI population and its parents. Fruits were considered mature when the mesocarp was fully dry, split along the fruit suture, and the peduncle was near complete abscission. After dehulling, nut weight was measured, shells were cracked, and kernel weight (KWe) was recorded using an electronic scale. Crack-out percentage, calculated between 2019 and 2021, was determined as the ratio of kernel and nut weights. After that, kernel length (KLen), width (KWidth), and thickness (KThick) was measured with a digital Vernier caliper. Kernel roundness (KRound) and globosity (KGlob) were estimated using the ratios width/length and width/thickness, respectively. Kernel size and shape traits were also assessed using image analysis. For that, a standard photo of six kernels was taken per individual (Figure 1), and images were analyzed using the Shape Analyzer (SA) software (Jurado, 2024). This deep learning-based tool automatically detects almond kernels from images and measures them. The parameters obtained included kernel length (KLen_SA), width (KWidth_SA), roundness (KRound_SA), symmetry based on ‘structural similarity index measure’ (SSIM) (Sym_SSIM), and Jaccard index (Sym_Jacc). Both indices determine kernel symmetry by comparing the similarity between the two sides of the kernel along the longitudinal axes using digital images (Figure 1B). For that, the box containing the kernel (Figure 1B) was divided in half and right-flipped to compare the similarity between both sides using the SSIM and Jaccard coefficient metrics, with a value of 1 indicating perfect symmetry. These traits were phenotyped in 2020 and 2021. Additionally, in 2020, the kernel shoulder, referring to anomalies in the basal area of the kernels (Figure 1), was visually assessed using a scale from one (no shoulder) to five (marked basal anomaly as observed in Figure 1C; KShoul_Vis), and measured using the Tomato analyzer software (Gonzalo et al., 2009) to determine the angle of the kernel shoulder (KShoul_Angle) as shown in Figure 1. Tegument (the kernel skin) and kernel color were determined with a Minolta Chroma Meter tri-stimulus color analyzer (CR-3500D; Minolta, Ramsey, NJ, USA) calibrated to a white porcelain reference plate using a CIELAB scale with color space coordinates L*, a* and b* (Tcolor_L, Tcolor_a, Tcolor_b, Kcolor_L, Kcolor_a, Kcolor_b). Tegument and kernel color were measured from 2018 to 2021.
Figure 1. (A) Original and (B) processed images for ‘Marcona’ shape attributes using the Shape analyzer software. (C) Kernel images of three different individuals from the ‘MC × MI’ population where segregation of kernel shape and the shoulder trait (arrow) are shown.
Fat content was analyzed by Soxhlet method, using 5 – 6 g of ground blanched almonds and petroleum ether (boiling point 40 to 60 °C) for 7 h in Soxhlet apparatus (Romero et al., 2021; Etheridge et al., 1998). The fat content was expressed as a percentage (%), calculated based on the weight of the extracted fat relative to the total weight of the sample. This trait was measured only in 2021.
Crude protein was analyzed from 2018 to 2021 by Dumas combustion procedure using Leco FP-528 analyzer as described in Romero et al. (2021). Briefly, 0.2 g of grounded sample was weighed in a porcelain sample holder (boat) for introduction into the combustion chamber (850 ± 1 °C) utilizing an automated sample loader. The combustion process converts covalently bound 130 nitrogen into nitrogen gas (N2) that is quantified by passing the gas through a conductivity cell and converted to protein by multiplying a factor of 6.25. The results were expressed as a percentage (%) of protein content.
Crude fiber was measured from 2019 to 2021 using 1 g of the grounded sample. The sample was treated with boiling 0.26 N sulfuric acid for 30 min, followed by boiling 0.23 N potassium hydroxide for another 30 min (Romero et al., 2021). The extracted residue was dried at 103 ± 1 °C for 3 h, weighed, then placed in a furnace (550 ± 1 °C for 3 h), and finally, the ashes were weighed. These results were expressed as a percentage (%) of ash content.
Fatty acids were analyzed by gas-chromatography with a flame ionization detector (GC-FID) using a capillary column (Romero et al., 2021). The fatty acid methyl esters (FAMEs) were prepared by trans-esterification with 0.5 M potassium hydroxide, following the official method UNE-EN ISO 5509:2000. FAMEs (1 mL) were separated using a gas-chromatograph (HP 6890; Agilent Technologies, Barcelona, Spain) equipped with an FID detector and a capillary column [30 m · 0.25 mm i.d. (HP-Innowax, Agilent Technologies)]. The carrier gas was helium, with a flow rate of 1 mL/min. The injector and detector temperatures were 220 and 275°C, respectively. The FAME identification was based on retention time relative to those of a standard FAME mixture (Sigma-Aldrich, Madrid, Spain). Fatty acids were measured only in 2021. Results were expressed in mg/g as average values of three replicates.
2.3 Data analyses
In this study, LSmean, which provides a way to obtain mean trait values that are adjusted for fixed effects, was considered to predict the effect of the year, assuming that the environmental conditions of every year affect all individuals in the population in the same way. Therefore, for traits with more than one year of data, the LSmean was calculated according to the following equation:
where is the phenotypic value of the i-th individual in the j-th year, is the intercept (overall mean phenotypic value), is the effect of the i-th individual, is the effect of the j-th year, and is the error term for the i-th individual in the j-th year. The coefficient of determination (R2) was used to measure how the LSmean fits the data and how well it can predict outcomes. R2 was determined using the following formula:
where SST is the sum of squares of the residual errors, and SST is the total sum of the errors. R2 ranges between 0 and 1, with values closer to 1 indicating better LSmean prediction.
Pairwise correlation coefficients were calculated for all traits (single-year data and LSmean data) using the JMP® software (JMP®, Version 16, SAS Institute Inc.) Significance was calculated using the Spearman correlation coefficient significant (p<0.001). Normal distribution was assessed using the Shapiro-Wilk test (p<0.05) using JMP®.
2.4 Genotyping, SNP filtering, and linkage map construction
Total genomic DNA from the 91 individuals and two parents was isolated from young leaves using the CTAB method (Doyle and Doyle, 1990), adapted for 96-well plates. The DNA samples were genotyped with 33 SSRs and the almond Axiom™ 60K SNP array (Duval et al., 2023).
For the SSRs, a set of markers that were heterozygous in ‘Marcona’ or in ‘Marinada’ was selected. In genomic regions not well covered with the initial set of SSRs, new SSRs were designed using the almond reference genome (Alioto et al., 2020). Primers were designed using Primer 3 (http://primer3.ut.ee, v4.1.0) with default parameters. The list of SSR markers used is presented in Supplementary Table 1. PCR reactions were conducted in a final volume of 10 μL containing 200 ng of genomic DNA, 1x NH4 reaction buffer, 1.5 mM MgCl2, 0.2 mM dNTPs (10mM), 0.2 μM of each marker, 1 U of BIOTaq (Bioline, London, UK) and HPLC H2O to reach the final volume. PCRs were performed in a GeneAmp PCR System 9700 thermal cycler (Applied Biosystems, CA, USA) with the following conditions: initial denaturation at 94°C for 1 min, 35 cycles of denaturation at 94°C for 15 s, primer annealing at a specific temperature for each primer for 15 s, extension at 72°C for 30 s, and a final extension at 72°C for 5 min. Forward primers were designed with a generic fluorochrome sequence at the 5’ ends (FAM, VIC, NED, or PET). PCR products were added to 12 μL of deionized formamide containing 0.35 μL of GeneScan500 LIZ size standard (Applied Biosystems, CA, USA) and heated at 94°C for 3 min. Capillary electrophoresis was performed in an ABI Prism 3130xl automated sequencer (Applied Biosystems, CA, USA). GeneMapper v5.0 software (Applied Biosystems) was used for SSR allele sizing.
The almond Axiom™ 60K SNP array genotyping was performed on an Axiom GeneTitan™ (ThermoFisher Scientific) system at the INRAE Gentyane platform in Clermont-Ferrand (France). Genotypic data was retrieved using the Axiom Analysis Suite (ThermoFisher, 2017). Samples were filtered following the Axiom best practices workflow, setting an average call rate > 95. We then filtered out SNPs with the following characteristics: (i) monomorphic SNPs in the progeny; (ii) heterozygous SNPs in ‘Marcona’ and ‘Marinada’ but with only two genotypic classes in the progeny; (iii) homozygous SNPs in ‘Marcona’ and heterozygous in ‘Marinada’ with three genotypic classes in the progeny; (iv) homozygous SNPs in ‘Marinada’ and heterozygous in ‘Marcona’, but with three genotypic classes in the progeny. After filtering based on segregation, missing data, and putative genotyping error identification and imputation were performed with AlphaFamImputed software (Whalen et al., 2020) using default settings. We then ordered the SNPs based on their physical position in the ‘Texas’ almond genome v2.0 (Alioto et al., 2020) and phased them manually. Finally, a set of bins (i.e. groups of SNPs with identical genotypes for all the individuals) was established, with each bin separated from the adjacent bin by at least one recombination event.
Finally, three linkage maps were built using JoinMap 5® (van Ooijen, 2018). The MC × MI map was constructed using all the previously selected bins, the ‘Marcona’ map was built using only bins heterozygous in ‘Marcona’, and the ‘Marinada’ map was built using only bins heterozygous in ‘Marinada’. For map construction, a minimum logarithm of odds (LOD) score of 10 was selected for SNP grouping. Makers showing segregation distortion higher than 0.01 were excluded from linkage mapping unless other distorted markers with similar ratios surrounded them. The maximum likelihood algorithm with default parameters and the Kosambi mapping function were used (Kosambi, 1944).
2.5 QTL mapping
LSmean and single-year data were analyzed for QTL detection in the three linkage maps (‘Marcona’, ‘Marinada’, and MC × MI) using MapQTL 6.0® (van Ooijen, 2009). Interval mapping (Lander and Botstein, 1989) and multiple QTL mapping (Jansen and Stam, 1994) strategies were used for QTL discovery. To determine the significance threshold for each QTL, LOD was calculated for each linkage group (LG) and trait using the permutation test (1000 permutations) at a 95% significance level (p < 0.05) (Lander and Botstein, 1989; van Ooijen, 1992). As most traits had a significance level similar to 3, a final threshold of LOD 3 was used in all cases. The QTL confidence interval was defined as the LOD - 1. For traits with non-normally distributed data, we used the Kruskal-Wallis non-parametric test in MapQTL, retaining only QTLs that were significant in this test (p< 0.01) and also significant with interval mapping (LOD > 3.0). QTLs were named according to the recommendations for standard QTL nomenclature and reporting of the Genome Database for Rosaceae (Jung et al., 2019). Graphical representation of linkage maps and QTLs was created using MapChart (Voorrips, 2002). The physical positions of the QTLs are based on the almond reference genome of ‘Texas’ v3.0-F1 (Castanera et al., 2024).
3 Results
3.1 Trait distributions and correlations
We obtained phenotypic data for 22 traits, including both kernel physical and chemical characteristics. The physical traits measured were kernel weight, crack-out percentage (crack-out), kernel size (length, width, and thickness), kernel shape (roundness, globosity, shoulder, and symmetry), and tegument and kernel color (L*, a*, b*). The kernel chemical traits included kernel protein, fiber, and fat content. For the fatty acid profile, we measured myristic, palmitic, palmitoleic, margaric, cis-10-heptadecenoic, stearic, oleic, vaccenic, linoelaidic, linoleic, arachidic, cis-11-eicosenoic, and cis-11,14-eicosadienoic acids.
3.1.1 Trait distributions
‘Marcona’ and ‘Marinada’ showed intermediate values for most traits, however ‘Marcona’ displayed high kernel width and protein content compared to the population mean (Figure 2; Supplementary Table 2). In general, ‘Marcona’ had a larger kernel weight, a rounder kernel (wider but with a similar length), a lower crack-out percentage, and a higher protein content compared to ‘Marinada’.
Figure 2. Frequency distribution of phenotyped traits. For traits with more than one year of data, LSmean values are used. X axis represents phenotypic values, and Y axis represents frequency. Grey and black arrows indicate phenotypic values for ‘Marcona’ and ‘Marinada’, respectively.
Concerning fruit dimensions, individuals of the ‘Marcona’ × ‘Marinada’ (MC × MI) population exhibited weights ranging from 0.89 to 1.61 g, lengths from 16.77 to 24.87 mm, widths from 12.17 to 17.95 mm, and thicknesses from 7.21 to 11.56 mm (Figure 2; Supplementary Table 2). In terms of roundness (width/length ratio) and globosity (width/thickness ratio), the trait distributions spanned from 0.56 to 0.84 mm and 1.31 to 2.27 mm, respectively, with a larger number of individuals showing elongated shapes compared to rounded shapes (Figure 2). Similarly, there was an evident segregation for the shoulder trait, with a higher frequency of fruits with less pronounced shoulders. For symmetry, measured using Jaccard and SSIM indexes, a clear trend towards fruits with high symmetry was observed. For crack-out percentage, a distribution biased towards fruits with hard shell (<40%) was observed.
Regarding the chemical composition of the kernel, protein content varied between 18.47 and 27.72%, fiber content ranged from 2.80 to 7.56%, and fat content ranged from 46.62 to 58.83% (Figure 2). In the individuals of the population, a distribution towards higher levels of protein and fat content was observed. However, the distribution of fiber content exhibited a slightly different pattern towards lower amounts. The predominant fatty acid detected was oleic (ranging from 52.90 to 160.85 mg/g), followed by linolenic (5.33-17.12 mg/g), palmitic (6.78-12.84 mg/g), stearic (1.03-3.45 mg/g), and vaccenic (1.83-3.15 mg/g). Other identified acids, such as myristic, palmiloleic, margaric, heptadeceonic, linoelaidic, arachidic, eicosenoic, and eicosadienoic, presented concentrations lower than 1.5 mg/g (Figure 2).
Regarding the color of the tegument, a distribution towards nuts with darker pigmentation was noted. In contrast, for nuts without the tegument, minimal differences were observed among the samples.
The normality of each trait was evaluated using the Shapiro-Wilk test (Supplementary Table 2). Among the kernel dimension traits, all followed a normal distribution except for roundness, symmetry, and kernel shoulder (p< 0.01). Kernel weight also displayed a normal distribution (W = 0.987, p = 0.514). For tegument and kernel color traits, only Tegument Color L* significantly deviated from normality. In terms of chemical composition, fat content followed a normal distribution, in contrast to protein and fiber content (Supplementary Table 2). Among the fatty acids, only palmitoleic, stearic, and vaccenic acids exhibited a normal distribution pattern.
3.1.2 Trait correlations
The coefficient of determination (R2) was calculated for all traits phenotyped over multiple years to assess the predictive accuracy of LSmean values (Supplementary Table 3). Overall, a high prediction was observed for all traits. The lowest prediction values were observed for fiber content (R2 = 0.45) and tegument color L* (0.47) (Supplementary Table 3). Conversely, all the other traits presented R2 values higher than 0.64 with the highest values recorded for roundness-SA and symmetry-Jaccard-SA (0.91).
Significant correlations between years were obtained for all traits, except for tegument and kernel color parameters, which were only significant between 2020 and 2021 (Supplementary Table 4). Moreover, for all the other traits, higher correlations were observed between 2020 and 2021 compared to those between 2018 and 2019. The highest inter-annual correlations were observed for crack-out percentage (0.89) and kernel roundness (0.88) between 2020 and 2021, while the lowest correlation was for kernel thickness (0.25) between 2018 and 2019.
Several sets of correlated traits were identified in the population (Supplementary Table 4). One set focused on traits associated with kernel dimensions. Considering only comparisons between LSmeans of all traits, kernel weight demonstrated intermediate to high correlations with both kernel length and width, using measurements from both manual assessments (0.52 and 0.60, respectively) and outputs from the Shape Analyzer (SA) software (0.62 and 0.67, respectively). Conversely, a low correlation of 0.27 was reported between kernel weight and thickness. Regarding kernel length, a moderate positive correlation with weight (0.52) was observed, along with a negative correlation with roundness (-0.67). Interestingly, no significant correlation between kernel length and width was noted, in contrast to the positive correlation between width and roundness (0.60) and roundness-SA (0.52). Kernel width also exhibited a moderate correlation with thickness (0.48), which was correlated with roundness (0.48) and kernel shoulder (0.53 and 0.46 for angle and visual, respectively). As expected, high correlations were observed for kernel length (0.82; KLength vs. KLenght_SA), width (0.83; KWidth vs. KWidth_SA), and roundness (0.67; KRound vs. KRound_SA) when comparing manual measures to outputs from the Shape Analyzer software (Supplementary Table 4).
A different set of interrelated traits was documented for chemical compounds (Supplementary Table 4). Notably, a negative correlation was identified between fat and protein content (-0.61), contrasting with the consistently high positive correlations among all analyzed fatty acids (ranging from 0.33 to 0.92). There was a low and non-significant correlation observed between fat, fiber, and protein content, and the fatty acids (Supplementary Table 4). Additionally, negative correlations between certain chemical and physical parameters were noted, such as crack-out percentage and protein content (-0.34) and fat content and globosity (-0.47).
High correlations were also observed between color parameters like tegument color a* and tegument color L* (-0.44), and tegument color a* and tegument color b* (0.55) (Supplementary Table 4).
3.2 Genotyping and linkage map construction
A total of 9243 SNPs, representing 15.3% of the almond Axiom™ 60K SNP array, were obtained from the SNP filtering procedures and used for linkage mapping along with 33 simple-sequence repeat (SSR) markers. Of these SNPs, 3810 were heterozygous in ‘Marcona’ and homozygous in ‘Marinada’, 4060 were heterozygous in ‘Marinada’ and homozygous in ‘Marcona’, and the remaining 1373 SNPs were heterozygous in both parents. Additionally, 33 SSR markers were included.
For all constructed linkage maps (‘Marcona’, ‘Marinada’, and ‘MC × MI’), markers were grouped into eight LGs, each corresponding to an almond chromosome (Supplementary Table 5). The ‘Marcona’ map covered 448.0 cM and presented 370 bins (i.e., groups of markers in unique genetic positions) (Table 1). Similarly, the ‘Marinada’ map had 316 bins distributed along 447.0 cM. The ‘MC × MI’ showed higher marker saturation than parental maps, with 685 bins covering 540.8 cM (Table 1). In the three linkage maps, LG1 covered the largest genetic distance and included the highest number of markers. The average distance between consecutive bins was less than 1 cM in the ‘MC × MI’ map, whereas for the parental maps, this distance ranged from 1.15 (‘Marcona’; LG1) to 1.61 cM (‘Marinada’; LG7) (Table 1). Additionally, the linkage maps showed good genome coverage without major gaps: the average physical distance between consecutive markers was 196, 555, and 612 Kbp in the ‘MC × MI’, ‘Marcona’, and ‘Marinada’ maps, respectively.
Table 1. Summary statistics for the ‘Marcona’, ‘Marinada’ and ‘Marcona’ × ‘Marinada’ (MC × MI) genetic maps.
3.3 QTL mapping
QTL analyses were performed using single-year data for all traits, and for those traits with more than one year of data, LSmean values were also used. All QTLs are presented in Supplementary Table 6. In most cases, LSmean data identified the most consistent QTLs, which were detected in at least two different years through single-year QTL mapping. To simplify data presentation, we present QTLs using LSmean and those for which only one year of phenotypic data was available (Table 2).
Table 2. QTL mapping results using single year phenotypic data or, for traits phenotyped for more than one year, least square (LS) estimates.
In total, 51, 18, and 24 QTLs were detected using the MC × MI, ‘Marcona’, and ‘Marinada’ genetic maps, respectively. After excluding QTLs that cosegregate across different maps, a total of 53 unique QTLs were identified (Table 2). Most QTLs detected with the parental maps were also identified with the MC × MI map, except qP-KWe3.1 (only found on ‘Marcona’ map) and qP-KWidth4.1 (only found on ‘Marinada’ map). Conversely, 20 QTLs (qP-KWidth7.1, qP-KRound6.1, qP-KGlob2.1, qP-KGlob3.1, qP-KLen_SA6.1, qP-KWidth_SA1.1, qP-KWidth_SA2.1, qP-KWidth_SA4.1, qP-KWidth_SA7.1, qP-KRound_SA6.1, qP-Sym_SSIM8.1, qP-Sym_Jacc8.1, qP-Protein2.1, qP-Protein3.1, qP-Protein5.1, qP-Protein7.1, qP-Fiber1.1, qP-Fat3.1, qP-Fat5.1, qP-VaccenicAc1.1) were identified on MC × MI map but not on the parental maps. All consistent QTLs identified with the MC × MI map are presented in Figure 3.
Figure 3. Genetic position (cM) on the ‘Marcona’ × ‘Marinada’ map of detected QTLs using LSmean and those for which only one year of data was available.
At least one QTL was identified for all traits except for the color parameters Tegument_L, Tegument_a, Tegument_b, Kernel_L, and linoelaidic acid. For kernel weight, two regions were detected on LGs 3 and 4 (Table 2), with the major QTL (qP-KWe4.1) found at the beginning of LG4, explaining 29.0% of the phenotypic variation (PV) in the MC × MI map and 26.1% in the ‘Marinada’ map. This QTL showed an additive effect of the ‘Marinada’ allele (+0.11 g) increasing kernel size in the population (Table 2). A less significant weight QTL, only detected in the ‘Marcona’ map, was also identified on LG3 (qP-We3.1; PV=15.4%). For crack-out percentage, a highly significant QTL (qP-Crack2.1) located on LG2 was detected in the three maps with a LOD of 17.2 and explaining 59.7% of the PV in the MC × MI map (Table 2; Figure 3).
For kernel size and shape traits, a major QTL was detected at the beginning of LG1 (11.4-19.2 cM) for kernel length (qP-KLen1.1) using the ‘Marinada’ and the MC × MI maps, explaining up to 32.1% of the PV. Similarly, another major QTL for kernel width (qP-KWidth1.1) was identified in the central part of the same LG1 (45.3-57.4 cM) in the ‘Marcona’ (LOD=3.8; PV=18.3%) and MC × MI (LOD=4.9; PV=22.9%) maps. Overlapping with qP-KLen1.1, other consistent QTLs were identified for kernel roundness (qP-KRound1.1; LOD=4.1 to 7.8; PV=19.6 to 33.9%) and kernel length_SA (qP-KLen_SA1.1; LOD=6.3 to 6.8; PV=28.5 to 31.2%). In the same region of qP-KWidth1.1, we identified other QTLs for kernel width SA (qP-KWidth_SA1.1), kernel symmetry SSIM (qP-Sym_SSIM1.1), kernel symmetry Jaccard (qP-Sym_Jacc1.1), and kernel shoulder (qP-KShoul_Vis1.1) (Table 2; Figure 3). For kernel thickness and kernel shoulder angle, QTLs (qP-KThick1.1 and qP-KShoul_Angle1.1) were identified in positions near qP-KLen1.1; however, the LOD plots do not clearly reveal whether these represent the same QTL or two distinct QTLs located in very close proximity (Figure 3). Other stable QTLs for kernel width (LOD = 3.3 to 3.9; PV = 15.8 to 18.7%) were also identified on LGs 4, 7 and 8 (qP-KWidth4.1, qP-KWidth7.1, and qP-KWidth8.1). Overlapping with qP-KWidth4.1, a consistent QTL for kernel thickness (qP-KThick4.1) was also identified in ‘Marinada’ (LOD=4.2; PV=20.0%) and MC × MI (LOD=5.3; PV=24.6%) maps (Table 2; Figure 3). Additional regions with kernel globosity QTLs were reported on LGs 2, 3, and 4, each showing similar LOD scores (3.8-5.0) and PV percentages (18.1-23.3%). Comparison of detected QTLs for kernel length, width, and roundness using manual determinations and Shape Analyzer (SA) software measurements revealed QTLs in similar positions, with only a few QTLs detected exclusively by one method or the other (qP-KRound1.1, qP-Round_SA2.1, qP-Width_SA2.1, qP-KWidth_SA4.1, qP-KLen_SA6.1).
Regarding the kernel chemical composition, four QTLs for protein content were mapped on LGs 2, 3, 5, and 7 on the MC × MI map (Table 2; Figure 3). Among them, qP-Protein2.1 showed the highest LOD scores (4.5) and explained the largest proportion of PV (21.2%). The other three additional protein content QTLs (qP-Protein3.1, qP-Protein5.1, and qP-Protein7.1) were detected with lower significance, having LOD scores below 4 and explaining no more than 19% of the phenotypic variance (Table 2). For fat content, two QTLs were found on LGs 3 and 5 of the MC × MI map, both with the same significance (LOD=3.9) and explaining similar PV (19%) (Table 2). Notably, the protein and fat content QTLs on LG5 (qP-Protein5.1 and qP-Fat5.1) overlapped in the same region (Figure 3). For fiber content, a unique QTL mapped on LG1 (14.8-23.3 cM) of the MC × MI map was detected, associated with a LOD score of 3.4 and explained 16.2% of PV.
For the fatty acid content, QTLs were detected for all compounds except linoelaidic acid, on MC × MI map within the same region at the bottom of LG1 (70-85 cM). The QTL with the highest LOD score and PV explained for these acids was reported for margaric acid (LOD=8.2; PV=37.9%), whereas the mystiric acid QTL presented the lowest significance (LOD=3.8; PV=19.7%). On the ‘Marinada’ map, all the same QTLs for fatty acids reported in the MC × MI map were mapped to the same position of LG1. The ‘Marinada’ allele for these QTLs was associated with reductions in the mean values of all acids within the population (Table 2). On the ‘Marcona’ map, only QTLs for palmitic acid, margaric acid and oleic acid were detected, all found on the same LG1 interval as for MC × MI and ‘Marinada’ maps, with PV not exceeding the 20% (Table 2). For these QTLs, the ‘Marcona’ allele was associated with increasing in fatty acid content. This effect was notable on oleic acid, the main fatty acid in almonds, where the ‘Marcona’ allele resulted in an increase of 22 mg per gram of sample.
Finally, two QTLs (qP-KColor_a1.1 and qP-KColor_b2.1) associated with kernel color were mapped on the MC × MI and ‘Marinada’ maps (Table 2; Figure 3). qP-KColor_a1.1, mapped on LG1, explained 25.3% of the PV in MC × MI map, whereas qP-KColor_b2.1, found on LG2 was associated with PV of 27.1% in MC × MI (Table 2).
4 Discussion
The high number of seedlings produced and assessed each year in almond breeding programs highlights the importance of new genomic and phenotypic technologies to enhance breeding efficiency (Font i Forcada et al., 2017). This study examined the genetic basis of several almond quality traits related to kernel physical and chemical characteristics. This research was conducted on a population derived from the cross between the almond cultivars ‘Marcona’ and ‘Marinada’, aiming to improve phenotypic screenings and to provide genetic information that could serve as a foundation for developing DNA markers of breeding interest related to these traits.
Several QTL analyses have been reported in almond using biparental populations, most of them employing a relatively low number of markers (Ballester et al., 1998, 2001; Sánchez-Pérez et al., 2007; Font i Forcada et al., 2012; Fernández i Martí et al., 2013; Goonetilleke et al., 2018; Paizila et al., 2021; Goonetilleke et al., 2023). In this study, we used the recently developed almond Axiom™ 60K SNP array (Duval et al., 2023), which allows the development of highly saturated maps and genome-wide association analyses (Pérez de los Cobos et al., 2023). The marker saturation observed in the maps generated here greatly improved upon that reported in previous linkage maps (Sánchez-Pérez et al., 2007; Font i Forcada et al., 2012; Fernández i Martí et al., 2013; Goonetilleke et al., 2018), demonstrating that the almond 60K SNP array is an excellent tool for genetic analyses in almond. The high-quality linkage map, along with the QTLs reported in this study, will enable the implementation of efficient marker-assisted selection strategies aimed at improving kernel quality traits, which are key almond breeding objectives (Batlle et al., 2017).
High-throughput phenotyping is crucial to increase the efficiency of breeding programs. To this end, standard phenotyping protocols for almond shape traits were compared with image analysis using the newly developed artificial intelligence-based Shape Analyzer software (Jurado, 2024). The strong correlation observed between traits assessed through conventional and image analysis methods for kernel length, width, roundness, and symmetry, along with the discovery of QTLs in identical positions, underscores the reliability of these phenotyping protocols for their routine implementation in breeding programs.
4.1 Trait correlations
Most traits phenotyped over multiple years showed significant correlations across those years. This suggests that the phenotypic data obtained is of high quality and that most traits exhibit high heritability. These findings are consistent with other studies on traits like kernel shape (Martínez-Garcia et al., 2019), which facilitates inheritance studies and supports the implementation of marker-based selection approaches.
Identifying correlations between traits can provide valuable insights for more efficient breeding and selection. We found several interesting correlations between physical and chemical traits: for example, kernel weight is correlated with kernel length and width, but these two are not correlated with each other. Additionally, kernel roundness is correlated with kernel length and width, but not with kernel weight. Kernel thickness is correlated with kernel width and shape but not with kernel length, which opens the possibility of discovering kernel shapes different than those of the parents used. Our results indicate that kernel weight is more related to the size than the shape of the kernels, as reported in recent studies (Lipan et al., 2022). High correlations between morphological traits were also found previously in progenies derived from ‘Blanquerna’ × ‘Vivot’ (Fernández i Martí et al., 2013) and ‘Nonpareil’ × ‘Lauranne’ (Goonetilleke et al., 2023). Crack-out percentage and globosity are inversely correlated, and a QTL for both traits has been identified in the same region of LG2. This correlation between these two traits has not been previously reported, and further results are needed to confirm whether this is a common phenomenon in almond or specific to this population. For the chemical kernel composition, it is noteworthy that there is no correlation between total fat content and the composition of fatty acids, but there is a high correlation between most of the different fatty acids, as previously observed (Font i Forcada et al., 2012). Additionally, there is a notable negative correlation between fat and protein content. The highest correlation between physical and chemical traits was observed for the fat content, which was inversely correlated with globosity (-0.45), and for protein content, which is inversely correlated with crack-out percentage (-0.39). These correlations are particularly significant, given that the only detected QTL for these traits was mapped to the same region of chromosome 2. Other interesting correlations of chemical traits have been explored, such as the relationship between oleic acid content and shelf life recently pointed out in rapeseed (Spasibionek et al., 2020) and almond (Sideli et al., 2024).
4.2 Genetic analysis of kernel traits
Several QTLs associated with kernel traits were identified by our study, some of which co-localize QTLs previously identified by other authors. For instance, the crack-out percentage QTL (qP-Crack2.1), located on chromosome 2, has been reported by previous studies for crack-out and shell hardness (Goonetilleke et al., 2018; Pavan et al., 2021; Sánchez-Pérez et al., 2007; Sideli et al., 2023; Pérez de los Cobos et al., 2023). Both ‘Marcona’ and ‘Marinada’ exhibited hard shells, however wide variation was found in their F1 progeny. This suggests the dominance of hard shells and the presence of alleles in heterozygosity in the parental cultivars. The additive effect for crack-out percentage was 6.2% in ‘Marinada’ and 7.3% in ‘Marcona’. Meanwhile, in the MC × MI map, the difference between the highest and lowest mean values was 15.7%, closely matching the combined effect of both parents and supporting this hypothesis. Similarly, for kernel weight, a major QTL was identified at the beginning of LG4, the same chromosome where a previous QTL for this trait was identified in another F1 population derived from the cross ‘R1000’ × ‘Desmayo Largueta’ (Sánchez-Pérez et al., 2007) but at a slightly different genomic position (approx. 3 Mbp apart). QTLs for kernel weight have been identified in various genomic regions across studies (Fernández i Martí et al., 2013; Goonetilleke et al., 2023; Pérez de los Cobos et al., 2023), suggesting that kernel weight is a quantitative trait controlled by several genes in the almond germplasm. This was corroborated in our study, where in addition to the main kernel weight QTL (qP-KWe4.1), an additional QTL, detected only in the ‘Marcona’ map, indicated the presence of multiple alleles associated with weight variation in these cultivars. Additionally, a QTL for kernel width (qP-KWidth_SA4.1) was also identified, overlapping with qP-We4.1, suggesting that width may influence the observed differences in fruit weight. These interactions and the numerous QTLs reported for this trait highlight the challenges in implementing effective breeding strategies for weight gain in breeding programs.
QTLs for various kernel morphology traits, including length, thickness, width, roundness, symmetry, and shoulder angle were identified at various genomic locations. Notably, major QTLs for kernel length and width were found 25 Mbps apart on chromosome 1, suggesting that different genetic mechanisms control these traits. qP-KLen1.1, associated with kernel length, had previously been mapped in a panel of 98 almond cultivars (Font i Forcada et al., 2015a) and in an F1 population (‘Vivot’ × ‘Blanquerna’) (Fernández i Martí et al., 2013). Additionally, Goonetilleke et al. (2023) reported two overlapping QTLs for kernel width and length on LG1 of the ‘Lauranne’ map. However, this region is situated between the kernel length and width QTLs identified in the MC × MI population, suggesting that qP-KWidth1.1 is a new QTL associated with kernel width found in almonds. For roundness and symmetry, QTLs for the same traits were recently reported in the same genetic region of chromosome 1 (Goonetilleke et al., 2023). A QTL for kernel thickness was also identified on LG4 (qP-KThick4.1). Finally, another QTL for roundness was mapped on LG7 in a different position where a QTL for sphericity was reported on the same LG in a ‘Nonpareil’ × ‘Lauranne’ population (Goonetilleke et al., 2023).
In this population, we studied the segregation of the shoulder character, which is being analyzed for the first time in this study. We observed a strong correlation between kernel shoulder, width, and thickness, with a negative correlation with kernel length. Similar to the pattern seen with shape QTLs, QTLs associated with kernel shoulder exhibit a broader confidence interval compared to those for length or width, and similar to shape–related traits. This suggests that the QTL for kernel shoulder is likely influenced by both kernel length and width QTLs. To further validate QTLs for kernel length and width, it would be preferable to use a population where the kernel shoulder is not segregating.
Manual measurements for kernel shape traits and image analysis using artificial intelligence were employed in this study for QTL mapping. Despite the high correlation between both sets of phenotypic data, some QTLs identified through image analysis were not detected with manual measurements. This highlights the efficiency of using this new phenotyping method, not only in reducing data collection efforts but also in increasing the identification of new QTLs.
4.3 Genetic analysis of kernel chemical composition
A cluster of overlapping QTLs located at the lower end of LG1 was identified for the fatty acids. These results indicate that a single molecular marker could predict the content of all these fatty acids. This is particularly significant given the prevalence of these acids in almonds, their role in health benefits associated with almond consumption, and their impact on kernel rancidity, which compromises flavor and postharvest shelf life (Barreca et al., 2020; Becerra-Tomás et al., 2019; Flankin and Mitchell, 2019). Although, these fatty acids were quantified in only one year; the fact that all acids mapped to the same region of LG1, along with the high percentage of variance explained by these QTLs (over 25% for most of the acids), underscores the reliability of the results despite validation with additional year data. Moreover, this region coincides with a QTL previously documented in almonds associated with fatty acids content and located at the bottom of LG1 in a biparental population (‘Vivot’ × ‘Blanquerna’) and a panel of 98 almond accessions (Font i Forcada et al., 2012, 2015b; Sideli et al., 2024). Additionally, this region of LG1 encompasses markers recently associated with different degrees of rancidity (Sideli et al., 2024). It is known that fatty acids and phenolic compounds are related to kernel rancidity, so the overlap of these two regions highlights the impact of these acids on controlling rancidity levels. Furthermore, the additive effect observed with these acids (up to 28.7 mg/g kernel weight for oleic acid) underscores the significant potential for improvement through breeding, which could result in new cultivars with an enhanced fatty acid profile.
We identified four QTLs associated with protein content on LGs 2, 3, 5, and 7. Among these, only qP-Protein3.1 and qP-Protein7.1 co-localize with QTLs mapped by previous studies (Font i Forcada et al., 2012, 2015b). However, the primary QTL for protein content discovered in this study, qP-Protein2.1, did not align with any previously identified QTL region, suggesting that the regulation of protein content involves multiple genes. For fiber and fat content, QTLs related to these traits are reported for the first time by our study. Both qP-Fiber3.1 and qP-Fat3.1 showed low significance, which could be related to their dependence on environmental conditions, as different stresses like drought and temperature have been linked to interannual variation in these compounds (Kodad et al., 2018). This is consistent with the low proportion of phenotypic variability explained by cultivars for fiber (15.7%) and fat (28.4%) contents in previous studies (Romero et al., 2011), highlighting the low heritability of these compounds.
5 Conclusions
In this study, several kernel quality and chemical traits were investigated in a segregating almond F1 population, followed by QTL analyses. The use of the first high-density SNP array developed in almond (Axiom™ 60K SNP array), combined with traditional phenotyping protocols for kernel shape traits with image analysis using artificial intelligence, enhanced the detection power of QTLs. Notably, a region at the lower end of LG1 was mapped where QTLs for different fatty acids co-localized. Additionally, major QTLs for kernel shape and dimensions were identified in a few genomic regions. The results revealed multiple QTLs distributed across the entire genome, with QTL hotspots that can be used by breeders to further implement marker-assisted breeding in almond. These regions will be targeted for further fine mapping and in silico gene annotation to identify genes and polymorphisms associated with the phenotypic variation of these QTLs.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
FP: Writing – review & editing, Writing – original draft, Software, Methodology, Investigation, Formal Analysis, Data curation. AR: Writing – review & editing, Methodology, Investigation, Funding acquisition, Conceptualization. LL: Writing – review & editing, Methodology, Data curation. MX: Writing – review & editing, Methodology, Investigation. PA: Writing – review & editing, Methodology, Investigation. IE: Writing – review & editing, Writing – original draft, Supervision, Methodology, Investigation, Funding acquisition, Formal Analysis, Data curation, Conceptualization. IB: Writing – review & editing, Investigation, Funding acquisition, Conceptualization. AC: Writing – review & editing, Writing – original draft, Software, Investigation, Formal Analysis, Data curation.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. We acknowledge financial support from the Spanish Ministry of Science and Innovation through the Estate Agency of Research: Project PID2020-118612RR-I00 (Better Almonds) funded by MICIU/AEI/10.13039/501100011033/and PID2022-140362OR-I00 funded by MICIU/AEI/10.13039/501100011033 and by “ERDF/EU”. Authors FP, IE, IB are grateful to CERCA Program from Generalitat of Catalonia for its support. FP. wishes to acknowledge the receipt of a FPI doctoral fellowship (PRE2018-086724) funded by MICIU/AEI/10.13039/501100011033 and by “ESF Investing in your future”. LL has been funded by “Ministerio de Universidades” and the European-Union Next Generation EU within the frame of Grants for the Requalification of the Spanish University System, modality ‘Margarita Salas’.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2024.1504198/full#supplementary-material
Supplementary Table 1 | Information on the SSR markers used in this study for map construction.
Supplementary Table 2 | Phenotypic values of ‘Marcona’ and ‘Marinada’ parentals, number of individuals considered for QTL analyses (N), and minimum (Min), maximum (Max), mean, and standard deviation (SD) values of the ‘Marcona’ × ‘Marinada’ population. The results of the Shapiro-Wilk normality tests are also included.
Supplementary Table 3 | Coefficient of determination (R2) for all traits for which LSmean values were calculated.
Supplementary Table 4 | Pairwise correlation coefficient among phenotyped traits. Significant (p<0.001) Spearman correlation coefficients are highlighted in bold.
Supplementary Table 5 | Summary of marker position on the ‘Marcona’, ‘Marinada’, and ‘Marcona’ × ‘Marinada’ linkage maps.
Supplementary Table 6 | Summary of all detected QTLs.
References
Alioto, T., Alexiou, K. G., Bardil, A., Barteri, F., Castanera, R., Cruz, F., et al. (2020). Transposons played a major role in the diversification between the closely related almond and peach genomes: results from the almond genome sequence. Plant J. 101, 455–472. doi: 10.1111/tpj.14538
Alonso, J. M., Socias i Company, R., Kodad, O. (2017). Late-blooming in almond: A controversial objective. Scientia Hortic. 224, 61–67. doi: 10.1016/j.scienta.2017.05.036
Barreca, D., Nabavi, S. M., Sureda, A., Rasekhian, M., Raciti, R., Silva, A. S., et al. (2020). Almonds (Prunus dulcis Mill. D.A. Webb): a source of nutrients and health-promoting compounds. Nutrients. 12 (3), 672. doi: 10.3390/nu12030672
Becerra-Tomás, N., Paz-Graniel, I., Kendall, C. W. C., Kahleova, H., Rahelic, D., Sievenpiper, J. L., et al. (2019). Nut consumption and incidence of cardiovascular diseases and cardiovascular disease mortality: a meta-analysis of prospective cohort studies. Nutrition Rev. 77 (10), 691–709. doi: 10.1093/nutrit/nuz042
Ballester, J., Boskovic, R., Batlle, I., Arús, P., Vargas, F., de Vicente, M. C. (1998). Location of the self-compatibility gene on the almond linkage map. Plant Breed. 117, 69–72. doi: 10.1111/j.1439-0523.1998.tb01450.x
Ballester, J., Socias i Company, R., Arús, P., De Vicente, M. C. (2001). Genetic mapping of a major gene delaying blooming time in almond. Plant Breed. 120, 268–270. doi: 10.1046/j.1439-0523.2001.00604.x
Batlle, I., Dicenta, F., Gradziel, T. M., Wirthensohn, M., Duval, H., Vargas, F. J. (2017). “Classical genetics and breeding,” in Almonds: Botany, production and uses. Eds. Socias i Company, R., Gradziel, T. M. (CABI International, USA), 111–148.
Beltrán, A., Maestre, S. E., Grané, N., Valdés, A., Prats, M. S. (2021). Variability of chemical profile in almonds (Prunus dulcis) of different cultivars and origins. Foods 10, 153. doi: 10.3390/foods10010153
Calle, A., Aparicio-Durán, L., Batlle, I., Eduardo, I., Miarnau, X. (2024). ) Review of agronomical and kernel quality traits of 273 almond cultivars. Genet. Resour. Crop Evol. doi: 10.1007/s10722-024-02183-0
Castanera, R., de Tomás, C., Ruggieri, V., Vicient, C., Eduardo, I., Aranzana, M. J., et al. (2024). A phased genome of the highly heterozygous ‘Texas’ almond uncovers patterns of allele-specific expression linked to heterozygous structural variants. Horticulture Res. 11, uhae106. doi: 10.1093/hr/uhae106
D’Amico-Willman, K. M., Ouma, W. Z., Meulia, T., Sideli, G. M., Gradziel, T. M., Fresnedo-Ramírez, J. (2022). Whole-genome sequence and methylome profiling of the almond [Prunus dulcis (Mill.) D.A. Webb] cultivar ‘Nonpareil’. G3 Genes Genomes Genet. 12, jkac065. doi: 10.1093/g3journal/jkac065
Di Guardo, M., Farneti, B., Khomenko, I., Modica, G., Mosca, A., Distefano, G., et al. (2021). Genetic characterization of an almond germplasm collection and volatilome profiling of raw and roasted kernels. Horticulture Res. 8, 27. doi: 10.1038/s41438-021-00465-7
Doyle, J. J., Doyle, J. L. (1990). A rapid Total DNA preparation procedure for fresh plant tissue. Focus 12, 13–15.
Duval, H., Coindre, E., Ramos-Onsins, S. E., Alexiou, K. G., Rubio-Cabetas, M. J., Martínez-García, P. J., et al. (2023). Development and evaluation of an Axiom™ 60K SNP array for almond (Prunus dulcis). Plants 12, 242. doi: 10.3390/plants12020242
Etheridge, R. D., Pesti, G. M., Foster, E. H. (1998). A comparison of nitrogen values obtained utilizing the Kjeldahl nitrogen and Dumas combustion methodologies (Leco CNS 2000) on samples typical of an animal nutrition analytical laboratory. Anim. Feed Sci. Technol. 73, (1–2). doi: 10.1016/S0377-8401(98)00136-9
FAOSTAT (2024). Food and Agriculture Organization of the United Nations (Rome, Italy: FAOSTAT database). Available at: https://www.fao.org.
Fernández i Martí, A., Font i Forcada, C., Socias i Company, R. (2013). Genetic analysis for physical nut traits in almond. Tree Genet. Genomes 9, 455–465. doi: 10.1007/s11295-012-0566-8
Flankin, L. M., Mitchell, A. E. (2019). Review of the sensory and chemical characteristics of almond (Prunus dulcis) flavor. J. Agric. Food Chem. 67, 2743–2753. doi: 10.1021/acs.jafc.8b06606
Font i Forcada, C., Fernández i Martí, A., Socias i Company, R. (2012). Mapping quantitative trait loci for kernel composition in almond. BMC Genet. 13, 1–9. doi: 10.1186/1471-2156-13-47/TABLES/3
Font i Forcada, C., Oraguzie, N., Reyes-Chin-Wo, S., Espiau, M. T., Socias i Company, R., Fernández i Martí, A. (2015a). Identification of genetic loci associated with quality traits in almond via association mapping. PloS One 10. doi: 10.1371/journal.pone.0127656
Font i Forcada, C., Sánchez-Pérez, R., Eduardo, I., Wu, S. B., Fernández i Martí, A. (2017). “Molecular breeding and genomics,” in Almonds: Botany, production and uses. Eds. Socias i Company, R., Gradziel, T. M. (CABI International, USA), 149–167.
Font i Forcada, C., Velasco, L., Socias i Company, R. (2015b). Association mapping for kernel phytosterol content in almond. Front. Plant Sci. 6. doi: 10.3389/fpls.2015.00530
Gonzalo, M. J., Brewer, M. T., Anderson, C., Sullivan, D., Gray, S., Van Der Knaap, E. (2009). Tomato fruit shape analysis using morphometric and morphology attributes implemented in tomato analyzer software program. J. Am. Soc. Hortic. Sci. 134, 77–87. doi: 10.21273/jashs.134.1.77
Goonetilleke, S. N., March, T. J., Wirthensohn, M. G., Arús, P., Walker, A. R., Mather, D. E. (2018). Genotyping by sequencing in almond: SNP discovery, linkage mapping, and marker design. G3: Genes Genomes Genet. 8, 161–172. doi: 10.1534/g3.117.300376
Goonetilleke, S., Wirthesohn, M., Mather, D. (2023). Genetic analysis of quantitative variation in almond nut traits. Tree Genet. Genomes 19, 55. doi: 10.1007/s11295-023-01630-w
Gradziel, T. M., Curtis, R., Socias i Company, R. (2017). “Production and growing regions,” in Almonds: Botany, production and uses. Eds. Socias i Company, R., Gradziel, T. M. (CABI International, USA), 70–86.
Jansen, R. C., Stam, P. (1994). High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136, 1447–1455. doi: 10.1093/genetics/136.4.1447
Jung, S., Lee, T., Cheng, C. H., Buble, K., Zheng, P., Yu, J., et al. (2019). 15 years of GDR: New data and functionality in the Genome Database for Rosaceae. Nucleic Acids Res. 47, 1137–1145. doi: 10.1093/nar/gky1000
Jurado, F. (2024). Applications of artificial intelligence for predictive genomics and phenotyping. Barcelona, Spain: Universitat Autònoma de Barcelona.
Kodad, O. (2017). “Chemical composition of almond nuts,” in Almonds: Botany, production and uses. Eds. Socias i Company, R., Gradziel, T. M. (CABI International, USA), 428–448.
Kodad, O., Socias i Company, R., Alonso, J. M. (2018). Genotyping and environmental effects on tocopherol content in almond. Antioxidants 7, 6. doi: 10.3390/antiox7010006
Kosambi, D. D. (1944). The estimation of map distances from recombination values. Ann. Eugenics 12, 172–175. doi: 10.1111/j.1469-1809.1943.tb02321.x
Lander, E. S., Botstein, D. (1989). Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121, 185–199. doi: 10.1093/genetics/121.1.185
Lipan, L., Martín-Palomo, M. J., Sánchez-Rodríguez, L., Cano-Lamadrid, M., Sendra, E., Hernández, F., et al. (2019). Almond fruit quality can be improved by means of deficit irrigation strategies. Agric. Water Manage. 217, 236–242. doi: 10.1016/j.agwat.2019.02.041
Lipan, L., Romero, A., Echeverria, G., Maldonado, M., Company, T., Escalona, J. M., et al. (2022). Native versus modern almond cultivars of mallorca island: from biodiversity to industrial aptitude and fruit quality. Agronomy 12, 1933. doi: 10.3390/agronomy12081933
Lotti, C., Minervini, A. P., Delvento, C., Losciale, P., Gaeta, L., Sánchez-Pérez, R., et al. (2023). Detection and distribution of two dominant alleles associated with the sweet kernel phenotype in almond cultivated germplasm. Front. Plant Sci. 14, 1171195. doi: 10.3389/fpls.2023.1171195
Martín Carratalá, M. L., García-López, C., Berenguer-Navarro, V., Grané-Teruel, N. (1998). New contribution to the chemometric characterization of almond cultivars on the basis of their fatty acid profiles. J. Agric. Food Chem. 46, 963–967. doi: 10.1021/jf970672h
Martínez-Garcia, P. J., Rubio, M., Cremades, T., Dicenta, F. (2019). Inheritance of shell and kernel shape in almond (Prunus dulcis). Scientia Hortic. 244, 330–338. doi: 10.1016/j.scienta.2018.09.041
Martínez-Gómez, P., Prudencio, A. S., Gradziel, T. M., Dicenta, F. (2017). The delay of flowering time in almond: a review of the combined effect of adaptation, mutation and breeding. Euphytica 213, 197. doi: 10.1007/s10681-017-1974-5
Paizila, A., Karci, H., Motalebipour, E., Guney, M., Kafkas, S. (2021). Quantitative trait loci analysis for flower-related traits in almond (Prunus dulcis). Plant Breed. 141, 119–132. doi: 10.1111/pbr.12986
Pavan, S., Delvento, C., Mazzeo, R., Ricciardi, F., Losciale, P., Gaeta, L., et al. (2021). Almond diversity and homozygosity define structure, kinship, inbreeding, and linkage disequilibrium in cultivated germplasm, and reveal genomic associations with nut and seed weight. Horticulture Res. 8, 15. doi: 10.1038/s41438-020-00447-1
Pérez de los Cobos, F., Coindre, E., Dlalah, N., Quilot-Turion, B., Batlle, I., Arús, P., et al. (2023). Almond population genomics and non-additive GWAS reveal new insights into almond dissemination history and candidate gene for nut traits and blooming time. Horticulture Res. 10, uhad193. doi: 10.1093/hr/uhad193
Rabadán, A., Álvarez-Ortí, M., Gómez, R., de Miguel, C., Pardo, J. E. (2018). Influence of genotype and crop year in the chemometrics of almond and pistachio oils. J. Sci. Food Agric. 98, 2402–2410. doi: 10.1002/jsfa.8732
Ricciardi, F., Del Cueto, J., Bardaro, N., Mazzeo, R., Ricciardi, L., Dicenta, F., et al. (2018). Synteny-based development of CAPS markers linked to the sweet kernel LOCUS, controlling amygdalin accumulation in almond [Prunus dulcis (Mill.) D.A. Webb. Genes 9, 385. doi: 10.3390/genes9080385
Romero, A. (2014). Almond quality requirements for industrial purposes - Its relevance for the future acceptance of new cultivars from breeding programs. Acta Hortic. 1028, 213–220. doi: 10.17660/ActaHortic.2014.1028.34
Romero, A., Rovira, M., Cristofori, V., Silvestri, C. (2021). Hazelnut kernel size and industrial aptitude. Agriculture 11, 1115. doi: 10.3390/agriculture11111115
Romero, A., Vargas, F. J., Tous, J., Ninot, A., Miarnau, X. (2011). New almond varieties from IRTA’s breeding programme: (1) chemical composition. Acta Hortic. 912, 477–484. doi: 10.17660/ActaHortic.2011.912.73
Sánchez-Pérez, R., Howad, W., Dicenta, F., Arús, P., Martínez-Gómez, P. (2007). Mapping major genes and quantitative trait loci controlling agronomic traits in almond. Plant Breed. 126, 310–318. doi: 10.1111/j.1439-0523.2007.01329.x
Sánchez-Pérez, R., Howad, W., Garcia-Mas, J., Arús, P., Martínez-Gómez, P., Dicenta, F. (2010). Molecular markers for kernel bitterness in almond. Tree Genet. Genomes 6, 237–245. doi: 10.1007/s11295-009-0244-7
Sánchez-Pérez, R., Pavan, S., Mazzeo, R., Moldovan, C., Aiese Cigliano, R., Del Cueto, J., et al. (2019). Mutation of a bHLH transcription factor allowed almond domestication. Science 364, 1095–1098. doi: 10.1126/science.aav8197
Sideli, G. M., Mather, D., Wirthensohn, M., Dicenta, F., Goonetilleke, S. N., Martínez-García, P. J., et al. (2023). Genome-wide association analysis and validation with KASP markers for nut and shell traits in almond (Prunus dulcis [Mill.] D.A.Webb). Tree Genet. Genomes 19, 13. doi: 10.1007/s11295-023-01588-9
Sideli, G., Wang, S., Allen, B., Grilo, F., Gradziel, T. (2024). A novel detection method for lipid stability in almond leads to the discovery of marker-trait associations. Fruit Res. doi: 10.48130/frures-0024-0012. in press.
Spasibionek, S., Mikołajczyk, K., Ćwiek–Kupczyńska, H., Piętka, T., Krótka, K., Matuszczak, M., et al. (2020). Marker assisted selection of new high oleic and low linolenic winter oilseed rape (Brassica napus L.) inbred lines revealing good agricultural value. PloS One 15, e0233959. doi: 10.1371/journal.pone.0233959
van Ooijen, J. W. (1992). Accuracy of mapping quantitative trait loci in autogamous species. Theor. Appl. Genet. 84, 803–811. doi: 10.1007/BF00227388
van Ooijen, J. W. (2009). MapQTL® 6, Software for the mapping of quantitative trait loci in experimental populations of diploid species (Wageningen, Netherlands: Kyazma B.V.).
van Ooijen, J. W. (2018). JoinMap® 5, Software for the calculation of the genetic linkage maps in experimental populations of diploid species (Wageningen, Netherlands: Kyazma B.V.).
Vargas, F., Romero, M., Clavé, J., Vergés, J., Santos, J., Batlle, I. (2008). 'Vairo', 'Marinada', 'Constantí', and 'Tarraco' almonds. Hortscience 43, 535–537. doi: 10.21273/HORTSCI.43.2.535
Voorrips, R. E. (2002). MapChart: software for the graphical presentation of linkage maps and QTLs. J. Heredity 93, 77–78. doi: 10.1093/jhered/93.1.77
Whalen, A., Gorjanc, G., Hickey, J. M. (2020). AlphaFamImpute: High-accuracy imputation in full-sib families from genotype-by-sequencing data. Bioinformatics 36, 4369–4371. doi: 10.1093/bioinformatics/btaa499
Keywords: Prunus dulcis, breeding, kernel traits, linkage mapping, quantitative trait loci
Citation: Pérez de los Cobos F, Romero A, Lipan L, Miarnau X, Arús P, Eduardo I, Batlle I and Calle A (2024) QTL mapping of almond kernel quality traits in the F1 progeny of ‘Marcona’ × ‘Marinada’. Front. Plant Sci. 15:1504198. doi: 10.3389/fpls.2024.1504198
Received: 30 September 2024; Accepted: 11 November 2024;
Published: 27 November 2024.
Edited by:
Pedro José Martínez-García, Spanish National Research Council (CSIC), SpainReviewed by:
Stefano Pavan, University of Bari Aldo Moro, ItalyMario Di Guardo, University of Catania, Italy
Copyright © 2024 Pérez de los Cobos, Romero, Lipan, Miarnau, Arús, Eduardo, Batlle and Calle. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Alejandro Calle, YWxlamFuZHJvLmNhbGxlQGlydGEuY2F0