- 1Crop Development Centre, Department of Plant Sciences, University of Saskatchewan, Saskatoon, SK, Canada
- 2Department of Food and Human Nutritional Sciences, University of Manitoba, Winnipeg, MB, Canada
- 3Department of Food and Bioproduct Science, University of Saskatchewan, Saskatoon, SK, Canada
- 4Richardson Centre for Food Technology and Research, University of Manitoba, Winnipeg, MB, Canada
With the expanding interest in plant-based proteins in the food industry, increasing emphasis is being placed on breeding for protein concentration and quality. Two protein quality traits i.e., amino acid profile and protein digestibility, were assessed in replicated, multi-location field trials from 2019 to 2021 in pea recombinant inbred line population PR-25. This RIL population was targeted specifically for the research of protein related traits and its parents, CDC Amarillo and CDC Limerick, had distinct variations in the concentration of several amino acids. Amino acid profile was determined using near infrared reflectance analysis, and protein digestibility was through an in vitro method. Several essential amino acids were selected for QTL analysis, including lysine, one of the most abundant essential amino acids in pea, and methionine, cysteine, and tryptophan, the limiting amino acids in pea. Based on phenotypic data of amino acid profiles and in vitro protein digestibility of PR-25 harvested in seven location-years, three QTLs were associated with methionine + cysteine concentration, among which, one was located on chromosome 2 (R2 = 17%, indicates this QTL explained 17% phenotypic variation of methionine + cysteine concentration within PR-25), and two were located on chromosome 5 (R2 = 11% and 16%). Four QTLs were associated with tryptophan concentration and are located on chromosome 1 (R2 = 9%), chromosome 3 (R2 = 9%), and chromosome 5 (R2 = 8% and 13%). Three QTLs were associated with lysine concentration, among which, one was located on chromosome 3 (R2 = 10%), the other two were located on chromosome 4 (R2 = 15% and 21%). Two QTLs were associated with in vitro protein digestibility, one each located on chromosomes 1 (R2 = 11%) and 2 (R2 = 10%). QTLs associated with in vitro protein digestibility, and methionine + cysteine concentration on chromosome 2 were identified to be co-localized with known QTL for total seed protein concentration in PR-25. QTLs associated with tryptophan and methionine + cysteine concentration co-localized on chromosome 5. The identification of QTLs associated with pea seed quality is an important step towards marker-assisted selection of breeding lines with improved nutritional quality, which will further boost the competitiveness of pea in plant-based protein markets.
Introduction
Despite the currently relatively high sales price of the plant-based protein products, their popularity is increasing in recent years (Van Loo et al., 2020; Estell et al., 2021). Trends toward sustainable agriculture, healthy diets, food security, expanding global population, animal welfare, and religious beliefs are contributing to the expansion of plant-based protein product launches to meet the diverse needs (Sabate & Soret, 2014; Alcorta et al., 2021; Aschemann-Witzel et al., 2021).
Pea protein, as one of the common ingredients in the fabrication of meat alternatives, is rich in lysine, but relatively low in tryptophan and sulfur containing amino acids, such as methionine and cysteine (Leterme et al., 1990; Pilorge et al., 2021). To provide a complete and balanced set of amino acids, food companies need to complement pea protein with cereal protein, since cereals tend to be rich in the sulfur amino acids, but limiting in lysine (Srivastava et al., 2015). Compared to soybean protein, which is also widely used in plant-based protein products, pea protein is less allergenic, and contains less offensive off-flavors, such as beany, chalky flavors, and hence require less processing (Hoffmann et al., 2017; Taylor et al., 2021). These advantages could contribute to the wider use of pea protein in the future, but the current limitation of sulfur amino acids could also be an obstacle to wider use. Until now, most of the emphasis related to pea breeding for protein has been related to protein quantity instead of quality. But as the demands of the plant-based protein industries is continuously growing, a more profound study of pea protein quality traits is necessary.
Protein digestibility corrected amino acid score (PDCAAS) is an index that has been widely adopted when evaluating the quality of a specific protein. PDCAAS is based on both the amino acids profile and the digestibility of the proteins (Schaafsma, 2005). Amino acid profile is typically assessed by HPLC analysis, (Dołowy & Pyka, 2014; Sharma et al., 2014). HPLC analysis produces accurate results but is expensive and low throughput. Amino acids have distinct attributes, which increases the complexity of enzyme digestion steps. Three different digestion steps are required in HPLC analysis to assess the complete amino acid profile, and long incubation periods makes it a cumbersome method for amino acid profiling on large numbers of samples typical of plant breeding programs (Kambhampati et al., 2019). The use of near-infrared spectroscopy (NIR) has been widely used to assess the protein concentration in different crops. Recently, this non-destructive approach has been extended to assess the amino acid profile of legume seed samples (Hang et al., 2022; Shi et al., 2022). Shi et al. (2022) developed a calibration curve to predict the concentration of individual amino acids in soybean based on the NIR scans. The development and enhancement of the curve relies on data derived from established and validated wet chemistry reference methods, including the use of liquid chromatographic methods. For most amino acids, relatively high correlations were detected between the reference chemistry method, namely HPLC, and the NIR analysis. The correlation coefficients fell within a range of 0.76 to 0.91. Even for the least abundant amino acids, for instance, methionine, the correlation coefficient was 0.77. The benefits of using NIR relate to high throughput, low cost of analysis, and the potential to use intact seeds for non-destructive analysis (Jiang, 2020). The latter is particularly important when dealing with precious seed quantities such as those derived from breeding programs.
Protein digestibility reflects the extent to which a protein is broken down into its constituent amino acids and the latter made available for absorption within the gastrointestinal tract. In vivo and in vitro methods have been used to measure protein digestibility. The in vivo method assesses true fecal digestion in a rodent model and is accurate, but it has the limitations of high cost and low throughput, due to the significant involvement of animals in the analysis (Tavano et al., 2016). In contrast, in vitro methods simulate conditions within the digestive tract, including enzyme digestion, with protein digestibility assessed by various methods, including the change of pH of samples over a defined time course. This reproducible approach provides a method with relatively high throughput compared to the in vivo method and has been adopted to assess digestibility of different plant proteins (Duodu et al., 2002; Bessada et al., 2019; Ketnawa & Ogawa, 2021).
With more diverse needs from both consumers and the food industry, it has become necessary to develop crop varieties with improved protein quality. Understanding the underlying genetic control of protein quality traits is critical to aid in breeding such varieties. In the current study, we used PR-25, a recombinant inbred line population, to understand the genetic basis of protein quality traits in pea. PR-25 is derived from the cross of a medium protein parent CDC Amarillo and a high protein parent CDC Limerick. These two genotypes differed in individual amino acid concentrations from 10% to 27%, which makes PR-25 as a valuable source for the identification of QTL regions associated with protein quality traits. These QTL regions have the potential to be developed into breeder-friendly DNA markers, which could be used to improve the protein quality of pea and hence, increase the competitiveness of pea in plant-based protein markets.
Materials and methods
PR-25 mapping population
The PR-25 mapping population used in this study is a recombinant inbred line (RIL) population with 110 lines derived from the cross of CDC Amarillo x CDC Limerick. The mapping population was grown in seven locations during 2019 to 2021 in microplots (1 m2) and the harvested seed from individual plots was used in the current study. Information about PR-25, including its seeding date, harvest date and crop management in all locations were detailed in Zhou et al. (2022). Briefly, PR-25 was grown at Sutherland nursery in 2019 with 2 biological replicates. It was grown at Sutherland, Rosthern and Lucky Lake nurseries in 2020 and Floral, Rosthern and Lucky Lake nurseries in 2021 with 3 biological replicates per nursery. All nurseries were in Saskatchewan, Canada and among which, Sutherland and Floral are located in the Dark Brown soil zone, Rosthern is located in the Black soil zone, and Lucky Lake is located in the Brown soil zone. Best management practices for field pea production in western Canada were used at each location.
Amino acid profiling and NIR calibration development
A total of 2359 whole seed samples, including 159 samples from 2017 GWAS (Gali et al., 2019) and 2200 samples from 2019-2021 PR-25, were used for calibration development and amino acid profile assessment (Table 1). They were stored at -20°C before any analysis. All samples were scanned in Dr. James House’s lab (University of Manitoba, Winnipeg, MB, Canada) via NIR spectroscopy on a PerkinElmer DA 7250 diode array NIR system (PerkinElmer Health Sciences Canada Inc.) to obtain spectral data. A sub-sample of 339, including all 159 samples from 2017 GWAS and 180 samples from 2019-2021 PR-25, were selected from 2359 samples for development and improvement of the calibration formula. These sub-samples were analyzed via HPLC for amino acid concentration and the protocol of HPLC was detailed by Shi et al. (2022). A prediction model was created using the whole seed spectrum and chemical data of 2017 GWAS samples, the calibration was then applied to all remaining samples to estimate chemical compositions (Shi et al., 2022). Annual maintenance and improvement of prediction model were made based on the HPLC data of selected samples from PR-25. Sixty samples each were chosen from year 2019 to 2021 to improve the accuracy of calibration formula. Samples in each year were divided into quartiles of predicted crude protein content in descending order and within each quartile, 15 samples were drawn randomly.
Table 1 Information of whole seed pea samples used for near infrared reflectance (NIR) calibration development and amino acid profile assessment.
All seed samples were stored in a 4°C walk-in cooler prior to NIR scanning to avoid protein denaturation. After scanning with NIRS, approximately 20 grams of each selected sample was ground with a Retsch ZM-200 grinder (Retsch, Haan, Germany) using a 0.75mm sieve and then stored at -20 °C before further analysis.
Detailed information about NIR analysis of whole seeds, as well as wet chemistry analysis of protein and amino acid concentrations was reported by Shi (2021). Detailed information about the development of an NIR calibration model for pea amino acid concentration was reported by Hang et al. (2022).
In vitro protein digestibility assessment
Sample preparations were conducted in the Grain Innovation Lab (Crop Development Centre, University of Saskatchewan). Five grams of pea seeds per line from 2019 PR-25 yield trials were ground into homogeneous powder using a cyclone sample mill model 3010-030 (UDY Corporation, USA). Protein digestibility was evaluated using an in vitro method. Detailed protocol of in vitro protein digestibility (IVPD) determination was described by Cabuk et al. (2018).
Measurement of in vitro protein digestibility-corrected amino acid score
In vitro PDCAAS was calculated as follows based on Nosworthy et al. (2017):
Statistical analysis
Two sample t-tests with equal variance were conducted on protein related traits, including the concentration of protein and each of 18 amino acids, for the parents of PR-25, CDC Amarillo and CDC Limerick (Table 2). Data of CDC Amarillo and CDC Limerick from all 7 station-years were used for the t-test. A Pearson correlation analysis was performed among protein quality traits, including total protein, protein digestibility (in vitro), PDCAAS and 18 amino acids found in pea seeds (Table 3), based on the average of 7 station-years data (2 biological replicates in 2019 Sutherland, 3 biological replicates in 2020 Sutherland, 2020 Rosthern, 2020 Lucky Lake, 2021 Floral, 2021 Rosthern, 2021 Lucky Lake) for each trait. Analysis of variance (ANOVA) was conducted for amino acids of interest, including methionine, cystine, tryptophan and lysine using 7 station-years data; ANOVA of multi-environment function was conducted in QTL IciMapping software (Table 4). A comparison of amino acid profiles was made among years 2019, 2020, 2021 and the average of all 7 station-years (Figure 1). A comparison of amino acid profiles was also made between CDC Amarillo and CDC Limerick based on their average of 7 station-years (Figure 2). Each amino acid profile represented the average percentage of each amino acid in pea seeds in the given year. Frequency distribution of methionine + cysteine concentration, tryptophan concentration, lysine concentration, protein digestibility (in vitro) and PDCAAS was made based on the average of biological replicates for each line in the given year-location (Figures 3–7).
Table 2 Two sample t-tests with equal variance between the parents of PR-25, CDC Amarillo and CDC Limerick, across 7 station-years.
Table 3 Correlation analysis among total protein, protein digestibility (in vitro), PDCAAS and 18 amino acids found in pea seed.
Figure 1 Amino acids profile of PR-25 from field trials conducted in Sutherland nursery in 2019, Sutherland, Rosthern, Lucky Lake nurseries in 2020 and Floral, Rosthern and Lucky Lake nurseries in 2021, and the average amino acid profiles of PR-25 across the seven station-years.
Figure 3 Frequency distribution of methionine + cysteine concentration of PR-25 across seven station-years based on the average of biological replicates for each line in each station-year (two biological replicates in 2019 Sutherland and three biological replicates for the rest of the station-years), and the averaged methionine + cysteine concentration for the parents of PR-25 in each station-year.
Figure 4 Frequency distribution of tryptophan concentration of PR-25 across seven station-years based on the average of biological replicates for each line in each station-year (two biological replicates in 2019 Sutherland and three biological replicates for the rest of the station-years), and the averaged tryptophan concentration for the parents of PR-25 in each station-year.
Figure 5 Frequency distribution of lysine concentration of PR-25 across seven station-years based on the average of biological replicates for each line in each station-year (two biological replicates in 2019 Sutherland and three biological replicates for the rest of the station-years), and the averaged lysine concentration for the parents of PR-25 in each station-year.
Figure 6 Frequency distribution of protein digestibility (in vitro) of PR-25 across seven station-years based on the average of biological replicates for each line in each station-year (two biological replicates in 2019 Sutherland and three biological replicates for the rest of the station-years), and the averaged protein digestibility (in vitro) for the parents of PR-25 in each station-year.
Figure 7 Frequency distribution of PDCAAS of PR-25 across seven station-years based on the average of biological replicates for each line in each station-year (two biological replicates in 2019 Sutherland and three biological replicates for the rest of the station-years), and the averaged PDCAAS for the parents of PR-25 in each station-year.
Genotyping, linkage map construction and QTL identification
PR-25 population was genotyped using Axiom® 90K SNP array and a linkage map representing 901 unique loci and measuring 855.4 cM was reported by Zhou et al. (2022). This linkage map was used for QTL analysis in the current study. The nomenclature of the markers was in accordance with Axiom® 90K SNP array. To increase the accuracy of QTL identification, QTL analyses were first performed individually in each station-year. Data of some station-years, in which their individual analysis didn’t include any significant QTL, were excluded and the rest were averaged and used for combined QTL analysis. Phenotypic traits, including methionine + cysteine concentration, tryptophan concentration, lysine concentration and in vitro protein digestibility, were assessed. QTL analysis was conducted via Windows QTL Cartographer (Wang et al., 2012), where cross type was set as Ri1, map function was set as Kosambi, analysis type was set as composite interval mapping, permutation time was set as 1000, significance level was set as 0.05, and walk speed was set as 1.0 Cm. Other detailed information of the QTL identification approach was described in Zhou et al., 2022.
Results
The results of two sample t-test between the parents of PR-25 population, CDC Amarillo and CDC Limerick, showed that they differed significantly in their concentration of seed protein and most of the amino acids, except for alanine (Table 2). The average proportion of each amino acid from field trials conducted over three seasons is summarized in Figure 1. The average of each amino acid content from the 7 station-years can be ranked from top to bottom as: Glutamine (18.3%), aspartic acid (12.4%), lysine (7.7%), leucine (7.5%), arginine (7.5%), phenylalanine (5.2%), serine (5.1%), valine (4.9%), glycine (4.5%), isoleucine (4.4%), proline (4.4%), alanine (4.3%), threonine (3.9%), tyrosine (3.3%), histidine (2.9%), cysteine (1.4%), methionine (1.2%), tryptophan (1.1%). The average proportion of each amino acid of CDC Amarillo and CDC Limerick from 7 station-years is summarized in Figure 2. There were positive correlations among most amino acids (Table 3). Variation in amino acid profile was detected across the station-years. Analysis of variance of methionine, cysteine, tryptophan, and lysine showed that the phenotypic variation can be attributed to the effects of genotype, station-year, and the interaction of genotype and station-year (Table 4). Frequency distribution of methionine + cysteine concentration, tryptophan concentration, lysine concentration, protein digestibility and PDCAAS presented the ranges, the distribution of phenotypic scores of PR-25 in each station-year (Figures 3–7).
As the primary interest of this research was related to both the abundant and limiting amino acids, hence, lysine, as the abundant amino acid, and tryptophan, cysteine and methionine, as the limiting amino acids, were selected for QTL analysis (Table 4). Cysteine and methionine were combined in QTL analysis since they belong to same metabolic pathway (Ravanel et al, 1998). ANOVA showed that environment and genotype × environment interactions had significant contributions to the variation in amino acid concentrations. Therefore, QTL analysis was based on averaged data from selected station-years from which their individual analysis showed significant QTLs. A total of 14 QTLs were identified associated with protein quality traits of interest and they were significant in at least 4 of 7 station-years when compared individually. Four QTLs were identified associated with the methionine + cysteine concentration. Met+Cys-QTL-1 was found on chromosome 2 and it explained 15% of the phenotypic variation, its flanking markers were Chr2LG1_244771437/Chr2LG1_287501555 and there were 22 markers in between; Met+Cys-QTL-2 was found on chromosome 5 and it explained 11% of the phenotypic variation, its flanking markers were Chr5LG3_4173823/Chr5LG3_15801800 and there were 10 markers in between; Met+Cys-QTL-3 was also found on chromosome 5 and it explained 16% of the phenotypic variation, its flanking markers were Chr5LG3_101924498/Chr5LG3_137457380 and there were 10 markers in between; Met+Cys-QTL-4 was found on chromosome 3 and it explained 10% of the phenotypic variation, its flanking markers were Chr3LG5_120117355/Chr3LG5_408080154 and there were 8 markers in between (Figure 8). Chr2LG1_259006623, Chr5LG3_5113345, Chr5LG3_137457380 were the loci within the peak region of each of these three methionine + cysteine associated QTLs. Five QTLs were identified associated with tryptophan concentration, Trp-QTL-1 was found on chromosome 5 and it explained 8% of the phenotypic variation, its flanking markers were Chr5LG3_5127342/Chr5LG3_7509381 and there were 2 markers in between; Trp-QTL-2 was also found on chromosome 5 and it explained 13% of the phenotypic variation, its flanking markers were Chr5LG3_67663653/Chr5LG3_112710798 and there were 10 markers in between; Trp-QTL-3 was found on chromosome 3 and it explained 9% of the phenotypic variation, its flanking markers were Chr3LG5_185794949/Chr3LG5_198663551 and there were 4 markers in between; Trp-QTL-4 was also found on chromosome 3 and it explained 8% of the phenotypic variation, its flanking markers were Chr3LG5_424086163/Chr3LG5_455814220 and there were 2 markers in between; Trp-QTL-5 was found on chromosome 1 and it explained 9% of the phenotypic variation, its flanking markers were Chr1LG6_132953926/Chr1LG6_233117537 and there were 2 markers in between. Chr5LG3_5381756, Chr5LG3_84309239, Chr3LG5_197202364, Chr1LG6_233117537 were the loci within the peak region of each of these four tryptophan associated QTLs. Three QTLs were identified associated with lysine concentration. Lys-QTL-1 was found on chromosome 4 and it explained 21% of the phenotypic variation, its flanking markers were Chr4LG4_185310109/Chr4LG4_218381712 and there were 23 markers in between; Lys-QTL-2 was also found on chromosome 4 and it explained 15% of the phenotypic variation, its flanking markers were Sc02659_148875/Chr4LG4_417303831 and there were 2 markers in between; Lys-QTL-3 was found on chromosome 1 and it explained 10% of the phenotypic variation, its flanking markers were Chr3LG5_424086163/Chr3LG5_437233435 and there were 2 markers in between. Chr4LG4_206035753, Chr4LG4_326486541, Chr3LG5_437233435 were the loci within the peak region of each of these three lysine associated QTLs. Two QTLs were identified associated with IVPD. IVPD-QTL-1 was found on chromosome 2 and it explained 10% of the phenotypic variation, its flanking markers were Chr2LG1_285985643/Chr2LG1_290867919 and there were 6 markers in between; IVPD-QTL-2 was found on chromosome 4 and it explained 11% of the phenotypic variation, its flanking markers were Chr1LG6_36689547/Chr1LG6_71617678 and there were 12 markers in between. Chr2LG1_291265214, Chr1LG6_41413580 were the loci within the peak region of each of these two in vitro protein digestibility associated QTLs. In addition, a set of QTL analyses were conducted in each station-year for methionine + cysteine concentration, tryptophan concentration, lysine concentration and protein digestibility, as shown in Table 5. The positions of identified QTLs, based on the individual QTL analysis from each station-year, were presented in Supplementary Figures 1–4.
Figure 8 Combined QTL analysis in PR-25 reveals the QTLs associated for protein concentration (PC) (Zhou et al., 2022), in vitro protein digestibility (IVPD), lysine concentration (K), tryptophan concentration (W) and methionine + cysteine concentration (M+C). Combined analysis was based on averaged phenotypic data from station-years in which the QTL was significant (minimum of 3 of 7 station-years).
Table 5 Detected quantitative trait loci (QTL) for methionine + cysteine, tryptophan, lysine concentration, and in vitro protein digestibility in PR-25 using composite interval mapping (CIM).
Discussion
Development of pea varieties with high seed protein concentration and quality is necessary to fulfill the growing plant-based protein demand. To accomplish this goal, a better understanding of the underlying genetic control of the protein related traits is required.
Pea recombinant inbred line population PR-25, derived from the cross of CDC Amarillo and CDC Limerick, was developed specifically for the study of protein related traits. The variation in concentration of individual amino acids between CDC Amarillo and CDC Limerick ranged from 10 to 27%. They were significantly different in the concentration of almost all amino acids, except for alanine. These ensured sufficient diversity within PR-25 population to identify amino acid related QTLs. CDC Amarillo and CDC Limerick are widely grown in western Canada for their good yield and protein concentration, therefore, PR-25 is an ideal population for the research of protein-quality traits and could potentially avoid tradeoff between favorable traits.
HPLC analysis of individual amino acids is a destructive method which requires protein digestion steps. However, the unique attributes of some amino acids increase the complexity of the digestion steps. In comparison to HPLC analysis, NIR analysis is a non-destructive, high throughput and cost-efficient method to assess amino acid profile for pea protein. Amino acid profile of a thousand samples could be assessed in a week by using NIR analysis, while it may take months if using HPLC analysis. In the current study, the calibration formula developed based on HPLC quantification of amino acids in pea genotypes was used for NIR based prediction of amino acids. For most amino acids, their correlation coefficient values (r) were above 0.9. For the limiting amino acids, such as methionine, cystine and tryptophan, their r values were also acceptable, which were 0.733, 0.833 and 0.855, respectively.
Pea protein is limited in sulfur amino acids and is abundant in lysine, thus complements cereal protein to provide complete plant protein. There is a debate on whether the improvements of limiting amino acids in certain crops are necessary since the other option would always be paired up with a different crop to provide complete nutrition. Though it seems less cost-effective to enhance the nutritional attributes of a single crop, this approach has several benefits. When introducing more ingredients into a food product, there is an increased risk of increasing the allergenicity of the product. Secondly, having more ingredients in a food product often means more food processing steps, including masking of unpleasant flavors, or more addition of food coloring which increases the cost. Meanwhile, over-processing is an issue that causes a decline in consumer acceptance. There is increased willingness among consumers to opt for products with fewer ingredients, driven by their consciousness for more healthy diets. Furthermore, climate change has different impacts on crops, and some are more severely influenced (Raza et al., 2019). The improvement of protein quality in single crop would contribute to food security and provide strength for agricultural businesses.
In PR-25, the averaged in vitro PDCAAS of 7 station-years ranged from 0.73 to 0.94. The PDCAAS of pea protein isolates was reported as 0.86 for children and 0.93 for adults by the FDA (2019). Some lines in PR-25 had lower PDCAAS than what was reported by the FDA. However, the measurements were conducted on whole pea seeds in the current study, while FDA measured PDCAAS on protein isolates. Since the fractionation process had positive effects on PDCAAS via improving the protein digestibility, whole seed samples were expected to have a lower PDCAAS than protein concentrates or isolates (Rivera Del Rio et al., 2022).
Some variation in amino acid profile was detected among station-years. These variations can be attributed to the effects of environments, and the interaction between genotype and environment. These abiotic factors impact individual amino acids differently and some amino acids, for instance, arginine and phenylalanine, had larger variations across station-years compared to others, which would lead to a change of their proportion in pea protein. However, these proportional changes in some amino acids only had limited impact on the overall amino acid profile as the profile didn’t change significantly across station-years.
All 18 amino acids assessed in this study were found positively correlated with total protein concentration, in most cases with correlation coefficients above 0.8. Meanwhile, close proximities or overlaps were found on several protein-related traits. Met+Cys-QTL-1 was found adjacent to PC-QTL-1 on chromosome 2, where IVPD-QTL-1 was located in between. Trp-QTL-1 was found within Met+Cys-QTL-2 on one end of chromosome 5 linkage group 3a and Trp-QTL-2 overlapped with Met+Cys-QTL-3 on another end. Met+Cys-QTL-4, PC-QTL-3, Lys-QTL-3 and Trp-QTL-4 were found overlapped on one end of Chromosome 3. Though overlaps were found among these protein-related traits, it didn’t necessarily lead to strong correlations. For instance, the correlation between protein concentration and methionine + cysteine concentration was 0.55, between IVPD and protein concentrations was 0.10, between IVPD and methionine + cysteine concentration was 0.17, between tryptophan concentration and methionine + cysteine concentration was 0.69. The main reason was that all protein associated traits assessed in this study are quantitative traits that are regulated by multiple loci. The identified QTL regions only explained small portions of the phenotypic variations and hence, though some of the QTL regions of these traits were in close proximity, their correlations were not very strong. Yet, within these overlapped QTL regions, there are potentials to find pleiotropic QTLs that control multiple protein quality traits of pea, as the research by Li et al. (2018) had identified several pleiotropic QTLs that associated with multiple amino acids concentrations in soybeans.
QTL analyses on pea seed traits including protein and mineral concentration, seed yield, thousand seed weight, seed number per plant, have been conducted by several research programs in the past two decades and numerous QTL had been identified associated with these traits (Tar'an et al., 2004; Ma et al., 2017; Dissanayaka et al., 2020; Klein et al., 2020; Zhou et al., 2022). Protein digestibility and amino acid profile have also been studied for deeper understanding of the functional attributes of pea protein (Nosworthy & House, 2017; Cabuk et al., 2018). However, none of the previous publications was related to QTL analysis for pea protein digestibility and amino acids concentration, and the QTLs identified in this study provide valuable information of the underlying genetic control of these traits. Twelve loci were found within the peak regions of the QTLs identified in this study, three were associated with methionine + cysteine concentration, four were associated with tryptophan concentration, three were associated with lysine concentration, and two were associated with in vitro protein digestibility. The information of these loci will be beneficial for developing markers to facilitate the selection of high protein-quality varieties in pea breeding. Several lines in PR-25, including PR-25-2, PR-25-46-PR-25-53, PR-25-86 and PR-25-96, had high concentrations of methionine, cysteine and tryptophan, as well as good protein concentration and grain yield. These lines had consistently good performance in protein quality traits across 7 station-years without compromise on other valued traits. These lines could be the potential high protein quality varieties or could be used as parental materials to develop varieties with better performance.
Conclusion
The effect of genotype x environment interaction on the amino acid concentrations of pea cultivars is significant. The segregation pattern of amino acid concentrations in PR-25 population combined with the NIR-based predictions offers a possibility for high throughput selection of breeding lines for amino acid concentrations. Three QTLs were found associated with methionine + cysteine concentration, four QTLs were found associated with the tryptophan concentration, three QTLs were found associated with lysine concentration and two QTLs were found associated with in vitro protein digestibility. Overlaps were found among protein-related traits on chromosome 2 and chromosome 5. These identified QTL regions have a potential for use in marker-assisted selection of protein quality traits.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
TW, BT and KG conceptualized the study and secured funding for this study through a research grant from the Saskatchewan Ministry of Agriculture. KG developed the RIL population and conducted genotyping and linkage map construction. JZ conducted field trials, collected phenotypic data, conducted IVPD experiments, data curation, QTL analysis and wrote the manuscript. MN provided expert advice on IVPD experiments. JH designed methodology for amino acid profiling via NIR analysis. ZW conducted amino acid profiling. All authors contributed to the article and approved the submitted version.
Funding
This research was part of the research project Pea Protein ‘Omics Determination (P-POD) supported by the Saskatchewan Ministry of Agriculture, Strategic Research Initiative (project # 20180436).
Acknowledgments
We thank Brent Barlow and the rest of the pulse crop breeding staff at the Crop Development Centre for planting, maintaining, and harvesting PR-25 from 2019 to 2021. We thank Perkin Elmer for providing access to the DA 7250 NIR spectrophotometer used in the current study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1083086/full#supplementary-material
Supplementary Figure 1 | Individual QTL analysis from each station-year in PR-25 reveals the QTLs associated for methionine + cysteine concentration. Individual QTL analysis was based on average phenotypic data of biological replicates in each station-year.
Supplementary Figure 2 | Individual QTL analysis from each station-year in PR-25 reveals the QTLs associated for tryptophan concentration. Individual QTL analysis was based on average phenotypic data of biological replicates in each station-year.
Supplementary Figure 3 | Individual QTL analysis from each station-year in PR-25 reveals the QTLs associated for lysine concentration. Individual QTL analysis was based on average phenotypic data of biological replicates in each station-year.
Supplementary Figure 4 | Individual QTL analysis from each station-year in PR-25 reveals the QTLs associated for in vitro protein digestibility. Individual QTL analysis was based on average phenotypic data of biological replicates in each station-year.
References
Alcorta, A., Porta, A., Tárrega, A., Alvarez, M. D., Vaquero, M. P. (2021). Foods for plant-based diets: Challenges and innovations. Foods 10, 293. doi: 10.3390/foods10020293
Aschemann-Witzel, J., Gantriis, R. F., Fraga, P., Perez-Cueto, F. J. A. (2021). Plant-based food and protein trend from a business perspective: Markets, consumers, and the challenges and opportunities in the future. Crit. Rev. Food Sci. Nutr. 61 (18), 3119–3128. doi: 10.1080/10408398.2020.1793730
Bessada, S. M. F., Barreira, J. C. M., Oliveira, B. P. P. (2019). Pulses and food security: Dietary protein, digestibility, bioactive and functional properties. Trends Food Sci. Technol. 93, 53–68. doi: 10.1016/j.tifs.2019.08.022
Cabuk, B., Nosworthy, M. G., Stone, A. K., Korber, D. R., Tanaka, T., House, J. D., et al. (2018). Effect of fermentation on the protein digestibility and levels of non-nutritive compounds of pea protein concentrate. Food Technol. Biotechnol. 56 (2), 257–264. doi: 10.17113/ftb.56.02.18.5450
Dissanayaka, D. N., Gali, K. K., Jha, A. B., Lachagari, V. B. R., Warkentin, T. D. (2020). Genome-wide association study to identify single nucleotide polymorphisms associated with fe, zn, and Se concentration in field pea. Crop Sci. 60, 2070–2084. doi: 10.1002/csc2.20161
Dołowy, M., Pyka, A. (2014). Application of TLC, HPLC and GC methods to the study of amino acid and peptide enantiomers: A review. Biomed. Chromatogr. 28 (1), 84–101. doi: 10.1002/bmc.3016
Duodu, K. G., Nunes, A., Delgadillo, I., Parker, M. L., Mills, E. N. C., Belton, P. S., et al. (2002). Effect of grain structure and cooking on sorghum and maize in vitro protein digestibility. J. Cereal Sci. 35 (2), 161–174. doi: 10.1006/jcrs.2001.0411
Estell, M., Hughes, J., Grafenauer, S. (2021). Plant protein and plant-based meat alternatives: Consumer and nutrition professional attitudes and perceptions. Sustainability. 13, 1478. doi: 10.3390/su13031478
FDA (2019) GRAS notification for roquette freres pea protein isolate. Available at: https://www.fda.gov/media/134207/download#:~:text=Pea%20Protein%20Isolate%20is%20a,and%2093%25%20PDCAAS%20for%20adults (Accessed 24 October 2022).
Gali, K. K., Sackville, A., Tafesse, E. G., Lachagari, V. B. R., McPhee, K., Hybl, M., et al. (2019). Genome-wide association mapping for agronomic and seed quality traits of field pea (Pisum sativum l.). Front. Plant Sci. 10. doi: 10.3389/fpls.2019.01538
Hang, J., Shi, D., Neufeld, J., Bett, K. E., House, J. D. (2022). Prediction of protein and amino acid contents in whole and ground lentils using near-infrared reflectance spectroscopy. LWT. 165, 113669. doi: 10.1016/j.lwt.2022.113669
Hoffmann, B., Munch, S., Schwagele, F., Neusu, C., Jira, W. (2017). A sensitive HPLC-MS/MS screening method for the simultaneous detection of lupine, pea, and soy proteins in meat products. Food Control 71, 200–209. doi: 10.1016/j.foodcont.2016.06.021
Jiang, G. (2020). Comparison and application of non-destructive NIR evaluations of seed protein and oil content in soybean breeding. Agron. 10 (1), 77. doi: 10.3390/agronomy10010077
Kambhampati, S., Li, J., Evans, B. S., Allen, D. K. (2019). Accurate and efficient amino acid analysis for protein quantification using hydrophilic interaction chromatography coupled tandem mass spectrometry. Plant Methods 15, 46. doi: 10.1186/s13007-019-0430-z
Ketnawa, S., Ogawa, Y. (2021). In vitro protein digestibility and biochemical characteristics of soaked, boiled and fermented soybeans. Sci. Rep. 11, 14257. doi: 10.1038/s41598-021-93451-x
Klein, A., Houtin, H., Rond-Coissieux, C., Naudet-Huart, M., Touratier, M., Marget, P., et al. (2020). Meta-analysis of QTL reveals the genetic control of yield-related traits and seed protein content in pea. Sci. Rep. 10, 15925. doi: 10.1038/s41598-020-72548-9
Leterme, P., Monmart, T., Baudart, E. (1990). Amino acid composition of pea (Pisum sativum) proteins and protein profile of pea flour. J. Sci. Food. Agric. 53, 107–110. doi: 10.1002/jsfa.2740530112
Li, X., Tian, R., Kamala, S., Du, H., Li, W., Kong, Y., et al. (2018). Identification and verification of pleiotropic QTL controlling multiple amino acid contents in soybean seed. Euphytica. 214, 93. doi: 10.1007/s10681-018-2170-y
Ma, Y., Coyne, C. J., Grusak, M. A., Mazourek, M., Cheng, P., Main, D., et al. (2017). Genome-wide SNP identification, linkage map construction and QTL mapping for seed mineral concentrations and contents in pea (Pisum sativum l.). BMC Plant Biol. 17, 43. doi: 10.1186/s12870-016-0956-4
Nosworthy, M. G., Franczyk, A., Zimoch-Korzycka, A., Appah, P., Utioh, A., Neufeld, J., et al. (2017). Impact of processing on the protein quality of Pinto bean (Phaseolus vulgaris) and buckwheat (Fagopyrum esculentum moench) flours and blends, as determined by in vitro and in vivo methodologies. J. Agric. Food Chem. 65 (19), 3919–3925. doi: 10.1021/acs.jafc.7b00697
Nosworthy, M. G., House, J. D. (2017). Factors influencing the quality of dietary proteins: Implications for pulses. Cereal Chem. 94 (1), 49–57. doi: 10.1094/CCHEM-04-16-0104-FI
Pilorge, E., Kezeya, B., Stauss, W., Muel, F., Mergenthaler, M. (2021). Pea and rapeseed acreage and land use for plant-based meat alternatives in the EU. OCL. 28, 54. doi: 10.1051/ocl/2021037
Ravanel, S., Gakiere, B., Job, D., Douce, R. (1998). The specific features of methionine biosynthesis and metabolism in plants. PNAS. 95 (13), 7805–7812. doi: 10.1073/pnas.95.13.7805
Raza, A., Razzaq, A., Mehmood, S. S., Zou, X., Zhang, X., Lv, Y., et al. (2019). Impact of climate change on crops adaptation and strategies to tackle its outcome: A review. Plants 8, 34. doi: 10.3390/plants8020034
Rivera Del Rio, A., Boom, R. M., Janssen, A. E. M. (2022). Effect of fractionation and processing conditions on the digestibility of plant proteins as food ingredients. Foods 11 (6), 870. doi: 10.3390/foods11060870
Sabate, J., Soret, S. (2014). Sustainability of plant-based diets: Back to the future. Am. J. Clin. Nutr. 100, 476–482. doi: 10.3945/ajcn.113.071522
Schaafsma, G. (2005). The protein digestibility-corrected amino acid score (PDCAAS)–a concept for describing protein quality in foods and food ingredients: A critical review. J. AOAC Int. 88 (3), 988–994. doi: 10.1093/jaoac/88.3.988
Sharma, G., Attri, S. V., Behra, B., Bhisikar, S., Kumar, P., Tageja, M., et al. (2014). Analysis of 26 amino acids in human plasma by HPLC using AQC as derivatizing agent and its application in metabolic laboratory. Amino Acids 46, 1253–1263. doi: 10.1007/s00726-014-1682-6
Shi, D. (2021) Effects of genotype, environment and genotype by environment interaction on soybean protein and amino acid content using near-infrared reflectance spectroscopy (Winnipeg: University of Manitoba). Available at: https://mspace.lib.umanitoba.ca/bitstream/handle/1993/35965/Shi_Da.pdf?sequence=1&isAllowed=y (Accessed July 28th, 2022).
Shi, D., Hang, J., Neufeld, J., Zhao, S., House, J. D. (2022). Estimation of crude protein and amino acid contents in whole, ground and defatted ground soybeans by different types of near-infrared (NIR) reflectance spectroscopy. J. Food Compos. Anal. 111, 104601. doi: 10.1016/j.jfca.2022.104601
Srivastava, S., Neerubala, Singh, S., Shamim, M. Z. (2015). Nutritional composition of weaning food using malted cereal and pulses flour for infants. Int. J. Pure App. Biosci. 3 (1), 171–185.
Tar'an, B., Warkentin, T., Somers, D., Miranda, D., Vandenberg, A., Blade, S., et al. (2004). Identification of quantitative trait loci for grain yield, seed protein concentration and maturity in field pea (Pisum sativum l.). Euphytica. 136, 297–306. doi: 10.1023/B:EUPH.0000032721.03075.a0
Tavano, O. L., Neves, V. A., Junior, S. I. S. (2016). In vitro versus in vivo protein digestibility techniques for calculating PDCAAS (protein digestibility-corrected amino acid score) applied to chickpea fractions. Int. Food Res. J. 89 (1), 756–763. doi: 10.1016/j.foodres.2016.10.005
Taylor, S. L., Marsh, J. T., Koppelman, S. J., Kabourek, J. L., Johnson, P. E., Baumert, J. L. (2021). A perspective on pea allergy and pea allergens. Trends Food Sci. Technol. 116, 186–198. doi: 10.1016/j.tifs.2021.07.017
Van Loo, E. J., Caputo, V., Jayson, L., Lusk, J. L. (2020). Consumer preferences for farm-raised meat, lab-grown meat, and plant-based meat alternatives: Does information or brand matter? Food Policy 95) 101931. doi: 10.1016/j.foodpol.2020.101931
Wang, S., Basten, C. J., Zeng, Z. B. (2012). Windows QTL cartographer 2.5. department of statistics (Raleigh, NC: North Carolina State University).
Keywords: pea, methionine, cysteine, tryptophan, lysine, protein digestibility, QTLs
Citation: Zhou J, Wan Z, Gali KK, Jha AB, Nickerson MT, House JD, Tar’an B and Warkentin TD (2023) Quantitative trait loci associated with amino acid concentration and in vitro protein digestibility in pea (Pisum sativum L.). Front. Plant Sci. 14:1083086. doi: 10.3389/fpls.2023.1083086
Received: 28 October 2022; Accepted: 14 February 2023;
Published: 10 March 2023.
Edited by:
Karam B. Singh, Commonwealth Scientific and Industrial Research Organisation (CSIRO), AustraliaReviewed by:
R. Varma Penmetsa, University of California, Davis, CA, United StatesLars Kamphuis, Curtin University, Australia
Copyright © 2023 Zhou, Wan, Gali, Jha, Nickerson, House, Tar’an and Warkentin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Thomas D. Warkentin, dG9tLndhcmtlbnRpbkB1c2Fzay5jYQ==