Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 31 July 2024
Sec. Technical Advances in Plant Science
This article is part of the Research Topic Physiology and Breeding of Cereals View all 11 articles

Modeling QTL-by-environment interactions for multi-parent populations

  • 1Biometris, Wageningen University and Research Center, Wageningen, Netherlands
  • 2Rijk Zwaan Breeding B.V., De Lier, Netherlands
  • 3Plant Genetics, NIAB, Cambridge, United Kingdom

Multi-parent populations (MPPs) are attractive for genetic and breeding studies because they combine genetic diversity with an easy-to-control population structure. Most methods for mapping QTLs in MPPs focus on the detection of QTLs in single environments. Little attention has been given to mapping QTLs in multienvironment trials (METs) and to detecting and modeling QTL-by-environment interactions (QEIs). We present mixed model approaches for the detection and modeling of consistent versus environment-dependent QTLs, i.e., QTL-by-environment interaction (QEI). QTL effects are assumed to be normally distributed with variances expressing consistency or dependence on environments and families. The entries of the corresponding design matrices are functions of identity-by-descent (IBD) probabilities between parents and offspring and follow from the parental origin of offspring DNA. A polygenic effect is added to the models to account for background genetic variation. We illustrate the wide applicability of our method by analyzing several public MPP datasets with observations from METs. The examples include diallel, nested association mapping (NAM), and multi-parent advanced inter-cross (MAGIC) populations. The results of our approach compare favorably with those of previous studies that used tailored methods.

1 Introduction

Genotype-by-environment interaction (GEI) implies the differential behavior of genotypes across a range of environmental conditions. Broadly adapted genotypes show stable performance across environmental conditions, whereas narrowly adapted genotypes do well under specific conditions. Adequate description and understanding of GEI patterns is fundamental to the creation of better adapted varieties that comply with environmental and societal challenges. Obviously, adaptation to conditions imposed by climate change will be a major target for many breeding programs.

GEI is a common phenomenon in many crops, and cereals are no exception to that. Typically, a diversity panel is evaluated in a multi-environment trial (MET) or a number of managed stress trials and the GEI is explained in terms of marker or QTL effects that are environment dependent. Some recent examples of this approach using a QTL analysis in the form of a genome-wide association analysis are Bustos-Korts et al. (2019) and Bretani et al. (2022) in barley and Millet et al. (2016) and Shu et al. (2023) in maize. Examples of the same approach with an emphasis on genomic prediction are Burgueño et al. (2012); Lopez-Cruz et al. (2015), and Jarquín et al. (2017) in wheat and Millet et al. (2019) and Barreto et al. (2024) in maize, whereas Cuevas et al. (2017) give examples in wheat and maize. It is less common to use biparental populations for the investigation of the genetic bases of GEI in the form of QTL by environment interactions (QEI). Two somewhat older examples are Boer et al. (2007) for maize and Mathews et al. (2008) for wheat. All the above papers use a linear mixed model approach or a Bayesian equivalent to model the data. For a description of such models in relation to the modeling of GEI and QEI, see van Eeuwijk et al. (2010); Malosetti et al. (2013), and Van Eeuwijk et al. (2016).

The utilization of multi-parent populations (MPPs) in the investigation of QEI offers several advantages. Compared with the use of biparental populations in METs, the higher genetic diversity of MPPs will increase the chance of displaying polymorphisms at QTLs that interact with environments. In comparison with diversity panels, the known pedigree of MPPs alleviates the problems of population structure and minor allele frequencies. A few papers have attempted to analyze multi-environment trials for multi-parent populations (MET&MPP) with often QTL analyses per environment and subsequent comparison of QTL test statistics or −log10(p-value) profiles. An attractive variation on this single-environment single-response QTL analysis was given by Coles et al. (2010) for a maize diallel design where QTLs for the photoperiod sensitivity were mapped using an integrated response variable, i.e., the difference between observations under long-day and short-day conditions was analyzed (Coles et al., 2010). In a maize NAM design, QTL effects for yield were assessed between two weakly correlated trial locations with consideration of the genetic covariance between environments (Garin et al., 2020). Other examples of maize NAM and diallel MET&MPPs (Buckler et al., 2009; Giraud et al., 2014) demonstrated QTL detection for the average response across environments, ignoring QEI. For MAGIC populations, Verbyla et al. (2014) present mixed model technology to study QEI. Puglisi et al. (2021) analyzed a barley MAGIC population in a MET for genomic prediction.

A generic statistical approach for studying QEI for any type of MPP seems to be missing. We propose such an approach in the form of an IBD-based mixed model framework with random QTL effects whose stability depends on both the environment and the family. A background polygenic effect is added that itself can be structured again by family and environment. We illustrate our approach in several MET&MPP cereal datasets: a maize diallel (Coles et al., 2010), two maize NAM designs (Bauer et al., 2013; Giraud et al., 2014; Garin et al., 2020), and a wheat MAGIC population (Scott et al., 2021).

2 Materials and methods

2.1 General framework

Figure 1 illustrates the framework of MET&MPP analysis for investigating QEI. The process begins with the computation of IBD probabilities using pedigree and genome information of parents and progenies. Next, the best linear unbiased estimates (BLUEs) for genotypic means are obtained through single-trial analysis, which includes corrections for block and spatial effects. Subsequently, a MET&MPP analysis is conducted using mixed model approaches to study QEI. An important issue in the building of mixed models for GEI and QEI in METs is the formulation of the variance–covariance (VCOV) structure between trials (Smith et al., 2005; Boer et al., 2007). An informal exploration of this structure is possible via the fitting of a genotype-plus-genotype-by-environment interaction (GGE) model and the corresponding visualization by a GGE biplot (Yan et al., 2000; Yan and Kang, 2002). Details of each step in Figure 1 are described below, and descriptions of the proposed IBD-based MET&MPP analysis using mixed models for QEI are elaborated in section 2.2 Mixed models for IBD-based QTL analysis of METs for MPPs.

Figure 1
www.frontiersin.org

Figure 1 Description of framework for Identity By Descent (IBD)-based QTL analysis of Multi-Environment Trial (MET) data collected for a Multi-Parent Population (MPP) allowing the study of QTL by environment interactions (QEI).

The entries of design matrices, representing genetic predictors for testing QTL main effects and QEI, are derived from marker information and have the form of expected numbers of alleles originating from individual parents. These expected allele numbers are simple functions of IBD probabilities between parents and offspring. IBD probabilities can be calculated using the R package statgenIBD (Boer and van Rossum, 2021) for a wide range of MPP designs, whereas a Mathematica tool called RABBIT can be used for more complex designs (Zheng, 2019). Both RABBIT and statgenIBD employ hidden Markov models (HMM) and inheritance vectors to calculate IBD probabilities based on the pedigree and genome information of parents and offspring (Zheng et al., 2014, 2015; Boer and van Rossum, 2021). In the current paper, we chose to compute IBD probabilities at a grid of 5 cM along the genome for the empirical MET&MPP datasets as a compromise between mapping resolution and computation time. The exception was the analysis of a MAGIC wheat data set for which we defined a grid at 0.5 cM.

The multi-environment phenotypes used as inputs to our MET&MPP models are typically genotypic means for offspring belonging to segregating families evaluated in multiple field trials. Trial-specific genotypic means, which have been corrected for block, row, column, and spatial effects, are calculated as best linear unbiased estimators (BLUEs). The R package SpATS (Rodriguez-Alvarez et al., 2018) offers a convenient and efficient way to convert the trial data at plot level into vectors of adjusted genotypic means. Similarly, the R package statgenSTA (Rossum et al., 2021b) can be employed to compute genotypic BLUEs per trial together with corresponding standard errors.

To explore the heterogeneity of genotypic variances and correlations across trials, we performed a GGE biplot analysis (Yan et al., 2000). The GGE biplot is a rank 2 principal component fit to the genotype-by-trial table of BLUEs in which the first axis is closely related to the genotypic main effects whereas the second axis shows GEI effects. The R package statgenGxE (Rossum et al., 2021a) was used to create the GGE biplots. These plots can be helpful to investigate whether the contributions of individual QTLs to the genetic correlations between trials align with the overall correlations.

2.2 Mixed models for IBD-based QTL analysis of METs for MPPs

2.2.1 Models for QTL effects

Four QTL models are proposed differing in the effect types at a putative QTL in terms of stability across families and environments: consistent across both environments and families (environment-consistent and family-consistent, EC&FC QTL), environment-specific and family-consistent (ES&FC QTL), environment-consistent and family-specific (EC&FS QTL), or environment-specific and family-specific (ES&FS QTL). For MAGIC populations, we consider the consistency and specificity of QTL effects only in relation to the environments.

We elaborate these four QTL models for a MET&MPP with C crosses (or families) derived from P parents across J environments or trials; ncj denotes the number of observations (genotypic BLUEs) for the c-th family in the j-th environment; the total number of observations from the c-th family across all J environments is jncj=nc·; the total number of observations in the j-th environment across all C families is cncj=n·j; so the total number of observations in the MET&MPP is jcncj=cnc·=jn·j=N. In all model descriptions, we present later, matrices (and vectors) are presented in bold font and random terms are underlined. The general mixed model for a single-locus QTL model at the q-th genomic position can be expressed as:

Y_=Xβ+Zqu_q+g_+ϵ_

Y_ is the N×1 column vector for all N observations in a MET&MPP. The fixed part, Xβ, models effects for families and environments and their interactions. The structure of the design matrix for the random QTL effects, Zq, and the vector of QTL effects, u_q, is determined by the effect type at the QTL. The four QTL models have different design matrices Zq and QTL effect vectors u_q and will be described with superscripts to indicate the matrix dimensions to help distinguish the model structures.

In the EC&FC QTL model, the QTL effect at the q-th genomic position is assumed to be stable across environments and families, which is comparable with the generic IBD-based mixed model approach to map additive QTLs in previous studies (Li et al., 2021, 2022):

Zq(N×P)=[πq,1(N×1)πq,k(N×1)πq,P(N×1)],
u_q(P×1)=[a_q,1a_q,ka_q,P]T MVN(0, IPΣq2)
(EC&FC QTL model)

Zq(N×P) is a N×P design matrix that can be partitioned into P column vectors. Each of the P column vectors is a N×1 column vector, πq,k(N×1), containing expected numbers of allelic copies associated with the k-th (k=1,2,, P) parent for all N observations. The P×1 column vector u_q(P×1) contains element a_q,k representing the QTL effect associated with the k-th parent. The variance of u_q(P×1) is IPΣq2, where IP is the P-dimensional identity matrix and Σq2 is the genetic variance for the putative QTL, implying a homogeneous VCOV structure for the QTL effect across all environments and families.

In the ES&FC QTL model, the QTL effect is defined as being unstable across J environments due to QEI but consistent across C families. The design matrix and the QTL effects become:

Zq(N×PJ)=j=1JZq,j(n·j×P)
u_q(PJ×1)=[(u_q,1(P×1))T(u_q,j(P×1))T(u_q,J(P×1))T]TMVN(0,j=1JIPΣq,j2)
(ES&FC QTL model)

Zq(N×PJ) is a N×PJ design matrix that can be split into J diagonal components. Each of these J components is a n·j×P design matrix, Zq,j(n·j×P), whose entries correspond to an n·j×P sub matrix of Zq(N×P), i.e., the entries corresponding to the j-th environment in Zq(N×P) are copied to Zq,j(n·j×P), with n·j being the number of observations in the j-th environment. The vector of effects u_q(PJ×1) is a PJ×1 column vector that can be partitioned into J sub vectors. Each of the J sub vectors is a P×1 column vector u_q,j(P×1) that contains the QTL effects of the P parents in the j-th environment with IPΣq,j2 being the environment-specific QTL variance. The symbol stands for a direct sum (Schott, 2016).

In the EC&FS QTL model, parental QTL effects are specified within each of C biparental families and therefore vary between families, while being stable across all J environments. The design matrix for the QTL effects and the vector of QTL effects are defined as follows:

Zq(N×2C)=c=1C[πq,P1c(nc·×1) πq,P2c(nc·×1)]
u_q(2C×1)=[a_q,P11 a_q,P21a_q,P1c a_q,P2ca_q,P1C a_q,P2C]T~MVN(0,c=1CI2Σq,c2)
(EC&FS QTL model)

Zq(N×2C) is a N×2C design matrix that can be partitioned into C diagonal components. Each of these components contains two nc.×1 column vectors, πq,P1c(nc·×1) and πq,P2c(nc·×1), whose elements indicate the expected numbers of allele copies associated with the two parents of that family, P1c and P2c. The random QTL effects u_q(2C×1) form a 2C×1 column vector associated with the pairs of parents across the families, and for the c-th family, the variance of the QTL effects is I2Σq,c2.

In the ES&FS QTL model, the QTL effects are stable across neither environments nor families, but both are environment and family specific. The ES&FS QTL model is established by merging the ES&FC and EC&FS QTL models. The design matrix for the QTL effects and the corresponding vector of QTL effects is:

Zq(N×2CJ)=j=1Jc=1C[πq,P1c(ncj×1) πq,P2c(ncj×1)],
u_q(2JC×1)=[(u_q,1(2C×1))T(u_q,j(2C×1))T(u_q,J(2C×1))T]TMVN(0,j=1Jc=1CI2Σq,j,c2)
(ES&FS QTL model)

Zq(N×2CJ) is a N×2CJ design matrix with JC diagonal components. Each of these components possesses a pair of ncj×1 column vectors, πq,P1c(ncj×1) and πq,P2c(ncj×1), whose structure resembles that in the EC&FS QTL model, but now the vectors are designated for the j-th environment, where ncj is the number of observations in the c-th family and j-th environment. Remember that j=1Jc=1Cncj=N. The 2JC×1 column vector u_q(2JC×1) can be partitioned into J column vectors with each column vector having the form u_q,j(2C×1), comparable with that in the EC&FS QTL model. For the j-th environment, the variance of the QTL effects across the C families can be written as c=1CI2Σq,j,c2. QTL variances are heterogeneous and depend on environment and family simultaneously.

2.2.2 Models for polygenic effect

The QTL models above can be combined in single- or multi-QTL models with a structured polygenic effect, g_. We write for the VCOV of g_, Σg=ΣMETΣMPP, with ΣMPP defined by the relations between the families and ΣMET by the relations between the environments. For ΣMPP, we allowed for identity, i.e., homogeneity across families (ΣMPPid), heterogeneity across families (ΣMPPidh), or a population structure as a marker-based kinship matrix (ΣMPPkin). For ΣMET, we assumed identity or homogeneity of genetic variance across environments and no genetic correlation between environments (ΣMETid), heterogeneity of genetic variance across environments and no genetic correlation (ΣMETidh), and an unstructured model with environment-specific genetic variances and correlations (ΣMETus). The combination of three structure models for ΣMPP and three structure models for ΣMET generates nine VCOV structures for the polygenic background effect. However, in practice for our data, the polygenic model that worked best in all cases was Σg=ΣMETusΣMPPidh.

2.2.3 Model selection and genome-wide QTL scans

To conduct genome-wide QTL scans, several strategies can be employed. We adopted the following protocol. Based on preliminary analyses, we first chose the VCOV model for the polygenic effects in all genome-wide scans Σg=ΣMETusΣMPPidh. We then conducted four series of full genome scans, one for each type of QTL (EC&FC, ES&FC, EC&FS, and ES&FS). Within each series of scans, we performed significance tests for QTLs using likelihood ratio tests (LRT) for single-variance components. The corresponding p-value followed from an approximation of the test statistic by a mixture of χ2 distributions with 0 and 1 degrees of freedom (Self and Liang, 1987). A conservative Bonferroni-corrected significance threshold was used at a genome-wide level of 0.05. Within a series of scans for a particular QTL type, in the first round, we selected the most significant QTL with the highest −log10(p-value) as a cofactor for a second round. In this second round, an exclusion window of 20 cM around the cofactor was defined in which no tests for QTLs were performed. This is to avoid collinearity problems. The process of identifying cofactors was repeated in subsequent rounds of scans until no further QTLs were found and the −log10(p-value) profile stabilized. This procedure can be thought of as a forward regression variable selection procedure. For each particular type of QTL (EC&FC, ES&FC, EC&FS, and ES&FS), a multi-QTL model was produced that comprised all of the QTLs identified in the series of scans. The QTLs as appearing in the four final multi-QTL models corresponding to the four specific QTL types were then combined within one model. When in this combined model QTLs of different types (EC&FC, ES&FC, EC&FS, and ES&FS) coincided with respect to their position, we assessed for that position the nature of the QTL by comparing AIC values (Akaike, 1974) for models with different formulations for the QTL effect. The model with the smallest AIC value then determined the nature of the QTL effect.

2.3 Data

To test our approach, we collected several empirical MET&MPP datasets (Table 1) including a wheat MAGIC population, a maize diallel, and two maize NAM designs. The wheat MAGIC population was phenotyped in two field trials in the UK in 2017 and 2018. The maize diallel designs were screened in managed stress trials, whereas the two maize NAM designs were screened across several geographic locations in the EU. In this paper, we did not distinguish the two types of trials and use the term “environments” to describe the trial conditions.

Table 1
www.frontiersin.org

Table 1 Overview of MET&MPP data sets. Structure, size, traits, nature of phenotypic data, and reference to earlier data presentations and analyses.

2.3.1 Maize diallel

In the maize diallel design, four biparental families were created by crossing each of two temperate inbred parents to each of two photoperiod-sensitive tropical parents (Coles et al., 2010). Recombinant inbred lines (RILs) were screened under long-day (summer seasons) and short-day (winter seasons) conditions for photoperiod-related traits, e.g., days to silking (DTS) and anthesis (DTA), whose measurements were converted to growing degree days (GDDTS and GDDTA) to account for the influence of temperature. In the previous study (Coles et al., 2010), QTL mapping for photoperiodic responses was performed by calculating the difference in responses between summer and winter conditions. In addition, QTL mapping for the separate conditions was performed by the tool MCQTL 4.0 (Jourjon et al., 2005).

2.3.2 Maize NAM, dent and flint panels

Two maize NAM designs, a flint panel and a dent panel, were taken from the maize EU-NAM project (Bauer et al., 2013; Giraud et al., 2014; Lehermeier et al., 2014). In the flint panel, 11 doubled haploid (DH) families were derived from the central parent UH007 crossed with 11 peripheral parents. Multiple traits such as dry matter yield (DMY), days to silking (DtSILK), and plant height (PH) were measured across six locations in the EU, namely, Wadersloh (Germany), Ploudaniel (France), La Coruña (Spain), Einbeck (Germany), Roggenstein (Germany), and Eckartseier (Germany). In the dent panel, 10 families were derived from the central parent F353 which was crossed with 10 peripheral parents. The same traits as measured in the flint panel were measured again but across four locations only, namely, Wadersloh (Germany), Mons (Germany), Einbeck (Germany), and Roggenstein (Germany).

In previous work by Giraud et al. (2014), combined linkage and linkage disequilibrium mapping was performed for both NAM panels, whereas Garin et al. (2020) used the NAM flint panel as an example for a QEI study using only the Roggenstein and La Coruña locations.

2.3.3 Wheat MAGIC

Construction, genotyping, and phenotyping of the bread wheat (Triticum aestivum L.) cultivars in the NIAB Diverse MAGIC population was previously described (Scott et al., 2021). Briefly, 16 northwestern wheat cultivars were selected based on genetic diversity as founders. These were inter-crossed using a funnel design scheme across four generations, with the outputs of the crossing funnel selfed over six generations to produce 504 recombinant inbred lines (RILs). The founders were genotyped via exome capture and the RILs via whole-genome low-coverage sequencing, allowing 1.1 M high-quality single-nucleotide polymorphisms (SNPs) to be called in the RILs via imputation (Scott et al., 2021). The populations were phenotyped for agronomic traits in two field trials conducted in the United Kingdom in 2017 and 2018, as described by Scott et al. (2021). Although genome wide scans for QEI were performed for all traits recorded in both years, in this paper we will focus on QTLs close to the long-day photoperiod response locus Photoperiod-B1 (Ppd-B1) on chromosome 2B for the traits grain yield (GY), grain protein content (GPC), flag leaf to ear distance (FLED), and height to flag leaf base (HFLB). Earlier QTL analyses on the same data were performed by Scott et al. (2021) for all trait-by-year combinations, where neither QEI issues nor allelic differences between parents were addressed.

3 Results

3.1 Detecting QEI from QTL models

We present the results of QEI analysis for the maize diallel, two maize NAM MET&MPPs, and for the NIAB Diverse wheat MAGIC population. For the maize diallel and NAM populations, we emphasize genome-wide QTL analyses. We also examined the GGE biplot to explore the genotypic correlations between the environments. Environments with weak correlations and therefore high GEI will reveal underlying QEI effects. We inspected environment-specific QTLs to see whether their effect profiles corresponded to the genotypic correlations observed between the environments. For the wheat MAGIC, we concentrated on the interpretation of allele effect profiles for a pleiotropic QTL on chromosome 2B, more specifically the region around the Ppd-B1 locus. We compared our findings with previous studies that employed different methods to investigate QEI.

3.2 Maize diallel design for days to silking and anthesis

Two flowering-related traits, growing degree days to silking (GDDTS) and growing degree days to anthesis (GDDTA), were measured in the summer and winter seasons to evaluate the photoperiod sensitivity under long-day and short-day conditions. For both traits, the correlations between summer and winter conditions were weak (Figures 2A, B).

Figure 2
www.frontiersin.org

Figure 2 Results of QEI analysis of maize diallel. (A) Analysis for growing degree days to anthesis (GDDTA). (B) Analysis for growing degree days to silk (GDDTS). On the left, GGE biplot for exploring genetic correlations between environments. Top right shows the −log10(p) profile for QTL detection with superimposition of QTL allele effect types indicating consistency of allele effects across environments and families. Bottom right shows allele effect profiles with alleles being either consistent or inconsistent across environments and/or families. Environment-consistent QTLs show the same color and intensity across environments. Environment-specific QTLs show environment specific colors and/or intensities. Family consistent QTLs show a parent specific color and intensity that does not depend on the family and that is either consistent across environments or particular to each environment. Family-specific QTLs show colors and/or intensities for parents that change across families.

In the QEI analysis, we found that nearly half of the identified QTLs were environment-specific QTLs for both GDDTS and GDDTA. For the trait GDDTA (Figure 2A), in total 10 QTLs were detected, among which six QTLs were environment specific. Five of the six environment-specific QTLs were comparable with QTLs for photoperiodic responses (calculated as differences of responses between the two seasons) from a previous study (Coles et al., 2010). Particularly strong environment-specific QTLs on chromosomes 8, 9, and 10 were identified with contrasting parental effects in especially the summer season. The same summer season, QTLs were identified by Coles et al. (2010) using single-environment analysis. As for the trait GDDTS (Figure 2B), we identified six environment-specific QTLs out of a total of 13 QTLs, which were in accordance with the QTLs for photoperiodic responses identified by Coles et al. (2010) with strong effects under summer conditions.

For both traits GDDTA and GDDTS, a few environment-consistent QTLs were estimated with specific effects within families (EC&FS QTLs), which implied QTL-by-family background interactions. We did not identify ES&FS QTLs for GDDTA and GDDTS, but analysis of other traits, such as plant height (PH), ear height (EH), and total leaf number (TLN) in this maize diallel MET&MPP, revealed some ES&FS QTLs (results not shown here).

3.3 Maize NAM design (dent panel) for dry matter yield, days to silking, and plant height

The NAM dent panel of maize was screened for important traits including dry matter yield (DMY), plant height (PH), and days to silking (DtSILK) across four locations in the EU. The GGE plots (Figure 3) depict varying levels of genetic correlations between the environments. For DMY, the environments Roggenstein and Mons exhibited a weak correlation, whereas the environments Einbeck and Wadersloh displayed a highly positive correlation. For PH, the Roggenstein environment stood apart with a weak correlation to the remaining environments. Regarding DtSILK, the environments Wadersloh, Mons, and Einbeck formed a cluster with a weak correlation to Roggenstein.

Figure 3
www.frontiersin.org

Figure 3 Results of QEI analysis of maize NAM dent design. (A) Dry matter yield (DMY), (B) Plant height (PH). (C) Days to silk (DtSILK). Interpretation of plots as in Figure 2.

In our QEI analysis, we successfully identified several environment-specific QTLs for each trait. For instance, our methodology detected five environment-specific QTLs out of a total of 13 QTLs for the DMY trait (Figure 3A). The effect profiles of these five environment-specific QTLs exhibited strong variations between the two weakly correlated environments, Mons and Roggenstein. In a separate study, Giraud et al. (2014) conducted combined linkage disequilibrium linkage analysis (LDLA) using adjusted means across all environments. The environment-consistent QTLs identified through our QEI analysis were comparable with those QTLs on chromosome 1, 3, 6, and 7 reported by Giraud et al. (2014). However, it is worth noting that our study revealed environment-specific QTLs on chromosomes 1, 2, and 4 that were not previously reported by Giraud et al. (2014). For the PH trait, we found two environment-specific QTLs out of a total of 13 QTLs (Figure 3B), and these QTLs displayed differential effect profiles between the two weakly correlated environments, Roggenstein and Wadersloh. Regarding the DtSILK trait, our analysis identified five environment-specific QTLs out of a total of 16 QTLs (Figure 3C), which exhibited distinct effect profiles between Roggenstein and the other environments.

For all traits, we detected QTLs with consistent effects across environments that could be either consistent across families as well as specific to families (EC&FC and EC&FS QTLs), but no environment-specific and family-specific (ES&FS) QTLs were found for any of the traits.

3.4 Maize NAM design (flint panel) for dry matter yield, days to silking and plant height

The flint panel of the maize NAM design was screened for DMY, PH, and DtSILK across six locations in the EU. For the DMY trait (Figure 4A), the GGE biplot reveals three clusters: La Coruña, Roggenstein, and the remaining environments. Among the detected nine QTLs, two QTLs on chromosomes 2 and 6 were found to be specific to certain environments. These environment-specific QTLs were previously studied by Garin et al. (2020), who assessed QEI between the weakly correlated environments La Coruña and Roggenstein. Our results corroborate Garin’s work by confirming an environment-specific QTL on chromosome 6 with significant effects in the Roggenstein environment. The QEI signal identified by Garin et al. (2020) on chromosome 5, displaying a relatively weak signal, was not detected in our analysis across the six environments. We discovered an additional environment-specific QTL on chromosome 2, which exhibited different effect profiles between Eckartseier and the other environments. This particular QTL was not reported in the study by Garin et al. (2020), which only considered the locations of Roggenstein and La Coruña.

Figure 4
www.frontiersin.org

Figure 4 Results of QEI analysis of maize NAM flint design. (A) dry matter yield (DMY), (B) plant height (PH), and (C) days to silk (DtSILK). Interpretation of plots as in Figure 2.

For PH (Figure 4B), two QTLs were family-specific while being environment-consistent (EC&FS). For DtSILK (Figure 4C), we even found four such EC&FS QTLs. Furthermore, for this trait, we also identified two QTLs that were both environment- and family-specific (ES&FS).

3.5 NIAB diverse wheat MAGIC

Outcomes of genome wide QTL scans for two grain traits (GY, GPC) and two plant height related traits (HFLB, FLED) are presented in Figure 5. The most remarkable result of these scans is that on chromosome 2B, around 62 Mb, QEI was detected for all four traits with the direction of the allelic effect tending to reverse for the majority of founder alleles between the two seasons (Table 2). Notably, founder Kloka showed the most extreme QEI at this locus for all four traits. With respect to the two grain-related traits, the Kloka allele showed a strong negative effect for GY in 2016–2017, and a light negative effect in 2017–2018, whereas for GPC, the Kloka allele effect was strongly positive in 2016–2017 and lightly positive in 2017–2018. Therefore, a strong negative correlation could be observed between the Kloka allele effects for GY and GPC at this locus, as would be expected from well-established trade-offs between yield and protein content in wheat (e.g., Scott et al., 2021; White et al., 2022). For the plant height traits, the QTL effect from Kloka at this locus on chromosome 2B reduced HLFB across both seasons, but very strongly in 2016–2017 and somewhat in 2017–2018. For FLED, the Kloka allele effect was moderately positive in 2016–2017 and somewhat negative in 2017–2018. Previous studies show that the long-day photoperiod response locus Photoperiod-B1 (Ppd-B1) is located at this approximate genome position and that it affects flowering time in the NIAB Diverse MAGIC population (Scott et al., 2021). Allelic effects at Ppd-B1 are known to be controlled by copy number variation (CNV) in the underlying gene, whereby wild-type photoperiod-sensitive Ppd-B1b alleles are associated with a haploid CNV of 1, whereas photoperiod-sensitive Ppd-B1a alleles that result in earlier flowering are associated with elevated copy number variation (e.g., CNV = 2 in the cultivar Récital, CNV = 3 in Sonora64, and CN0V = 4 in Chinese Spring) (Díaz et al., 2012). To further determine whether allelic variation at Ppd-B1 may underpin the environment-specific interactions at this genomic region for GY, GPD, HFLB, and FLED, we assessed Ppd-B1 CNV in the 16 NIAB Diverse MAGIC founders. Using quantitative TaqMan® assay, we found that whereas 14 founders carried one copy (indicative of wild-type Ppd-B1b alleles), two founders had increased CNV indicative of photoperiod insensitive Ppd-B1a alleles associated with early flowering: Kloka (CNV = 3) and Maris Fundin (CNV = 4).

Figure 5
www.frontiersin.org

Figure 5 Results of QEI analysis in wheat MAGIC for the traits grain yield (GY), grain protein content (GPC), flag leaf to ear distance (FLED), and height to flag leaf base (HFLB).

Table 2
www.frontiersin.org

Table 2 Heat plot for parental allele effects estimated for four traits in wheat MAGIC in two environments (2016–2017, 2017–2018) at chromosome 2B close to PpdB1 locus.

4 Discussion

4.1 QEI analysis by assessing the stability of QTL effects across families and environments

The MET&MPP analysis employs IBD-based mixed model approaches to detect QEI by testing various types of QTL effects across diverse environments and families. Previous studies have discussed the advantages of utilizing multiallelic IBD markers compared with biallelic identity-by-state (IBS) markers (Jurcic et al., 2021; Li et al., 2021), as well as the distinction between modeling QTL terms as random or fixed effects (Boer et al., 2007; Wang et al., 2022). In our current study, we propose fitting functions of IBD probabilities as genetic predictors within the MET&MPP analysis. This approach enables the estimation of multiallelic QTL effects with respect to parent origins, considering the expected numbers of allele copies from each parent (Wei and Xu, 2016; Li et al., 2021). In contrast to modeling fixed QTL effects in the MET&MPP analysis, treating QTL effects as random terms addresses the issue of overfitting the number of genetic parameters, particularly in complex MET&MPPs involving a large number of parents, families, and environments. Our QTL modeling approach allows for defining the nature of random QTL effects, such as EC&FC, ES&FC, EC&FS, and ES&FS, by modeling combinations of homogeneity or heterogeneity of genetic structures corresponding to both environments and families.

EC&FC QTLs can be regarded as the most stable QTLs in the MET&MPP analysis. EC&FC QTLs are comparable with those QTLs identified by an analysis on genotypic means across environments (Giraud et al., 2014; Garin et al., 2020). EC&FC QTLs identified in MET&MPP analyses at particular genomic positions will in general agree with QTLs for the same trait identified in single-environment analyses, like those in Coles et al. (2010), but there is no guarantee that all the QTLs identified in simpler types of analyses will be reproduced by a more complex MET&MPP analysis. EC&FC QTLs tend to be detected with higher power and resolution in MET&MPP analysis than when analyzing separate environments or a genotypic main effect across environments, as the joint analysis of all environments increases the sample size for mapping stable QTLs.

QTLs detected in separate environments with differential mapping profiles are likely to be detected as ES&FC QTLs by MET&MPP analysis. Parental effect profiles of ES&FC QTLs across environments tend to show differences between weakly correlated environments, whereas highly and positively correlated environments rarely convey QEI signals. Positions for ES&FC QTLs in our MET&MPP analyses coincided with QTLs found by a simpler single trait/environment QTL procedure in which the response variable was the difference between two environments, such as in the maize diallel MPP (Coles et al., 2010). However, we remark that with small population sizes, weak QEI signals that occur in a limited number of environments may not be detected as these signals are diluted by the noise in all those environments in which the QTL did not show an effect. For example, in the maize NAM flint panel (811 genotypes across 6 environments), the QEI signal on chromosome 5 specific to the environments La Coruña and Roggenstein (Garin et al., 2020) disappeared in our MET&MPP analysis on the full set of six environments, but this QTL was indeed identified when the MET&MPP analysis was restricted to La Coruña and Roggenstein.

Defining QTL effects to vary across families allows the evaluation of QTL-by-family interactions (QFI) (Jannink and Jansen, 2001; Blanc et al., 2006; Han et al., 2016). For family-specific QTLs that are stable across environments (EC&FS), simple digenic interaction can be investigated for pairs of markers by two-dimensional genome scans in families. Such two-dimensional QTL scans for epistatic interactions require large family sizes for sufficient detection power. For the detection of ES&FS QTLs with family-specific effects that are unstable across environments, even larger families are required. As an illustration of this point, we detected very few ES&FS QTLs in the maize diallel design, with around 140 genotypes per family over two environments, and hardly any ES&FS QTL in the maize NAM design, with around 80 genotypes per family.

4.2 Pleiotropic effects of QTL at Ppd-B1 in the NIAB Diverse MAGIC population

In a previous paper (Scott et al., 2021), it was shown that the photoperiod response locus Ppd-B1 affects the time taken to reach key developmental stages in the NIAB Diverse MAGIC population, including flag leaf emergence (growth stage 39, GS39), ear emergence (GS55), and flowering time (GS65). Under field conditions, the increasing day length across spring and summer help trigger transition from vegetative to reproductive plant growth and rate of progress across all subsequent developmental stages. Thus, allelic variation at Ppd-B1 helps define the timepoints that key development stages are exposed to the prevailing environmental conditions. Here, we identified QEI at a pleiotropic QTL controlling multiple plant height and grain traits at the Ppd-B1 locus, finding QTL effects from the founder Kloka to be both notably strong, and affected by environment. Kloka was unique among the NIAB Diverse MAGIC founders in that it carried three copies of the Ppd-B1 gene, predictive of an early flowering photoperiod insensitive Ppd-B1a allele. Interestingly, no QTL for total plant height was identified in the population at Ppd-B1 (Scott et al., 2021). Together, these results indicate that Ppd-B1a alleles promote flowering and that whereas overall height is not affected, under some growth environments, the Kloka photoperiod-insensitive Ppd-B1a allele results in a shift in plant height ratios between the upper and lower stems and is associated with effects on final grain yield and protein content. In environment 1, height to flag leaf base (HFLB) showed a strong decrease whereas height from flag leaf to ear base (FLEB) exhibited a marked increase, whereas in environment 2 the effects of the Kloka Ppd-B1a allele were much reduced. While the two field trials investigated here were located on the same farm in the UK, analysis of weather data has shown that both years were unusual compared with historical data across the preceding 56 years: environment 1 was characterized by extreme high temperatures and drought in the developmental stages before anthesis (March and April), and extreme precipitation during grain filling, whereas environment 2 was characterized by extreme terminal heat and drought during the grain filling stage (June and July) (Fradgley et al., 2023). Thus, the combined effects of the early flowering Kloka Ppd-B1a allele in environment 1 would have led to earlier development of the reproductive meristem, stem extension, and mid-canopy emergence during extreme temperatures and low precipitation. As temperature was unusually high throughout anthesis and start of grain filling (May–June) in both test environments, and subsequent grain filling stages in environment 1 took place under average temperatures, it is not likely that the known negative impact of high temperatures around anthesis and grain filling on wheat yield (e.g., Djanaguiraman et al., 2020) are the cause of reduced yield and high grain protein content in environment 1. Interestingly, analysis of CNV at Ppd-B1 genes in tetraploid wheat show that the photoperiod-insensitive Ppd-B1b allele results in increased spikelet number per ear (Arjona et al., 2018). This suggests that the pronounced negative effect of allelic variation at Ppd-B1 on yield in environment 1 may have been due to earlier exposure of each developmental stage to the extreme heat and drought conditions across the stem extension phases across which final spikelet number is determined in the shoot apical meristem as it reaches the terminal spikelet phase (as well as possible impacts of changed stem length ratios on later development during grain filling).

In addition to Kloka, the only other founder to have increased Ppd-B1 CNV was Maris Fundin, predicted to carry four copies via TaqMan®. While increased CNV is associated with early flowering photoperiod-insensitive Ppd-B1a alleles in wheat (Díaz et al., 2012; Würschum et al., 2015), other factors may influence the allelic effect. For instance, previous characterization of the photoperiod-insensitive Ppd-B1a allele in Chinese Spring found that it carries four tandemly duplicated copies of the gene: three intact and one truncated (Díaz et al., 2012). Indeed, analysis of three sets of near isogenic lines (NILs) in which photosensitive Ppd-B1a alleles from either Chinese Spring (CNV = 4), Sonora64 (CNV = 3), or Récital (CNV = 2) had been introgressed into a Paragon genetic background found the insensitive allele from Sonora64 to be significantly earlier flowering than that from Chinese Spring (Díaz et al., 2012). This highlights that whereas increased copy number results in an early flowering allele, the allelic effect is not wholly dependent on the number of copies of the genes present and that functionality of the copies present also plays a role. Indeed, whereas we found the NIAB Diverse MAGIC founder Maris Fundin to have four copies of Ppd-B1, analysis of allelic effects found that it did not have as strong an effect on the four target grain and plant height traits compared with the Kloka allele (CNV = 3). Surveys of >1,100 global wheat varieties (Díaz et al., 2012; Kane et al., 2013; Würschum et al., 2015) finds that the occurrence of cultivars with four copies of Ppd-B1 is very rare, having previously been identified in only a limited number of Australian cultivars with pedigree links to Chinese Spring (Kane et al., 2013). Comparison of the pedigree of Maris Fundin (Capelle Desprez × [Vilmorin-29 × Vogel-8058]) × TJB-16 (Fradgley et al., 2019), with cultivars previously assayed for Ppd-B1 CNV, did not enable insight into the parental donor of the Maris Fundin CNV. However, collectively our results indicate that while the Maris Fundin allele characterized by the presence of four Ppd-D1 copies likely confers a photoperiod-insensitive Ppd-B1a allele, its effect is not as strong as the CNV = 3 Kloka allele identified here.

4.3 Conclusion

We introduced a general framework for studying QEI in METs for MPPs. The framework creates design matrices for QTL effects starting from IBD probabilities between parents and offspring for any kind of MPP. The IBD probabilities are combined with a standard procedure for creating factorial interactions between quantitative (IBD for genomic position) and qualitative (environment, family) covariates. In this way, four types of design matrices are produced that correspond to four types of QTL effects: EC&FC, ES&FC, EC&FS, and ES&FS. The effects are assumed to come from normal distributions. Following up on the definition of the QTL models, a relatively simple stepwise protocol is followed to arrive at multi-QTL models with mixtures of QTL effect types at the different loci. We illustrated with various examples the power of our approach to dissect the genetic basis of GEI in any kind of MPP.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://git.wur.nl/li178/QxE_MPP.

Author contributions

WL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft. MB: Conceptualization, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. RJ: Conceptualization, Funding acquisition, Resources, Supervision, Writing – review & editing. CZ: Conceptualization, Formal analysis, Investigation, Methodology, Software, Supervision, Writing – review & editing. LPA: Data curation, Investigation, Validation, Writing – original draft. JC: Data curation, Investigation, Validation, Writing – original draft. FV: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. Rijk Zwaan B.V. provided the funding for WL to work on this topic. JC and LP-A were supported by Biotechnology and Biological Sciences Research Council (BBSRC) grant BB/Y514081/1.

Conflict of interest

Author RJ was employed by Rijk Zwaan Breeding B.V.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Akaike, H. (1974). A new look at the statistical model identification. IEEE Trans. automatic control 19, 716–723. doi: 10.1109/TAC.1974.1100705

CrossRef Full Text | Google Scholar

Arjona, J. M., Royo, C., Dreisigacker, S., Ammar, K., Villegas, D. (2018). Effect of Ppd-A1 and Ppd-B1 allelic variants on grain number and thousand kernel weight of Durum wheat and their impact on final grain yield. Front. Plant Sci. 9, 888. doi: 10.3389/fpls.2018.00888

PubMed Abstract | CrossRef Full Text | Google Scholar

Barreto, C. A. V., das Graças Dias, K. O., de Sousa, I. C., Azevedo, C. F., Nascimento, A. C. C., Guimarães, L. J. M., et al. (2024). Genomic prediction in multi-environment trials in maize using statistical and machine learning methods. Sci. Rep. 14, 1062. doi: 10.1038/s41598-024-51792-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Bauer, E., Falque, M., Walter, H., Bauland, C., Camisan, C., Campo, L., et al. (2013). Intraspecific variation of recombination rate in maize. Genome Biol. 14, R103. doi: 10.1186/gb-2013-14-9-r103

PubMed Abstract | CrossRef Full Text | Google Scholar

Blanc, G., Charcosset, A., Mangin, B., Gallais, A., Moreau, L. (2006). Connected populations for detecting quantitative trait loci and testing for epistasis: An application in maize. Theor. Appl. Genet. 113, 206–224. doi: 10.1007/s00122-006-0287-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Boer, M. P., van Rossum, B. (2021). statgenIBD: Calculation of IBD Probabilities. doi: 10.32614/CRAN.packages

CrossRef Full Text | Google Scholar

Boer, M. P., Wright, D., Feng, L., Podlich, D. W., Luo, L., Cooper, M., et al. (2007). A mixed-model quantitative trait loci (QTL) analysis for multiple-environment trial data using environmental covariables for QTL-by-environment interactions, with an example in maize. Genetics 177, 1801–1813. doi: 10.1534/genetics.107.071068

PubMed Abstract | CrossRef Full Text | Google Scholar

Bretani, G., Shaaf, S., Tondelli, A., Cattivelli, L., Delbono, S., Waugh, R., et al. (2022). Multi-environment genome-wide association mapping of culm morphology traits in barley. Front. Plant Sci. 13, 926277. doi: 10.3389/fpls.2022.926277

PubMed Abstract | CrossRef Full Text | Google Scholar

Buckler, E. S., Holland, J. B., Bradbury, P. J., Acharya, C. B., Brown, P. J., Browne, C., et al. (2009). The genetic architecture of maize flowering time. Science 1979 325, 714–718. doi: 10.1126/science.1174276

CrossRef Full Text | Google Scholar

Burgueño, J., de los Campos, G., Weigel, K., Crossa, J. (2012). Genomic prediction of breeding values when modeling genotype× environment interaction using pedigree and dense molecular markers. Crop Sci. 52, 707–719. doi: 10.2135/cropsci2011.06.0299

CrossRef Full Text | Google Scholar

Bustos-Korts, D., Dawson, I. K., Russell, J., Tondelli, A., Guerra, D., Ferrandi, C., et al. (2019). Exome sequences and multi-environment field trials elucidate the genetic basis of adaptation in barley. Plant J. 99, 1172–1191. doi: 10.1111/tpj.14414

PubMed Abstract | CrossRef Full Text | Google Scholar

Coles, N. D., McMullen, M. D., Balint-Kurti, P. J., Pratt, R. C., Holland, J. B. (2010). Genetic control of photoperiod sensitivity in maize revealed by joint multiple population analysis. Genetics 184, 799–812. doi: 10.1534/genetics.109.110304

PubMed Abstract | CrossRef Full Text | Google Scholar

Cuevas, J., Crossa, J., Montesinos-López, O. A., Burgueño, J., Pérez-Rodríguez, P., de Los Campos, G. (2017). Bayesian genomic prediction with genotype× environment interaction kernel models. G3: Genes Genomes Genet. 7, 41–53. doi: 10.1534/g3.116.035584

CrossRef Full Text | Google Scholar

Díaz, A., Zikhali, M., Turner, A. S., Isaac, P., Laurie, D. A. (2012). Copy number variation affecting the Photoperiod-B1 and Vernalization-A1 genes is associated with altered flowering time in wheat (Triticum aestivum). PloS One 7, e33234. doi: 10.1371/journal.pone.0033234

PubMed Abstract | CrossRef Full Text | Google Scholar

Djanaguiraman, M., Narayanan, S., Erdayani, E., Prasad, P. V. V. (2020). Effects of high temperature stress during anthesis and grain filling periods on photosynthesis, lipids and grain yield in wheat. BMC Plant Biol. 20, 268. doi: 10.1186/s12870-020-02479-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Fradgley, N., Gardner, K. A., Bentley, A. R., Howell, P., Mackay, I. J., Scott, M. F., et al. (2023). Multi-trait ensemble genomic prediction and simulations of recurrent selection highlight importance of complex trait genetic architecture for long-term genetic gains in wheat. silico Plants 5, diad002. doi: 10.1093/insilicoplants/diad002

CrossRef Full Text | Google Scholar

Fradgley, N., Gardner, K. A., Cockram, J., Elderfield, J., Hickey, J. M., Howell, P., et al. (2019). A large-scale pedigree resource of wheat reveals evidence for adaptation and selection by breeders. PloS Biol. 17, e3000071. doi: 10.1371/journal.pbio.3000071

PubMed Abstract | CrossRef Full Text | Google Scholar

Garin, V., Malosetti, M., van Eeuwijk, F. (2020). Multi-parent multi-environment QTL analysis: an illustration with the EU-NAM Flint population. Theor. Appl. Genet. 133, 2627–2638. doi: 10.1007/s00122-020-03621-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Giraud, H., Lehermeier, C., Bauer, E., Falque, M., Segura, V., Bauland, C., et al. (2014). Linkage disequilibrium with linkage analysis of multiline crosses reveals different multiallelic QTL for hybrid performance in the flint and dent heterotic groups of maize. Genetics 198, 1717–1734. doi: 10.1534/genetics.114.169367

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, S., Utz, H. F., Liu, W., Schrag, T. A., Stange, M., Würschum, T., et al. (2016). Choice of models for QTL mapping with multiple families and design of the training set for prediction of Fusarium resistance traits in maize. Theor. Appl. Genet. 129, 431–444. doi: 10.1007/s00122-015-2637-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Jannink, J. L., Jansen, R. (2001). Mapping epistatic quantitative trait loci with one-dimensional genome searches. Genetics 157, 445–454. doi: 10.1093/genetics/157.1.445

PubMed Abstract | CrossRef Full Text | Google Scholar

Jarquín, D., Lemes da Silva, C., Gaynor, R. C., Poland, J., Fritz, A., Howard, R., et al. (2017). Increasing genomic-enabled prediction accuracy by modeling genotype× environment interactions in Kansas wheat. Plant Genome 10, pp.plantgenome2016–12. doi: 10.3835/plantgenome2016.12.013

CrossRef Full Text | Google Scholar

Jourjon, M. F., Jasson, S., Marcel, J., Ngom, B., Mangin, B. (2005). MCQTL: Multi-allelic QTL mapping in multi-cross design. Bioinformatics 21, 128–130. doi: 10.1093/bioinformatics/bth481

PubMed Abstract | CrossRef Full Text | Google Scholar

Jurcic, E. J., Villalba, P. V., Pathauer, P. S., Palazzini, D. A., Oberschelp, G. P. J., Harrand, L., et al. (2021). Single-step genomic prediction of Eucalyptus dunnii using different identity-by-descent and identity-by-state relationship matrices. Heredity 127, 176–189. doi: 10.1038/s41437-021-00450-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Kane, H., Eagles, H. A., Laurie, D. A., Trevaskis, B., Vallance, N., Eastwood, R. F., et al. (2013). Ppd-B1 and Ppd-D1 and their effects in southern Australian wheat. Crop Pasture Sci. 64, 100–114. doi: 10.1071/CP13086

CrossRef Full Text | Google Scholar

Lehermeier, C., Krämer, N., Bauer, E., Bauland, C., Camisan, C., Campo, L., et al. (2014). Usefulness of multiparental populations of maize (Zea mays L.) for genome-based prediction. Genetics 198, 3–16. doi: 10.1534/genetics.114.161943

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., Boer, M. P., van Rossum, B.-J., Zheng, C., Joosen, R. V. L., Van Eeuwijk, F. A. (2022). statgenMPP: an R package implementing an IBD-based mixed model approach for QTL mapping in a wide range of multi-parent populations. Bioinformatics 38(22):5134–5136. doi: 10.1093/bioinformatics/btac662

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., Boer, M. P., Zheng, C., Joosen, R. V. L., van Eeuwijk, F. A. (2021). An IBD-based mixed model approach for QTL mapping in multiparental populations. Theor. Appl. Genet. 1, 1–18. doi: 10.1007/s00122-021-03919-7

CrossRef Full Text | Google Scholar

Lopez-Cruz, M., Crossa, J., Bonnett, D., Dreisigacker, S., Poland, J., Jannink, J. L., et al. (2015). Increased prediction accuracy in wheat breeding trials using a marker× environment interaction genomic selection model. G3: Genes Genomes Genet. 5, 569–582. doi: 10.1534/g3.114.016097

CrossRef Full Text | Google Scholar

Malosetti, M., Ribaut, J. M., van Eeuwijk, F. A. (2013). The statistical analysis of multi-environment data: Modeling genotype-by-environment interaction and its genetic basis. Front. Physiol. 44. doi: 10.3389/fphys.2013.00044

CrossRef Full Text | Google Scholar

Mathews, K. L., Malosetti, M., Chapman, S., McIntyre, L., Reynolds, M., Shorter, R., et al. (2008). Multi-environment QTL mixed models for drought stress adaptation in wheat. Theor. Appl. Genet. 117, 1077–1091. doi: 10.1007/s00122-008-0846-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Millet, E. J., Kruijer, W., Coupel-Ledru, A., Alvarez Prado, S., Cabrera-Bosquet, L., Lacube, S., et al. (2019). Genomic prediction of maize yield across European environmental conditions. Nat. Genet. 51, 952–956. doi: 10.1038/s41588-019-0414-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Millet, E. J., Welcker, C., Kruijer, W., Negro, S., Coupel-Ledru, A., Nicolas, S. D., et al. (2016). Genome-wide analysis of yield in Europe: allelic effects vary with drought and heat scenarios. Plant Physiol. 172, 749–764. doi: 10.1104/pp.16.00621

PubMed Abstract | CrossRef Full Text | Google Scholar

Puglisi, D., Visioni, A., Ozkan, H., Casas, A. M., Igartua, E., Valè, G., et al. (2021). Genomic prediction of grain yield in a barley MAGIC population modeling genotype per environment interaction. Front. Plant Sci. 12, 664148. doi: 10.3389/fpls.2021.664148

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodriguez-Alvarez, M. X., Boer, M. P., van Eeuwijk, F. A., Eilers, P. H. (2018). Correcting for spatial heterogeneity in plant breeding experiments with P-splines. Spatial Stat 23, 52–71. doi: 10.1016/j.spasta.2017.10.003

CrossRef Full Text | Google Scholar

Rossum, B.-J., van, Eeuwijk, F., van, Boer, M., Malosetti, M., Bustos-Korts, D., Millet, E. J., et al. (2021a). statgenGxE: Genotype by Environment (GxE) Analysis. https://CRAN.R-project.org/package=statgenGxE.

Google Scholar

Rossum, B.-J., van, Eeuwijk, F., van, Boer, M. P., Malosetti, M., Bustos-Korts, D., Millet, E. J., et al. (2021b). statgenSTA: Single Trial Analysis (STA) of Field Trials. https://CRAN.R-project.org/package=statgenSTA.

Google Scholar

Schott, J. R. (2016). Matrix analysis for statistics (John Wiley & Sons).

Google Scholar

Scott, M. F., Fradgley, N., Bentley, A. R., Brabbs, T., Corke, F., Gardner, K. A., et al. (2021). Limited haplotype diversity underlies polygenic trait architecture across 70 years of wheat breeding. Genome Biol. 22, 137. doi: 10.1186/s13059-021-02354-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Self, S. G., Liang, K. Y. (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J. Am. Stat. Assoc. 82, 605–610. doi: 10.1080/01621459.1987.10478472

CrossRef Full Text | Google Scholar

Shu, G., Wang, A., Wang, X., Chen, R., Gao, F., Wang, A., et al. (2023). Identification of QTNs, QTN-by-environment interactions for plant height and ear height in maize multi-environment GWAS. Front. Plant Sci. 14, 1284403. doi: 10.3389/fpls.2023.1284403

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, A. B., Cullis, B. R., Thompson, R. (2005). The analysis of crop cultivar breeding and evaluation trials: An overview of current mixed model approaches. J. Agric. Sci. 143, 449–462. doi: 10.1017/S0021859605005587

CrossRef Full Text | Google Scholar

van Eeuwijk, F. A., Bink, M. C., Chenu, K., Chapman, S. C. (2010). Detection and use of QTL for complex traits in multiple environments. Curr. Opin. Plant Biol. 13, 193–205. doi: 10.1016/j.pbi.2010.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Eeuwijk, F. A., Bustos-Korts, D. V., Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × Environment interactions? Crop Sci. 56, 2119–2140. doi: 10.2135/cropsci2015.06.0375

CrossRef Full Text | Google Scholar

Verbyla, A. P., Cavanagh, C. R., Verbyla, K. L. (2014). Whole-genome analysis of multienvironment or multitrait QTL in MAGIC. G3: Genes Genomes Genet. 4, 1569–1584. doi: 10.1534/g3.114.012971

CrossRef Full Text | Google Scholar

Wang, S., Xie, F., Xu, S. (2022). Estimating genetic variance contributed by a quantitative trait locus: A random model approach. PloS Comput. Biol. 18, e1009923. doi: 10.1371/journal.pcbi.1009923

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, J., Xu, S. (2016). A random-model approach to QTL mapping in multiparent advanced generation intercross (MAGIC) populations. Genetics 202 (2), 471–486.

PubMed Abstract | Google Scholar

White, J., Sharma, R., Balding, D., Cockram, J., Mackay, I. J. (2022). Genome-wide association mapping of Hagberg falling number, protein content, test weight, and grain yield in U.K. wheat. Crop Sci. 62, 965–981. doi: 10.1002/csc2.20692

PubMed Abstract | CrossRef Full Text | Google Scholar

Würschum, T., Boeven, P. H., Langer, S. M., Longin, C. F., Leiser, W. L. (2015). Multiply to conquer: Copy number variations at Ppd-B1 and Vrn-A1 facilitate global adaptation in wheat. BMC Genet. 16, 96. doi: 10.1186/s12863-015-0258-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, W., Hunt, L. A., Sheng, Q., Szlavnics, Z. (2000). Cultivar evaluation and mega-environment investigation based on the GGE biplot. Crop Sci. 40, 597–605. doi: 10.2135/cropsci2000.403597x

CrossRef Full Text | Google Scholar

Yan, W., Kang, M. S. (2002). GGE biplot analysis: A graphical tool for breeders, geneticists, and agronomists (CRC press). doi: 10.1201/9781420040371

CrossRef Full Text | Google Scholar

Zheng, C. (2019). RABBIT (v3.2) Manual.

Google Scholar

Zheng, C., Boer, M. P., Van Eeuwijk, F. A. (2014). A general modeling framework for genome ancestral origins in multiparental populations. Genetics 198, 87–101. doi: 10.1534/genetics.114.163006

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, C., Boer, M. P., van Eeuwijk, F. A. (2015). Reconstruction of genome ancestry blocks in multiparental populations. Genetics 200, 1073–1087. doi: 10.1534/genetics.115.177873

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: multi-parent population, diallel, NAM, MAGIC, QTL-by-environment interaction, multienvironment trial, maize, wheat

Citation: Li W, Boer MP, Joosen RVL, Zheng C, Percival-Alwyn L, Cockram J and Van Eeuwijk FA (2024) Modeling QTL-by-environment interactions for multi-parent populations. Front. Plant Sci. 15:1410851. doi: 10.3389/fpls.2024.1410851

Received: 01 April 2024; Accepted: 27 June 2024;
Published: 31 July 2024.

Edited by:

Roxana Savin, Universitat de Lleida, Spain

Reviewed by:

Ernesto Igartua, Spanish National Research Council (CSIC), Spain
Salar Shaaf, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Germany

Copyright © 2024 Li, Boer, Joosen, Zheng, Percival-Alwyn, Cockram and Van Eeuwijk. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fred A. Van Eeuwijk, fred.vaneeuwijk@wur.nl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.