Skip to main content

ORIGINAL RESEARCH article

Front. Genet. , 03 March 2025

Sec. Genomics of Plants and the Phytoecosystem

Volume 16 - 2025 | https://doi.org/10.3389/fgene.2025.1484146

Identification of QTL for branch traits in soybean (Glycine max L.) and its application in genomic selection

Qichao Yang&#x;Qichao Yang1Jing Wang,&#x;Jing Wang1,2Yajun Xiong,&#x;Yajun Xiong1,2Alu Mao,Alu Mao1,2Zhiqing Zhang,Zhiqing Zhang1,2Yijie Chen,Yijie Chen1,2Shirui Teng,Shirui Teng1,2Zhiyu LiuZhiyu Liu1Jun Wang,
Jun Wang1,2*Jian Song
Jian Song1*Lijuan Qiu,,,
Lijuan Qiu2,3,4,5*
  • 1MARA Key Laboratory of Sustainable Crop Production in the Middle Reaches of the Yangtze River (Co-construction by Ministry and Province), College of Agriculture, Yangtze University, Jingzhou, China
  • 2The Shennong Laboratory, Zhengzhou, Henan, China
  • 3State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
  • 4National Key Facility for Gene Resources and Genetic Improvement / Key Laboratory of Crop Germplasm Utilization, Ministry of Agriculture and Rural Affairs / Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
  • 5National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China

Introduction: Branches are important for soybean yield, and previous studies examining branch traits have primarily focused on branch number (BN), while research assessing branch internode number (BIN), branch length (BL), and branch internode length (BIL) remains insufficient.

Methods: A recombinant inbred line (RIL) population consisting of 364 lines was constructed by crossing ZD41 and ZYD02878. Based on the RIL population, we genetically analyzed four branch traits using four different GWAS methods including efficient mixed-model association expedited, restricted two-stage multi-locus genome-wide association analysis, trait analysis by association, evolution and linkage, and three-variance-component multi-locus random-SNP-effect mixed linear model analyses. Additionally, we screened candidate genes for the major QTL and constructed a genomic selection (GS) model to assess the prediction accuracy of the four branch traits.

Results and Discussion: In this study, four branch traits (BN, BIN, BL, and BIL) were phenotypically analyzed using the F6-F9 generations of a RIL population consisting of 364 lines. Among these four traits, BL exhibited the strongest correlation with BIN (0.92), and BIN exhibited the strongest broad-sense heritability (0.89). Furthermore, 99, 43, 50, and 59 QTL were associated with BN, BIN, BL, and BIL, respectively, based on four different methods, and a major QTL region (Chr10:45,050,047..46,781,943) was strongly and simultaneously associated with all four branch traits. For the 207 genes within this region, nine genes were retained as candidates after SNP variation analysis, fixation index (FST), spatial and temporal expression analyses and functionality assessment that involved the regulation of phytohormones, transcription factors, cell wall and cell wall cellulose synthesis. Genomic selection (GS) prediction accuracies for BN, BIN, BL, and BIL in the different environments were 0.59, 0.49, 0.48, and 0.56, respectively, according to GBLUP. This study lays the genetic foundation for BN, BIN, BL, and BIL and provides a reference for functional validation of regulatory genes in the future.

1 Introduction

Soybeans (Glycine max L. Merrill) are rich in protein and oil and are one of the major global crops in the world, and they play a key role in the food and feed industries. The soybean branch is an important component of the soybean plant and affects soybean yield (Guo and Guo, 2021; Li et al., 2021). Furthermore, branch yield was much higher than was main stem yield in certain soybean cultivars (Board, 1987). Soybean branch traits affect soybean yield by influencing photosynthesis and assimilation product distribution in the aboveground canopy of soybeans and are positively correlated with soybean yield per plant (Li et al., 2011; Qiao et al., 2016). The effective branch number (BN), branch internode number (BIN), branch length (BL), and branch internode length (BIL) were the four subtraits of soybean branches (Guo et al., 2015; Qiao et al., 2016; Carpenter and Board, 1997). Modern cultivars have been domesticated from wild soybeans (Glycine soja Sieb. and Zucc.); however, many traits were attenuated during this process. For example, high protein content and stress resistance due to a reduction in genetic diversity and the utilization of elite traits from wild soybeans have both become valid approaches to improve modern cultivars (Jin et al., 2003; He et al., 2014; Nawaz et al., 2018; Wang et al., 2019).

Genetically, BN, BIN, BL, and BIL are controlled by multiple loci, and 21 quantitative trait locis (QTL) associated with branching have been identified on chromosomes 4, 5, 6, 10, 11, 12, 14, 15, 17, 18, and 19 (https://www.soybase.org/). The branching-related QTL qBN-1 on chromosome 6 (R2 = 22.69%) was identified in the segregating populations of high-branching and low-branching cultivars (Lamlom et al., 2020). Using cultivars from different genetic backgrounds and 7,189 single nucleotide polymorphisms (SNPs), the branch number-related QTL qBR6_1 was mapped to chromosome 6 near the E1 gene, which controls flowering time, thus suggesting a possible pleiotropic effect for E1 (Yang et al., 2017). Among the ten QTL associated with branch number that were identified in the three populations, QTL-qBN.C2 (R2 = 33.3%) was identified as the major QTL. Glyma.06G188400 within this QTL may participate in branching development by regulating the axillary meristematic tissue (Yang et al., 2021). The gene Glyma.06G210600 located on qBR6-1 was believed to be a candidate gene for branch number, as it encodes a TEOSINTE-BRANCHED1/CYCLOIDEA/PROLIFERATING CELL FACTORS 1 and 2 (TCP) transcription factor that may be involved in the regulatory network of branching and growth (Shim et al., 2017; Shim et al., 2019). Overexpression of GmmiR156b increased the number of branches and resulted in decreased expression of GmSPL9, thus indicating that the GmmiR156-SPL module is involved in branching regulation (Bao et al., 2019; Sun et al., 2019). The ectopic expression of GmMYB181, a soybean R2R3-MYB transcription factor, in Arabidopsis increases branch number, possibly through shoot development or hormone signaling pathways (Yang et al., 2018). Additionally, SoyZH13_18g242900, also known as Dt2, was highly expressed in the lateral shoots and shoot tips (Liang et al., 2022). Abr11 was localized as a QTL associated with the average BL (La, 2018). Nonetheless, seldom studies have been focused on BIN and BIL.

Genomic selection (GS) is an effective tool for improving breeding selection efficiency and shortening the breeding cycle, and it has been widely applied in the context of quantitative traits of both animals and crops (Duhnen et al., 2017). Genomic prediction accuracy is affected by additive effects, model selection, population type, marker density, gene effects, heritability, genetic architecture, and the extent and distribution of linkage disequilibrium (LD) between markers and QTL (Desta and Ortiz, 2014). Among these GS methods, the random regression best linear unbiased predictor (rrBLUP) and genomic best linear unbiased predictor (GBLUP) are important and popular (Rabier et al., 2016). In soybeans, the phenotypic variations in seed protein content and yield were predicted using the GBLUP model, and it was observed that the modeling of additive-by-additive epistasis possessed a higher prediction accuracy than did the modeling of additive effects (Duhnen et al., 2017). Additionally, the genomic selction of soybean proteins using different SNPs revealed that the prediction accuracy was higher when using major SNPs or when increasing the density of SNPs (Qin et al., 2022).

In this study, a recombinant inbred line (RIL) population derived from a cross between Zhongdou 41 (ZD41) and ZYD02878 was used to explore the genetic mechanisms of four branch traits, e.g., branch number (BN), branch internode number (BIN), branch length (BL), branch internode length (BIL), in soybeans. A total of 251 different QTL related to those four branch traits were identified using four different methods, and possible candidate genes were analyzed. Based on these results, the prediction accuracies of these traits were estimated and compared. Our results provide a solid foundation for elucidating these genetic mechanisms.

2 Materials and methods

2.1 Plant materials

In the summer of 2015, ZD41 (♀) and ZYD02878 (♂) were crossed and yielded a hybrid population at the Crop Science Experimental Base of Yangtze University in Jingzhou City, Hubei Province (112.06°E, 30.37°N), and a RIL population consisting of 364 lines was constructed by single seed descent (SSD) from the F2-F5 generations (Chen et al., 2023).

The F6-F8 RIL population was planted during 2019–2020 in Jingzhou (JZ) and Sanya (SY) according to a previous report (Chen et al., 2023), and F9 was planted in 2021 at Ajian Farm, Zhengji Township, Shangqiu, Henan Province (115.98°E,34.41°N) from June to October. Hereafter, the four different environments are designated 19JZ, 19SY, 20JZ, and 21SQ.

2.2 Phenotyping and statistics

The effective BN of each plant was scored according to the Descriptors and Data Standards for Soybeans (Glycine spp.) (Qiu and Chang, 2006). The BIN and BL values of each effective branch were measured. BIL data are obtained from the raw data of the BL and BIN that are presented as BL/BIN. BN was scored in four environments (19JZ, 19SY, 20JZ, and 21SQ), and three environments (19JZ, 19SY, and 20JZ) were used for BL, BIN, and BIL scoring. The data for each environment consisted of two parents and a RIL populations consisting of 364 lines.

The 1.5×interquartile range (IQR) and 3-σ principle were applied to exclude the outliers when the mean values for BL, BIN, and BIL of an individual plant, technical replicates of each RIL line, and biological replicates of different environments were calculated.

Descriptive statistics, correlation, and normality analyses were performed using the R package lme4, and the broad-sense heritability (h2) was calculated as described by Chen et al. (2023).

2.3 Genotyping and genetic analysis

Genomic DNA was isolated from young fresh leaves using cetyltrimethylammonium bromide (CTAB) method (Rogers and Bendich, 1988), and then “ZDX1” (an Illumina soybean 200 K gene chip) was adopted for genotyping (Sun et al., 2022). Raw genotype was filtered according to MAF <0.05 and integrity >0.8 using PLINK2.0 (Wen et al., 2014), and resulted in a total of 117,772 high quality SNPs. Then, 6,098 bin markers were constructed using SNPbinner and high-quality SNPs (Gonda et al., 2019).

To identify QTL associated with BN, BIN, BL, and BIL, four different methods were used. These included Efficient Mixed-Model Association expedited (EMMAX) (Xue et al., 2013), Restricted Two-stage Multi-locus Genome-Wide association analysis (RTM-GWAS) (He and Gai, 2020), Trait Analysis by association, Evolution and Linkage (TASSEL), and three-variance-component multi-locus random-SNP-effect mixed linear model analyses (3VmrMLM) (Leamy et al., 2017; Zuo et al., 2022).

The kinship matrix (relatedness) was calculated using emmax-intel64 based on 6,098 bin markers and was applied to correct for population structure and relatedness in the mixed linear models (Zhao et al., 2020). For GWAS analysis using TASSEL, the first 20 principal components (PCs) was calculated using GCTA, kinship was calculated using TASSEL, and both the 20 PCs and kinship were introduced as covariates in the association study using the Mixed Linear Model (MLM). Based on the Bonferroni method, the significance threshold for EMMAX and RTM-GWAS was determined to be 1/m, where m is the number of SNPs. To perform RTM-GWAS, 4,715 SNP linkage disequilibrium blocks (SNPLDBs) were first inferred with a MAF of 0.01 and a maximum block length of 10,000. The GSC matrix was then calculated for QTL detection. In order to balance false positives and false negatives in hypothesis testing, RTM-GWAS commonly use 0.01 as the statistical significance level to control the whole test error rate. Based on suggestion of 3VmrMLM method developer, single-environment analysis was performed using a 3 variance-component multi-locus random SNP effects mixed linear model constructed using 3VmrMLM with CriLOD = 3 to identify significantly associated bins. Besides, QTL regions of EMMAX and TASSEL are defined by 100 kb up and downstream of the significantly associated SNPs.

Based on the identified QTL, the LD between QTL was calculated using PLINK and R, and QTL with an average >0.9 in the region were recognized as co-localized QTL. Unique QTL are contiguous regions of co-localized QTL that are strung together by multiple adjacent or overlapping co-localized QTL.

2.4 Candidate gene identification

The QTL regions most likely associated with branch traits were identified by analyzing the frequency and effects of the mapped QTL. The significance of EMMAX was subsequently set to 0.01 to narrow the QTL region. Genes within associated regions were first annotated by SoyBase (https://soybase.org) and phytozome (https://phytozome-next.jgi.doe.gov/info/Gmax_Wm82_a2_v1). Single nucleotide polymorphism (SNP) variation, genetic differentiation analysis, and temporal and spatial tissue-specific gene expression pattern analysis were performed to narrow the candidate gene list. For genetic differentiation analysis, the fixation index (FST) was calculated from published genome sequence data using vcftools (0.1.13) with a 100 bp window size, and potentially domesticated genes were defined if the FST within coding regions was larger than 0.6 (Danecek et al., 2011; Song et al., 2013; Li et al., 2023). Gene expression data regarding axillary meristem, flower, leaf, meristem, nodule, pod, root, root hair, shoot apex and shoot apical meristem (SAM) were extracted from the PPRD database, and heatmaps were constructed using the R package ‘pheatmap’ (Shim et al., 2017; Shim et al., 2019).

2.5 Genomic prediction

Based on the bin markers and associated markers using four different methods, three different marker sets were constructed. These included (1) 6,098 genome-wide markers (GWM), (2) 400 trait-associated markers (TAM), and (3) 5,698 genome-wide unassociated markers (GWUM) (Chen et al., 2023). The trait-associated markers are SNP markers mapped genome-wide by the QTL for the branch-associated traits that we have located, and because of the overlap and size of the regions between the QTL that we have located, these QTL cover a total of 400 SNP regions. To compare the effects of trait-associated regions on GS precision, three marker sets in the GBLUP model were used as random effects, and a five-fold cross-validation and 20 replications were performed to measure the average prediction precision of each GS model. To ensure the reproducibility of the results, the same random seeds were used in 20 replicates. The GBLUP model was constructed using the R package rrBLUP.

3 Results

3.1 Phenotype analysis of BN, BIN, BL and BIL

Based on the 3-σ principle and 1.5×interquartile range (IQR), the outliers of BN, BIN, BL, and BIL and the mean in each environment were removed and analyzed with descriptive statistics, and it was observed that normal or nearly normal distribution was observed in the RIL population regarding the four traits in different environments (Table 1; Figure 1; Supplementary Figure S1). For each trait, BN, BIN, BL, and BIL values ranged between 0-28.33, 2.00-19.78, 2.30–152.66 cm, and 0.78–10.69 cm, respectively. Transgressive segregation occurs universally for different traits under different environments. In general, considerably more individuals exceeded the higher-value parent than they did the lower-value parent in BN, BIN, and BL; however, an opposite trend was observed for 19JZ and 19SY of BIL and 21SQ of BN. Correlation analysis revealed significant correlations among all four traits (BN, BIN, BL, and BIL). BIN and BL exhibited the highest correlation (Pearson’s coefficient r = 0.92, P < 0.001), and this was followed by BL and BIL (r = 0.80, P < 0.001) and BN and BL (r = 0.64, P < 0.001). The lowest correlation was observed between BN and BIL (Figure 1E). Additionally, all four traits were significantly correlated in each environment; however, the four traits in 19SY were less correlated with other environments (Figures 1A–D). The BIN in different environments exhibited the highest correlation with BN, BL, and BIL (Figures 1A–D). The broad-sense heritability of BN, BIN, BL, and BIL was 0.88, 0.89, 0.75, and 0.85 (Table 2), respectively, thus indicating that the environment exerted greater impact on BL than it did on the other traits.

Table 1
www.frontiersin.org

Table 1. Descriptive statistics of BN, BIN, BL, and BIL in different environments.

Figure 1
www.frontiersin.org

Figure 1. Correlation analysis of BN, BIN, BL, and BIL in different environments. (A–E) correlation of BN, BIN, BL, BIL, and mean value of different environments. *, **, *** represents significance level of 0.05, 0.01, and 0.001 respectively.

Table 2
www.frontiersin.org

Table 2. Variance components and broad sense heritability of different traits.

3.2 Genetic analysis of branch related traits

A total of 12, 3, 3, and 4 unique QTL regions that were significantly associated with BN, BIN, BL, and BIL, respectively, were identified by EMMAX using the phenotypes of the different environments (19JZ, 19SY, 20JZ, and 21SQ, respectively) and the means (Supplementary Table S4). These QTL were distributed on eight different chromosomes (2, 3, 7, 10, 11, 12, 13, and 19) and were predominantly concentrated on chromosomes 10 and 19, and they accounted for 64.10% of the total significant regions (Supplementary Table S1). Of all QTL identified in the different environments, the peak SNP of the most significant QTL region Chr12:2,621,904..5,817,926 for BN was Chr12: 5,396,475 with a -log10(P) of 6.59 (Supplementary Figure S2; Supplementary Table S2; Supplementary Table S4). Another QTL on chromosome 10 (Chr10:43,366,437..46,985,577) was determined to be not only significantly associated with BIN but also with BL, and both shared the same peak SNP of Chr10:45, 290, 023 (Supplementary Figure S2; Supplementary Table S2; Supplementary Table S4). This region for BIN and BL was repeatedly identified at 19 JZ, 20 JZ, and at the mean (Supplementary Table S1). Among the QTL associated with BIL, the major QTL was mapped to the region of Chr13:37,175,077..39,002,308, within which the peak SNP was Chr13:38, 260, 409 with a -log10(P) of 5.65 (Supplementary Figure S2; Supplementary Table S2; Supplementary Table S4).

A total of 11 significant QTL regions (six, two, two, and one QTL for BN, BIN, BL, and BIL, respectively) as detected by TASSEL were distributed on six chromosomes (3, 10, 11, 12, 13, and 19) (Supplementary Table S4). Similar to the QTL identified by EMMAX, most of the QTL identified by TASSEL were located on chromosomes 10 and 19. Among the six QTL associated with BN, the most significant was Chr12:3,633,162..6,216,535 that was identified from the mean and yielded a phenotypic variance explained (PVE) of 7.18% (Supplementary Figure S3; Supplementary Table S1; Supplementary Table S4). The QTL regions for BL were located on chromosomes 10 and 11, and the region on chromosome 10 (Chr10:42,222,333..45,658,686) was identified by 20JZ (Supplementary Table S1; Supplementary Table S4). The mean was also mapped by BIN (Supplementary Table S1; Supplementary Table S4). For BIL, only one region of Chr13:37,360,709..39,002,308 was identified by TASSEL from 20 JZ and the mean; however, this region was not identified in BN, BIN, or BL (Supplementary Table S1; Supplementary Table S4).

Based on 4,715 SNPLDBs, 51, 22, 30, and 39 unique QTL associated with BN, BIN, BL, and BIL, respectively, were identified on all 20 chromosomes (Supplementary Table S4). The PVE of each QTL ranged from 1.54% to 21.06%, and this explained 26.02%–58.29% of the total phenotypic variance in the different environments (Supplementary Table S1). Among these four traits, the QTL of BN exhibited the highest total average PVE of 47.57%, whereas those of BIN, BL, and BIL were 29.61%, 38.45%, and 43.50%, respectively (Supplementary Table S1). For the 53 BN-associated QTL, there was a major QTL region on Chr10:45,257,940..45,419,307 that displayed the highest PVE of 21.06% in 20JZ and 18.38% for the mean (Supplementary Figure S4; Supplementary Table S1). This region was identified as a major QTL associated with BIN (19 JZ, 20 JZ, and mean) and BL (20 JZ and mean) with a PVE >10% (Supplementary Figure S4; Supplementary Table S1). Regarding BIL, two major QTL regions with PVE > 10% were mapped to chromosome 13 (Chr13:38,262,556..38,389,319 and Chr13:41,863,394..41,887,491) and were separated by a distance of 3.47 Mb (Supplementary Table S1); however these two QTL regions were only detected from 20JZ and the mean, respectively (Supplementary Table S1).

Eight, five, three, and six QTL were identified for BN, BIN, BL, and BIL, respectively, using 3VmrMLM (Supplementary Table S4). These QTL were distributed across seven distinct chromosomes. The PVE of these four traits in the different environments (19JZ, 19SY, 20JZ, and 21SQ) ranged from 3.78% to 22.46% (Supplementary Table S1). Similar to the RTM-GWAS, 3VmrMLM identified more QTL for BN and yielded the highest total average PVE of 19.67% compared to that of BIN, BL, and BIL (Supplementary Table S1). Among all QTL identified by 3VmrMLM, a major QTL region associated with chromosome 10 (Chr10:45,257,940..45,322,107) explained 22.46%, 13.32%, 10.90%, and 5.78% of phenotypic variance in BN, BIN, BL, and BIL, respectively (Supplementary Figure S5; Supplementary Table S1). This region exhibited the highest PVE for BN, BIN, and BL but not for BIL. The QTL that explained the highest PVE for BIL (10.49%) was Chr11:24,394,808..24,461,847 that was mapped only from the 19SY (Supplementary Table S1).

3.3 Co-localized QTL and pleiotropic QTL

LD between 251 QTL regions related to those four branch traits revealed that 166 QTL were co-localized (LD > 0.9), and these accounted for 66.14% of all QTL (Supplementary Table S3). Among these co-localized QTL regions, two major QTL regions (Chr10:43,366,437..47,047,085 and Chr11:10,609,377..11,279,201) were identified using four different methods and were associated with all four branch traits (Supplementary Table S3). Region Chr10:43,366,437..47,047,085 was co-localized with QTL identified using four different methods for three traits (BN, BIN, and BL) (Figures 2A–C; Supplementary Table S3). Furthermore, this region exhibited pleiotropic effects on the four branch traits according to two different methods (RTM-GWAS and 3VmrMLM) (Figures 2G, H). The other co-localization regions exerted a major effect. Chr11:10,609,377..11,279,201 was co-localized by four different methods in the BL, and this region was also pleiotropic for the four branch traits in EMMAX (Figures 2C, E). Additionally, there were another two QTL regions (Chr03:36,518,347..37,236,411 and Chr13:41,619,891..42,577,532) that revealed the co-localization of BN and BIL according to three different methods, thus indicating a pleiotropic effect (Figures 2A, D, G).

Figure 2
www.frontiersin.org

Figure 2. Co-localized QTL of BN, BIN, BL, and BIL in different environments (19JZ, 19SY, 20JZ, 21SQ) and mean. (A–D) QTL co-localized for the same trait in different methods (E–H): Pleiotropy QTL for the same method.

3.4 Candidate gene selection for the major QTL

The major QTL (Chr10:42,222,333..47,047, 085) with the highest PVE associated with the four branch traits was narrowed down to the interval Chr10:45,050,047..46,781,943,207 genes were identified, and 16 of them were observed to contain SNP variations resulting in stopgain and non-synonyms. Among these, 10 genes exhibited a genetic differentiation index (FST) of greater than 0.6, thus indicating that they may have been subjected to domestication (Supplementary Table S6; Supplementary Table S7; Supplementary Figure S6). A spatial and temporal expression pattern analysis of the 10 domesticated genes revealed that nine genes were expressed either constitutively or specifically in certain tissues, and only one gene was not expressed (Figure 3). Two of the nine genes were annotated to encode unknown proteins (Glyma.10G229200 and Glyma.10G233600), and one encoded a protein relevant to disease resistance (Glyma.10G228000) (Supplementary Table S8). Additionally, a region overlapped by all QTL mapped to this locus (Chr10:45,257,940..45,322,107) consisted of three genes that included Glyma.10G221300, Glyma.10G221400, and Glyma.10G221500. These nine genes are potential candidates for branch traits (Table 3).

Figure 3
www.frontiersin.org

Figure 3. Spatial and temporal expression patterns of 13 candidate genes. The heat map illustrating the spatial and temporal expression profiles of 13 candidate genes, as derived from publicly available RNA-seq data. The data was normalized by log10(FPKM+1), where FPKM stands for Fragments Per Kilobase of exon model per Million mapped fragments.

Table 3
www.frontiersin.org

Table 3. Gene function annotation of 9 candidate genes associated with four branch traits.

3.5 Prediction accuracy of GS

The prediction accuracy for BN, BIN, BL and BIL was calculated using GBLUP, and prediction accuracies of 0.18–0.59, 0.14-0.49, 0.10-0.48, and 0.24-0.56 were identified, respectively, in the different environments (19JZ, 19SY, 20JZ, and 21SQ) of the three datasets. The values were 0.33–0.57, 0.21-0.48, 0.19-0.47, and 0.27-0.45, respectively, when the mean value of those traits was used as the phenotype (Figure 4). Comparatively, the prediction accuracy of the three traits with the exception of BIL was lower than the mean value in different environments. In terms of different marker sets, the prediction accuracy of TAM (Total associated markers) was the highest in either the mean or different environments with an average prediction accuracy of 0.46. The prediction accuracy of GWUM (Genome wide marker without associated markers) was the lowest, with an average prediction accuracy of 0.24. Compared to that of the TAM and GWM, the prediction accuracy of the TAM was improved by 15.28%–109.35%.

Figure 4
www.frontiersin.org

Figure 4. Prediction accuracy of genomic selection on BN, BIN, BL, and BIL. GWUM: 5698 Genome-wide uncorrelated markers, GWM: 6098 Genome-wide markers, TAM: 400 Trait associated markers.

4 Discussion

4.1 QTL reliability was improved by using multiple QTL mapping methods

In different QTL mapping procedures, the number and effects of QTL are closely related to the population and model of QTL mapping (Fu et al., 2017). Localizing QTL using multiple QTL mapping methods can effectively reduce false-positive QTL and increase the confidence in major QTL (Sonah et al., 2015). In this study, 39, 17, 164, and 31 QTL related to those four branch traits were identified using EMMAX, TASSEL, RTM-GWAS, and 3VmrMLM, respectively, and 34, 17, 58 and 27 QTL identified by these four methods were co-localized with QTL identified by the other three methods, indicating a high repeatability of those methods. It worth noticing that all QTL detected by TASSEL were repeated in different environments, and although RTM-GWAS identified the most QTL, and very limited (35.4%) were repeated across environments, suggested TASSEL performed the best in our study and RTM-GWAS probably detected most of the missing heritability in different environments. The QTL identified by RTM-GWAS is much higher than those identified by other methods. This may be due to RTM-GWAS used multiple allelic variant markers and multi-locus models to comprehensively analyze the genetic composition of QTL in the population, which has been reported in previous studies (Gai and He, 2020). In addition, a total of 83 QTL overlapped in 25 QTL regions among all the QTL located by these methods, which may indicate that these regions are pleiotropic or one of the significant loci influencing the branch traits.

The 251 QTL mapped by those four different methods constitutes 143 unique regions of different branch traits (Supplementary Table S5). Among these unique QTL, Chr10:42,222,333..47,047,085 was the major QTL region that not only existed in all environments but also predominantly exhibited the highest PVE (Supplementary Table S1; Supplementary Table S5). Additionally, certain branching studies have localized this QTL in previous studies, thus indicating its high reliability (Ding, 2011; Li et al., 2008). Another major QTL region (Chr11:10,609,377..11,279,201) was co-localized by four different methods and was associated with four branch traits, thus suggesting that it is a reliable QTL (Supplementary Table S1; Supplementary Table S3; Supplementary Table S5).

4.2 Candidate gene for branch traits

Among the 207 genes in the region of Chr10:45,050,047..46,781,943 that were analyzed by SNP variant analysis, FST, and spatial temporal specific expression analysis, nine genes may be candidate genes for soybean branch number, including seven genes encoding known proteins and two genes encoding proteins of unknown function. Of the seven genes encoding known proteins, six may be associated with soybean branch traits, with the exception of a disease-associated gene. Glyma.10G222900 encodes a DHHC-type zinc finger family protein that may control plant growth and development, including that of branches (Xiang et al., 2010). The integrase-type DNA-binding superfamily protein encoded by Glyma.10G223200 may exhibit high homology to AtERF114 and regulate plant branches and structures in an auxin-dependent manner (Lyu et al., 2022). Glyma.10G223500 regulates cellulose content by encoding cellulose synthase 6 (Endler et al., 2016). Glyma.10G227100 encodes the RING/FYVE/PHD zinc-finger superfamily protein that is associated with plant growth, development, and abiotic stress responses (Han et al., 2022). The pectin lyase-like superfamily protein is encoded by Glyma.10G231000 and plays a key role in cell wall biosynthesis and organization (Han et al., 2018; Yu et al., 2021). The PPPDE putative thiol peptidase family protein is encoded by Glyma.10G231400 and regulates the ubiquitin signaling pathway (Liu et al., 2020). Additionally, three genes in the region Chr10:45,257,940..45,322,107 were also correlated with branch traits with a high probability. The Family S-adenosylmethionine carrier 1 encoded by Glyma.10G221300 (S-adenosylmethionine) is associated with the synthesis of ethylene and polyamines (Yang et al., 2023). Glyma.10G221400 encodes a putative protein known as carboxypeptidase D. The PLASTOCHRON3 (PLA3)/GOLIATH (GO) gene encoding this protein in rice is capable of converting rachis branches into shoots (Kawakatsu et al., 2009). The Gigantea protein encoded by Glyma.10G221500 regulates the growth and development of plants and is involved in the regulation of flowering time in soybeans (Wong et al., 2009; Patnaik et al., 2023; Watanabe et al., 2011). Interestingly, some genes related to other traits, e.g., flowering time, maturation time, were reported to be colocalized within those regions (Xia et al., 2021). It is reported that the tillering in rice was influenced by light signal via strigolactone pathway (Xie et al., 2020), suggested a possible role of gene involved in the light receiving participated in branching in soybean.

4.3 Genomic prediction

Genomic selection is a powerful tool for plant and animal improvement, however, it has not yet been applied to soybean branch traits. The identification of 99, 43, 50, and 59 QTL associated with BN, BIN, BL, and BIL, respectively, suggests that these four traits are quantitative traits controlled by multiple loci. These four branch traits exhibited high heritability (>0.75), thus suggesting the possibility of higher prediction accuracy for GS (Lan et al., 2020). Based on the three genomic marker sets, GS studies examining BN, BIN, BL, and BIL revealed that the prediction accuracy of GWUM was lower than that of GWM, whereas the prediction accuracy of TAM was higher than that of GWM, thus indicating that marker density does not always improve prediction accuracy (Xavier et al., 2016). Retaining 400 SNPs relevant to the four branch traits was an effective approach to improve the prediction accuracy of different traits, possibly due to the observation that the reduction in SNPs decreased the background noise with no or low effect. This result is consistent with those of previous studies, thus indicating that most of the QTL identified in this study were effective (Xiong et al., 2023).

In this study, a batch of different QTL related to branch traits was identified using different methods, which assisted the marker development and marker assisted selection. At the meanwhile, genomic selection model was constructed based on the QTL mapped as well, which could be applied in the branch traits selection in the breeding populations and accelerate the breeding process. And candidate genes identified in this study provided solid foundation towards the gene cloning and molecular mechanism clarification of branch related traits.

5 Conclusion

A total of 99, 43, 50, and 59 QTL were identified for BN, BIN, BL and BIL respectively, based on which genomic selection model was constructed with prediction accuracy of larger than 0.48. For the major QTL with highest PVE, nine candidate genes were screened out for further study. This study provided a deep view of genetic mechanism underlying branch related traits, provided a beneficial trial of genomic selection on soybean branch related traits, which could be applied in breeding programs.

Data availability statement

The data presented in the study are deposited in the Soybean Functional Genomics and Breeding repository, direct access to data through link https://sfgb.rmbreeding.cn/about/data.

Author contributions

QY: Formal Analysis, Investigation, Writing - original draft, Writing - review and editing. JiW: Data curation, Investigation, Writing - review and editing. YX: Data curation, Investigation, Writing - review and editing. AM: Investigation, Writing - review and editing. ZZ: Investigation, Writing - review and editing. YC: Investigation, Writing - review and editing. ST: Investigation, Writing - review and editing. ZL: Investigation, Writing - review and editing. JuW: Conceptualization, Funding acquisition, Writing - original draft, Writing - review and editing. JS: Conceptualization, Writing - review and editing. LQ: Conceptualization, Funding acquisition, Writing - review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was supported by grants from Key Research Project of the Shennong Laboratory (SN01-2024-01), the Agricultural Science and Technology Innovation Program (ASTIP) of Chinese Academy of Agricultural Sciences, National Natural Science Foundation of China (32072016).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1484146/full#supplementary-material

Abbreviations

BIL, Branch Internode Length; BIN, Branch Internode Number; BL, Branch Length; BN, Branch Number; CTAB, Cetyltrimethylammonium Bromide; EMMAX, Efficient Mixed-Model Association Expedited; FST, Fixation Index; GBLUP, Genomic Best Linear Unbiased Predictor; GS, Genomic Selection; GWM, Genome-Wide Markers; GWUM, Genome-Wide Unassociated Markers; IQR, Interquartile Range; LD, Linkage Disequilibrium; MLM, Mixed Linear Model; PVE, Phenotypic Variance Explained; QTL, Quantitative Trait Loci; RIL, Recombinant Inbred Line; RTM-GWAS, Restricted Two-Stage Multi-Locus Genome-Wide Association; SNP, Single Nucleotide Polymorphism; SNPLDBs, SNP Linkage Disequilibrium Blocks; TAM, Trait-Associated Markers; TASSEL, Trait Analysis by Association, Evolution and Linkage; ZD 41, Zhongdou 41; 3VmrMLM, Three-Variance Component Multi-Locus Random SNP Effect Mixed Linear Model.

References

Bao, A. L., Chen, H. F., Chen, L. M., Chen, S. L., Hao, Q. N., Guo, W., et al. (2019). CRISPR/Cas9-mediated targeted mutagenesis of GmSPL9 genes alters plant architecture in soybean. BMC Plant Biol. 19, 131. doi:10.1186/s12870-019-1746-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Board, J. E. (1987). Yield components related to seed yield in determinate soybean. Crop Sci. 27, 1296–1297. doi:10.2135/cropsci1987.0011183X002700060041x

CrossRef Full Text | Google Scholar

Carpenter, A. C., and Board, J. E. (1997). Branch yield components controlling soybean yield stability across plant populations. Crop Sci. 37, 885–891. doi:10.2135/cropsci1997.0011183X003700030031x

CrossRef Full Text | Google Scholar

Chen, Y. J., Xiong, Y. J., Hong, H. L., Li, G., Gao, J., Guo, Q. Y., et al. (2023). Genetic dissection of and genomic selection for seed weight, pod length, and pod width in soybean. Crop J. 11, 832–841. doi:10.1016/j.cj.2022.11.006

CrossRef Full Text | Google Scholar

Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27, 2156–2158. doi:10.1093/bioinformatics/btr330

PubMed Abstract | CrossRef Full Text | Google Scholar

Desta, Z. A., and Ortiz, R. (2014). Genomic selection: genome-wide prediction in plant improvement. Trends Plant Sci. 19, 592–601. doi:10.1016/j.tplants.2014.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Ding, H. (2011). The natural selection effect on resistance to SCN and major agronomic characters of soybean by SSR analysis. Nanjing Agric. Univ. 63.

Google Scholar

Duhnen, A., Gras, A., Teyssèdre, S., Romestant, M., Claustres, B., Daydé, J., et al. (2017). Genomic selection for yield and seed protein content in soybean: a study of breeding program data and assessment of prediction accuracy. Crop Sci. 57, 1325–1337. doi:10.2135/cropsci2016.06.0496

CrossRef Full Text | Google Scholar

Endler, A., Schneider, R., Kesten, C., Lampugnani, E. R., and Persson, S. (2016). The cellulose synthase companion proteins act non-redundantly with cellulose synthase interacting1/POM2 and cellulose synthase 6. Plant Signal Behav. 11, e1135281. doi:10.1080/15592324.2015.1135281

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, Y., Zhang, D. Q., Gleeson, M., Zhang, Y. F., Lin, B. G., Hua, S. J., et al. (2017). Analysis of QTL for seed oil content in Brassica napus by association mapping and QTL mapping. Euphytica 213, 17. doi:10.1007/s10681-016-1817-9

CrossRef Full Text | Google Scholar

Gai, J. Y., and He, J. B. (2020). Major characteristics, often-raised queries and potential usefulness of the restricted two-stage multi-locus genome-wide association analysis. Sci. Agric. Sin. 53, 1699–1703. doi:10.3864/j.issn.0578-1752.2020.09.001

CrossRef Full Text | Google Scholar

Gonda, I., Ashrafi, H., Lyon, D. A., Strickler, S. R., Hulse-Kemp, A. M., Ma, Q. Y., et al. (2019). Sequencing-based bin map construction of a tomato mapping population, facilitating high-resolution quantitative trait loci detection. Plant Genome 12, 180010. doi:10.3835/plantgenome2018.02.0010

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, R., and Guo, S. J. (2021). Study on the ideal plant type of soybean hybrids with high yield and good quality. Shanxi Agric. Univ. 55.

Google Scholar

Guo, S. J., Yang, K. M., Huo, J., Zhou, Y. H., Wang, H. Y., and Li, G. Q. (2015). Correlationand principal component analysis between traits and yield in soybean. J. Shanxi Agric. Sci. 43, 505–508. doi:10.3969/j.issn.1002-2481.2015.05.01

CrossRef Full Text | Google Scholar

Han, G. L., Qiao, Z. Q., Li, Y. X., Yang, Z. R., Wang, C. F., Zhang, Y. Y., et al. (2022). RING zinc finger proteins in plant abiotic stress tolerance. Front. Plant Sci. 13, 877011. doi:10.3389/fpls.2022.877011

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, Y. G., Ma, L. G., Zhao, L., Feng, W. S., and Zheng, X. K. (2018). Bioinformatics analysis of BAN gene in Arabidopsis thaliana. IOP Conf. Ser. Earth Environ. Sci. 170, 052022. doi:10.1088/1755-1315/170/5/052022

CrossRef Full Text | Google Scholar

He, J. B., and Gai, J. Y. (2020). QTL-allele matrix detected from RTM-GWAS is a powerful tool for studies in genetics, evolution, and breeding by design of crops. J. Integr. Agric. 19, 1407–1410. doi:10.1016/S2095-3119(20)63199-9

CrossRef Full Text | Google Scholar

He, Q. Y., Yang, H. Y., Xiang, S. H., Wang, W. B., Xing, G. N., Zhao, T. J., et al. (2014). QTL mapping for the number of branches and pods using wild chromosome segment substitution lines in soybean Glycine max (L.) Merr. Plant Genet. Resour. 12, S172–S177. doi:10.1017/S1479262114000495

CrossRef Full Text | Google Scholar

Jin, Y., He, T. H., and Lu, B. R. (2003). Fine scale genetic structure in a wild soybean (Glycine soja) population and the implications for conservation. New Phytol. 159, 513–519. doi:10.1046/j.1469-8137.2003.00824.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kawakatsu, T., Taramino, G., Itoh, J. I., Allen, J., Sato, Y., Hong, S. K., et al. (2009). PLASTOCHRON3/GOLIATH encodes a glutamate carboxypeptidase required for proper development in rice. Plant J. 58, 1028–1040. doi:10.1111/j.1365-313X.2009.03841.x

PubMed Abstract | CrossRef Full Text | Google Scholar

La, T. C. (2018). Characterization of a diverse USDA collection of wild soybean (Glycine soja siebold and zucc.) accessions and subsequent mapping for seed composition and agronomic traits in a RIL population. Columbia, MO: University of Missouri-Columbia.

Google Scholar

Lamlom, S. F., Zhang, Y., Su, B. S., Wu, H. T., Zhang, X., Fu, J. D., et al. (2020). Map-based cloning of a novel QTL qBN-1 influencing branch number in soybean [Glycine max (L.) Merr.]. Crop J. 8, 793–801. doi:10.1016/j.cj.2020.03.006

CrossRef Full Text | Google Scholar

Lan, S., Zheng, C. F., Hauck, K., McCausland, M., Duguid, S. D., Booker, H. M., et al. (2020). Genomic prediction accuracy of seven breeding selection traits improved by QTL identification in flax. Int. J. Mol. Sci. 21, 1577. doi:10.3390/ijms21051577

PubMed Abstract | CrossRef Full Text | Google Scholar

Leamy, L. J., Zhang, H. Y., Li, C. B., Chen, C. Y., and Song, B. H. (2017). A genome-wide association study of seed composition traits in wild soybean (Glycine soja). BMC Genomics 18, 18. doi:10.1186/s12864-016-3397-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, R. D., Xu, C. L., Yin, Y. Y., Song, W. W., Sun, S., Han, T. F., et al. (2021). Effects of close planting on photosynthetic characteristics and yield of less branched soybean varieties. Soybean Sci. 40, 633–642. doi:10.11861/j.issn.1000-9841.2021.05.0633

CrossRef Full Text | Google Scholar

Li, W. X., Zheng, D. H., Van, K. J., and Lee, S. H. (2008). QTL Mapping for major agronomic traits across two years in soybean (Glycine max L. Merr.). J. Crop Sci. Biotechnol. 11, 171–176.

Google Scholar

Li, Y. D., Zhu, X. C., Tang, L., Cao, W. X., and Zhu, Y. (2011). Simulation of canopy photosynthetic production based on plant type in rice. Acta Agron. Sin. 37, 868–875. doi:10.3724/SP.J.1006.2011.00868

CrossRef Full Text | Google Scholar

Li, Y. H., Qin, C., Wang, L., Jiao, C. Z., Hong, H. L., Tian, Y., et al. (2023). Genome-wide signatures of the geographic expansion and breeding of soybean. Sci. China Life Sci. 66, 350–365. doi:10.1007/s11427-022-2158-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, Q. J., Chen, L. Y., Yang, X., Yang, H., Liu, S. L., Kou, K., et al. (2022). Natural variation of Dt2 determines branching in soybean. Nat. Commun. 13, 6429. doi:10.1038/s41467-022-34153-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, H. R., Li, G., Yang, X. J., Kuijer, H. N. J., Liang, W. Q., and Zhang, D. B. (2020). Transcriptome profiling reveals phase-specific gene expression in the developing barley inflorescence. Crop J. 8, 71–86. doi:10.1016/j.cj.2019.04.005

CrossRef Full Text | Google Scholar

Lyu, J., Guo, Y., Du, C., Yu, H., Guo, L., Liu, L., et al. (2022). BnERF114.A1, a rapeseed gene encoding APETALA2/ETHYLENE RESPONSE FACTOR, regulates plant architecture through auxin accumulation in the apex in Arabidopsis. Int. J. Mol. Sci. 23, 2210. doi:10.3390/ijms23042210

PubMed Abstract | CrossRef Full Text | Google Scholar

Nawaz, M. A., Yang, S. H., and Chung, G. (2018). “Wild soybeans: an opportunistic resource for soybean improvement,” in Rediscovery of landraces as a resource for the future. Editor O. Grillo (London: InTechOpen). doi:10.5772/intechopen.74973

CrossRef Full Text | Google Scholar

Patnaik, A., Kumar, A., Behera, A., Mishra, G., Dehery, S. K., Panigrahy, M., et al. (2023). GIGANTEA supresses wilt disease resistance by down-regulating the jasmonate signaling in Arabidopsis thaliana. Front. Plant Sci. 14, 1091644. doi:10.3389/fpls.2023.1091644

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiao, H. T., Liu, X., and Gao, L. (2016). Correlation and grey relation analysis of soybean branch and yield、agronomic traits. Liaoning Agric. Sci., 36–40. doi:10.3969/j.issn.1002-1728.2016.01.009

CrossRef Full Text | Google Scholar

Qin, J., Wang, F. M., Zhao, Q. S., Shi, A. N., Zhao, T. T., Song, Q. J., et al. (2022). Identification of candidate genes and genomic selection for seed protein in soybean breeding pipeline. Front. Plant Sci. 13, 882732. doi:10.3389/fpls.2022.882732

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiu, L. J., and Chang, R. Z. (2006). Descriptors and data standard for soybean (Glycine spp). Beijing: Chinese Agriculture Press.

Google Scholar

Rabier, C. E., Barre, P., Asp, T., Charmet, G., and Mangin, B. (2016). On the accuracy of genomic selection. PLoS ONE 11, e0156086. doi:10.1371/journal.pone.0156086

PubMed Abstract | CrossRef Full Text | Google Scholar

Rogers, S. O., and Bendich, A. J. (1988). Extraction of DNA from plant tissues. Plant Mol. Biol. Man. A6, 89–99. doi:10.1007/978-94-017-5294-7_6

CrossRef Full Text | Google Scholar

Shim, S., Ha, J., Kim, M. Y., Choi, M. S., Kang, S. T., Jeong, S. C., et al. (2019). GmBRC1 is a candidate gene for branching in soybean (Glycine max (L.) Merrill). Int. J. Mol. Sci. 20, 135. doi:10.3390/ijms20010135

PubMed Abstract | CrossRef Full Text | Google Scholar

Shim, S., Kim, M. Y., Ha, J., Lee, Y. H., and Lee, S. H. (2017). Identification of QTLs for branching in soybean (Glycine max (L.) Merrill). Euphytica 213, 225. doi:10.1007/s10681-017-2016-z

CrossRef Full Text | Google Scholar

Sonah, H., O’Donoughue, L., Cober, E., Rajcan, I., and Belzile, F. (2015). Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean. Plant Biotechnol. J. 13, 211–221. doi:10.1111/pbi.12249

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, Q. J., Hyten, D. L., Jia, G. F., Quigley, C. V., Fickus, E. W., Nelson, R. L., et al. (2013). Development and evaluation of SoySNP50K, a high-density genotyping array for soybean. PLoS ONE 8, e54985. doi:10.1371/journal.pone.0054985

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, R. J., Sun, B. C., Tian, Y., Su, S. S., Zhang, Y., Zhang, W. H., et al. (2022). Dissection of the practical soybean breeding pipeline by developing ZDX1, a high-throughput functional array. Theor. Appl. Genet. 135, 1413–1427. doi:10.1007/s00122-022-04043-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, Z. X., Su, C., Yun, J. X., Jiang, Q., Wang, L. X., Wang, Y. N., et al. (2019). Genetic improvement of the shoot architecture and yield in soya bean plants via the manipulation of GmmiR156b. Plant Biotechnol. J. 17, 50–62. doi:10.1111/pbi.12946

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X. T., Chen, L. Y., and Ma, J. X. (2019). Genomic introgression through interspecific hybridization counteracts genetic bottleneck during soybean domestication. Genome Biol. 20, 22–15. doi:10.1186/s13059-019-1631-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Watanabe, S., Xia, Z. J., Hideshima, R., Tsubokura, Y., Sato, S., Yamanaka, N., et al. (2011). A map-based cloning strategy employing a residual heterozygous line reveals that the GIGANTEA gene is involved in soybean maturity and flowering. Genetics 188, 395–407. doi:10.1534/genetics.110.125062

PubMed Abstract | CrossRef Full Text | Google Scholar

Wen, Z. X., Tan, R. J., Yuan, J. Z., Bales, C., Du, W. Y., Zhang, S. C., et al. (2014). Genome-wide association mapping of quantitative resistance to sudden death syndrome in soybean. BMC Genomics 15, 809. doi:10.1186/1471-2164-15-809

PubMed Abstract | CrossRef Full Text | Google Scholar

Wong, C. E., Singh, M. B., and Bhalla, P. L. (2009). Molecular processes underlying the floral transition in the soybean shoot apical meristem. Plant J. 57, 832–845. doi:10.1111/j.1365-313X.2008.03730.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Xavier, A., Muir, W. M., and Rainey, K. M. (2016). Assessing predictive properties of genome-wide selection in soybeans. G3 Genes Genomics Genet. 6, 2611–2616. doi:10.1534/g3.116.032268

PubMed Abstract | CrossRef Full Text | Google Scholar

Xia, Z., Zhai, H., Wu, H., Xu, K., Watanabe, S., and Harada, K. (2021). The synchronized efforts to decipher the molecular basis for soybean maturity loci E1, E2, and E3 that regulate flowering and maturity. Front. Plant Sci. 12, 632754. doi:10.3389/fpls.2021.632754

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang, J., Lin, J. Z., Tang, D. Y., Zhou, B., Guo, M., He, R. Q., et al. (2010). A DHHC-type zinc finger protein gene regulates shoot branching in Arabidopsis. Afr. J. Biotechnol. 9, 7759–7766. doi:10.5897/AJB10.650

CrossRef Full Text | Google Scholar

Xie, Y., Liu, Y., Ma, M., Zhou, Q., Zhao, Y., Zhao, B., et al. (2020). Arabidopsis FHY3 and FAR1 integrate light and strigolactone signaling to regulate branching. Nat. Commun. 11, 1955. doi:10.1038/s41467-020-15893-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, H. Z., Chen, Y. L., Pan, Y. B., Wang, J. S., Lu, W. G., and Shi, A. N. (2023). A genome-wide association study and genomic prediction for Phakopsora pachyrhizi resistance in soybean. Front. Plant Sci. 14, 1179357. doi:10.3389/fpls.2023.1179357

PubMed Abstract | CrossRef Full Text | Google Scholar

Xue, Y., Warburton, M. L., Sawkins, M., Zhang, X. H., Setter, T., Xu, Y. B., et al. (2013). Genome-wide association analysis for nine agronomic traits in maize under well-watered and water-stressed conditions. Theor. Appl. Genet. 126, 2587–2596. doi:10.1007/s00122-013-2158-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, G., Zhai, H., Wu, H. Y., Zhang, X. Z., Lu, S. X., Wang, Y. Y., et al. (2017). QTL effects and epistatic interaction for flowering time and branch number in a soybean mapping population of Japanese×Chinese cultivars. J. Integr. Agric. 16, 1900–1912. doi:10.1016/S2095-3119(16)61539-3

CrossRef Full Text | Google Scholar

Yang, H., Xue, Q., Zhang, Z. Z., Du, J. Y., Yu, D. Y., and Huang, F. (2018). GmMYB181, a soybean R2R3-MYB protein, increases branch number in transgenic Arabidopsis. Front. Plant Sci. 9, 1027. doi:10.3389/fpls.2018.01027

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, L., Wang, X. X., Zhao, F. Y., Zhang, X. L., Li, W., Huang, J. S., et al. (2023). Roles of S-adenosylmethionine and its derivatives in salt tolerance of cotton. Int. J. Mol. Sci. 24, 9517. doi:10.3390/ijms24119517

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Y. H., Lei, Y., Bai, Z. Y., Wei, B. G., Zhang, H. P., and Zhang, R. J. (2021). Physical mapping and candidate gene prediction of branch number on the main stem in soybean [Glycine max (L.) Merr.]. Genet. Resour. Crop Evol. 68, 2907–2921. doi:10.1007/s10722-021-01163-y

CrossRef Full Text | Google Scholar

Yu, A. D., Liu, L., Long, R. C., Kang, J. M., Chen, L., Yang, Q. C., et al. (2021). Function and application prospect of plant UDP-glycosyltransferase (UGT). Plant Physiology J. 54, 631–642. doi:10.13592/j.cnki.ppj.2021.0128

CrossRef Full Text | Google Scholar

Zhao, Y. H., Li, X. X., Chen, Z., Lu, H. W., Liu, Y. C., Zhang, Z. F., et al. (2020). An overview of genome-wide association studies in plants. Chin. Bull. Bot. 55, 715–732. doi:10.11983/CBB20091

CrossRef Full Text | Google Scholar

Zuo, J. F., Chen, Y., Ge, C., Liu, J. Y., and Zhang, Y. M. (2022). Identification of QTN-by-environment interactions and their candidate genes for soybean seed oil-related traits using 3VmrMLM. Front. Plant Sci. 13, 1096457. doi:10.3389/fpls.2022.1096457

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: soybean, branch, QTL, candidate gene, genomic selection

Citation: Yang Q, Wang J, Xiong Y, Mao A, Zhang Z, Chen Y, Teng S, Liu Z, Wang J, Song J and Qiu L (2025) Identification of QTL for branch traits in soybean (Glycine max L.) and its application in genomic selection. Front. Genet. 16:1484146. doi: 10.3389/fgene.2025.1484146

Received: 21 August 2024; Accepted: 06 February 2025;
Published: 03 March 2025.

Edited by:

Yingpeng Han, Northeast Agricultural University, China

Reviewed by:

Jindong Liu, Chinese Academy of Agricultural Sciences, China
Tri D. Vuong, University of Missouri, United States
Lirong Zhu, Michigan Technological University, United States

Copyright © 2025 Yang, Wang, Xiong, Mao, Zhang, Chen, Teng, Liu, Wang, Song and Qiu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jun Wang, d2FuZ2phZ3JpQHlhbmd0emV1LmVkdS5jbg==; Jian Song, c29uZ2ppYW5AeWFuZ3R6ZXUuZWR1LmNu; Lijuan Qiu, cWl1bGlqdWFuQGNhYXMuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more