Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 18 January 2023
Sec. Functional and Applied Plant Genomics
This article is part of the Research Topic Genetic Characterization of Yield- and Quality-Related Traits in Legumes View all 14 articles

Fine mapping and candidate gene analysis of proportion of four-seed pods by soybean CSSLs

Fubin Cao&#x;Fubin Cao1†Ruru Wei&#x;Ruru Wei1†Jianguo Xie&#x;Jianguo Xie2†Lilong HouLilong Hou1Chaorui KangChaorui Kang1Tianyu ZhaoTianyu Zhao1Chengcheng SunChengcheng Sun1Mingliang YangMingliang Yang1Ying ZhaoYing Zhao1Candong LiCandong Li3Nannan WangNannan Wang3Xiaoxia WuXiaoxia Wu1Chunyan Liu*Chunyan Liu1*Hongwei Jiang*Hongwei Jiang2*Qingshan Chen*Qingshan Chen1*
  • 1College of Agriculture, Key Laboratory of Soybean Biology in Chinese Ministry of Education, Northeast Agricultural University, Harbin, China
  • 2Jilin Academy of Agricultural Sciences, Soybean Research Institute, Changchun, Jilin, China
  • 3Jiamusi Branch Institute, Heilongjiang Academy of Agricultural Sciences, Jiamusi, Heilongjiang, China

Soybean yield, as one of the most important and consistent breeding goals, can be greatly affected by the proportion of four-seed pods (PoFSP). In this study, QTL mapping was performed by PoFSP data and BLUE (Best Linear Unbiased Estimator) value of the chromosome segment substitution line population (CSSLs) constructed previously by the laboratory from 2016 to 2018, and phenotype-based bulked segregant analysis (BSA) was performed using the plant lines with PoFSP extreme phenotype. Totally, 5 ICIM QTLs were repeatedly detected, and 6 BSA QTLs were identified in CSSLs. For QTL (qPoFSP13-1) repeated in ICIM and BSA results, the secondary segregation populations were constructed for fine mapping and the interval was reduced to 100Kb. The mapping results showed that the QTL had an additive effect of gain from wild parents. A total of 14 genes were annotated in the delimited interval by fine mapping. Sequence analysis showed that all 14 genes had genetic variation in promoter region or CDS region. The qRT−PCR results showed that a total of 5 candidate genes were differentially expressed between the plant lines having antagonistic extreme phenotype (High PoFSP > 35.92%, low PoFSP< 17.56%). The results of haplotype analysis showed that all five genes had two or more major haplotypes in the resource population. Significant analysis of phenotypic differences between major haplotypes showed all five candidate genes had haplotype differences. And the genotypes of the major haplotypes with relatively high PoFSP of each gene were similar to those of wild soybean. The results of this study were of great significance to the study of candidate genes affecting soybean PoFSP, and provided a basis for the study of molecular marker-assisted selection (MAS) breeding and four-seed pods domestication.

1 Introduction

Soybean originated in China and then spread around the world. As an important food crop and oil crop, it has a cultivation history of more than 5000 years (Guo, 2004; Barik et al., 2018; Zhang et al., 2018). China is the world’s largest soybean consumer. In the past 10 years, China’s soybean consumption accounted for more than 30% of the world’s total, ranking first in the world (Liu and Fan, 2021). However, China’s soybean yield per unit is low, only 70% of the international average (Si and Han, 2021), and the domestic soybean supply is weak, with a supply rate of only approximately 15% (Liu and Fan, 2021). Improving soybean yield is an urgent problem to be solved in the world’s soybean industry.

The yield of soybean is affected by multiple factors, such as 100-seed weight, the number of pods per plant, the number of seeds per plant and the number of seeds per pod. These are all quantitative traits controlled by multiple genes and are easily affected by environmental factors (Xie et al., 2021). The total number of soybean pods per plant is one of the main factors limiting soybean yield. When the total number of pods varies little or the potential for increasing the number of pods is limited, PoFSP becomes the main limiting factor for the seed number per plant of soybean. Studies have shown that varieties with a high PoFSP have higher yield and productivity, and it is an effective way to increase the yield by increasing PoFSP of varieties, which has great practical value for increasing the yield. (Peng et al., 1994).

Compared with traditional breeding, the application of molecular marker technology has mostly overcome the difficulties introduced by environmental factors into phenotype identification (Song et al., 2012), and the emergence of MAS and QTL mapping has significantly shortened the breeding years (Hasan et al., 2021). MAS indirectly realizes the selection of QTL controlling a certain trait through the selection of genetic markers, so as to achieve the purpose of selecting the trait (Wakchaure et al., 2015). In 1988, Paterson mapped QTLs by using molecular markers in tomato for the first time (Paterson et al., 1988), and this method has been widely used in different crops (Song et al., 2005; Jiang et al., 2011; Yao et al., 2015; Botero-Ramírez et al., 2020). At present, a large number of studies have been carried out on QTL mapping of pod traits in soybean, and many QTLs related to four-seed pods have been mapped. Wang et al. mapped the QTL related to the number of pods and compared the heritability of one, two, three, and four seed pods by using soybean RILs (recombinant inbred line), and they found that the heritability of four-seed pods was higher and the main reason for the variation in seed pods number was the shedding of flower pods. In 2007, Wang et al. mapped two four-seed pods QTLs detected in two years by using CIM (composite interval mapping) method in soybean RILs (Wang et al., 2007). In 2008, Li et al. mapped three four-seed pods QTLs by using CIM method in soybean CSSLs (Li et al., 2008). In 2009, Zhou et al. mapped two four-seed pods QTLs by using CIM method in soybean RILs (Zhuo et al., 2009). In 2012, Gao et al. mapped 8 QTLs associated with four-seed pods by using CIM method in soybean RILs (Gao et al., 2012). In 2013, Yang et al. mapped 4 four-seed pods QTLs by using the multi-environment joint analysis method in soybean RILs (Yang et al., 2013b). In 2018, Ning et al. mapped 36 and 12 four-seed pods QTLs in soybean by using single marker analysis, CIM and multiple interval mapping methods RIL6013 and RIL3613, respectively (Ning et al., 2018). In 2021, Li et al. constructed 3 RHL s and used single marker analysis to fine-map a four-seed pods QTL, narrowing the QTL interval to 321 kb (Li et al., 2021).

In 1991, BSA method was first proposed by Michelmore et al. (Michelmore et al., 1991). Its core purpose is to use extreme material of the target trait in the population to construct two mixed pools and to locate the target gene by analyzing the degree of association between the polymorphic molecular markers and the phenotype (Zou et al., 2016). Compared to using near-isogenic lines (NILs) to identify markers in specific regions of the genome, BSA is faster and less labor. And BSA significantly reduces the cost of sequencing and analysis because only a few extreme individuals in a population are needed (Yang et al., 2013a).

The quality of the genetic population directly affects the effect, difficulty and application scope of the constructed genetic map s. If the genome differences between the parents are greater, the effect of exchange and recombination between their chromosomes will be more obvious, and the DNA polymorphism and phenotypic variation will be more abundant (Xi et al., 2005). Cultivated soybean was domesticated from annual wild soybean (Broich and Palmer, 1980), and there are significant differences between them. In this study, CSSLs were constructed using cultivated soybeans and wild soybeans. There are abundant variations in this population, and the ability to detect QTLs is strong. Although the genetic variation information is rich, the diversity of their genetic background will affect the accuracy of QTL mapping by using traditional mapping population such as F2, BC, DH, RIL and others (Zhao et al., 2009; Qiao et al., 2016). Compared with the primary mapping population, the genetic background of CSSLs is purer, which not only eliminates genetic noise but also improves the accuracy of QTL mapping (Li, 2014). At present, CSSLs have been applied to 20 crops, and researchers have used these CSSLs to map a large number of QTLs (He et al., 2014; Li et al., 2016; Alyr et al., 2020; Yuan et al., 2020; Xu et al., 2022).

In this study, CSSLs constructed previously in the laboratory was used as the material. Combined with the results of ICIM method and BSA method in CSSLs, PoFSP QTL in soybean was initially mapped, and the secondary segregated populations were constructed for fine mapping. The genes were screened out in the fine mapping interval, and the candidate genes for PoFSP were analyzed and predicted by expression and haplotype analysis.

2 Materials and methods

2.1 Plant material and field management

The population orientation material for this study was CSSLs containing 208 lines constructed from SN14 (the recurrent parent) and ZYD00006 (Jiang et al., 2020), which were planted in Xiangyang Farm, Harbin, China, from 2016 to 2018 (45°45′N, 126°38′E). A completely random block design was adopted, and each line was planted in one row, repeated three times. The row length was 5 m, the row spacing was 65 cm, and the plant spacing in each single-row plot was 6 cm, with approximately 80 plants. Fine mapping population was constructed by backcrossing R92 with extremely high PoFSP in CSSLs with recurrent parent SN14. The R92-F1 seeds obtained in the same year were propagated in Nannan, Hainan, and the R92-F2 was planted in Xiangyang Farm the following year, and 121 R92-F2 individuals were obtained. RHLs(H1) was constructed from an individual in R92-F2 whose genotype was completely heterozygous in the target interval and whose remaining background was relatively pure. The resource population was also planted on the Sunshine Farm, and the planting method was the same as that of CSSLs. The field management of the above materials followed general agricultural practices.

2.2 Phenotypic identification and statistical analysis

For CSSLs, 5 complete plants were selected for each material at the mature stage in the field to test the number of pods and the number of seeds per pod at the end of September every year before harvesting. Each plant of R92-F2 and H1 s was also investigated in the field before harvesting for the number of pods and the number of seeds per pod, and PoFSP was calculated. The measurement standard referred to the “Soybean Germplasm Description Specification and Data Standard” (Zhou et al., 2015).

The phenotypic data were sorted using Microsoft® Excel 2016, the descriptive analysis was performed using SPSS 17.0, and the significance analysis was performed using one-way ANOVA. BLUE was calculated by the BLUE calculation option in AOV module of ICIMapping 4.2 software. The parameters used were each line of CSSLs as a fixed factor and the years and repetition as random factors.

2.3 Bulk segregant analysis

In this study, phenotype-based BSA was performed by 30 materials with high PoFSP (PoFSP > 17.14%) and 30 materials with low PoFSP (PoFSP< 1.46%) in 208 CSSLs. A total of 3,716,818 SNPs were detected, and 3,105,246 high-quality SNPs were obtained after screening out. The Euclidean Distance method was used to analyze the association between the markers and PoFSP in the sequencing results:

ED=

(A1A2)2+(C1C2)2+(G1G2)2+(T1T2)2

A1 and A2 was in the same position. A1 represented the occurrence frequency of A in the high phenotype pool, and A2 represented the occurrence frequency of A in the low phenotype pool. C, G, and T were the same as A. The mean ± 3 times the standard deviation (0.02238) was taken as the threshold value to judge whether the marker and the trait were closely linked, and if it exceeded the specified threshold, it was considered to be related to the trait.

2.4 QTL mapping for PoFSP

The whole genome resequencing data of CSSLs (Zheng et al., 2022) constructed in the laboratory and PoFSP data and BLUE values from 2016-2018 were used for QTL mapping. QTL mapping employed the RSTEP-LRT-ADD model with the 1,000 permutations calculation of ICIMapping 4.2 software. Parameters setting was “By condition number” = −1,000 (equivalent to deleting duplicate markers), “PIN” = 0.001 (PIN: the largest-value for entering variables in stepwise regression of residual phenotype on marker variables). DNA from fresh leaves was extracted using the cetyltrimethylammonium ammonium bromide (CTAB) plant tissue DNA extraction method, and the DNA concentration and purity were determined using a NanoDrop 2000C (Sunnyvale, California, USA) ultradifferential photometer and 1.5% agarose gel electrophoresis. The SSR markers were used as molecular markers. After the polymerase chain reaction (PCR) of Panaud, the genotype of the secondary segregated population was detected by 8% PAGE separation (Doyle, 1990). The genetic maps of the secondary segregating populations were constructed by the MAP module in ICIMapping 4.2 software. QTL mapping employed ICIM-ADD model with 1,000 permutations calculation of ICIMapping 4.2 software. And the threshold of ICIMapping for QTL detection was chose p ≤ 0.05.

2.5 qRT−PCR analysis of candidate genes

According to different stages of flower bud differentiation, the flower tissues of soybean were sampled three days before flowering, two days before, one day before, on the day of flowering, three days after flowering and five days after flowering and stored in liquid nitrogen. The sampling materials were selected from the R92-F2. L-9 and L-14 which had the genotype of ZYD00006 had a high PoFSP (PoFSP > 35.92%). L-72 and L-93 which had the genotype of SN14 had a low PoFSP (PoFSP< 17.56%). Three biological replicates were sampled per material. After grinding the samples in liquid nitrogen, RNA from the plant tissues was extracted using the TRIzol method. The RNA concentration and purity were determined using a NanoDrop 2000C (Sunnyvale, California, USA) ultradifferential photometer and 1.5% agarose gel electrophoresis. Reverse transcription of the extracted RNA into cDNA using the Tianhe Real-time quantitative PCR (RT−qPCR) kit was performed using SYBR qPCR Mix (Vazyme, Q711, Vazyme Biotech, Nanjing, China) on the Light Cycler 480 System (Roche, Roche Diagnostics, Basel, Switzerland). qRT−PCR primer sequences for the candidate genes were designed using Premier 5.0. Using GmACTIN as an internal reference, the average of three biological replicates was taken, and the relative expression levels of the candidate genes were calculated by the 2-ΔCt method.

2.6 Haplotype analysis

Haplotype analysis of PoFSP candidate genes was conducted using 527 soybean germplasm resources. Combined with the sequence information of the promoter region and CDS region of the candidate gene in the reference genome, local sequence alignment was performed on the resequencing data of the candidate gene between the parents of CSSLs to find the difference sites and analyze the variation in the promoter elements and CDS region translation products. The haplotype distribution of the candidate genes in the resource population was counted by DnaSP 5.0 software. The major haplotypes were divided by the number of varieties exceeding 5% of in resource population, and Haploview 4.2 software was used to perform linkage disequilibrium (LD) analysis on the mutation sites among the major haplotypes. Using one-way AOV(analysis of variance) in IBM SPSS Statistics 22 software, combined with the phenotypic data of PoFSP in the resource population in 2021, the significance of the difference between the phenotypes of each major haplotype material was compared.

3 Results

3.1 QTL analysis in CSSLs

QTL analysis was performed based on the phenotypic data (Zheng et al., 2022) and BLUE value data of PoFSP in CSSLs from 2016 to 2018 (Figure 1). A total of 17 QTLs were detected, distributed on 13 chromosomes (Supplementary Table 1), of which 5 were repeatedly detected in two or more years. qPoFSP07-1 and qPoFSP07-2 on Chr07 were detected in 2017 and BLUE, with maximum PVE (phenotypic variance explained) of 3.87% and 8.76%, respectively; qPoFSP13-1 on Chr13 was detected in 2018 and BLUE, with a maximum PVE of 9.18%; qPoFSP17-1 on Chr17 was detected in 2018 and BLUE, with a maximum PVE of 5.14 %; qPoFSP20-2 on Chr20 was detected in 2017, 2018 and BLUE, with a maximum PVE of 30.69 % (Table 1; Figure 2). The above 5 QTLs were repeatedly detected in two or more years, which was more reliable than the QTLs that appeared in a single year. And the five QTLs would be the focus of follow-up research.

FIGURE 1
www.frontiersin.org

Figure 1 Phenotypic identification of PoFSP in CSSLs. (A): PoFSP of CSSLs in 2016. (B): PoFSP of CSSLs in 2017. (C): PoFSP of CSSLs in 2018. (D): PoFSP of CSSLs in BLUE.

TABLE 1
www.frontiersin.org

Table 1 PoFSP QTLs detected in CSSLs for two or more years.

FIGURE 2
www.frontiersin.org

Figure 2 Distribution of PoFSP QTLs on chromosomes in CSSLs. The unit of each position in the figure is bp.

3.2 Analysis of the BSA sequencing results

BSA-seq was performed on the extreme materials of PoFSP in CSSLs. A total of 6 PoFSP QTLs were detected, which were distributed on Chr01, Chr03, Chr08, Chr12, Chr13 and Chr18, and the interval size was 0.09 Mb, 3.67 Mb, 0.10 Mb, 2.57 Mb, 3.61 Mb, and 27.4 Mb, respectively (Supplementary Table 2; Figure 3A). The QTL located on chromosome 13 coincides with the qPoFSP13-1 in CSSLs mapping results, with a size of 3.61 Mb (Figure 3B). qPoFSP13-1 was repeated twice in ICIM results and overlapped with BSA result s, showing a strong association with PoFSP phenotype. Therefore, qPoFSP13-1 was further fine-mapped as an important candidate QTL for PoFSP.

FIGURE 3
www.frontiersin.org

Figure 3 BSA results. (A): The distribution of Euclidean distance (ED)-associated values on chromosomes. (B): Preliminary mapping process of QTLs for PoFSP. a: The abscissa is the chromosome name. The color points represent the ED value of each single nucleotide polymorphism (SNP) locus. The black line is the fitted ED value, and the red dotted line represents the significantly associated threshold. The higher the ED value is, the better the correlation.

3.3 Construction and fine mapping of secondary separation populations

According to the location information of the candidate QTL (qPoFSP13-1) on the chromosome and the whole genome resequencing information of CSSLs, combined with the introduction of the ZYD00006 segment in the whole genome, a total of 16 lines from 208 CSSLs were screened out for the ZYD00006 homozygous genotype introduction segment completely covered the candidate QTL on chromosome 13. The results of phenotypic identification showed that PoFSP in Line R92 was significantly higher than that of its parents and reached an extremely significant level (P ≤ 0.01), which was a superparental material, and its target introduction segment size was 4.9 Mb (Figures 4A, B). Therefore, R92 was selected as the follow-up research material.

FIGURE 4
www.frontiersin.org

Figure 4 Genotypic background and phenotypic identification of R92. (A): The distribution of the ZYD00006 segment in the R92 genome. (B): PoFSP in R92 and the parents. b: “*” means the significant difference at the 0.05 level, and “**” means the significant difference at the 0.01 level.

The secondary segregation population R92-F2, including 121 plants, was constructed by crossing R92 with the recurrent parent. PoFSP was calculated for the R92-F2. The phenotypic data showed that the highest and lowest PoFSP in R92-F2 were 56.00% and 2.38%, respectively. The R92-F2 was rich in phenotypic variation with obvious differences and was suitable for QTL analysis (Figure 5A; Table 2). Eight pairs of SSR markers evenly distributed on the R92 target introduction segment and polymorphic between parents were screened out, linkage analysis was performed on the R92-F2, and the QTL interval was located between markers 13-755 and 13-769. The length of the candidate interval was 1.2 Mb, and PVE was 11.06%. The ADD results showed that the additive effect of the gain came from ZYD00006 (Figure 5B; Table 3).

FIGURE 5
www.frontiersin.org

Figure 5 Phenotypic identification and QTL results of fine-mapping populations. (A): Frequency histogram of the phenotypic distribution of PoFSP in the R92-F2. (B): QTL mapping results of ICIM method in R92-F2. (C): Frequency histogram of the phenotypic distribution of PoFSP in the H1. (D): QTL mapping results of ICIM method in H1.

TABLE 2
www.frontiersin.org

Table 2 Phenotypic statistics of PoFSP in the secondary segregated population.

TABLE 3
www.frontiersin.org

Table 3 Mapping of PoFSP QTLs by secondary segregated population ICIM.

RHLs(H1) is constructed from an individual in R92-F2 whose genotype is completely heterozygous in the target interval and whose remaining background is relatively pure. In H1 containing 126 plants, the highest PoFSP was 47.22%, and the lowest PoFSP was 0.00. The H1 was rich in phenotypic variation with obvious differences and was suitable for QTL analysis (Figure 5C; Table 2). Ten pairs of SSR markers evenly distributed in the initial mapping QTL interval and polymorphic between parents were screened out, and H1 was genotyped. A total of one PoFSP QTL with PVE of 19.47% was detected. The ADD results showed that the additive effect of gain still came from ZYD00006 (Figure 5D; Table 3). Finally, qPoFSP13-1 was fine-mapped between the two flanking markers 13-766 and 13-769, and the candidate interval length was 100 kb (Figure 6).

FIGURE 6
www.frontiersin.org

Figure 6 Schematic diagram fine mapping of qPoFSP13-1. www.frontiersin.orgrepresents the ZYD00006 genotype, www.frontiersin.orgrepresents the SN14 genotype, and www.frontiersin.orgrepresents the heterozygous genotype. R92, L1 and L2 were the same genotypes as ZYD00006; L3, L4 and L5 were heterozygous genotypes of ZYD00006, and L6 and L7 were the same genotypes as SN14. Different letters indicate the significant differences, while the same letters indicate the non-significant differences.

3.4 Gene screening and prediction within the QTL interval

According to the Williams 82.a2.v1 reference genome sequence information in SoyBase(https://www.soybase.org/) and Phytozome(https://phytozome-next.jgi.doe.gov/), a total of 14 PoFSP candidate genes were annotated in qPoFSP13-1 (Supplementary Table 3). Combined with the resequencing data of the parents of CSSLs, the sequence analysis results showed 13 candidate genes had InDel mutations and 14 candidate genes had SNP mutations in the promoter region, 8 candidate genes had nonsynonymous mutations and 2 candidate genes had InDel mutations in the coding region (Supplementary Table 4; Supplementary Table 5; Supplementary Table 6). The expression levels of 14 candidate genes were analyzed in 6 periods before and after flowering. qRT-PCR results showed that Glyma.13G126000 was expressed at low levels before flowering, on the day of flowering and after flowering in the high-extreme materials L-9 and L-14. However, its expression level gradually increased before flowering, peaked on the day of flowering, and then gradually decreased in the low-extreme materials L-72 and L-93. Glyma.13G126100 was also expressed at low levels during each period of in the high-extreme materials L-9 and L-14. However, its expression in low-extreme materials L-72 and L-93 was higher than that in high-extreme materials as a whole, and its expression also increased gradually before flowering and peaked on the day of flowering, and then gradually decreased in L-72 and L-93. The expression levels of Glyma.13G126300, Glyma.13G126400 and Glyma.13G126700 gradually increased before flowering, peaked on the day of flowering, and then gradually decreased in the high-extreme materials L-9 and L-14. However, it showed low expression in each period in the low-extreme materials L-72 and L-93 (Figure 7). According to the above results, it is predicted that Glyma.13G126000 and Glyma.13G126100 might be negative regulators of PoFSP, while Glyma.13G126300, Glyma.13G126400 and Glyma.13G126700 might be positive regulators of PoFSP. In conclusion, Glyma.13G126000, Glyma.13G126100, Glyma.13G126300, Glyma.13G126400, and Glyma.13G126700 were screened out as candidate genes affecting PoFSP in soybean.

FIGURE 7
www.frontiersin.org

Figure 7 Expression analysis of candidate genes. (A): Phenotypic identification of extreme materials. (B–F): Real-time PCR results of the candidate genes. a: Different letters indicate the significant differences at the 0.05 level, while the same letters indicate the nonsignificant differences. b-f: -3, -2, -1, 0, + 3, +5 represent 3 days before flowering, 2 days before flowering, 1 day before flowering, the day of flowering, 3 days after flowering, 5 days after flowering, respectively.

3.5 Candidate gene haplotype analysis

Using DnaSP 5.0 software, haplotype analysis was performed on the above five candidate genes for PoFSP in the resource population contained 527 varieties. AOV was used to analyze the significance of the phenotypic differences among the major haplotypes. The major haplotypes were divided according to the standard that the number of varieties exceeded 5% of the resource population. All five candidate genes had two or more major haplotypes in the resource population.

Glyma.13G126000 had 2 major haplotypes in the resource population, Hap_2 had 367 resource varieties, and Hap_6 has 84 resource varieties. The results of AOV showed that there was an extremely significant difference in PoFSP between Hap_2 and Hap_6. Through SNP analysis, Glyma.13G126000 had a total of 8 differential SNPs in the resource population, of which 7 SNPs were between Hap_2 and Hap_6. SNP-1009 was located on a motif sequence in the promoter region of Glyma.13G126000, and its mutation might affect the binding of transcription factors to promoters. SNP-1270 mutation did not cause changes in promoter elements, but it was tightly linked to SNP-1009 according to the LD analysis. SNP-2667, SNP-2019, SNP-1560 and SNP-806 were also located in the promoter region, but none of their mutations caused changes in promoter elements. In the coding region, SNP3498 was A in Hap_2, which was consistent with ZYD00006, while it was G in Hap_6, which was consistent with SN14 (Figures 8A, B). This site mutation caused the corresponding translation product to change from Glu to Arg, which might affect the changes in gene structure or function.

FIGURE 8
www.frontiersin.org

Figure 8 Haplotype analysis results of Glyma.13G126000 and Glyma.13G126100.(A): Correlation analysis of the major haplotype phenotype in Glyma.13G126000. (B): LD analysis of SNPs among major haplotypes ofGlyma.13G126000. (C): Correlation analysis of the major haplotype phenotype in Glyma.13G126100. (D): LD analysis of SNPs among major haplotypes ofGlyma.13G126100. a,c: “**” indicates a significant difference at the 0.01 level.

Glyma.13G126100 had a total of 2 major haplotypes in the resource population, Hap_2 had 114 resource varieties, and Hap_3 had 362 resource varieties. The results of AOV showed that there was an extremely significant difference in PoFSP between Hap_2 and Hap_3. Through SNP analysis, Glyma.13G126100 had a total of 6 differential SNPs in the resource population. SNP-1991 was located on the promoter of Glyma.13G126100, and its mutation resulted in an alteration of an MYB transcription factor-related element that might affect gene transcription. SNP-1907 did not cause changes in promoter elements, but it was tightly linked to SNP-1991 by LD analysis. SNP-2773, SNP-2719, and SNP-1227 were also located in the promoter region, but none of their mutations caused changes in the promoter elements. In the coding region, SNP23 was C in Hap_2, which was consistent with SN14 and encoded Pro, while it was G in Hap_3, which was a dominant mutation type different from its parents and encoded Gly (Figures 8C, D). Variations in the translation products caused by the mutations at this site might affect the gene structure or function.

Glyma.13G126300 had a total of 2 major haplotypes in the resource population, Hap_2 had 78 resource varieties, and Hap_3 had 364 resource varieties. The results of AOV showed that there was an extremely significant difference in PoFSP between Hap_2 and Hap_3. Through SNP analysis, Glyma.13G126300 had a total of 15 differential SNPs in the resource population, of which 5 SNPs were between Hap_2 and Hap_3, and all of them were located in the promoter region. There was a motif sequence on SNP-443 and SNP-284 respectively, and thier mutations caused changes in the motif sequence, which might affect transcription factor binding. According to LD analysis, SNP-592 was closely linked with SNP-443 and SNP-284, but SNP-592 did not cause the promoter element to change (Figures 9A, B).

FIGURE 9
www.frontiersin.org

Figure 9 Haplotype analysis results of Glyma.13G126300. (A): Correlation analysis of the major haplotype phenotype in Glyma.13G126300. (B): LD analysis of SNPs among major haplotypes ofGlyma.13G126300. a: “**” indicates a significant difference at the 0.01 level.

Glyma.13G126400 has a total of 2 major haplotypes in the resource population, Hap_1 has 128 resource varieties, and Hap_3 has 364 resource varieties. The results of AOV showed that there was an extremely significant difference in PoFSP between Hap_1 and Hap_3. Through SNP analysis, Glyma.13G126400 has 5 differential SNPs in the resource population, of which 3 SNPs were between Hap_1 and Hap_3, and all of them are located in the promoter region. According to LD analysis, SNP-1785 and SNP-1499 are closely linked with SNP-1723, but mutation of them will not cause changes in the promoter elements, and there is a TATA-box sequence at SNP-1723. Mutations in this sequence may affect the production of TATA-box-binding proteins, possibly resulting in the inhibition of RNA polymerase transcription factor synthesis (Figures 10A, B).

FIGURE 10
www.frontiersin.org

Figure 10 Haplotype analysis results of Glyma.13G126400 and Glyma.13G126700. (A): Correlation analysis of the major haplotype phenotype in Glyma.13G126400. (B): LD analysis of SNPs among major haplotypes ofGlyma.13G126400. (C): Correlation analysis of the major haplotype phenotype in Glyma.13G126700. (D): LD analysis of SNPs among major haplotypes ofGlyma.13G126700. a,c: “**” indicates a significant difference at the 0.01 level.

Glyma.13G126700 had a total of 4 major haplotypes in the resource population. Hap_1 had 75 resource varieties, Hap_3 had 361 resource varieties, Hap_4 had 28 resource varieties, and Hap_6 had 28 resource varieties. AOV and multiple comparisons showed that only Hap_3 and Hap_4 had a significant difference in PoFSP. Through SNP analysis, Glyma.13G126700 had a total of 5 SNPs in the resource population, of which 2 SNPs were between Hap_3 and Hap_4. SNP-2213 was located in a CAAT-box sequence on the promoter, and its mutation might lead to changes in the CAAT-box sequence, which might affect the normal initiation of gene transcription. SNP1410 was located in the 3’UTR, and its mutation might affect the stability of the gene structure. According to LD analysis, although SNP1571 had no mutation in Hap_3 and Hap_4, it was closely linked with SNP1410. In Hap_4, the linkage between the two was broken, which might also destroy the 3’UTR structure and affect the stability of the gene structure (Figures 10C, D).

4 Discussion

Soybean originated in China and is an important oil and cash crop (He et al., 2022). The yield of soybean is affected by a variety of yield factors, such as the number of pods per plant, the number of seeds per pod, 100-seed weight and the number of seeds per plant. It is a quantitative trait susceptible to environmental factors and controlled by multiple genes (Sun et al., 2012). As one of the important factors affecting yield, it is of great significance to improve PoFSP for yield research.

According to the data published on the SoyBase, 425 QTLs related to soybean four-seed pods have been reported to date. In this study, 12 QTLs detected in a single year and 5 QTLs repeatedly detected in two years or more were mapped by PoFSP data and BLUE value in CSSLs from 2016 to 2018. Compared with previous research results, qPoFSP20-2 detected in this study in 2017, 2018 and BLUE value highly overlapped with the Ln loci mapped by Fang et al. (Fang et al., 2013) in 2013, reflecting the reliability of CSSLs mapping results. qPoFSP20-2 was detected three times and its LOD and PVE were the most prominent. But qPoFSP13-1 was detected two times and its LOD and PVE were also outstanding, and coincided with BSA results. In this study, the latter QTL was selected as the research object more convincingly.

In previous studies, the Ln gene named after a narrow leaflet has been frequently studied, and it was an important gene affecting the number of four-seed pods (Takahashi, 1934; Domingo, 1945; Peng et al., 1994). A large number of studies have proven that soybean leaf type and four-seed pods were controlled by pleiotropic genes (Johnson and Bernard, 1962; Weiss, 1970; Dinkins et al., 2002). Fang et al. successfully cloned the Ln gene and found through functional analysis that the Ln gene could increase the number of four-seed pods (Fang et al., 2013). Moreover, while the Ln gene has been studied in depth, other genes related to PoFSP have also been discovered one after another. Wang et al. successfully cloned the soybean GmCYP78A10 gene in 2015 and found that GmCYP78A10 can increase PoFSP (Wang et al., 2015). Wang et al. analyzed transcriptome sequencing to obtain differentially expressed genes in 2022 and found that the Glyma.19G240800 gene was homologous to the Arabidopsis cytochrome P450 gene Klu (Wang et al., 2022). Poretska et al. found that the Klu gene was required for female meiosis during ovule development (Poretska et al., 2020). In this study, CSSLs and constructed RHLs were used to fine-map a QTL for PoFSP (qPoFSP13-1). In addition, qPoFSP13-1 has not been reported in previous studies, and no previously reported PoFSP genes were found in qPoFSP13-1, including the genes mentioned above. Therefore, this study might predict a new gene affecting PoFSP in soybean.

China’s wild soybean resources are rich and widely distributed. Compared with cultivated soybeans, wild soybeans have more pods and a higher protein content (Jin et al., 2017). The application of wild soybean varieties can effectively solve the problem of genetic simplification in the breeding of cultivated soybean. This study used PoFSP phenotype data and BLUE values from 2016-2018 of the wild soybean CSSLs and BSA results to comap the QTL(qPoFSP13-1), and the genes identified by fine mapping with high reliability. The mapping results showed that the QTL had an additive effect of gain from the wild parent. Moreover, the genotypes of the five candidate genes in the resource population with higher haplotypes of PoFSP were similar to those of wild soybean. This indicated that the original gene haplotype affecting PoFSP in wild soybean has been preserved during the process of soybean domestication. These results are of great significance for the study of four-seed pods inheritance.

In this study, five candidate genes (Glyma.13G126000, Glyma.13G126100, Glyma.13G126300, Glyma.13G126400, and Glyma.13G126700) were analyzed and predicted in the fine-mapping interval. All these genes mutated between cultivated and wild parents. According to the Williams 82 reference genome gene annotation information published by the SoyBase and Phytozome websites, Glyma.13G126000, Glyma.13G126100 and Glyma.13G126300 had no annotated gene functions. This study preliminarily predicted that Glyma.13G126000 and Glyma.13G126100 might be negative regulators of PoFSP in soybean through the analysis of gene expression in plant lines having antagonistic extreme phenotype, but Glyma.13G126300 might be a positive regulator of PoFSP in soybean. Additional research is needed to verify its regulatory mechanism on soybean four-seed pods. Glyma.13G126400 belongs to the L14b domain of ribosomal proteins. Ribosomal proteins are an important part of the ribosome and have an important impact on the reproductive process of cells (Jin et al., 2018). Wu et al. (2012) found that the mitochondrial ribosomal protein GCD1 was required for the maturation of Arabidopsis female gametes (Wu et al., 2012). In 2014, Agustin et al. found that the Arabidopsis ribosomal protein L27a promoted female gametophyte development in a dose-dependent manner (Zsögön et al., 2014). Zhang et al. found in 2015 that the Arabidopsis mitochondrial ribosomal protein gene HEART STOPPER was required for seed development (Zhang et al., 2015). Yan et al. found in 2016 that the Arabidopsis ribosomal protein L18aB was necessary for male gametophyte and embryonic development, and Xie et al. found in 2018 that L18aB was involved in the formation of embryonic stalks in early embryonic development, further confirming that L18aB was required for reproductive development (Yan et al., 2016; Xie et al., 2018). In 2017, Lu et al. found that the Arabidopsis mitochondrial ribosomal protein S9 M was required for central cell maturation and endosperm development, and in 2020, they found that S9 M was also involved in male gametogenesis and seed development (Lu et al., 2017; Lu et al., 2020). In 2020, Xiong et al. found that the Arabidopsis ribosomal protein L27a participated in the formation of gametophytes by interacting with its molecular chaperone KETCH1 (Xiong et al., 2020). Luo et al. found in 2020 that the Arabidopsis cytoplasmic ribosomal protein L14B was required for Arabidopsis fertilization (Luo et al., 2020). Therefore, it was speculated that the Glyma.13G126400 gene might perform important functions during the process of ovule differentiation, thereby affecting the differentiation of seed pods in soybean. The Glyma.13G126700 gene expresses the homology domain of Gnk2. Gnk2 is a small-molecule antibacterial peptide (Sawano et al., 2007). Its role in the formation of four-seed pods and ovule differentiation needs further research and verification. qRT-PCR results showed that both Glyma.13G126400 and Glyma.13G126700 might be positive regulators of PoFSP in soybean.

Haplotype analysis is an effective method to study the genetic diversity of genes using soybean germplasm resources. In this study, the resource population contained 527 varieties were used for haplotype analysis of the above five candidate genes for PoFSP. In 2021, the number of one, two, three and four-seed pods at the maturity stage of the resource population were investigated in the field, and PoFSP was calculated. In 2021, the maximum PoFSP in the resource population was 52.78%, the minimum value was 0, and the average value was 9.5%. Skewness and kurtosis were 1.4 and 1.2, respectively, and CV was 1.27 (Table 4; Figure 11). In more than half of the resource population contained 527 varieties, PoFSP was between 0 and 5%. Haplotype analysis of candidate genes for PoFSP in the resource population was helpful for the discovery of rare variation materials in PoFSP. The five candidate genes all had two or more major haplotypes in the resource population. Through significance analysis of phenotypic differences between major haplotypes, it was found that the five candidate genes all had haplotype differences in the resource population. In this study, the number of cultivars exceeded 5% of the total resource population to classify the major haplotypes. Although haplotypes with extreme phenotypes might be missed, major haplotypes could be mined more precisely for allelic diversity. According to the present results, the effects of the above five genes on soybean PoFSP were possible, but the specific effects and mechanisms of each gene on PoFSP need further study.

TABLE 4
www.frontiersin.org

Table 4 Statistics analysis of PoFSP in the resource population.

FIGURE 11
www.frontiersin.org

Figure 11 Frequency histogram of the phenotypic distribution of PoFSP in the resource population.

Conclusion

In this study, PoFSP was comapped to a QTL(qPoFSP13-1) on chromosome 13 in CSSLs by ICIM and BSA methods. F2 and RHL fine mapping populations were constructed for this QTL. The QTL was fine mapped between 23.91 Mb and 24.01 Mb on chromosome 13, and the interval size was 100 Kb. A total of 14 candidate genes were included in this QTL. Through sequence analysis, expression analysis and haplotype analysis, 5 candidate genes and their effects on PoFSP were analyzed and predicted in soybean.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding authors.

Author contributions

ChL, HJ, and QC conceived the study and designed and managed the experiments. MY,XW and YZ, provided plant lines. FC, LH, CK, TZ and CS performed trials and collected data. FC, CaL and NW completed statistical analyses of phenotypic data. FC, RW and JX contributed to writing the paper. All authors contributed to the article and approved the submitted version.

Funding

This study was financially supported by the National Key Research and Development Program of the “14th Five Year Plan” (Accurate identification of yield traits in soybean)(2021YFD120160204), National Nature Science Foundation of China (U20A2027 and 31771882), Funding Scheme for Introducing High-level Scientific and Technological Innovation Talents by Jilin Research Institutes in 2019.China Agriculture Research System of MOF and MARA (CARS-04-PS07 and CARS-04-PS11), National Key R&D Program of China (2021YFF1001202, 2021YFD1201104-02-02), Natural Science Foundation of Heilongjiang (TD2022C003, JJ2022YX0475), Hundred-thousand and million project of Heilongjiang province for engineering and technology science’ soybean breeding technology innovation and new cultivar breeding (2019ZX16B01-1).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.1104022/full#supplementary-material

Abbreviations

CSSLs, Chromosome segment substitution lines; BSA, Bulked Segregant Analysis; RHL, Residual Heterozygous Lines; BLUE, Best Linear Unbiased Estimator; MAS, Marker-assisted selection; SNP, Single Nucleotide Polymorphisms; InDel, Insertion-Deletion; ANOVA, Analysis of Variance; ICIM, Inclusive Composite Interval Mapping; CIM, Composite Interval Mapping; PoFSP, The proportion of four-seed pods; QTL, Quantitative trait loci; LOD, Logarithm of odds; PVE, Phenotypic Variance Explained; qRT-PCR, Quantitative Real-time PCR; RILs, Recombinant Inbred Line.

References

Alyr, M. H., Pallu, J., Sambou, A., Nguepjop, J. R., Seye, M., Tossim, H.-A., et al. (2020). Fine-mapping of a wild genomic region involved in pod and seed size reduction on chromosome A07 in peanut (Arachis hypogaea l.). Genes 11 (12), 1402. doi: 10.3390/genes11121402

PubMed Abstract | CrossRef Full Text | Google Scholar

Barik, S., Pandit, E., Mohanty, S., Pradhan, S. (2018). QTL mapping for traits at reproductive stage drought stress in rice using single marker analysis. ORYZA-An Int. J. Rice 55 (1), 134–140. doi: 10.5958/2249-5266.2018.00016.4

CrossRef Full Text | Google Scholar

Botero-Ramírez, A., Laperche, A., Guichard, S., Jubault, M., Gravot, A., Strelkov, S. E., et al. (2020). Clubroot symptoms and resting spore production in a doubled haploid population of oilseed rape (Brassica napus) are controlled by four main QTLs. Front. Plant Sci. 11, 604527. doi: 10.3389/fpls.2020.604527

PubMed Abstract | CrossRef Full Text | Google Scholar

Broich, S. L., Palmer, R. G. (1980). A cluster analysis of wild and domesticated soybean phenotypes. Euphytica 29 (1), 23–32. doi: 10.1007/BF00037246

CrossRef Full Text | Google Scholar

Dinkins, R., Keim, K., Farno, L., Edwards, L. (2002). Expression of the narrow leaflet gene for yield and agronomic traits in soybean. J. Heredity 93 (5), 346–351. doi: 10.1093/jhered/93.5.346

CrossRef Full Text | Google Scholar

Domingo, W. E. (1945). Leaflet shape in the soybean. J. Agric. Res. 70, 251.

Google Scholar

Doyle, J. J. (1990). Isolation of plant DNA from fresh tissue. Focus 12, 13–15.

Google Scholar

Fang, C., Li, W., Li, G., Wang, Z., Zhou, Z., Ma, Y., et al. (2013). Cloning of ln gene through combined approach of map-based cloning and association study in soybean. J. Genet. Genomics 40, 93–96. doi: 10.1016/j.jgg.2013.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, J., Liu, C., Jiang, H., Hu, G., Chen, Q. (2012). QTL analysis of pod number per plant in soybean under multiple locations. Chin. J. Oil Crop Sci. 34 (1), 1.

Google Scholar

Guo, W. (2004). Origins of soybean cultivation in China. J. Nanjing Agric. Univ. (Social Sci. Edition) 4 (1), 60–69.

Google Scholar

Hasan, N., Choudhary, S., Naaz, N., Sharma, N., Laskar, R. A. (2021). Recent advancements in molecular marker-assisted selection and applications in plant breeding programmes. J. Genet. Eng. Biotechnol. 19 (1), 1–26. doi: 10.1186/s43141-021-00231-1

PubMed Abstract | CrossRef Full Text | Google Scholar

He, R., Shi, Y., Zhang, J., Liang, Y., Zhang, B., Li, J., et al. (2014). QTL mapping for plant height using chromosome segment substitution lines in upland cotton. Acta Agronomica Sin. 40 (3), 457–465. doi: 10.3724/SP.J.1006.2014.00457

CrossRef Full Text | Google Scholar

He, J., Yao, L., Pecoraro, L., Liu, C., Wang, J., Huang, L., et al. (2022). Cold stress regulates accumulation of flavonoids and terpenoids in plants by phytohormone, transcription process, functional enzyme, and epigenetics. Crit. Rev. Biotechnol. 18:1–18. doi: 10.1080/07388551.2022.2053056

CrossRef Full Text | Google Scholar

Jiang, W., Lee, J., Jin, Y.-M., Qiao, Y., Piao, R., Jang, S. M., et al. (2011). Identification of QTLs for seed germination capability after various storage periods using two RIL populations in rice. Molecules Cells 31 (4), 385–392. doi: 10.1007/s10059-011-0049-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, H., Li, C., Li, R., Li, Y., Yin, Y., Ma, Z., et al. (2020). Construction of wild soybean backcross introgression lines. Chin. J. Oil Crop Sci. 42 (01), 8–16. doi: 10.19802/j.issn.1007-9084.2020014

CrossRef Full Text | Google Scholar

Jin, X., Cao, F., Xu, L., Yin, D., Wang, H., Piao, L. (2017). A brief discussion on use of wild soybean to create breeding resources and varieties. J. Northeast Agric. Sci. 42 (01), 12–15. doi: 10.16423/j.cnki.1003-8701.2017.01.004

CrossRef Full Text | Google Scholar

Jin, C., Hou, M., Pan, Y. (2018). Research progress of ribosomal protein function in arabidopsis thaliana. Plant Physiol. J. lant Physiol. J. 54 (02), 203–212. doi: 10.13592/j.cnki.ppj.2017.0319

CrossRef Full Text | Google Scholar

Johnson, H. W., Bernard, R. L. (1962). Soybean genetics and breeding. Adv. Agron. 14, 149–221. doi: 10.1016/S0065-2113(08)60438-1

CrossRef Full Text | Google Scholar

Li, C. (2014). Research progress on population mapping and localization method of soybean QTL. Soybean Sci. Technol. 03), 20–22.

Google Scholar

Li, C., Jiang, H., Zhang, W., Qiu, P., Liu, C., Li, W., et al. (2008). QTL analysis of seed and pod traits in soybean. Mol. Plant Breed. 6 (6), 1091–1100.

Google Scholar

Li, Y., Liu, C., Wang, N., Zhang, Z., Hou, L., Xin, D., et al. (2021). Fine mapping of a QTL locus (QNFSP07-1) and analysis of candidate genes for four-seeded pods in soybean. Mol. Breed. 41 (11), 1–16. doi: 10.1007/s11032-021-01265-6

CrossRef Full Text | Google Scholar

Liu, M., Fan, Q. (2021). Study on the current situation and problems of soybean consumption, production and import in China. Grain Sci. And Technol. And Economy 46 (06), 28–35. doi: 10.16465/j.gste.cn431252ts.20210606

CrossRef Full Text | Google Scholar

Li, Y., Wang, X., Huang, S., Chen, L., Ni, Z., Wang, Y., et al. (2016). QTL mapping and epistatic interaction analysis of canopy parameters in soybean. Mol. Plant Breed. 14 (12), 3414–3429. doi: 10.13271/j.mpb.014.003414

CrossRef Full Text | Google Scholar

Luo, A., Zhan, H., Zhang, X., Du, H., Zhang, Y., Peng, X. (2020). Cytoplasmic ribosomal protein L14B is essential for fertilization in arabidopsis. Plant Sci. 292, 110394. doi: 10.1016/j.plantsci.2019.110394

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, C., Xie, Z., Yu, F., Tian, L., Hao, X., Wang, X., et al. (2020). Mitochondrial ribosomal protein S9M is involved in male gametogenesis and seed development in arabidopsis. Plant Biol. 22 (4), 655–667. doi: 10.1111/plb.13108

CrossRef Full Text | Google Scholar

Lu, C., Yu, F., Tian, L., Huang, X., Tan, H., Xie, Z., et al. (2017). RPS9M, a mitochondrial ribosomal protein, is essential for central cell maturation and endosperm development in arabidopsis. Front. Plant Sci. 8, 2171. doi: 10.3389/fpls.2017.02171

PubMed Abstract | CrossRef Full Text | Google Scholar

Michelmore, R. W., Paran, I., Kesseli, R. (1991). Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations. Proc. Natl. Acad. Sci. 88 (21), 9828–9832. doi: 10.1073/pnas.88.21.9828

CrossRef Full Text | Google Scholar

Ning, H., Yuan, J., Dong, Q., Li, W., Xue, H., Wang, Y., et al. (2018). Identification of QTLs related to the vertical distribution and seed-set of pod number in soybean [Glycine max (L.) merri]. PLoS One 13 (4), e0195830. doi: 10.1371/journal.pone.0195830

PubMed Abstract | CrossRef Full Text | Google Scholar

Paterson, A. H., Lander, E. S., Hewitt, J. D., Peterson, S., Lincoln, S. E., Tanksley, S. D. (1988). Resolution of quantitative traits into mendelian factors by using a complete linkage map of restriction fragment length polymorphisms. Nature 335 (6192), 721–726. doi: 10.1038/335721a0

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, Y., Zhu, J., Yang, G., Yuan, J. (1994). Relation of soybean leaf shape distribution to 4-seeded pods. Acta Agronomica Sin. 20 (4), 501–503.

Google Scholar

Poretska, O., Yang, S., Pitorre, D., Poppenberger, B., Sieberer, T. (2020). AMP1 and CYP78A5/7 act through a common pathway to govern cell fate maintenance in arabidopsis thaliana. PLoS Genet. 16 (9), e1009043. doi: 10.1371/journal.pgen.1009043

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiao, W., Qi, L., Cheng, Z., Su, L., Li, J., Sun, Y., et al. (2016). Development and characterization of chromosome segment substitution lines derived from oryza rufipogon in the genetic background of o. sativa spp. indica cultivar 9311. BMC Genomics 17 (1), 1–12. doi: 10.1186/s12864-016-2987-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Sawano, Y., Miyakawa, T., Yamazaki, H., Tanokura, M., Hatano, K.-i. (2007). Purification, characterization, and molecular gene cloning of an antifungal protein from ginkgo biloba seeds. 388 (3), 273–280. doi: 10.2210/pdb2e79/pdb

PubMed Abstract | CrossRef Full Text | Google Scholar

Si, W., Han, T. (2021). China’s soybean yield increase potential and realization path during the “14th Five−Year Plan” Period. Agric. Economic Issues 07, 17–24. doi: 10.13246/j.cnki.iae.2021.07.003

CrossRef Full Text | Google Scholar

Song, H., Cui, X., Ma, H., Zhu, Z., Luan, F. (2012). Construction of DNA fingerprint database based on SSR Marker for Varieties(Lines) of cucumis melo l. Scientia Agricultura Sin. 45 (13), 2676–2689.

Google Scholar

Song, X., Wang, K., Guo, W., Zhang, J., Zhang, T. (2005). A comparison of genetic maps constructed from haploid and BC1 mapping populations from the same crossing between gossypium hirsutum l. and gossypium barbadense l. Genome 48 (3), 378–390. doi: 10.1139/g04-126

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, Y., Shi, X., Jiang, H., Sun, D., Xin, D., Liu, C., et al. (2012). Epistatic effects and qE interaction effects of QTLs for 100 − seed weight in soybean. Chin. J. Oil Crop Sci. 34 (06), 598–603.

Google Scholar

Takahashi, N. (1934). Linkage relation between the genes for the forms of leaves and the number of seeds per pod of soybeans. Jpn. J. Genet. 9, 208–225. doi: 10.1266/jjg.9.208

CrossRef Full Text | Google Scholar

Wakchaure, R., Ganguly, S., Praveen, P., Kumar, A., Sharma, S., Mahajan, T. (2015). Marker assisted selection (MAS) in animal breeding: a review. J. Drug Metab. Toxicol. 6 (5), e127. doi: 10.4172/2157-7609.1000e127

CrossRef Full Text | Google Scholar

Wang, N., Jiang, H., Zhang, Z., Li, Y., Li, C., Song, Y., et al. (2022). Gene mining of 100-grain weight and the number of four-seed pods in soybean (Glycine max). Plant Breed. 141 (2), 143–158. doi: 10.1111/pbr.12993

CrossRef Full Text | Google Scholar

Wang, X., Li, Y., Zhang, H., Sun, G., Zhang, W., Qiu, L. (2015). Evolution and association analysis of GmCYP78A10 gene with seed size/weight and pod number in soybean. Mol. Biol. Rep. 42 (2), 489–496. doi: 10.1007/s11033-014-3792-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Zhang, X., Zhou, R., Sha, A., Wu, X., Cai, S., et al. (2007). QTL analysis of seed and pod traits in soybean RIL population. Acta AGRONOMICA Sin. 33 (3), 441–448.

Google Scholar

Weiss, M. G. (1970). Genetic linkage in soybeans: linkage group I 1. Crop Sci. 10 (1), 69–72. doi: 10.2135/cropsci1970.0011183X001000010027x

CrossRef Full Text | Google Scholar

Wu, J.-J., Peng, X.-B., Li, W.-W., He, R., Xin, H.-P., Sun, M.-X. (2012). Mitochondrial GCD1 dysfunction reveals reciprocal cell-to-cell signaling during the maturation of arabidopsis female gametes. Dev. Cell 23 (5), 1043–1058. doi: 10.1016/j.devcel.2012.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, J., Wang, Q., Zhang, Z., Xiong, X., Yang, M., Qi, Z., et al. (2021). QTL-seq identified QTLs and candidate genes for two-seed pod length and width in soybean (Glycine max). Plant Breed. 140 (3), 453–463. doi: 10.1111/pbr.12920

CrossRef Full Text | Google Scholar

Xie, F., Yan, H., Sun, Y., Wang, Y., Chen, H., Mao, W., et al. (2018). RPL18aB helps maintain suspensor identity during early embryogenesis. J. Integr. Plant Biol. 60 (4), 266–269. doi: 10.1111/jipb.12616

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, F., Duan, C.-Y., Liu, H.-H., Wu, J.-H., Zhang, Z.-H., Li, S., et al. (2020). Arabidopsis KETCH1 is critical for the nuclear accumulation of ribosomal proteins and gametogenesis. Plant Cell 32 (4), 1270–1284. doi: 10.1105/tpc.19.00791

PubMed Abstract | CrossRef Full Text | Google Scholar

Xi, Z., Zhu, F., Tai, G., Li, Z. (2005). Principle and methodology of QTL analysis in crop. Chin. Agric. Sci. Bull. 01), 88–92+99.

Google Scholar

Xu, Z., Wang, F., Zhou, Z., Meng, Q., Chen, Y., Han, X., et al. (2022). Identification and fine-mapping of a Novel QTL, qMrdd2, that confers resistance to maize rough dwarf disease. Plant Dis. 106 (1), 65–72. doi: 10.1094/PDIS-03-20-0495-RE

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, H., Chen, D., Wang, Y., Sun, Y., Zhao, J., Sun, M., et al. (2016). Ribosomal protein L18aB is required for both male gametophyte function and embryo development in arabidopsis. Sci. Rep. 6 (1), 1–12. doi: 10.1038/srep31195

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Z., Huang, D., Tang, W., Zheng, Y., Liang, K., Cutler, A. J., et al. (2013a). Mapping of quantitative trait loci underlying cold tolerance in rice seedlings via high-throughput sequencing of pooled extremes. PLoS One 8 (7), e68433. doi: 10.1371/journal.pone.0068433

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Z., Sun, Y., Qi, Z., Xin, D., Jiang, H., He, L., et al. (2013b). Analysis of additive effect,epistatic and QE interaction effect for QTL of pod number traits in soybean. J. Agric. Univ. 3), 1–13.

Google Scholar

Yao, D., Liu, Z., Zhang, J., Liu, S., Qu, J., Guan, S., et al. (2015). Analysis of quantitative trait loci for main plant traits in soybean. Genet. Mol. Res. 14 (2), 6101–6109. doi: 10.4238/2015.June.8.8

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, R., Huang, Z., Luo, L., Zhao, N., Chen, Y., Liang, Y., et al. (2020). Identification of grain-shattering QTL and preliminary mapping of a related major QTL based on chromosome segment substitution lines (CSSLs) of guangxi common wild rice (Oryza rufipogon griff.). J. South. Agric. 51 (5), 1004–1012. doi: 10.3969/j.issn.2095-1191.2020.05.003

CrossRef Full Text | Google Scholar

Zhang, H., Luo, M., Day, R. C., Talbot, M. J., Ivanova, A., Ashton, A. R., et al. (2015). Developmentally regulated HEART STOPPER, a mitochondrially targeted L18 ribosomal protein gene, is required for cell division, differentiation, and seed development in arabidopsis. J. Exp. Bot. 66 (19), 5867–5880. doi: 10.1093/jxb/erv296

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Wang, W., Guo, N., Zhang, Y., Bu, Y., Zhao, J., et al. (2018). Combining QTL-seq and linkage mapping to fine map a wild soybean allele characteristic of greater plant height. BMC Genomics 19 (1), 1–12. doi: 10.1186/s12864-018-4582-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, L., Zhou, H., Lu, L., Liu, L., Li, X., Lin, Y., et al. (2009). Identification of quantitative trait loci controlling rice mature seed culturability using chromosomal segment substitution lines. Plant Cell Rep. 28 (2), 247–256. doi: 10.1007/s00299-008-0641-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, H., Hou, L., Xie, J., Cao, F., Wei, R., Yang, M., et al. (2022). Construction of chromosome segment substitution lines and inheritance of seed-pod characteristics in wild soybean. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.869455

CrossRef Full Text | Google Scholar

Zhou, L., Wang, S.-B., Jian, J., Geng, Q.-C., Wen, J., Song, Q., et al. (2015). Identification of domestication-related loci associated with flowering time and seed size in soybean with the RAD-seq genotyping method. Sci. Rep. 5 (1), 1–8. doi: 10.1038/srep09350

CrossRef Full Text | Google Scholar

Zhuo, R., Chen, H., Wang, X., Zhang, X., Shan, Z., Wu, X., et al. (2009). QTL analysis of yield, yield components, and lodging in soybean. Acta AGRONOMICA Sin. 35 (5), 821–830. doi: 10.3724/SP.J.1006.2009.00821

CrossRef Full Text | Google Scholar

Zou, C., Wang, P., Xu, Y. (2016). Bulked sample analysis in genetics, genomics and crop improvement. Plant Biotechnol. J. 14 (10), 1941–1955. doi: 10.1111/pbi.12559

PubMed Abstract | CrossRef Full Text | Google Scholar

Zsögön, A., Szakonyi, D., Shi, X., Byrne, M. E. (2014). Ribosomal protein RPL27a promotes female gametophyte development in a dose-dependent manner. Plant Physiol. 165 (3), 1133–1143. doi: 10.1104/pp.114.241778

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: wild soybean, CSSLs, BSA, secondary segregation population, QTL for the proportion of four-seed pods, candidate gene

Citation: Cao F, Wei R, Xie J, Hou L, Kang C, Zhao T, Sun C, Yang M, Zhao Y, Li C, Wang N, Wu X, Liu C, Jiang H and Chen Q (2023) Fine mapping and candidate gene analysis of proportion of four-seed pods by soybean CSSLs. Front. Plant Sci. 13:1104022. doi: 10.3389/fpls.2022.1104022

Received: 21 November 2022; Accepted: 23 December 2022;
Published: 18 January 2023.

Edited by:

Zhengjun Xia, Chinese Academy of Sciences (CAS), China

Reviewed by:

Ying Huang, UMR9213 Institut des Sciences des Plantes de Paris Saclay (IPS2), France
Yuefeng Guan, Fujian Agriculture and Forestry University, China
Hai Nian, South China Agricultural University, China

Copyright © 2023 Cao, Wei, Xie, Hou, Kang, Zhao, Sun, Yang, Zhao, Li, Wang, Wu, Liu, Jiang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chunyan Liu, Y3lsaXVjbkAxMjYuY29t; Hongwei Jiang, ajM5OTQxMDJAMTI2LmNvbQ==; Qingshan Chen, cXNoY2hlbkAxMjYuY29t

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.