- 1Division of Genomic Resources, Indian Council of Agricultural Research (ICAR)-National Bureau of Plant Genetic Resources, New Delhi, India
- 2Amity Institute of Biotechnology, Amity University, Noida, India
- 3ICAR-National Bureau of Plant Genetic Resources, Regional Station Akola, Maharashtra, India
- 4Division of Germplasm Evaluation, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
- 5Division of Germplasm Conservation, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
Early flowering, maturity, and plant height are important traits for linseed to fit in rice fallows, for rainfed agriculture, and for economically viable cultivation. Here, Multi-Locus Genome-Wide Association Study (ML-GWAS) was undertaken in an association mapping panel of 131 accessions, genotyped using 68,925 SNPs identified by genotyping by sequencing approach. Phenotypic evaluation data of five environments comprising 3 years and two locations were used. GWAS was performed for three flowering time traits including days to 5%, 50%, and 95% flowering, days to maturity, and plant height by employing five ML-GWAS methods: FASTmrEMMA, FASTmrMLM, ISIS EM-BLASSO, mrMLM, and pLARmEB. A total of 335 unique QTNs have been identified for five traits across five environments. 109 QTNs were stable as observed in ≥2 methods and/or environments, explaining up to 36.6% phenotypic variance. For three flowering time traits, days to maturity, and plant height, 53, 30, and 27 stable QTNs, respectively, were identified. Candidate genes having roles in flower, pollen, embryo, seed and fruit development, and xylem/phloem histogenesis have been identified. Gene expression of candidate genes for flowering and plant height were studied using transcriptome of an early maturing variety Sharda (IC0523807). The present study unravels QTNs/candidate genes underlying complex flowering, days to maturity, and plant height traits in linseed.
Introduction
Linseed (Linum usitatissimum L.) is one of the earliest domesticated crops of the world and has been used for seed oil and fiber purposes since ancient times. It is a self-pollinated annual crop with a genome size of 373 Mb (2n = 2× = 30) (Wang et al., 2012). Linseed is considered to have originated in the central Asiatic center (northwest India), the near-eastern center, the Mediterranean center, and the Abyssinian Centre (Vavilov 1951). There are two morphotypes of linseed/flaxseed, flax type and linseed type, which differ substantially in terms of growth, development, and agronomic performance (Diederichsen and Ulrich, 2009; Soto-Cerda et al., 2013). For linseed type, short height, high branching, and high number of capsules and seed weight are desirable traits.
Linseed is one of the richest sources of omega-3 alpha linolenic acid (55–57%), which has tremendous cardiovascular benefits. Additionally, seeds of linseed have plenty of bioactive compounds such as lignans and soluble fibers, which are known for anticancer properties and reducing blood cholesterols, respectively (Green et al., 2008). In recent times, linseed is gaining popularity as a nutraceutical and functional food (Bassett et al., 2009; Goyal et al., 2014). India is one of the major linseed-producing countries (ranks 6th in the world) with a production of 174,000 tonne from 320,000 ha. area (FAOSTAT 2018). However, linseed productivity in India is far below (0.543 tonne/ha) the world average (1.053 tonne/ha) since it is mainly grown as a rainfed crop with limited or no additional resources such as irrigation and fertilizers (Kaur et al., 2017). High temperature during flowering even for a brief spell has significant negative impact on the seed set, which contributes to reduced seed yield (Ford and Zimmerman 1964; Wheeler et al., 2000; Jagadish et al., 2008). Early flowering and maturity are desirable traits for linseed cultivation in different countries (Miller et al., 2001; Hall et al., 2016; Sun et al., 2019) as it helps avoid frost, drought, and terminal heat and are suited for rainfed and Utera cultivation (Singh and Satapathy, 2019).
Flowering regulation in linseed is complex and affected by photoperiod as well as temperature. Linseed is a facultative long-day plant, and flowering initiation in short days is essential for the onset of the reproductive stage to facilitate early maturity (Domantovich et al., 2012). There also exist genotypic differences for photoperiod response (Zhang 2013; Sirohi and Wasnik 2018; Sun et al., 2019; Saroha et al., 2022). Genetic dissection of flowering time, maturity, and complex agro-morphological traits could enable tailoring locally adaptable high-yielding varieties in linseed.
For identifying genes/genomic regions underlying the complex traits, biparental quantitative trait loci (QTLs) and genome-wide association study (GWAS) are being used (Mackay et al., 2009; Burghardt et al., 2017). GWAS has the advantage over biparental QTL mapping on two fronts. The latter assays genetic variation limited to two parents and fewer recombination events. GWAS assays a wide swathe of natural variation through population-scale samples and also takes into account the historic recombination events across lineages, enabling a finer resolution of QTL (Burghardt et al., 2017). GWAS has been widely used in a wide range of plants such as rice, wheat, foxtail millet, chickpea, and greengram (Kumar et al., 2015; Jaiswal et al., 2019; Kumar et al., 2020; Reddy et al., 2020; Chaurasia et al., 2021). In flax, this approach was recently used for genetic dissection of fatty acid biosynthesis and agro-morphological traits (such as plant height, fiber percentage, 50% flowering, seed weight, and capsule and branch numbers), mucilage, and seed hull content (Xie et al., 2018a; Xie et al., 2018b, Singh et al., 2019; Soto-Cerda et al., 2018).
Advances in next-generation sequencing technologies have revolutionized research in agriculture and medical sciences. SNPs identified in the high-throughput manner have enabled association between phenotype and genotype (Wickland et al., 2017). Genotyping by Sequencing (GBS), a reduced representation sequencing approach (Hirsch et al., 2014), has become popular in agriculture owing to low cost and high-throughput genotyping of large number of accessions (Elshire et al., 2011; Wickland et al., 2017). GBS has been employed successfully in several crops such as barley, chickpea, and greengram (Kujur et al., 2015; Milner et al., 2019; Reddy et al., 2020).
In linseed/flaxseed, 340 QTLs have been identified by making use of SSR or SNP markers for 31 traits including 24 seed yield, seed quality, and agronomic traits (You and Cloutier, 2020; Soto-Cerda et al., 2021). For days to flowering, days to maturity, and plant height, there are a total of 28, 2, and 30 known QTLs, respectively, from previous studies (Soto-Cerda et al., 2014, 2021; Wu et al., 2018; You et al., 2018; You and Cloutier, 2020). Most of these studies have been undertaken on the Canadian flax core collection (Diederichsen et al., 2013; Soto-Cerda et al., 2014; 2021), or core collections from China and other countries (Xie et al., 2018a; Xie et al., 2018b) which have no or limited representation of linseed genetic resources from the Indian sub-continent. The natural genetic variation underlying the linseed accessions of the National Gene Bank (NGB), India, is yet to be tapped and utilized for dissection of complex agro-morphologically important traits. The present study was conducted with the aim of genetic dissection of flowering time, days to maturity, and plant height using genome-wide association strategy in a diverse panel of linseed accessions from NGB, India.
Materials and Methods
Plant Materials, Field Evaluation, and Statistical Analysis
A total of 220 diverse linseed germplasm accessions (53 exotic collections and 167 indigenous collections) from NGB, Indian Council of Agricultural Research-National Bureau of Plant Genetic Resources, were evaluated at New Delhi (28°38′53.7″N 77°09′05.4″E) for 2 years, 2017–18 (DL17-18) and 2018–19 (DL18-19), as reported earlier by our group (Saroha et al., 2022). The same set was also evaluated in three more environments including 2019–20 at New Delhi (DL19-20) and at Akola (20°42′03.2″N 77°01′53.6″E) for 2 years, 2018–19 (AK18-19) and 2019–20 (AK19-20). The evaluation was carried out following Augmented Block Design (ABD) in six blocks with three checks (T-397, Shekhar, and Kartika), replicating twice in each block. Each accession was grown in 3-m single rows with a distance of 45 cm between two rows. ICAR-NBPGR descriptors (Mahajan et al., 2000) were followed for recording the agro-morphological data. Plant height (PH) was recorded on three randomly selected plants from the middle of the row. For flowering time (days to 5% flowering: DF5, days to 50% flowering: DF50, and days to 95% flowering: DF95) and days to maturity (days to 80% physiological maturity: DM) traits, the entire row/accession was considered for recording the observations. Data of each individual environment was analyzed using augmentedRCBD, R package version 0.1.5 (Aravind et al., 2021), following which the adjusted means were calculated. An association mapping (AM) panel of 131 diverse accessions was constituted following standard procedure to include maximum trait variation for the studied traits. For descriptive statistics, adjusted means of selected 131 accessions were filtered from a total of 220 accessions. Descriptive statistics for the AM panel were calculated using PAST software (v4.04) (Hammer et al., 2001). The AM panel consisted of 85 indigenous accessions and 46 exotic accessions (Supplementary Table S1).
Genotyping, Population Structure, and Linkage Disequilibrium
Genomic DNA was isolated from two-week-old seedlings grown from seeds of the single plant progeny of each accession using a DNeasy Plant Mini Kit (QIAGEN). The ApeKI-digested, adapter-ligated, amplified, and purified 131-plex final DNA library was quantified using a Bioanalyzer (Agilent Technologies) and were sequenced on a single lane of the Illumina HiSeqTM X10 platform (Illumina® Inc., San Diego, CA, United States). Cleaned reads were mapped to reference genome Linum usitatissimum (You et al., 2018) downloaded from NCBI (Pseudomolecule level) using the MEM algorithm of BWA (v0.7.5) (Li, 2013). Variant calling was done using the GATK pipeline (v3.6), (Geraldine and Brian, 2020). Variants were filtered and indels were removed using vcftools (v0.1.17) (Danecek et al., 2011), keeping only biallelic SNPs. For excluding the rare alleles which may also arise due to genotyping errors, all SNPs were filtered at read-depth 10, stringent MAF of 8%, and missing data <20%. Population structure was estimated using an admixture-based model in STRUCTURE software (v2.3.4) (Pritchard et al., 2000). Three runs were performed for each number of the population (K) set from 3 to 9. Burn-in time and MCMC replication number were set to 100,000 and 300,000, respectively, for each run. The most probable K-value was determined using Structure Harvester, using the log probability of the data [LnP(D)] and delta K (ΔK) based on the rate of change in [LnP(D)] between successive K-values. PCA was calculated using PLINK (version 1.9) and then plotted using ‘R.’ The dendrogram was constructed using TASSEL (v4.0) (Bradbury et al., 2007) with the Neighbour-Joining method and then plotted with the Structure Q matrix using iTOL (Letunic and Bork, 2021). Genome-wide LD was estimated using the squared correlations of allele frequency (r2), using TASSEL (v5.2.73) with a sliding window size of 50. LD decay distance for the genome was estimated by plotting the scatterplot of LD r2 values between marker pairs and physical distance.
Genome-Wide Association Study (GWAS)
Six multi-locus models, FASTmrEMMA (Wen et al., 2018), FASTmrMLM (Zhang and Tamba, 2018), ISIS EM-BLASSO (Tamba et al., 2017), mrMLM (Wang et al., 2016), pLARmEB (Zhang et al., 2017), and pKWmEB (Ren et al., 2018) implemented in the mrMLM package v4.0.2 (Zhang et al., 2020) of ‘R,’ have been used in this study. Default values were used on all the parameters for the analysis. SNP genotyping information of the AM panel of 131 accessions and phenotyping information of five traits (DF5, DF50, DF95, DM, and PH) were used for the five individual environments independently for GWAS. QTNs with a threshold of Logarithm of Odds (LOD) score ≥3.0 were considered to be significantly associated with the trait. Manhattan and QQ plots were generated using the mrMLM package v4.0.2 of ‘R’ (Zhang et al., 2020).
Identification of Candidate Genes and Gene Expression Study
Genes around the 30 kb sequence (30 kb upstream and downstream, total 60 kb) of the stable QTNs were extracted following the flax genome annotation (You et al., 2018; You and Cloutier, 2020). Putative candidate genes were filtered based on functional annotation and homology with the Arabidopsis ortholog. Functional annotation of the genes was performed using the PANNZER2 tool (Törönen and Holm, 2022).
Differential gene expression of the candidate genes for flowering time (DF5, DF50, and DF95) and PH was studied using transcriptome (BioProject ID: PRJNA773597) of the floral bud at two developmental stages, flower, leaf, and stem of the early flowering variety Sharda (IC0523807). Reads per kilobase of transcript per million fragments mapped (RPKM) values were estimated to get the expression levels of candidate genes. For DM, expression of the candidate genes was performed in silico. Gene expression data and protein sequences of the rice variety Nipponbare were downloaded from the Rice Genome Annotation Project database (http://rice.uga.edu/index.shtml). Putative candidate genes of the DM trait were aligned against the protein sequences of Nopponbare, and the best hit was considered for expression analysis. Expression analysis was performed for the following developmental stages: Embryo—25 Days After Pollination (DAP) (SRX100753), Endosperm—25 DAP (SRX100754) and Seed—5 DAP (SRX100749) and 10 DAP (SRX100755). Heatmap plots of gene expression were generated using the ComplexHeatmap package v2.10.0 (Gu et al., 2016) of ‘R.’
Results
Phenotypic Variation
All the studied traits, DF5, DF50, DF95, DM, and PH, were heterogeneous for error variances; therefore, analysis for the respective environments was done independently. The extent of variation among the germplasm accessions was very high for flowering time traits. Out of the five environments, the lowest value for flowering time initiation (DF5) was 39.86 days, whereas the highest value was 123.81 days, with a maximum coefficient of variation (CV) of 19.23 (Table 1). For DF50, the minimum and maximum values were 48.14 and 128.5 days, respectively, with a maximum CV of 17.96. For completion of flowering (DF95), 53.56 and 134.89 days were minimum and maximum, respectively, with the highest CV being 18.07. In case of days to maturity, the extent of variation was relatively less than that of the flowering time traits. The minimum and maximum days to attain the physiological maturity were 102.11 and 158.17, respectively, with a maximum CV of 5.41. For the plant height trait, the range was 32.90–105.23 cm, with a maximum CV of 18.71 from all five environments. The variation among the accessions for the individual environment was also very high (Table 1; Figure 1) For all five traits, there were conspicuous differences in trait expression between the Akola and New Delhi regions, the former being the 2nd and the latter the 3rd zone of linseed growing areas of India (Figure 1). Linseed accessions showed earlier flowering, fewer days to maturity, and relatively shorter height in Akola than in New Delhi. Overall, for the AM panel, analysis of variance in all five environments showed significant variation for all five traits (Supplementary Table S2).
TABLE 1. Descriptive statistics of association mapping panel of 131 accessions for flowering time, days to maturity, and plant height in five environments.
FIGURE 1. Phenotypic variation depicted in bean plots in the association mapping panel of linseed accession for days to 5% flowering, DF5 (A); 50%, DF50 (B); 95%, DF95 (C); days to maturity, DM (D); and plant height, PH (E) in five environments. AK and DL stand for geographical locations Akola and New Delhi, respectively, and the following figures indicate years. The length of the bean depicts extent of variation and width along with blue lines depict relative number of accessions representing corresponding values on the Y-axis. The horizontal black lines represent the mean value.
Genome-Wide Identification of SNPs by Genotyping by Sequencing Approach
A total of 19.5 million quality reads were obtained by sequencing of ApeKI digested GBS libraries of 131 linseed germplasm accessions. A total of 68,925 high-quality SNPs were obtained with a read depth of 10, <20% missing data, and 8% minor allele frequency, which were mapped on the 15 chromosomes. The highest number of SNPs, 6,342, were mapped on chromosome 3, and the lowest number, 2,759, were mapped on chromosome 10 (Figure 2A). The distribution of SNPs across 15 chromosomes is shown in Figure 2B. To understand the pattern of the population structure in the AM panel of linseed accessions, 68,925 SNPs were employed using an admixture-based model in STRUCTURE software. The number of distinct sub-populations in the AM panel was determined as 4 per the maximum Delta K value plotted against the K using the ‘Structure Harvester’ program (Figure 3A). The bar plot clearly shows four sub-populations (Figure 3B). Sub-population-I consists of a maximum of 73 accessions of which 10 were exotic (EC) and 63 were indigenous (IC). Sub-population-II showed 21 accessions (IC: 11, EC: 10), sub-population-III: 30 accessions (IC: 15, EC: 15), and sub-population-IV showed only 7 accessions (IC: 2, EC: 5). In principal component analysis (PCA), plotting of the first two components against each other shows 4 clusters (Figure 3C). Furthermore, the phylogenetic tree using 68,925 SNPs grouped 131 accessions into four major clusters (Figure 3D). The estimates of r2 for 68,925 SNP loci were used to assess the rate of LD decay with distance. The genome-wide LD declined to 50% of its initial value at about 30 kb (Figure 4).
FIGURE 2. Distribution of SNPs on 15 chromosomes of linseed (A) and density of SNPs across individual chromosomes (B).
FIGURE 3. (A) ΔK plot showing best peak at K = 4. (B) Population structure of the linseed association mapping panel showing 4 sub-populations indicated by distinct colors. (C) Principal component analysis plot of the first two components showing 131 accessions into four clusters. (D) Dendrogram based on the Neighbour-Joining method. Accessions with different colors belong to respective sub-populations as shown in population structure.
FIGURE 4. Genome-wide linkage disequilibrium decay of r2 values (red line) against the physical distance in a panel of 131 accessions. The dotted line indicates a cut-off value of 50%.
Genome-Wide Association Study
Using five ML-GWAS methods, a total of 620 significant QTNs (LOD score of ≥3.0) have been identified for five traits across five environments, of which 335 were unique QTNs. With respect to methods, 26, 94, 144, 115, and 149 QTNs were identified using FASTmrEMMA, FASTmrMLM, ISIS EM-BLASSO, mrMLM, and pLARmEB, respectively; however, there were some common QTNs across methods (Supplementary Table S3). The multi-locus method, pKWmEB, could not identify any significant QTN in any of the environments. Among the significant QTNs, 68, 80, and 76 were for DF5, DF50, and DF95, respectively, whereas 84 and 95 QTNs were identified for DM and PH, respectively (Supplementary Table S3). In order to select the robust QTNs, the QTNs which have been identified either using ≥ two methods and/or in ≥ two environments were considered stable QTNs. From the stable QTNs, those identified in ≥ two methods or in ≥ two environments were considered strong QTNs, while those identified in ≥ two methods along with ≥ two environments were considered very strong QTNs. Accordingly, 21 very strong and 88 strong unique QTNs have been identified for the five traits (Supplementary Table S3). The rest of the significant QTNs were considered as potential QTNs. The total number of significant QTNs for all the studied traits in five environments using five ML-GWAS methods is shown (Supplementary Table S4).
In DF5, there were 21 stable QTNs, of which 5 and 16 were very strong and strong, respectively (Table 2). For DF50, a total of 28 stable QTNs comprised 4 very strong and 24 strong QTNs (Table 3), whereas for DF95, from 24 stable QTNs, 2 were very strong and 22 were strong QTNs (Table 4). Moreover, there were 10, 13, and 12 QTNs for DF5, DF50, and DF95, respectively, which were identified in ≥3 methods and/or environments. From the total of 73 stable QTNs for three flowering time traits, 15 QTNs were co-identified either in all or two of the three flowering time traits. One QTN (Lu07_3538758) showed pleiotropic effect and was identified for DF50 as well as DM. For three flowering time traits, the range of the LOD score and corresponding −log10(p) value ranged from 3.0 to 13.76 and 3.70 to 14.77, respectively, explaining up to 35.28% of flowering time variation (Tables 2−4).
For DM, 30 stable QTNs have been identified, of which 6 and 24 were very strong and strong, respectively (Table 5). For PH, there were 27 stable QTNs comprising 5 very strong and 22 strong QTNs (Table 6). The LOD and −log10(p) score of stable QTNs for DM were 3.03–9.17 and 3.73–10.09, respectively, explaining up to 28.78% of variation in days to maturity. For PH, the LOD and -log10(p) values of stable QTNs were 3.06–12.15 and 3.76–13.13, respectively. The stable QTNs accounted for phenotypic variation up to 36.6% for PH (Table 6). Manhattan plots showing significant QTNs and respective QQ-plots of ML-GWAS for five traits in five environments are shown (Figure 5; Supplementary Figures S1–S4). Positions of stable QTNs identified for flowering time (DF5, DF50, and DF95), DM, and PH have been depicted on the 15 chromosomes of linseed (Figure 6).
FIGURE 5. Manhattan plots and quantile-quantile plots for days to 5% (DF5), 50% (DF50), and 95% flowering (DF95), days to maturity (DM), and plant height (PH) using five ML-GWAS methods for environment DL19-20. The dotted lines in Manhattan plots show a threshold at LOD score of ≥3.0. The dots above the threshold depict significant QTNs in the respective chromosome. The pink dots depict significant QTNs identified by ≥2 methods.
FIGURE 6. Chromosomal positions of stable QTNs for flowering time (DF5, DF50, and DF95) (blue dots), days to maturity (DM) (red dots), and plant height (PH) (green dots) traits in linseed.
Identification of Candidate Genes
Genes around the 30 kb region (up and downstream, total 60 kb) of the 109 stable and unique QTNs were considered as the putative candidate genes based on their function, homology with the Arabidopsis ortholog, and pathway analysis. From a total of 201, 273, and 240 genes in the 30 kb region (Supplementary Table S5) of the stable QTNs, 35, 24, and 45 putative candidate genes (Supplementary Table S6) were identified for DF5, DF50, and DF95 traits, respectively. Meanwhile, for DM and PH, 128 and 46 candidate genes were considered as putative candidate genes from the total of 280 and 253 genes, respectively (Supplementary Tables S5, S6). Functional annotation of the putative candidate genes and respective GO prediction to enzyme classes (EC) and pathways (KEGG) have been shown in Supplementary Tables S7–S11. In addition, 70 of the stable QTNs were found within the genic regions (Table 7).
For flowering time traits (DF5, DF50, and DF95), the potential candidate genes included Lus10002500 (Glycerol-3-phosphate 2-O-acyltransferase 6), Lus10023256 (Protein POLLENLESS 3), Lus10024180 (Protein JINGUBANG), Lus10023257 (Small RNA 2′-O-methyltransferase), Lus10022584 (A PUTATIVE RECOMBINATION INITIATION DEFECT 1, PRD1), Lus10033883 (Protein Kinesin Light Chain-Related 2), Lus10042078 (KDEL-tailed cysteine endopeptidase CEP1), Lus10040256 (BEL1-like homeodomain protein 9), Lus10013910 (G-type lectin S-receptor-like serine/threonine-protein kinase RKS1), Lus10026767 (FAR1-related sequence 3), Lus10026770 (F-box protein 2), Lus10006489 (ABA responsive elements-binding factor 2), Lus10006587 (bHLH protein), Lus10026766 (C2H2 zinc-finger protein SERRATE), Lus10010121 (SET domain protein 14), Lus10019086 (MYB domain protein 106), Lus10010119 (Cytochrome P450 family 703, subfamily A, polypeptide 2), and Lus10000989 (maternal effect embryo arrest 18) (Table 7, Supplementary Table S6).
For DM, important candidate genes included Lus10037719 (Two-component response regulator ARR1), Lus10039906 (3-Ketoacyl-CoA synthase 19), Lus10010681 (Protein LURP-one-related 8), Lus10004130 (Bifunctional 3-dehydroquinate dehydratase/shikimate dehydrogenase), Lus10001717 (Protein pleiotropic regulatory locus 1), Lus10002133 (CSC1-like protein), Lus10001717 (Protein pleiotropic regulatory locus 1), Lus10002492 (AGAMOUS-like), Lus10004131 (FRIGIDA-like protein), and Lus10004132 (FRIGIDA-like protein) (Table 7; Supplementary Table S6).
For PH, the potential candidate genes included Lus10021899 (LRR receptor-like serine/threonine-protein Kinase GSO2), Lus10026287 (5′-methylthioadenosine/S-adenosylhomocysteine nucleosidase), Lus10005957 (Protein MAINTENANCE OF MERISTEMS), Lus10000613 and Lus10014757 (lateral organ boundaries domain protein), Lus10018946 (RH39), Lus10035590 (GDSL-like Lipase), and Lus10038763 (Phototropic-responsive NPH3 family protein) (Table 7; Supplementary Table S6).
Expression of Candidate Genes
Putative candidate genes for flowering time (DF5, DF50, and DF95) and PH traits were validated using transcriptome data of floral buds at two developmental stages, flower, leaf, and stem tissues in early flowering and maturing variety of Sharda (IC0523807) from the AM panel. Most of the candidate genes for flowering time showed higher expression in at least one of the three floral tissues than in the leaf and/or stem (Figure 7). In case of plant height, 26 of the 46 putative candidate genes showed expression in stem tissue. For DM, expression of candidate genes was studied using in silico expression making use of rice RNA-seq data of seeds at 5 and 10 DAP, embryo, and endosperms at 25 DAP. All but 10 of the total candidate genes showed expression in at least one of the four studied tissues (Supplementary Figure S5).
FIGURE 7. Heat map of expression level of candidate genes for three flowering time traits (DF5, DF50, and DF95) and PH using RNA-seq data of three reproductive tissues (floral buds at two stages of development and flower) and two vegetative tissues (leaf and stem) of early flowering variety Sharda (IC0523807).
Discussion
Trait Variation in Linseed
For the present study, evaluation data of five environments comprising two latitudinally distant geographical locations, New Delhi (28°38′53.7″N 77°09′05.4″E) (years: 2017–18, 2018–19 and 2019–20) and Akola (20°42′03.2″N77°01′53.6″E) (years: 2018–19 and 2019–20), were used. There was significant trait variation recorded within the AM panel for all five traits as evident in the range values (Table 1; Figure 1). Additionally, there was significantly high error variance (environmental factor) for all the studied traits, which impelled us to perform the statistical analysis independently for each individual environment (Table 1). A similar approach has been followed by other researchers for field phenotyping in GWAS studies in linseed (Xie et al., 2018b; Soto-Cerda et al., 2021). The difference in the trait values of both the locations was conspicuous not only for flowering and days to maturity traits but also for plant height (Figure 1). Overall, in Akola, relative to New Delhi, plants were shorter, with earlier flowering and fewer days to maturity. Similar differences in the life cycle of linseed across the latitude in the western Canadian provinces and in southern Chile have also been reported (Soto-Cerda et al., 2021). This could be due to the effect of the photoperiod and temperature on growth and development in linseed (Davidson and Yermanos 1965; Sirohi and Wasnik 2018).
Importance of Flowering and Maturity Time in Linseed
Different parts of linseed-growing countries have different suitability for early or late flowering and maturity based on prevailing short or long seasons (Sasaki et al., 2018; Raman et al., 2019). Early flowering and maturity are desirable traits in flax-growing regions of Canada and for its expansion in the Canadian Prairies (Miller et al., 2001; Sun et al., 2019). Early maturity is also desirable in south Asian countries, where water is a limiting factor and flowering or maturity coincides with higher temperature so as to avoid adverse effect on seed set and yield (Ford and Zimmerman 1964; Wheeler et al., 2000; Jagadish et al., 2008; Hall et al., 2016). Linseed grown in rice fallows to utilize residual moisture (Kaur et al., 2017; Kaur et al., 2019) also requires early maturing short duration varieties. Cultivation of linseed in rice fallows would not only make efficient use of natural resources but would also bring economic gain for farmers with meager inputs (Singh and Satapathy, 2019). This entails the genetic dissection of flowering time and days to maturity traits to accelerate utilization of genetic and genomic resources for rapid varietal development adapted to different niche areas. Previous study on GWAS in linseed for flowering time trait relied either on days to 5% or 50% flowering (Singh et al., 2019; Soto-Cerda et al., 2021). For further dissection of flowering time, in the present study, GWAS was performed for three flowering time traits, that is, initiation of flowering (days to 5% flowering), days to accomplish 50% flowering, and days to achieve complete flowering (days to 95% flowering). This probably facilitated identification of a higher number of QTNs for flowering time traits than previously reported (You and Cloutier, 2020; Soto-Cerda et al., 2021).
Comparison of QTNs Identified in Present Study With Previously Identified QTNs/QTLs
In flax/linseed, 14 studies have reported a total of 340 QTLs for 31 quantitative traits (You and Cloutier, 2020; Soto-Cerda et al., 2021). 200 of the QTLs were identified based on genetic maps, the scaffold sequences, or the pre-released chromosome-scale pseudomolecules. The work of You and Cloutier (2020) facilitated the mapping of the SSR and SNP markers from different references onto the recently released chromosome-scale pseudomolecules.
For flowering time, there had been very limited QTLs/associated markers (1 QTL and 2 SNPs) identified until 2020 (Soto-Cerda et al., 2014; Singh et al., 2019; You and Cloutier, 2020). Recently, Soto-Cerda et al. (2021) have reported 27 significant QTLs explaining 2.44%–14.71% of the average phenotypic variation for flowering time in flax in a panel of 200 accessions of the flax core collection using multi-locus GWAS methods. In our study, combining three traits of flowering time (DF5, DF50, and DF95), 53 stable (unique) QTNs on a total of 14 of 15 chromosomes were observed, with the highest number of eight QTNs on chromosomes 1 and 3, followed by seven QTNs on chromosome 9. Comparison of QTNs identified in our study with earlier studies revealed close physical proximity of a number of QTNs with previously reported QTLs. SSR marker Lu943 associated with first flowering time QTL (Soto-Cerda et al., 2014; You and Cloutier, 2020) could be located to 1.02 Mb proximity of QTN Lu01_27777416 identified for DF50 and DF95 in this study. The nearest QTL to Lu943 reported in previous studies was 16 Mb apart on chromosome-1 (Soto-Cerda et al., 2021).
Similarly, a detailed comparison of QTLs identified recently (Soto-Cerda et al., 2021) has revealed that 12 QTNs from the present study were within 2.5 Mb physical distance of 12 of the 27 reported QTLs. A few QTNs, Lu13_15811472, Lu03_715991, Lu05_16229944, and Lu02_22483108, were as close as 0.038 Mb (QTL: Lu13_15849708), 0.07 Mb (QTL: Lu3_637116), 0.6 Mb (QTL: Lu5_16839509), and 0.7 Mb (QTL: Lu2_21773820), respectively, to that of corresponding QTLs. This indicates that these QTNs could be co-located to the respective QTLs. Moreover, additional QTNs have been identified for flowering time traits which could be considered as novel QTNs.
For DM, there have been only three QTLs identified previously, two on chromosome 4, QDTM-Lu4.1 (Coordinates: 13170489–15040682) and QDm.BM.crc-LG4 (Coordinates: 14489225–14489333), and one on chromosome 11, QDTM-Lu11.2 (Coordinates: 14767787) (Kumar et al., 2015; You and Cloutier, 2020). In the present study, for DM, a total of 30 stable QTNs have been identified, of which five were on chromosome 4 (Lu04_16826508, Lu04_726,720, Lu04_12997862, Lu04_17812996, and Lu04_19832989) and two on chromosome 11 (Lu11_2812683 and Lu11_3277859). As could be seen above, two of the identified QTNs appear near to the reported QTL, QDTM-Lu4.1 and QDm.BM.crc-LG4 on chromosome 4, which suggests that they are possibly a part of the respective QTL. Other additional QTNs for days to maturity from our study are novel QTNs for DM.
For PH, there are 30 unique QTLs reported so far on 12 of 15 chromosomes (except on 2, 9, and 10) with the highest number of 9 QTLs on chromosome 1, followed by 4 QTLs on chromosome 3 (Zhang et al., 2018, Soto-Cerda et al., 2014; Wu et al., 2018, Xie et al., 2018b; You and Cloutier, 2020). In our study, a total of 27 stable QTNs have been identified on a total of 11 of 15 chromosomes (except on 2, 4, 10, and 12). On chromosome 1, five QTNs (Table 6) have been identified, of which four QTNs were in close proximity to previously identified QTLs. QTN Lu01_10278370 was localized 0.22, 0.72, and 3.6 Mb close to QTLs scaffold59_572553, scaffold344_309662, and QPLH-Lu1.1, respectively, on the pseudomolecule (You and Cloutier, 2020). Other QTNs, Lu01_8023806, Lu01_28491950, and Lu01_25520648, were close to earlier reported QTLs uq.C1–1, Lu943, and QPLH-Lu1.2 with a distance of 1.4 Mb, 0.3 Mb, and 5.5 Mb, respectively. Similarly, 2 of the 4 QTNs identified in our study were in close proximity to earlier reported QTLs/markers. That includes QTNs Lu03_5476022 (scafold156_641874, 0.43 Mb; Marker4371, 0.54 Mb; scaffold31_1800846, 1.5 Mb) and Lu03_21996588 (uq.C3–1, 3.2 Mb), which suggest that the identified QTNs could be part of earlier reported QTL regions of the chromosomes. So, for PH, the present study has identified QTNs which possibly are part of earlier reported QTLs/markers and some novel QTNs as well; however, we also missed some of the earlier reported QTLs. This could be due to different AM populations and number of SNP markers used.
Overall, in the present study, some of the earlier identified QTLs could be co-located as well as novel QTNs identified for the studied traits. High number of QTNs identified in this study shows the power of ML-GWAS methods for genetic dissection of complex traits over the single locus methods. The previous GWAS studies using single locus methods in linseed had revealed fewer associated loci than GWAS using multi-locus methods (Xie et al., 2018a, b; Singh et al., 2019; Soto-Cerda et al., 2014).
Identification of Candidate Genes
From the putative candidate genes for flowering time traits, notable genes included Lus10023257 (small RNA 2′-O-methyltransferase, HEN1, HUA ENHANCER 1) which carried QTN Lu12_2201590 within the gene sequence and functions like AGAMOUS in organ identity specifications in the flower. It also functions in controlling floral determinacy (Chen et al., 2002). Another important gene near the locus was Lus10023256 (Protein POLLENLESS 3) which is essential for male fertility, especially for microspore and pollen grain production in Arabidopsis (Glover et al., 1998). It is specifically involved in the regulation of cell division after male meiosis I and II to facilitate exit from meiosis and transition to G1 (Bulankova et al., 2010).
Lus10022584 (A PUTATIVE RECOMBINATION INITIATION DEFECT 1, PRD1) with QTN, Lu01_10588059 in its genic region, and its homologue in Arabidopsis have roles in initiating meiotic recombination by the mechanism of DNA cleavage that forms the double-strand breaks (DSB) to facilitate recombination (De Muyt et al., 2007).
The ortholog of Lus10026770 (Photo-responsive gene F-box of flowering 2, FOF2) in Arabidopsis negatively regulates flowering as its overexpression results in late flowering both in long-day and short-day photoperiods whereas mutants show early flowering phenotypes (He et al., 2017).
Interestingly, there were several candidate genes which showed potential function related to pollen or pollen tube development, such as Lus10033883 (KINESIN LIGHT CHAIN-RELATED-2, QTN locus: Lu03_19174892), which showed a role in pollen tube growth, and Lus10024180 (Protein JINGUBANG, QTN locus: Lu05_16229944), a negative regulator of pollen germination functions in stabilizing pollen tube growth that also plays a role in preventing pollination in moist environments by inhibiting jasmonic acid synthesis (Ju et al., 2016). Lus10042078 (KDEL-tailed cysteine endopeptidase—CEP1, QTN locus: Lu11_3283122) plays a role in anther wall tapetum and pollen development (Zhang et al., 2014). Other candidate genes which showed roles related to pollen/pollen tube development were Ras-related protein RABA4d (Szumlanski and Nielsen, 2009), F-box protein (Gusti et al., 2009), E3 ubiquitin-protein ligase APD2 (Luo et al., 2012) and among others LOB domain-containing protein 27, Beclin-1-like protein, Delta-1-pyrroline-5-carboxylate synthase A, and G-type lectin S-receptor-like serine/threonine-protein kinase RKS1.
Among the other notable candidate genes for flowering time, Lus10042079 (POOR HOMOLOGOUS SYNAPSIS 1) (QTN locus: Lu11_3283122) plays a role in pairing between homologous chromosomes and accurate chromosome segregation in meiosis (Ronceret et al., 2009). Lus10002500, which harbored the QTN Lu08_31911 within the gene, encodes for glycerol-3-phosphate 2-O-acyltransferase 6. In Arabdopsis, the glycerol-3-phosphate acyltransferase 6 genes are expressed in flowering tissues and have a role in the synthesis of the cutin and floral nano-ridges (Li-Beisson et al., 2009).
Interestingly, some other known flowering-related genes such as FRIGIDA-like protein, AGAMOUS-like, Protein ULTRAPETALA 1, and Chromatin remodeling protein SHL were observed near the QTNs associated with DM. Since the flowering time and days to maturity show positive phenotypic correlation (Saroha et al., 2022), it is possible that the QTLs also harbor underlying genes for both the traits.
For DM, important candidate genes include 2-component response regulator ARR1 which showed a role in the regulation of seed growth (Hill et al., 2013) and Pumilio homolog 24 which is involved in embryogenesis and embryo development ending in seed dormancy (Shanmugam et al., 2017). Bifunctional 3-dehydroquinate dehydratase/shikimate functions in embryo development ending in seed dormancy and shikimate pathways (Pagnussat et al., 2005). Soto-Cerda et al. (2021) have reported candidate genes associated with flowering time such as SUMO activating enzymes, GEM like protein 5, and Mannose-6-phosphate isomerase 1 which were involved in embryo development ending in seed dormancy. It is possible that these genes have pleiotropic effect on both the traits. Protein pleiotropic regulatory locus 1 has a role in cotyledon development and fruit development (Lee et al., 2008). Genes involved in amino acid biosynthesis (3-phosphoshikimate 1-carboxyvinyltransferase), fatty acid biosynthesis (3-Ketoacyl-CoA synthase 19; Probable fructokinase-6), and carbohydrate metabolism (UDP-glucuronate 4-epimerase 3) were also the notable candidate genes for DM.
For PH, candidate genes having a role in phloem or xylem histogenesis (gene: Lus10021899 - LRR receptor-like serine/threonine-protein Kinase GSO2), meristem development and regulation of growth (gene: Lus10026287–5-methylthioadenosine/S-adenosylhomocysteine nucleosidase), plant-type cell wall biogenesis and xylem and phloem pattern formation (gene: Lus10003908 - Microtubule-associated protein 70–5), and cellulose microfibril organization and plant-type secondary cell wall biogenesis (gene: Lus10031972 - COBRA-like protein 4) have been identified. Also, a gene, Lus10015710 (FT-interacting protein 1), involved in long-day photoperiodism and positive regulation of flower development has been found near to QTN, Lu14_6293660 associated with PH, indicating a possible pleiotropic effect on both the traits. Interestingly, the two traits showed a positive correction in linseed, as taller plants showed relatively delayed maturity (Saroha et al., 2022). It has been observed that the FT ortholog in tomato (SFT) can also target growth and termination of vegetative apical meristems. The constitutive expression of the 35S:SFT gene had resulted in shorter internodes, thinner stems, and arrested apices (Lifschitz et al., 2006). The overexpression of SFT had also shown attenuation of intercalary meristems of the stems even before and independent of flower formation. Since SFT/FT orthologs have pleiotropic effect on flowering and growth, it was suggested that the floral transition and growth attenuation were the two facets of the same cellular responses (Lifschitz et al., 2006). The pleotropic effect of major QTL (comprising a cluster of FT genes) for flowering time also had a strong association with growth habit in chickpea (Ortega et al., 2019).
Gene expression profile study showed that most of the candidate genes for flowering time traits and plant height were expressing/showed up-regulation in either the flowering tissues or stem, respectively. Although, the expression profile of candidate genes in the studied tissues gives additional evidence of their role in respective traits, the low or no expression of the candidate genes should not be restrictive at this juncture in terms of their possible role in respective traits as the studied tissues could capture only a limited range of flower/shoot development.
Conclusion
In the present study, GWAS was performed for three flowering time (DF5, DF50, and DF95) traits, days to maturity, and plant height in linseed using 68,925 SNPs and field evaluation data of five environments by employing five ML-GWAS methods. A total of 335 unique QTNs have been identified, of which 109 were stable QTNs comprising 88 strong and 21 very strong QTNs. For three flowering time traits (DF5, DF50, and DF95), a total of 53 stable QTNs have been identified, whereas for days to maturity and plant height, 30 and 27 stable QTNs have been identified, respectively. Several candidate genes having a role in flower/reproductive system development, seed and fruit development, phloem/xylem histogenesis and pattern formation, regulation of growth, and embryo development ending in seed dormancy have been identified. The study could co-locate the known QTLs as well as identify novel QTNs associated with the studied traits. The present study helps in elucidating QTNs and candidate genes underlying flowering time, days to maturity, and plant height in linseed and improves our understanding of genetic associations of these traits in linseed.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, PRJNA706105.
Author Contributions
DW, VK, and SR conceptualized the project and designed experiments. AS, DP, SG, DC, and SU have done investigation, data recording, and curation. AS and JA have done statistical and GWAS analysis. A performed transcriptome analysis. JA and JR provided germplasm resources. DW, RK, and ASe supervised the project. DW provided resources and funding acquisition and wrote the original draft.
Funding
This research was funded by the Science and Engineering Research Board (SERB), Government of India, in the form of the Early Career Research Award (ECR/2017/001280). Funds for part of the work were also received from the ICAR-NBPGR institutional project (PGR/ DGR-BUR-DEL482 01.01).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
Authors acknowledge funding for the project (ECR/2017/001280) from the Science and Engineering Research Board (SERB), Government of India, and the Indian Council of Agricultural Research-National Bureau of Plant Genetic Resources (ICAR-NBPGR), New Delhi. Authors thank former and present Director, ICAR-NBPGR and Heads, DGR, DGE, for research facilities and facilitation.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.811924/full#supplementary-material
Supplementary Figure S1 | Manhattan plots and quantile-quantile plots for five traits using five ML-GWA methods for environment AK18-19.
Supplementary Figure S2 | Manhattan plots and quantile-quantile plots for five traits using five ML-GWA methods for environment AK19-20.
Supplementary Figure S3 | Manhattan plots and quantile-quantile plots for five traits using five ML-GWA methods for environment DL17-18.
Supplementary Figure S4 | Manhattan plots and quantile-quantile plots for five traits using five ML-GWA methods for environment DL18-19.
Supplementary Figure S5 | Heatmap showing in silico gene expression of putative candidate genes for DM in flax and their rice (Oryza sativa) orthologs based on RNAseq data of four seed developmental stages/tissues of rice obtained from The Rice Genome Annotation Project. DAP, days after pollination.
References
Aravind, J., Mukesh, S. S., Wankhede, D. P., and Kaur, V. (2021). augmentedRCBD: Analysis of Augmented Randomised Complete Block Designs. R package version 0.1.5. 2021Available at: https://cran.r-project.org/package=augmentedRCBD.
Bassett, C. M. C., Rodriguez-Leyva, D., and Pierce, G. N. (2009). Experimental and Clinical Research Findings on the Cardiovascular Benefits of Consuming Flaxseed. Appl. Physiol. Nutr. Metab. 34 (5), 965–974. doi:10.1139/h09-087
Bradbury, P. J., Zhang, Z., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., and Buckler, E. S. (2007). TASSEL: Software for Association Mapping of Complex Traits in Diverse Samples. Bioinformatics 23, 2633–2635. doi:10.1093/bioinformatics/btm308
Bulankova, P., Riehs-Kearnan, N., Nowack, M. K., Schnittger, A., and Riha, K. (2010). Meiotic Progression in Arabidopsis is Governed by Complex Regulatory Interactions between SMG7, TDM1, and the Meiosis I-specific Cyclin TAM. Plant Cell. Nov 22 (11), 3791–3803. doi:10.1105/tpc.110.078378
Burghardt, L. T., Young, N. D., and Tiffin, P. (2017). A Guide to Genome-wide Association Mapping in Plants. Curr. Protoc. Plant Biol. 2 (1), 22–38. doi:10.1002/cppb.20041
Chaurasia, S., Singh, A. K., Kumar, A., Songachan, L. S., Yadav, M. C., Kumar, S., et al. (2021). Genome-wide Association Mapping Reveals Key Genomic Regions for Physiological and Yield-Related Traits under Salinity Stress in Wheat (Triticum aestivum L.). Genomics 113 (5), 3198–3215. doi:10.1016/j.ygeno.2021.07.014
Chen, X., Liu, J., Cheng, Y., and Jia, D. (2002). HEN1functions Pleiotropically in Arabidopsis development and Acts in C Function in the Flower. Dev. Mar. 129 (5), 1085–1094. doi:10.1242/dev.129.5.1085
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The Variant Call Format and VCFtools. Bioinformatics 27 (15), 2156–2158. doi:10.1093/bioinformatics/btr330
Davidson, J., and Yermanos, D. M. (1965). Flowering Pattern of Flax (Linum usitatissimum L.) 1. Crop Sci. 5, 23–28. doi:10.2135/cropsci1965.0011183x000500010010x
De Muyt, A., Vezon, D., Gendrot, G., Gallois, J.-L., Stevens, R., and Grelon, M. (2007). AtPRD1 Is Required for Meiotic Double Strand Break Formation in Arabidopsis thaliana. Embo J. 26 (18), 4126–4137. doi:10.1038/sj.emboj.7601815
Diederichsen, A., Kusters, P. M., Kessler, D., Bainas, Z., and Gugel, R. K. (2013). Assembling a Core Collection from the Flax World Collection Maintained by Plant Gene Resources of Canada. Genet. Resour. Crop Evol. 60, 1479–1485. doi:10.1007/s10722-012-9936-1
Diederichsen, A., and Ulrich, A. (2009). Variability in Stem Fibre Content and its Association with Other Characteristics in 1177 Flax (Linum usitatissimum L.) Genebank Accessions. Industrial Crops Prod. 30 (1), 33–39. doi:10.1016/j.indcrop.2009.01.002
Domantovich, A. V., Koshkin, V. A., Brutch, N. B., and Matvienko, I. I. (2012). Investigation of Photoperiod Sensitivity of Linum usitatissimum L. Lines and Effect of Short-Day Conditions on Their Economically Valuable Traits. Russ. Agric. Sci. 38, 173–177. doi:10.3103/s1068367412030056
Elshire, R. J., Glaubitz, J. C., Sun, Q., Poland, J. A., Kawamoto, K., Buckler, E. S., et al. (2011). A Robust, Simple Genotyping-By-Sequencing (GBS) Approach for High Diversity Species. PLoS ONE 6, e19379. doi:10.1371/journal.pone.0019379
FAOSTAT (2018). Food and Agriculture Data. Available at: http://www.fao.org/faostat/en/#data/QC. Accessed 18 May 2020.
Ford, J. H., and Zimmerman, D. C. (1964). Influence of Time of Flowering on Oil Content and Oil Quality of Flaxseed 1. Crop Sci. 4, 653–656. doi:10.2135/cropsci1964.0011183x000400060031x
Geraldine, A. V. D. A., and Brian, D. O. C. (2020). Genomics in the Cloud: Using Docker, GATK, and WDL in Terra. 1st Edition.
Glover, J., Grelon, M., Craig, S., Chaudhury, A., and Dennis, E. (1998). Cloning and Characterization of MS5 from Arabidopsis: a Gene Critical in Male Meiosis. Plant J. 15 (3), 345–356. doi:10.1046/j.1365-313x.1998.00216.x
Goyal, A., Sharma, V., Upadhyay, N., Gill, S., and Sihag, M. (2014). Flax and Flaxseed Oil: an Ancient Medicine & Modern Functional Food. J. Food Sci. Technol. 51 (9), 1633–1653. doi:10.1007/s13197-013-1247-9
Green, A. G., Singh, S. P., Chen, Y., and Dribnenki, J. C. P. (2008). “Flax,” in Compendium Transgenic Crop Plants: Transgenic Oilseed Crops. Editors C. Kole, and T. C. Hall (Oxford, UK: Blackwell Publishing Ltd.), 199–226. doi:10.1002/9781405181099.k0205
Gu, Z., Eils, R., and Schlesner, M. (2016). Complex Heatmaps Reveal Patterns and Correlations in Multidimensional Genomic Data. Bioinformatics 32 (18), 2847–2849. doi:10.1093/bioinformatics/btw313
Gusti, A., Baumberger, N., Nowack, M., Pusch, S., Eisler, H., Potuschak, T., et al. (2009). The Arabidopsis thaliana F-Box Protein FBL17 Is Essential for Progression through the Second Mitosis during Pollen Development. PLoS One 4 (3), e4780. doi:10.1371/journal.pone.0004780
Hall, L. M., Booker, H., Siloto, M. P. R., Jhala, A. J., and Weselake, R. J. (2016). Flax (Linum usitatissimum L.). Ind. oil crops 2016, 157–194. doi:10.1016/b978-1-893997-98-1.00006-3
Hammer, O., Harper, D. A. T., and Ryan, P. D. (2001). PAST: Paleontological Statistics Software Package for Education and Data Analysis. Palaeontol. Electron. 4 (1), 9.
He, L., Xiao, J., Rashid, K., Jia, G., Li, P., Yao, Z., et al. (2019). Evaluation of Genomic Prediction for Pasmo Resistance in Flax. Ijms 20 (2), 359. doi:10.3390/ijms20020359
He, R., Li, X., Zhong, M., Yan, J., Ji, R., Li, X., et al. (2017). A Photo-Responsive F-Box Protein FOF2 Regulates Floral Initiation by promoting FLC expression in Arabidopsis. Plant J. 91 (5), 788–801. doi:10.1111/tpj.13607
Hill, K., Mathews, D. E., Kim, H. J., Street, I. H., Wildes, S. L., Chiang, Y.-H., et al. (2013). Functional Characterization of Type-B Response Regulators in the Arabidopsis Cytokinin Response. Plant Physiol. May 162 (1), 212–224. doi:10.1104/pp.112.208736
Hirsch, C. D., Evans, J., Buell, C. R., and Hirsch, C. N. (2014). Reduced Representation Approaches to Interrogate Genome Diversity in Large Repetitive Plant Genomes. Briefings Funct. Genomics 13 (4), 257–267. doi:10.1093/bfgp/elt051
Jagadish, S. V. K., Craufurd, P. Q., and Wheeler, T. R. (2008). Phenotyping Parents of Mapping Populations of Rice for Heat Tolerance during Anthesis. Crop Sci. 48, 1140–1146. doi:10.2135/cropsci2007.10.0559
Jaiswal, V., Gupta, S., Gahlaut, V., Muthamilarasan, M., Bandyopadhyay, T., Ramchiary, N., et al. (2019). Genome-Wide Association Study of Major Agronomic Traits in Foxtail Millet (Setaria Italica L.) Using ddRAD Sequencing. Sci. Rep. 9, 5020. doi:10.1038/s41598-019-41602-6
Ju, Y., Guo, L., Cai, Q., Ma, F., Zhu, Q.-Y., and Zhang, Q. (2016). Arabidopsis JINGUBANG Is a Negative Regulator of Pollen Germination that Prevents Pollination in Moist Environments. Plant CellSep 28 (9), 2131–2146. doi:10.1105/tpc.16.00401
Kaur, V., Wankhede, D. P., Yadav, R., Aravind, J., and Kumar, A. K. (2019). Linseed Genetic Resources: Present Status and Future Prospects for Enhanced Utilization. New York: Nova Science Publishers.
Kaur, V., Yadav, R., and Wankhede, D. P. (2017). Linseed (Linum usitatissimum L.) Genetic Resources for Climate Change Intervention and its Future Breeding. Jans 9 (2), 1112–1118. doi:10.31018/jans.v9i2.1331
Kujur, A., Bajaj, D., Upadhyaya, H. D., Das, S., Ranjan, R., Shree, T., et al. (2015). A Genome-wide SNP Scan Accelerates Trait-Regulatory Genomic Loci Identification in Chickpea. Sci. Rep. 5, 11166. doi:10.1038/srep11166
Kumar, S., Kumari, J., Bhusal, N., Pradhan, A. K., Budhlakoti, N., Mishra, D. C., et al. (2020). Genome-Wide Association Study Reveals Genomic Regions Associated with Ten Agronomical Traits in Wheat under Late-Sown Conditions. Front. Plant Sci. 11, 549743. doi:10.3389/fpls.2020.549743
Kumar, S., You, F. M., Duguid, S., Booker, H., Rowland, G., and Cloutier, S. (2015). QTL for Fatty Acid Composition and Yield in Linseed (Linum usitatissimum L.). Theor. Appl. Genet. 128, 965–984. doi:10.1007/s00122-015-2483-3
Lee, J.-H., Terzaghi, W., Gusmaroli, G., Charron, J.-B. F., Yoon, H.-J., Chen, H., et al. (2008). Characterization of Arabidopsis and Rice DWD Proteins and Their Roles as Substrate Receptors for CUL4-RING E3 Ubiquitin Ligases. Plant Cell. Jan 20 (1), 152–167. doi:10.1105/tpc.107.055418
Letunic, I., and Bork, P. (2021). Interactive Tree of Life (iTOL) V5: an Online Tool for Phylogenetic Tree Display and Annotation. Nucleic Acids Res. 49 (W1), W293–W296. doi:10.1093/nar/gkab301
Li, H. (2013). Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM, 13033997. arXiv. 3.
Li-Beisson, Y., Pollard, M., Sauveplane, V., Pinot, F., Ohlrogge, J., and Beisson, F. (2009). Nanoridges that Characterize the Surface Morphology of Flowers Require the Synthesis of Cutin Polyester. Proc. Natl. Acad. Sci. U.S.A. 106, 22008–22013. doi:10.1073/pnas.0909090106
Lifschitz, E., Eviatar, T., Rozman, A., Shalit, A., Goldshmidt, A., Amsellem, Z., et al. (2006). The Tomato FT Ortholog Triggers Systemic Signals that Regulate Growth and Flowering and Substitute for Diverse Environmental Stimuli. Proc. Natl. Acad. Sci. U.S.A. 103 (16), 6398–6403. doi:10.1073/pnas.0601620103
Luo, G., Gu, H., Liu, J., and Qu, L.-J. (2012). Four Closely-Related RING-type E3 Ligases, APD1-4, Are Involved in Pollen Mitosis II Regulation in Arabidopsis. J. Integr. Plant Biol. 54 (10), 814–827. doi:10.1111/j.1744-7909.2012.01152.x
Mackay, T. F. C., Stone, E. A., and Ayroles, J. F. (2009). The Genetics of Quantitative Traits: Challenges and Prospects. Nat. Rev. Genet. 10, 565–577. doi:10.1038/nrg2612
Mahajan, R. K., Sapra, R. L., Srivastava, U., Singh, M., and Sharma, G. D. (2000). Minimal Descriptors (For Characterization and Evaluation) of Agri-Horticultural Crops (Part I). New Delhi: National Bureau of Plant Genetic Resources.
Miller, P. R., McDonald, C. L., Derksen, D. A., and Waddington, J. (2001). The Adaptation of Seven Broadleaf Crops to the Dry Semiarid Prairie. Can. J. Plant Sci. 81, 29–43. doi:10.4141/p00-028
Milner, S. G., Jost, M., Taketa, S., Mazón, E. R., Himmelbach, A., Oppermann, M., et al. (2019). Genebank Genomics Highlights the Diversity of a Global Barley Collection. Nat. Genet. 51, 319–326. doi:10.1038/s41588-018-0266-x
Ortega, R., Hecht, V. F. G., Freeman, J. S., Rubio, J., Carrasquilla-Garcia, N., Mir, R. R., et al. (2019). Altered Expression of an FT Cluster Underlies a Major Locus Controlling Domestication-Related Changes to Chickpea Phenology and Growth Habit. Front. Plant Sci. 10, 824. doi:10.3389/fpls.2019.00824
Pagnussat, G. C., Yu, H.-J., Ngo, Q. A., Rajani, S., Mayalagu, S., Johnson, C. S., et al. (2005). Genetic and Molecular Identification of Genes Required for Female Gametophyte Development and Function inArabidopsis. Feb 132 (3), 603–614. doi:10.1242/dev.01595
Pritchard, J. K., Stephens, M., Rosenberg, N. A., and Donnelly, P. (2000). Association Mapping in Structured Populations. Am. J. Hum. Genet. 67, 170–181. doi:10.1086/302959
Reddy, V. R. P., Das, S., Dikshit, H. K., Mishra, G. P., Aski, M., Meena, S. K., et al. (2020). Genome-Wide Association Analysis for Phosphorus Use Efficiency Traits in Mungbean (Vigna Radiata L. Wilczek) Using Genotyping by Sequencing Approach. Front. Plant Sci. 11, 537766. doi:10.3389/fpls.2020.537766
Ren, W.-L., Wen, Y.-J., Dunwell, J. M., and Zhang, Y.-M. (2018). pKWmEB: Integration of Kruskal-Wallis Test with Empirical Bayes under Polygenic Background Control for Multi-Locus Genome-wide Association Study. Heredity 120, 208–218. doi:10.1038/s41437-017-0007-4
Ronceret, A., Doutriaux, M.-P., Golubovskaya, I. N., and Pawlowski, W. P. (2009). PHS1 Regulates Meiotic Recombination and Homologous Chromosome Pairing by Controlling the Transport of RAD50 to the Nucleus. Proc. Natl. Acad. Sci. U.S.A. 106 (47), 20121–20126. doi:10.1073/pnas.0906273106
Saroha, A., Pal, D., Kaur, V., Kumar, S., Bartwal, A., Aravind, J., et al. (2022). Agro-morphological Variability and Genetic Diversity in Linseed (Linum usitatissimum L.) Germplasm Accessions with Emphasis on Flowering and Maturity Time. Genet. Resour. Crop Evol. 69, 315–333. doi:10.1007/s10722-021-01231-3
Shanmugam, T., Abbasi, N., Kim, H. S., Kim, H. B., Park, N. i., Park, G. T., et al. (2017). An Arabidopsis Divergent Pumilio Protein, APUM 24, Is Essential for Embryogenesis and Required for Faithful Pre‐ rRNA Processing. Plant J. 92 (6), 1092–1105. doi:10.1111/tpj.13745
Singh, N., Agarwal, N., and Yadav, H. K. (2019). Genome-wide SNP-Based Diversity Analysis and Association Mapping in Linseed (Linum usitatissimum L.). Euphytica 215, 139. doi:10.1007/s10681-019-2462-x
Singh, T., and Satapathy, B. S. (2019). Intensification of Pulses and Oilseeds Intensification of Pulses and Oilseeds in Rice Fallows. Indian Farm 69 (10), 31–34.
Sirohi, J. S., and Wasnik, K. G. (2018). Photosensitivity at Varietal Level in Linseed. Indian J. Plant Physiol. 21, 127–132.
Soto-Cerda, B. J., Duguid, S., Booker, H., Rowland, G., Diederichsen, A., and Cloutier, S. (2014). Genomic Regions Underlying Agronomic Traits in Linseed (Linum usitatissimum L.) as Revealed by Association Mapping. J. Integr. Plant Biol. 56, 75–87. doi:10.1111/jipb.12118
Soto-Cerda, B. J., Aravena, G., and Cloutier, S. (2021). Genetic Dissection of Flowering Time in Flax (Linum usitatissimum L.) through Single- and Multi-Locus Genome-wide Association Studies. Mol. Genet. Genomics 296 (4), 877–891. doi:10.1007/s00438-021-01785-y
Soto-Cerda, B. J., Diederichsen, A., Ragupathy, R., and Cloutier, S. (2013). Genetic Characterization of a Core Collection of Flax (Linum usitatissimum L.) Suitable for Association Mapping Studies and Evidence of Divergent Selection between Fiber and Linseed Types. BMC Plant Biol. 13, 13–78. doi:10.1186/1471-2229-13-78
Sun, J., Young, L. W., House, M. A., Daba, K., and Booker, H. M. (2019). Photoperiod Sensitivity of Canadian Flax Cultivars and 5-azacytidine Treated Early Flowering Derivative Lines. BMC Plant Biol. 19, 177. doi:10.1186/s12870-019-1763-5
Szumlanski, A. L., and Nielsen, E. (2009). The Rab GTPase RabA4d Regulates Pollen Tube Tip Growth in Arabidopsis thaliana. Plant Cell 21 (2), 526–544. doi:10.1105/tpc.108.060277
Tamba, C. L., Ni, Y.-L., and Zhang, Y.-M. (2017). Iterative Sure Independence Screening EM-Bayesian LASSO Algorithm for Multi-Locus Genome-wide Association Studies. PLoS Comput. Biol. 13, e1005357. doi:10.1371/journal.pcbi.1005357
Tamba, C. L., and Zhang, Y.-M. (2018). A Fast mrMLM Algorithm for Multi-Locus Genome-wide Association Studies. bioRxiv [Preprint]. doi:10.1101/341784
Törönen, P., and Holm, L. (2022). PANNZER -A Practical Tool for Protein Function Prediction. Protein Sci. 31 (1), 118–128. doi:10.1002/pro.4193
Vavilov, N. I. (1951). The Origin, Variation, Immunity and Breeding of Cultivated Plants. Chron. Bot. 13, 1–366. doi:10.1097/00010694-195112000-00018
Wang, S.-B., Feng, J.-Y., Ren, W.-L., Huang, B., Zhou, L., Wen, Y.-J., et al. (2016). Improving Power and Accuracy of Genome-wide Association Studies via a Multi-Locus Mixed Linear Model Methodology. Sci. Rep. 6, 19444. doi:10.1038/srep19444
Wang, Z., Hobson, N., Galindo, L., Zhu, S., Shi, D., McDill, J., et al. (2012). The Genome of Flax (Linum usitatissimum) Assembled De Novo from Short Shotgun Sequence Reads. Plant J. 72, 461–473. doi:10.1111/j.1365-313X.2012.05093.x
Wen, Y.-J., Zhang, H., Ni, Y.-L., Huang, B., Zhang, J., Feng, J.-Y., et al. (2018). Methodological Implementation of Mixed Linear Models in Multi-Locus Genome-wide Association Studies. Brief. Bioinform 19, 700–712. doi:10.1093/bib/bbw145
Wheeler, T. R., Craufurd, P. Q., Ellis, R. H., Porter, J. R., and Vara Prasad, P. V. (2000). Temperature Variability and the Yield of Annual Crops. Agric. Ecosyst. Environ. 82, 159–167. doi:10.1016/s0167-8809(00)00224-3
Wickland, D. P., Battu, G., Hudson, K. A., Diers, B. W., and Hudson, M. E. (2017). A Comparison of Genotyping-By-Sequencing Analysis Methods on Low-Coverage Crop Datasets Shows Advantages of a New Workflow, GB-eaSy. BMC Bioinforma. 18, 586. doi:10.1186/s12859-017-2000-6
Wu, J., Zhao, Q., Zhang, L., Li, S., Ma, Y., Pan, L., et al. (2018). QTL Mapping of Fiber-Related Traits Based on a High-Density Genetic Map in Flax (Linum usitatissimum L.). Front. Plant Sci. 9, 885. doi:10.3389/fpls.2018.00885
Xie, D., Dai, Z., Yang, Z., Sun, J., Zhao, D., Yang, X., et al. (2018b). Genome-wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-Seq. Front. Plant Sci. 8, 2232. doi:10.3389/fpls.2017.02232
Xie, D., Dai, Z., Yang, Z., Tang, Q., Sun, J., Yang, X., et al. (2018a). Genomic Variations and Association Study of Agronomic Traits in Flax. BMC Genomics 19, 512. doi:10.1186/s12864-018-4899-z
You, F. M., and Cloutier, S. (2020). Mapping Quantitative Trait Loci onto Chromosome-Scale Pseudomolecules in Flax. MPs 3, 28. doi:10.3390/mps3020028
You, F. M., Xiao, J., Li, P., Yao, Z., Jia, G., He, L., et al. (2018). Chromosome-scale Pseudomolecules Refined by Optical, Physical and Genetic Maps in Flax. Plant J. 95, 371–384. doi:10.1111/tpj.13944
Zhang, D., Liu, D., Lv, X., Wang, Y., Xun, Z., Liu, Z., et al. (2014). The Cysteine Protease CEP1, a Key Executor Involved in Tapetal Programmed Cell Death, Regulates Pollen Development in Arabidopsis. Plant Cell 26, 2939–2961. doi:10.1105/tpc.114.127282
Zhang, J., Feng, J.-Y., Ni, Y.-L., Wen, Y.-J., Niu, Y., Tamba, C. L., et al. (2017). pLARmEB: Integration of Least Angle Regression with Empirical Bayes for Multilocus Genome-wide Association Studies. Heredity 118, 517–524. doi:10.1038/hdy.2017.8
Zhang, J., Long, Y., Wang, L., Dang, Z., Zhang, T., Song, X., et al. (2018). Consensus Genetic Linkage Map Construction and QTL Mapping for Plant Height-Related Traits in Linseed Flax (Linum usitatissimum L.). BMC Plant Biol. 18, 160. doi:10.1186/s12870-018-1366-6
Zhang, T. (2013). Characterizing the Flax Core Collection for Earliness and Canopy Traits. Saskatoon, Saskatchewan, Canada: MSc thesis, University of Saskatchewan.
Keywords: flowering time, maturity, plant height, multi-locus GWAS, quantitative trait nucleotides, linseed, flaxseed
Citation: Saroha A, Pal D, Gomashe SS, Akash , Kaur V, Ujjainwal S, Rajkumar S, Aravind J, Radhamani J, Kumar R, Chand D, Sengupta A and Wankhede DP (2022) Identification of QTNs Associated With Flowering Time, Maturity, and Plant Height Traits in Linum usitatissimum L. Using Genome-Wide Association Study. Front. Genet. 13:811924. doi: 10.3389/fgene.2022.811924
Received: 09 November 2021; Accepted: 02 May 2022;
Published: 14 June 2022.
Edited by:
Frank M. You, Agriculture and Agri-Food Canada (AAFC), CanadaReviewed by:
Prashant Vikram, International Center for Biosaline Agriculture (ICBA), United Arab EmiratesLiqiang He, Hainan University, China
Copyright © 2022 Saroha, Pal, Gomashe, Akash, Kaur, Ujjainwal, Rajkumar, Aravind, Radhamani, Kumar, Chand, Sengupta and Wankhede. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Dhammaprakash Pandhari Wankhede, D.Wankhede@icar.gov.in